Ways to parse NCSA combined based log files

Posted by Kyle on Server Fault See other posts from Server Fault or by Kyle
Published on 2010-12-21T18:47:42Z Indexed on 2010/12/21 18:55 UTC
Read the original article Hit count: 272

Filed under:

I've done a bit of site: searching with Google on Server Fault, Super User and Stack Overflow. I also checked non site specific results and and didn't really see a question like this, so here goes...

I did spot this question, related to grep and awk which has some great knowledge but I don't feel the text qualification challenge was addressed. This question also broadens the scope to any platform and any program.

I've got squid or apache logs based on the NCSA combined format. When I say based, meaning the first n col's in the file are per NCSA combined standards, there might be more col's with custom stuff.

Here is an example line from a squid combined log:

1.1.1.1 - - [11/Dec/2010:03:41:46 -0500] "GET http://yourdomain.com:8080/en/some-page.html HTTP/1.1" 200 2142 "-" "Mozilla/5.0 (Windows; U; Windows NT 6.1; C) AppleWebKit/532.4 (KHTML, like Gecko)" TCP_MEM_HIT:NONE

I'd like to be able to parse n logs and output specific columns, for sorting, counting, finding unique values etc

The main challenge and what makes it a little tricky and also why I feel this question hasn't yet been asked or answered, is the text qualification conundrum.

When I spotted asql from the grep/awk question, I was very excited but then realised that it didn't support combined out of the box, something I'll look at extending I guess.

Looking forward to answers, and learning new stuff! Answers doesn't have to be limited to platform or program/language. For the context of this question, the platforms I use the most are Linux or OSX.

Cheers

Developer IT

Ways to parse NCSA combined based log files - Developer IT

Ways to parse NCSA combined based log files

apache

logging

squid

parsing

ncsa

Related posts about apache

Web site not responding

SVN Error 403 Forbidden

How can I setup dependencies for Axis2 / Axiom on Maven2

Problem compiling hive with ant

War deployment error related to classloading

Related posts about logging

log file is not getting created using JDK logging with Commons-logging

Python logging before you run logging.basicConfig?

Python: combine logging and wx so that logging stream is redirectet to stdout/stderr frame

Redirect logging output using custom logging handler

Java Logger API

Categories cloud