Class | Description |
---|---|
CombinedLogLoader |
CombinedLogLoader is used to load logs based on Apache's combined log format, based on a format like
LogFormat "%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\"" combined
The log filename ends up being access_log from a line like
CustomLog logs/combined_log combined
Example:
raw = LOAD 'combined_log' USING org.apache.pig.piggybank.storage.apachelog.CombinedLogLoader AS
(remoteAddr, remoteLogname, user, time, method, uri, proto, status, bytes, referer, userAgent);
|
CommonLogLoader |
CommonLogLoader is used to load logs based on Apache's common log format, based on a format like
LogFormat "%h %l %u %t \"%r\" %>s %b" common
The log filename ends up being access_log from a line like
CustomLog logs/access_log common
Example:
raw = LOAD 'access_log' USING org.apache.pig.piggybank.storage.apachelog.CommongLogLoader AS (remoteAddr,
remoteLogname, user, time, method, uri, proto, bytes);
|
LogFormatLoader |
This is a pig loader that can load Apache HTTPD access logs written in (almost) any
Apache HTTPD LogFormat.
Basic usage: Simply feed the loader your (custom) logformat specification and it will tell you which fields can be extracted from this logformat. For example: |
Copyright © 2007-2017 The Apache Software Foundation