public class CombinedLogLoader
extends RegExLoader
CombinedLogLoader is used to load logs based on Apache's combined log format, based on a format like
LogFormat "%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\"" combined
The log filename ends up being access_log from a line like
CustomLog logs/combined_log combined
Example:
raw = LOAD 'combined_log' USING org.apache.pig.piggybank.storage.apachelog.CombinedLogLoader AS
(remoteAddr, remoteLogname, user, time, method, uri, proto, status, bytes, referer, userAgent);