Package | Description |
---|---|
nl.basjes.pig.input.apachehttpdlog | |
org.apache.pig |
Public interfaces and classes for Pig.
|
org.apache.pig.builtin |
This package contains builtin Pig UDFs.
|
org.apache.pig.builtin.mock | |
org.apache.pig.impl.io | |
org.apache.pig.piggybank.storage | |
org.apache.pig.piggybank.storage.apachelog |
Modifier and Type | Class and Description |
---|---|
class |
nl.basjes.pig.input.apachehttpdlog.Loader |
Modifier and Type | Class and Description |
---|---|
class |
LoadFuncMetadataWrapper
Convenience class to extend when decorating a class that extends LoadFunc and
implements LoadMetadata.
|
Modifier and Type | Method and Description |
---|---|
protected void |
LoadFuncMetadataWrapper.setLoadFunc(LoadMetadata loadFunc)
The wrapped LoadMetadata object must be set before method calls are made on this object.
|
Modifier and Type | Class and Description |
---|---|
class |
AvroStorage
Pig UDF for reading and writing Avro data.
|
class |
BinStorage
Load and store data in a binary format.
|
class |
JsonLoader
A loader for data stored using
JsonStorage . |
class |
JsonMetadata
Reads and Writes metadata using JSON in metafiles next to the data.
|
class |
OrcStorage
A load function and store function for ORC file.
|
class |
ParquetLoader
Wrapper class which will delegate calls to parquet.pig.ParquetLoader
|
class |
PigStorage
A load function that parses a line of input into fields using a character delimiter.
|
class |
TrevniStorage
Pig Store/Load Function for Trevni.
|
Modifier and Type | Class and Description |
---|---|
class |
Storage
A convenient mock Storage for unit tests
|
Modifier and Type | Class and Description |
---|---|
class |
InterStorage
LOAD FUNCTION FOR PIG INTERNAL USE ONLY!
This load function is used for storing intermediate data between MR jobs of
a pig query.
|
class |
ReadToEndLoader
This is wrapper Loader which wraps a real LoadFunc underneath and allows
to read a file completely starting a given split (indicated by a split index
which is used to look in the List
|
class |
SequenceFileInterStorage
Store tuples (BinSedesTuples, specifically) using sequence files to leverage
sequence file's compression features.
|
class |
TFileStorage
LOAD FUNCTION FOR PIG INTERNAL USE ONLY! This load function is used for
storing intermediate data between MR jobs of a pig query.
|
Modifier and Type | Class and Description |
---|---|
class |
AllLoader
The AllLoader provides the ability to point pig at a folder that contains
files in multiple formats e.g.
|
class |
CSVExcelStorage
CSV loading and storing with support for multi-line fields,
and escaping of delimiters and double quotes within fields;
uses CSV conventions of Excel 2007.
|
class |
FixedWidthLoader
A fixed-width file loader.
|
class |
HiveColumnarLoader
Loader for Hive RC Columnar files.
Supports the following types: * Hive Type Pig Type from DataType string CHARARRAY int INTEGER bigint or long LONG float float double DOUBLE boolean BOOLEAN byte BYTE array TUPLE map MAP Partitions The input paths are scanned by the loader for [partition name]=[value] patterns in the subdirectories. If detected these partitions are appended to the table schema. For example if you have the directory structure: |
class |
HiveColumnarStorage |
class |
IndexedStorage
IndexedStorage is a form of PigStorage that supports a
per record seek. |
class |
PigStorageSchema
Deprecated.
Use PigStorage with a -schema option instead
|
Modifier and Type | Class and Description |
---|---|
class |
LogFormatLoader
This is a pig loader that can load Apache HTTPD access logs written in (almost) any
Apache HTTPD LogFormat.
Basic usage: Simply feed the loader your (custom) logformat specification and it will tell you which fields can be extracted from this logformat. For example: |
Copyright © 2007-2017 The Apache Software Foundation