Package org.apache.pig.piggybank.storage

Class Summary
CSVLoader A load function based on PigStorage that implements part of the CSV "standard" This loader properly supports double-quoted fields that contain commas and other double-quotes escaped with backslashes.
DBStorage  
HadoopJobHistoryLoader  
HadoopJobHistoryLoader.HadoopJobHistoryInputFormat  
HadoopJobHistoryLoader.HadoopJobHistoryReader  
HadoopJobHistoryLoader.JobHistoryPathFilter  
HadoopJobHistoryLoader.MRJobInfo  
HiveColumnarLoader Loader for Hive RC Columnar files.
Supports the following types:
* Hive Type Pig Type from DataType string CHARARRAY int INTEGER bigint or long LONG float float double DOUBLE boolean BOOLEAN byte BYTE array TUPLE map MAP

Partitions
The input paths are scanned by the loader for [partition name]=[value] patterns in the subdirectories.
If detected these partitions are appended to the table schema.
For example if you have the directory structure:

JsonMetadata Reads and Writes metadata using JSON in metafiles next to the data.
MultiStorage The UDF is useful for splitting the output data into a bunch of directories and files dynamically based on user specified key field in the output tuple.
MultiStorage.MultiStorageOutputFormat  
MultiStorage.MultiStorageOutputFormat.MyLineRecordWriter  
MyRegExLoader  
PigStorageSchema This Load/Store Func reads/writes metafiles that allow the schema and aliases to be determined at load time, saving one from having to manually enter schemas for pig-generated datasets.
RegExLoader RegExLoader is an abstract class used to parse logs based on a regular expression.
SequenceFileLoader A Loader for Hadoop-Standard SequenceFiles.
XMLLoader The load function to load the XML file This implements the LoadFunc interface which is used to parse records from a dataset.
XMLLoader.XMLFileInputFormat  
XMLLoader.XMLFileRecordReader  
 

Enum Summary
HadoopJobHistoryLoader.JobKeys Job Keys
 



Copyright © ${year} The Apache Software Foundation