Package | Description |
---|---|
org.apache.pig.builtin |
This package contains builtin Pig UDFs.
|
org.apache.pig.impl.io | |
org.apache.pig.impl.util | |
org.apache.pig.piggybank.storage | |
org.apache.pig.piggybank.storage.avro |
Modifier and Type | Class and Description |
---|---|
class |
BinStorage
Load and store data in a binary format.
|
class |
PigStorage
A load function that parses a line of input into fields using a character delimiter.
|
Modifier and Type | Class and Description |
---|---|
class |
InterStorage
LOAD FUNCTION FOR PIG INTERNAL USE ONLY!
This load function is used for storing intermediate data between MR jobs of
a pig query.
|
class |
SequenceFileInterStorage
Store tuples (BinSedesTuples, specifically) using sequence files to leverage
sequence file's compression features.
|
class |
TFileStorage
LOAD FUNCTION FOR PIG INTERNAL USE ONLY! This load function is used for
storing intermediate data between MR jobs of a pig query.
|
Modifier and Type | Method and Description |
---|---|
static FileInputLoadFunc |
Utils.getTmpFileStorageObject(org.apache.hadoop.conf.Configuration conf) |
Modifier and Type | Method and Description |
---|---|
static java.lang.Class<? extends FileInputLoadFunc> |
Utils.getTmpFileStorageClass(java.util.Properties properties) |
Modifier and Type | Class and Description |
---|---|
class |
AllLoader
The AllLoader provides the ability to point pig at a folder that contains
files in multiple formats e.g.
|
class |
CSVExcelStorage
CSV loading and storing with support for multi-line fields,
and escaping of delimiters and double quotes within fields;
uses CSV conventions of Excel 2007.
|
class |
CSVLoader
A load function based on PigStorage that implements part of the CSV "standard"
This loader properly supports double-quoted fields that contain commas and other
double-quotes escaped with backslashes.
|
class |
HiveColumnarLoader
Loader for Hive RC Columnar files.
Supports the following types: * Hive Type Pig Type from DataType string CHARARRAY int INTEGER bigint or long LONG float float double DOUBLE boolean BOOLEAN byte BYTE array TUPLE map MAP Partitions The input paths are scanned by the loader for [partition name]=[value] patterns in the subdirectories. If detected these partitions are appended to the table schema. For example if you have the directory structure: |
class |
HiveColumnarStorage |
class |
IndexedStorage
IndexedStorage is a form of PigStorage that supports a
per record seek. |
class |
PigStorageSchema
Deprecated.
Use PigStorage with a -schema option instead
|
class |
SequenceFileLoader
A Loader for Hadoop-Standard SequenceFiles.
|
Modifier and Type | Class and Description |
---|---|
class |
AvroStorage
AvroStorage is used to load/store Avro data
Document can be found here |
Copyright © 2007-2012 The Apache Software Foundation