@InterfaceAudience.Public @InterfaceStability.Evolving public interface LoadMetadata
| Modifier and Type | Method and Description |
|---|---|
java.lang.String[] |
getPartitionKeys(java.lang.String location,
org.apache.hadoop.mapreduce.Job job)
Find what columns are partition keys for this input.
|
ResourceSchema |
getSchema(java.lang.String location,
org.apache.hadoop.mapreduce.Job job)
Get a schema for the data to be loaded.
|
ResourceStatistics |
getStatistics(java.lang.String location,
org.apache.hadoop.mapreduce.Job job)
Get statistics about the data to be loaded.
|
void |
setPartitionFilter(Expression partitionFilter)
Set the filter for partitioning.
|
ResourceSchema getSchema(java.lang.String location, org.apache.hadoop.mapreduce.Job job) throws java.io.IOException
location - Location as returned by
LoadFunc.relativeToAbsolutePath(String, org.apache.hadoop.fs.Path)job - The Job object - this should be used only to obtain
cluster properties through JobContextImpl.getConfiguration() and not to set/query
any runtime job information.java.io.IOException - if an exception occurs while determining the schemaResourceStatistics getStatistics(java.lang.String location, org.apache.hadoop.mapreduce.Job job) throws java.io.IOException
LoadFunc, then LoadFunc.setLocation(String, org.apache.hadoop.mapreduce.Job)
is guaranteed to be called before this method.location - Location as returned by
LoadFunc.relativeToAbsolutePath(String, org.apache.hadoop.fs.Path)job - The Job object - this should be used only to obtain
cluster properties through JobContextImpl.getConfiguration() and not to set/query
any runtime job information.java.io.IOException - if an exception occurs while retrieving statisticsjava.lang.String[] getPartitionKeys(java.lang.String location,
org.apache.hadoop.mapreduce.Job job)
throws java.io.IOException
location - Location as returned by
LoadFunc.relativeToAbsolutePath(String, org.apache.hadoop.fs.Path)job - The Job object - this should be used only to obtain
cluster properties through JobContextImpl.getConfiguration() and not to set/query
any runtime job information.java.io.IOException - if an exception occurs while retrieving partition keysvoid setPartitionFilter(Expression partitionFilter) throws java.io.IOException
getPartitionKeys(String, Job), then this method is not
called by Pig runtime. This method is also not called by the Pig runtime
if there are no partition filter conditions.partitionFilter - that describes filter for partitioningjava.io.IOException - if the filter is not compatible with the storage
mechanism or contains non-partition fields.Copyright © 2007-2025 The Apache Software Foundation