org.apache.pig.piggybank.evaluation.util
Class SearchQuery

java.lang.Object
  extended by org.apache.pig.EvalFunc<String>
      extended by org.apache.pig.piggybank.evaluation.util.SearchQuery

public class SearchQuery
extends EvalFunc<String>

This small UDF takes a search engine URL (Google/Yahoo/AOL/Live) containing the search query and extracts it. The URL is assumed to be encoded. The query is normalized, converting it to lower-case, removing punctuations, removing extra spaces.


Field Summary
 
Fields inherited from class org.apache.pig.EvalFunc
log, pigLogger, reporter, returnType
 
Constructor Summary
SearchQuery()
           
 
Method Summary
 String exec(Tuple tuple)
          This callback method must be implemented by all subclasses.
 List<FuncSpec> getArgToFuncMapping()
           
 Schema outputSchema(Schema input)
           
 
Methods inherited from class org.apache.pig.EvalFunc
finish, getLogger, getPigLogger, getReporter, getReturnType, getSchemaName, isAsynchronous, progress, setPigLogger, setReporter, warn
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

SearchQuery

public SearchQuery()
Method Detail

exec

public String exec(Tuple tuple)
            throws IOException
Description copied from class: EvalFunc
This callback method must be implemented by all subclasses. This is the method that will be invoked on every Tuple of a given dataset. Since the dataset may be divided up in a variety of ways the programmer should not make assumptions about state that is maintained between invocations of this method.

Specified by:
exec in class EvalFunc<String>
Parameters:
tuple - the Tuple to be processed.
Returns:
result, of type T.
Throws:
IOException

getArgToFuncMapping

public List<FuncSpec> getArgToFuncMapping()
                                   throws FrontendException
Overrides:
getArgToFuncMapping in class EvalFunc<String>
Returns:
A List containing FuncSpec objects representing the Function class which can handle the inputs corresponding to the schema in the objects
Throws:
FrontendException

outputSchema

public Schema outputSchema(Schema input)
Overrides:
outputSchema in class EvalFunc<String>
Parameters:
input - Schema of the input
Returns:
Schema of the output


Copyright © ${year} The Apache Software Foundation