|
submit(script,
output_data_name,
iterations=1,
supporting_file_list=None,
reduction_script=None,
arguments='
' ,
debug=False)
Submits script non-blocking job to a MapReduce cluster and collects
output in output_data_name. |
source code
|
|
|
make_frame(script,
arguments='
' ,
iterations=1,
debug=False)
Generates a basic python frame for running a batch job on a MapReduce
cluster. |
source code
|
|
|
build_generic_hadoop_call(mapper,
reducer,
input,
output,
supporting_file_list=None,
num_mappers=None,
num_reducers=None,
key_comparator=None)
Builds a call array suitable for subprocess.Popen which submits a
streaming job to the configured MapReduce instance. |
source code
|
|
|
execute(hadoop_call)
Nonblocking execution of the given call array |
source code
|
|
|
execute_and_wait(hadoop_call)
Blocking execution of the given call array |
source code
|
|
|
submit_checkpoint(script,
output_data_name,
file_list,
reduction_script=None,
arguments='
' ,
debug=False) |
source code
|
|