Unable to find files in hadoop streaming -
i have similar issue hadoop streaming - unable find file error . none of solutions presented there working.
my command line is:
hadoop jar /mnt/shared/hadoop-streaming-1.0.3.jar -input /user/cloudera/mz_paf/batch_sk=1234 \ -output /user/cloudera/mz_paf/out \ -file /mnt/shared/java/paf-rules.properties -file /mnt/shared/java/pafvalid.py \ -mapper "pafvalid.py paf-rules.properties 10"
this results in
java.io.ioexception: cannot run program "pafvalid.py": java.io.ioexception: error=2, no such file or directory @ java.lang.processbuilder.start(processbuilder.java:460) @ org.apache.hadoop.streaming.pipemapred.configure(pipemapred.java:214) @ org.apache.hadoop.streaming.pipemapper.configure(pipemapper.java:66)
i have tried several other approaches mentioned in other sof issue including using hdfs locations input files: still same error of file not found. have tried having files in local directory avoid path issues. still no dice.
i have not passed in argument directly command line using hadoop streaming. if you, try put arguments inside python script first.
then change command line here
-mapper "pafvalid.py paf-rules.properties 10"
to
-mapper /mnt/shared/java/pafvalid.py
correct me if figure out
Comments
Post a Comment