Unable to find files in hadoop streaming -


i have similar issue hadoop streaming - unable find file error . none of solutions presented there working.

my command line is:

hadoop jar /mnt/shared/hadoop-streaming-1.0.3.jar -input /user/cloudera/mz_paf/batch_sk=1234 \ -output /user/cloudera/mz_paf/out \ -file /mnt/shared/java/paf-rules.properties -file /mnt/shared/java/pafvalid.py  \ -mapper "pafvalid.py paf-rules.properties 10" 

this results in

java.io.ioexception: cannot run program "pafvalid.py": java.io.ioexception: error=2, no such file or directory     @ java.lang.processbuilder.start(processbuilder.java:460)     @ org.apache.hadoop.streaming.pipemapred.configure(pipemapred.java:214)     @ org.apache.hadoop.streaming.pipemapper.configure(pipemapper.java:66) 

i have tried several other approaches mentioned in other sof issue including using hdfs locations input files: still same error of file not found. have tried having files in local directory avoid path issues. still no dice.

i have not passed in argument directly command line using hadoop streaming. if you, try put arguments inside python script first.

then change command line here

-mapper "pafvalid.py paf-rules.properties 10" 

to

-mapper /mnt/shared/java/pafvalid.py 

correct me if figure out


Comments

Popular posts from this blog

java - Run a .jar on Heroku -

java - Jtable duplicate Rows -

validation - How to pass paramaters like unix into windows batch file -