Sorting problems
I'm trying to sort out the outputs and I followed the directions but I keep getting "sort: No match"
I created a directory in my local Hadoop folder called output.
I think dl'edt outputs to the local hadoop folder using the command => bin/hadoop dfs -copyToLocal output output
Then I did the command "sort -m /output/output/* > outputMerged.txt " from inside my local hadoop folder. This is where I get the "sort: Not match" message.
If someone could help me out. Thanks
I created a directory in my local Hadoop folder called output.
I think dl'edt outputs to the local hadoop folder using the command => bin/hadoop dfs -copyToLocal output output
Then I did the command "sort -m /output/output/* > outputMerged.txt " from inside my local hadoop folder. This is where I get the "sort: Not match" message.
If someone could help me out. Thanks
1
person has this question
I have this question, too!
Tell me when someone answers.
The more people who ask this question, the more it gets noticed.
The more people who ask this question, the more it gets noticed.
Create a customer community for your own organization
Plans starting at $19/month
-
Inappropriate?I think, if you used bin/hadoop dfs -copyToLocal output output, all your "part-xxxxx" files would be in the output folder, not output/output. Just check inside the output folder to see where those files are.
Also, even if it is output/output, I think that extra / in front is trying to do sort -m on /output/output/* instead of /extra/ugrad_space/jdauz/hadoop.../output/output/*
And don't forget to move any directories out of there before using such as _logs.
1 person says
this answers the question
-
Inappropriate?thanks mike i figured it out
Loading Profile...




EMPLOYEE