Get your own customer support community
 

Answering Cloudera configurator questions with NFS directories

Using cloudera configurator. It is not clear when I have to using NFS directories, versus local directories . It is a pain to copy the configuration file into all of my 32 data nodes. Simpler
option is to store the configuration file and executable on a NFS.

It is not clear which of the following are NFS, local, hdfs directories
dfs.data.dir
dfs.client.buffer.dir
dfs.name.dir
hadoop.tmp.dir
mapred.local.dir

for example if I assigne /tmp to hadoop.tmp.dir , the /tmp gets filled up especially when doing crawlers. All my local directories are small .
If I assign /hdfs directory to hadoop.tmp.dir it is wrong
if I assign a NFS directory (available in large amounts), there is a sbtle problem When the jobs
run on different nodes, they end up deleting and overwriting each others temporary files.

It will be nice if someone could draw a table 5 rows by 3 columns with OK/notOk in each

LOCAL NFS HDFS
dfs.client.buffer.dir
dfs.name.dir
hadoop.tmp.dir OK NOTOK NOTOK
mapred.local.dir
Inappropriate?
1 person has this question

User_default_medium