Why are logs for EC2 defaulting to the small root partition?
In the continuing theme of "Using the EC2 scripts for long-running clusters".
With the default set up, all the hadoop logs are going to /var/log/hadoop, which is part of the root partition, which is quite small (rather than /mnt, which is plenty big). As a result, with my cluster, I fill up that partition within about a week or two, bringing the system to a halt.
So:
1- Should /var/log/hadoop be on /mnt (aka /dev/sda2)?
2- If so, can the scripts/images be configured that way by default?
3- Also, what's the best way for me to adjust this?
Also, just to make sure I understand -- the default config is for those job files to be kept forever, correct? I can adjust that in conf/log4j.properties, but by default they are never removed?
Thanks,
-Dan
With the default set up, all the hadoop logs are going to /var/log/hadoop, which is part of the root partition, which is quite small (rather than /mnt, which is plenty big). As a result, with my cluster, I fill up that partition within about a week or two, bringing the system to a halt.
So:
1- Should /var/log/hadoop be on /mnt (aka /dev/sda2)?
2- If so, can the scripts/images be configured that way by default?
3- Also, what's the best way for me to adjust this?
Also, just to make sure I understand -- the default config is for those job files to be kept forever, correct? I can adjust that in conf/log4j.properties, but by default they are never removed?
Thanks,
-Dan
3
people have this question
I have this question, too!
Tell me when someone answers.
The more people who ask this question, the more it gets noticed.
The more people who ask this question, the more it gets noticed.
-
Inappropriate?Hi Dan,
Yes, this is wrong, and should be fixed as you suggest. We're working on fixing it in the scripts. In the meantime, you can move your existing logs to somewhere on /mnt and then change /var/log/hadoop to be a symbolic link.
You are right that the log files are never removed by default.
Cheers,
Tom
2 people say
this answers the question
-
Should I / do I need to also do the moving/relinking on all the slaves machines? I would guess yes, but am hoping no ;-)
Loading Profile...



EMPLOYEE