-
Notifications
You must be signed in to change notification settings - Fork 115
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-8338] Use AMI preinstalled ganglia #133
base: branch-1.4
Are you sure you want to change the base?
[SPARK-8338] Use AMI preinstalled ganglia #133
Conversation
Couple of points
|
Sure. BTW Git will keep previous version for us.
Well, I don't like the latter - this would add more complexity - like the problem we have with Ganglia and httpd versions configs. I noticed when installing and uninstalling ganglia and httpd packages that rpm moves configuration files for it. It would be too difficult to maintain in the future. |
@shivaram |
Well it could happen - I just like the scripts to be a bit more robust and install the the versions of software required by the config files we maintain (at some level spark-ec2 is about configuring things to work). This is what we do with Spark and Hadoop and I don't see why ganglia should be any different. This is partly why I favor the uninstall anything on the AMI and install things we want approach. BTW there are other ways to solving this problem -- Instead of having a hard coded config file that we copy over we could have a diff file that we patch on to the config file. As far as I know the only things we need to patch from the default version is the port number [1] and the permissions [2] [1] https://github.com/mesos/spark-ec2/blob/branch-1.4/templates/etc/httpd/conf/httpd.conf#L136 |
I created python script to check and install packages using yum API (somewhat similar to chef). https://github.com/smartkiwi/spark-ec2/blob/branch-1.4-no_ganglia_install_check/ganglia/check_packages.py Some package versions available in AMI (i.e. ganglia 3.3) are not available in Amazon rpm repository - there is only ganglia 3.6. This is another point why we should always use ganglia packages preinstalled with AMI. I suggest to merge my changes as they are now and build new AMI with latest version of ganglia and its dependencies in near future. PS. With Spark and Hadoop - we install them from other repositories than Amazon rpm repository, and we do not support old versions of Spark if they would be removed from locations. |
Ok. I've finished python script that uses yum API to check for installed httpd and ganglia version and installing the correct version is the wrong one is installed. |
Thanks @smartkiwi -- I'll try this out today. cc @nchammas who might be a better reviewer for the Python code. |
BTW
|
Haven't had time to test this yet, but a few questions for @smartkiwi:
|
@smartkiwi I tried this out today and my first run ran into an error, but it seemed to work when I re-ran it. But thinking more about it, it is a hassle to keep relying on these old versions. So I tried an alternate method where we get rid of the httpd.conf and only use @nchammas Would be interested in hearing which strategy sounds better to you as well. |
The root cause was old ganglia/init.sh code that left here since the time when ganglia was not preinstalled.