-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Profiler takes 60 minutes to process and generate output on integration tests eventlog #1382
Comments
I executed the profiler locally. This is relativelly large eventlog compared to other ones we have in our testing enviornments. I explored some G1GC tuning, but I don't think this is going to be optimum.
Then I experimented without G1GC. My conclusion that the Xms could be a factor in the performance to work around the corner case.
|
This issue needs two parts:
|
Signed-off-by: Ahmed Hussein <ahussein@nvidia.com> Fixes NVIDIA#1382 Upon investigation, it was revealed that the min heap size could impact the runtime significantly. This code change aims at setting the xms java argument to 50% of the max heap size. - pass xms to the java cmd - update the runtime report to list jvm info along with jvm arguments related to heap: - `runtime.jvm.*` - `runtime.jvm.arg*`
* user-tools should add xms argument to java cmd Signed-off-by: Ahmed Hussein <ahussein@nvidia.com> Fixes #1382 Upon investigation, it was revealed that the min heap size could impact the runtime significantly. This code change aims at setting the xms java argument to 50% of the max heap size. - pass xms to the java cmd - update the runtime report to list jvm info along with jvm arguments related to heap: - `runtime.jvm.*` - `runtime.jvm.arg*` * remove debugging code Signed-off-by: Ahmed Hussein <ahussein@nvidia.com> --------- Signed-off-by: Ahmed Hussein <ahussein@nvidia.com>
Describe the bug
The ci/cd integration tests can take up to 60 minutes to run the profiler on a single eventlog. The eventlog is generated from the spark-rapids integration tests. We should investigate why it takes so long for that eventlog.
See our CI/CD logs on how to get that file.
The text was updated successfully, but these errors were encountered: