Record log URI in spark job runs #477

robhudson · 2017-05-18T19:33:56Z

This is to be able to more easily associate logs with a specific run

robhudson · 2017-06-14T23:16:46Z

See #520 for details on how the EMR logs are different from the spark job logs being created in the batch script.

In the meeting today we decided it would be best to update the batch.sh file to create the log files with a more deterministic name that we can use on the Python side. One idea was the job name + cluster job ID, if possible.

@maurodoglio Would the above cause any problems that you know of? Is it possible to get the cluster job ID into the batch script?

maurodoglio · 2017-06-22T21:54:47Z

Would the above cause any problems that you know of?

I don't think so

Is it possible to get the cluster job ID into the batch script?

I think so, you can probably use the aws cli and filter the list of running jobs by some attributes accessible from the machine (maybe the hostname?)

robhudson · 2017-06-29T19:02:28Z

Here's an example of pulling out the jobflow ID from the running cluster:
https://gist.github.com/robotblake/7b08526b7a411739cd4c344476dd0860

This could be inserted into the job flow steps prior to the batch.sh to pass the jobflow_id.

rafrombrc added this to the m3 milestone May 24, 2017

rafrombrc assigned robhudson May 24, 2017

rafrombrc added the ready label May 24, 2017

robhudson added a commit that referenced this issue May 24, 2017

Fix #477: Record LogUri with job history

5f31918

robhudson added review and removed ready labels May 24, 2017

robhudson mentioned this issue May 31, 2017

Backfill SparkJobRun.log_uri for old runs #520

Closed

rafrombrc added in progress and removed review labels Jun 14, 2017

rafrombrc modified the milestones: m4, m3 Jun 22, 2017

rafrombrc modified the milestones: m5, m4 Aug 23, 2017

rafrombrc added ready and removed in progress labels Sep 20, 2017

jezdez added the low hanging fruit label Sep 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Record log URI in spark job runs #477

Record log URI in spark job runs #477

robhudson commented May 18, 2017

robhudson commented Jun 14, 2017

maurodoglio commented Jun 22, 2017 •

edited

Loading

robhudson commented Jun 29, 2017

Record log URI in spark job runs #477

Record log URI in spark job runs #477

Comments

robhudson commented May 18, 2017

robhudson commented Jun 14, 2017

maurodoglio commented Jun 22, 2017 • edited Loading

robhudson commented Jun 29, 2017

maurodoglio commented Jun 22, 2017 •

edited

Loading