Simplify monitor tracking API #42816

dcramer · 2023-01-05T16:52:53Z

Problem Statement

Currently triggering a monitor check-in requires two calls:

-> POST .../monitors/{monitor_id}/checkins/

-> PUT .../monitors/{monitor_id}/checkins/{checkin_in}/ {"status": "ok"}

This creates a minor hurdle as it requires you to track the checkin_id returned in the first step, to publish the second step (to mark a job as finished, for example).

Solution Brainstorm

Instead we could provide a simplified API along the lines of:

-> PUT .../monitors/{monitor_id}/checkins/latest/

This request would modify the most recent unfinished check-in, meaning a few behaviors could be implemented:

With no checkins, this returns a 404 (same as invalid ID).
With no unfinished checkin, this returns a 404 (same as invalid ID).
With one unfinished checkin, this would modify that checkin.
With multiple unfinished checkins, this would modify the most recent checkin. Meaning likely the previous checkin would time out and fail.

Refs GH-42283

modernben · 2023-01-05T16:57:13Z

Yes, this would be great. We could also let the user guide the process based on the request body similar to how it is functioning now.

i.e.

PUT /monitors/{monitor_id}/checkins {"status": "in_progress"}
PUT /monitors/{monitor_id}/checkins {"status": "completed"}

If the body is empty, then you can run the determineIntent() logic

dcramer · 2023-01-05T16:57:26Z

From a documentation angle, we'd need to be clear that if you're using a single monitor to track jobs that might overlap or run in parallel you should not use this API.

dcramer · 2023-01-05T17:01:13Z

One other option is to support "latest" for the checkin identifier, which would be a bit less of an overload of the main endpoint.

Start a checkin:

POST /monitors/{monitor_id}/checkins/

Complete a checkin:

PUT /monitors/{monitor_id}/checkins/latest/

modernben · 2023-01-05T17:03:22Z

Yeah I could be on board with that. Then at least there's some intent from the user's perspective.

dcramer · 2023-01-05T17:06:18Z

One last thought, leaning towards the latest magic identifier, as you can already one-line register a result:

run-my-command.sh && curl -x POST .../monitors/my-cool-identifier/checkins/ -d '{"status": "finished"}'

We should just document that case (and caveat its not ideal)

abkfenris · 2023-01-05T17:12:02Z

At least the few other tools that I've used and explored have started from the 'ping when completed' endpoint, and then layered additional functionality on top of that.

By going that route, it might be easier to convince folks to migrate and then they can progressively take advantage of additional features.

modernben · 2023-01-05T17:19:43Z

At least the few other tools that I've used and explored have started from the 'ping when completed' endpoint, and then layered additional functionality on top of that.

By going that route, it might be easier to convince folks to migrate and then they can progressively take advantage of additional features.

I like the addition of the /start, /stop, and /fail endpoints from them. Then I don't even have to fuss with a request body.
POST .../monitors/my-cool-identifier/checkins/start
POST .../monitors/my-cool-identifier/checkins/finish

modernben · 2023-01-05T17:24:13Z

One last thought, leaning towards the latest magic identifier, as you can already one-line register a result:

run-my-command.sh && curl -x POST .../monitors/my-cool-identifier/checkins/ -d '{"status": "finished"}'

We should just document that case (and caveat its not ideal)

@dcramer Would your one-liner need to start a checkin as well?

curl -x POST .../monitors/my-cool-identifier/checkins/ && run-my-command.sh && curl -x POST .../monitors/my-cool-identifier/checkins/ -d '{"status": "finished"}'

dcramer · 2023-01-05T17:27:03Z

At least the few other tools that I've used and explored have started from the 'ping when completed' endpoint, and then layered additional functionality on top of that.
By going that route, it might be easier to convince folks to migrate and then they can progressively take advantage of additional features.

I like the addition of the /start, /stop, and /fail endpoints from them. Then I don't even have to fuss with a request body. POST .../monitors/my-cool-identifier/checkins/start POST .../monitors/my-cool-identifier/checkins/finish

Somethign we could easily add as we've done elsewhere to proxy over. e.g. have start just forward a payload to the other endpoint, asme with finish. I'll start w/ the basic PUT endpoint and we can decide from there.

One last thought, leaning towards the latest magic identifier, as you can already one-line register a result:
run-my-command.sh && curl -x POST .../monitors/my-cool-identifier/checkins/ -d '{"status": "finished"}'
We should just document that case (and caveat its not ideal)

@dcramer Would your one-liner need to start a checkin as well?

curl -x POST .../monitors/my-cool-identifier/checkins/ && run-my-command.sh && curl -x POST .../monitors/my-cool-identifier/checkins/ -d '{"status": "finished"}'

Yeah it'd create the checkin and immediately mark it as finished. I'm not sure we have test cases covering desired outcome here so its possible it would not cascade all metadata appropriately but we can verify that.

dcramer · 2023-01-05T17:31:12Z

Actually we already have branch coverage for creating a passing/failing checkin:

    def test_passing(self):
        self.login_as(self.user)

        for path_func in self._get_path_functions():
            monitor = self._create_monitor()
            path = path_func(monitor)

            resp = self.client.post(path, {"status": "ok"})
            assert resp.status_code == 201, resp.content

            checkin = MonitorCheckIn.objects.get(guid=resp.data["id"])
            assert checkin.status == CheckInStatus.OK

            monitor = Monitor.objects.get(id=monitor.id)
            assert monitor.status == MonitorStatus.OK
            assert monitor.last_checkin == checkin.date_added
            assert monitor.next_checkin == monitor.get_next_scheduled_checkin(checkin.date_added)

mitsuhiko · 2023-01-05T18:15:20Z

From a documentation angle, we'd need to be clear that if you're using a single monitor to track jobs that might overlap or run in parallel you should not use this API.

A simple improvement to that could be to allow a client to send along a hostname and use that as disambiguation key of concurrent monitors but I'm not even sure if that currently makes a lot of sense. If you have a cron job that is supposed to run hourly on all machines (eg: temp dir clean up etc.) that's not easily trackable in Sentry anyways unless you make a monitor for each of those.

But if there was, having a host name as key would solve a very common case, and it would be easy enough to supply as a user.

Add the ability to target `latest` as the `checkin_id` parameter, which will pull the most recent (creation date) check-in that is not yet marked as finished. Fixes GH-42816

modernben · 2023-01-06T00:08:10Z

Love the addition of /latest. I went ahead and created a Laravel Package for it.

https://packagist.org/packages/modernmcguire/sentry-cron-monitoring-laravel

dcramer mentioned this issue Jan 5, 2023

feat: Support latest identifier on checkins #42819

Merged

dcramer added a commit that referenced this issue Jan 5, 2023

feat: Support latest identifier on checkins

e7858e0

Add the ability to target `latest` as the `checkin_id` parameter, which will pull the most recent (creation date) check-in that is not yet marked as finished. Fixes GH-42816

dcramer closed this as completed in #42819 Jan 5, 2023

github-actions bot locked and limited conversation to collaborators Jan 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify monitor tracking API #42816

Simplify monitor tracking API #42816

dcramer commented Jan 5, 2023 •

edited

Loading

modernben commented Jan 5, 2023

dcramer commented Jan 5, 2023

dcramer commented Jan 5, 2023 •

edited

Loading

modernben commented Jan 5, 2023

dcramer commented Jan 5, 2023

abkfenris commented Jan 5, 2023

modernben commented Jan 5, 2023

modernben commented Jan 5, 2023

dcramer commented Jan 5, 2023

dcramer commented Jan 5, 2023

mitsuhiko commented Jan 5, 2023

modernben commented Jan 6, 2023

Simplify monitor tracking API #42816

Simplify monitor tracking API #42816

Comments

dcramer commented Jan 5, 2023 • edited Loading

Problem Statement

Solution Brainstorm

modernben commented Jan 5, 2023

dcramer commented Jan 5, 2023

dcramer commented Jan 5, 2023 • edited Loading

modernben commented Jan 5, 2023

dcramer commented Jan 5, 2023

abkfenris commented Jan 5, 2023

modernben commented Jan 5, 2023

modernben commented Jan 5, 2023

dcramer commented Jan 5, 2023

dcramer commented Jan 5, 2023

mitsuhiko commented Jan 5, 2023

modernben commented Jan 6, 2023

dcramer commented Jan 5, 2023 •

edited

Loading

dcramer commented Jan 5, 2023 •

edited

Loading