Run background task in master process? #3401

pe224 · 2018-11-20T17:10:53Z

I deploy aiohttp behind gunicorn using multiple workers with the --preload option.
Currently, I can run a background task using its own thread in the master process

import threading
import time
from aiohttp import web

async def handle(request):
    return web.Response(text='OK')

def my_job():
    while True:
        print('Still here')
        time.sleep(3)

app = web.Application()
app.router.add_get('/', handle)

threading.Thread(target=my_job, daemon=True).start()

It feels a bit dirty to use threads when I should be able to just hook into the event loop of the master process.
However, using app.on_startup does not work as the task will then be duplicated by each worker.

Is there a possibility to create a task only in the event loop of the master process or somehow remove the duplication by the workers?

The text was updated successfully, but these errors were encountered:

aio-libs-bot · 2018-11-20T17:13:54Z

GitMate.io thinks the contributor most likely able to help you is @asvetlov.

Possibly related issues are #1964 (The documentation on "Background tasks" is wrong), #1104 (Rewrite doc section about background tasks), #1921 (Run tasks (fetch urls) inside separate thread), #2745 (Running out of memory ), and #1092 (Allow to register application background tasks within an event loop).

asvetlov · 2018-11-20T19:56:15Z

With --preload you use a hack by running a thread in the main gunicorn process with executing a bunch of worker processes for aiohttp server handling after that.
In turn on_startup is called from the worker code.

The main question is what memory do you expect to have in my_job? Shared with the master process, with one of a forked worker or something independent (a new process).

I suspect you want to do something more complex than printing a line infinitely.

pe224 · 2018-11-20T20:56:39Z

Right, sorry for the missing info.
Ideally I would like to write to memory shared with all workers (worker access being read-only).
It could, in a pinch, be made almost as simple as printing a line when using the file system as temporary storage.

asvetlov · 2018-11-21T08:43:53Z

If you don't care about scaling and performance -- you don't need gunicorn at all.
If you do -- you need to think about deploying on multiple nodes anyway.
Running worker processes with message brokers (redis pubsub, rabbitmq, kafka etc.) sounds like a proper solution.

pe224 · 2018-11-23T12:39:11Z

Alright, thanks for the big-picture view.
I'll probably have to rethink the architecture as what I had in mind might work, but will be hacky.

lock · 2019-11-23T13:01:15Z

This thread has been automatically locked since there has not been
any recent activity after it was closed. Please open a new issue for
related bugs.

If you feel like there's important points made in this discussion,
please include those exceprts into that new issue.

pe224 closed this as completed Nov 23, 2018

aio-libs-bot mentioned this issue Dec 22, 2018

Recommended way to run background tasks #3459

Closed

lock bot added the outdated label Nov 23, 2019

lock bot locked as resolved and limited conversation to collaborators Nov 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run background task in master process? #3401

Run background task in master process? #3401

pe224 commented Nov 20, 2018 •

edited

Loading

aio-libs-bot commented Nov 20, 2018

asvetlov commented Nov 20, 2018 •

edited

Loading

pe224 commented Nov 20, 2018

asvetlov commented Nov 21, 2018

pe224 commented Nov 23, 2018

lock bot commented Nov 23, 2019

Run background task in master process? #3401

Run background task in master process? #3401

Comments

pe224 commented Nov 20, 2018 • edited Loading

aio-libs-bot commented Nov 20, 2018

asvetlov commented Nov 20, 2018 • edited Loading

pe224 commented Nov 20, 2018

asvetlov commented Nov 21, 2018

pe224 commented Nov 23, 2018

lock bot commented Nov 23, 2019

pe224 commented Nov 20, 2018 •

edited

Loading

asvetlov commented Nov 20, 2018 •

edited

Loading