A fair transcoding jobs list #1192

Jorropo · 2018-10-04T14:43:30Z

Currently transcoding jobs list is basic FIFO list, that good but it makes some problem.
On my instance a user upload for over 2 days of transcoding, this is long, and now when an other user upload a video he have to wait 2 days before transcoding.
So why I don't use the per day limit ? Cause I'm a small instances and most of the time she doesn't transcode. And a per day limit doesn't provide enough control, what if the instance is doing nothing, why blocking user ?

So somethings that could be good is :
(assume jobs arrive in this order)
job 1 for user A
job 2 for user A
job 3 for user A
job 4 for user B
job 5 for user B
job 6 for user C
job 7 for user A

First job 1 cause he is the first.
Then job 4 cause user A already have a job executed.
Then job 6 cause user A and B already have a job executed.
Then job 2 cause we reached the end of the list so we return to start.
Then job 5 cause user A already have a job executed.
Then job 3 cause we reached the end of the list so we return to start.
Then job 7 cause we reached the end of the list so we return to start.

Here is a python3 implementation, assume we have a function transcode that transcode a video.
(that just for explain)

global listOfJobs
global listOfAlreadyTreatedUser
listOfAlreadyTreatedUser = []
listOfJobs = [{"payloads":"some payload","user":"A"},{"payloads":"some payload","user":"A"},{"payloads":"some payload","user":"B"},{"payloads":"some payload","user":"B"},{"payloads":"some payload","user":"C"},]

def whatToTranscode():
    i = 0
    while i < len(listOfJobs):
        a = listOfJobs[i]
        if a["user"] not in listOfAlreadyTreatedUser:
            listOfAlreadyTreatedUser.append(a["user"])
            del listOfJobs[i]
            return a
    listOfAlreadyTreatedUser = []
    return listOfJobs.pop(0)

while len(listOfJobs) > 0:
    transcode(whatToTranscode())

ghost · 2018-10-06T23:06:32Z

I'd love to have other options for transcode queueing too. As I understand it, part of the technical challenge in implementing this right now is that the job queueing is handled by a generic library rather than logic that's been written specifically for video transcoding.

Chocobozzz · 2018-10-08T08:43:32Z

Proposal:

Every time you create a transcoding job for a specific user, check how many videos they uploaded in the last 24 hours
Create the transcoding job having a priority of Math.max(100 - (10 * upoadedInTheLast24Hours), 0)

ghost · 2018-10-08T19:05:18Z

@Chocobozzz, I'm not sure what "priority" would concretely mean in your suggestion, or what the reason would be to hardcode 10 videos per day.

As an aside, anything using "number of videos" as a metric will be quite bad. Duration would be better, or accumulated transcode CPU time would be best. Please remember that videos can be arbitrarily complex and/or long.

rigelk · 2018-10-08T19:34:24Z

@scanlime I guess the "10 videos per day" was just to illustrate. More importantly, transcode CPU time is hard to guess too (even though that's exactly what would be required to run the algorithm in a fair way), as shown with #799.

Duration as a metric would be a good middle ground.

ghost · 2018-10-08T19:41:07Z

Why guess transcode CPU time when you can measure it? Only works after rather than before obvs, but you it would mean one user who uploads a huge video busts their own personal quota for a little while, and that can be measured accurately using wall clock or process time.

…

-m

On Mon, Oct 8, 2018 at 12:34 PM Rigel Kent ***@***.***> wrote: @scanlime <https://github.com/scanlime> I guess the "10 videos per day" was just to illustrate. More importantly, transcode CPU time is hard to guess too (even though that's exactly what would be required to run the algorithm in a fair way), as shown with #799 <#799>. Duration as a metric would be a good middle ground. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1192 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AA4YpEwnhZ-7wLq3aKVvESx9LHnPZmnFks5ui6jQgaJpZM4XIQ4R> .

Chocobozzz · 2018-10-09T16:00:40Z

I'm not sure what "priority" would concretely mean in your suggestion,

https://github.com/OptimalBits/bull/blob/master/REFERENCE.md#queueadd

what the reason would be to hardcode 10 videos per day.

It's just an example... the duration or file sizes could be interesting too 👍

Jorropo · 2018-10-10T08:10:37Z

I think priority could be good (and need less work), but you can't use video uploaded in the day, that not enough precise.
Maybe we can estimate transcoding time by transcoding 10 seconds of the video and multiply by total video time in second divided by 10.

ghost · 2018-10-10T14:56:55Z

we already have the actual wall clock time elapsed by all transcodes, which would be much more useful than guessing time based on length or file size.

…

On Wed, Oct 10, 2018 at 1:10 AM Jorropo ***@***.***> wrote: I think priority could be good (and need less work), but you can't use video uploaded in the day, that not enough precise. Maybe we can estimate transcoding time by transcoding 10 seconds of the video and multiply by total video time in second divided by 10. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1192 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AA4YpMN4YH4fwfEqOHIAVdqLXtaxR-JMks5ujauEgaJpZM4XIQ4R> .

vincib · 2018-10-30T08:19:15Z

We could also have a script that could be launched to launch either a particular job or any job for a specific video, or any job, and "spread the load of transcoding" ?
(maybe from anywhere if we use NFS or a SAN to access the files ? ;) )

rigelk · 2018-10-30T09:41:37Z

@vincib the problem is that a "job" doesn't just do transcoding. It also means modifying entries in the database to change the hash, and potentially send updates or chain actions in response to the video transcoding.

vincib · 2018-10-30T10:41:41Z

sure, the remote job execution process could access the PGSQL database and the storage filesystem to do all what it need to do. (just ensure the job is properly locked and could be relaunched in case of crash...)
for bigger PT instances, that could be very useful (transcoding is heavy cpu-wise...)

kyrahabattoir · 2021-01-24T10:23:01Z

Round robin transcoding queue would certainly be nice.

emansom · 2022-08-05T10:27:32Z

Is a manual override on this job list possible? e.g. if I want to push a video to the front of the queue?

My usecase would be a video that has a fixed release schedule on social media.

Currently my instance is backpressured by some 200+ transcode jobs, most of 3+ hours videos, that all get four resolutions (360, 480, 720 and 1080) importing a whole YouTube channel.

While that's going on, the YouTube channel in particular is facing misused DCMA claims (patent trolls) on its videos and has to rely on PeerTube for sharing its latest video to subscribers, which is now backpressured by about three weeks of transcode jobs. Not ideal.

Chocobozzz · 2022-08-05T11:34:07Z

@emansom you may be interested in #4771 and #4968

vid-bin · 2022-09-03T00:43:44Z

Posting under my new account now. Heres hoping 4.3.0 fixes #4968 because I'm in the same boat as @emansom.

I think this could be solved by having the new-resolution-hls jobs have a higher priority or having a separate job queue for them entirely.

Chocobozzz added Type: Feature Request ✨ Component: Transcoding labels Oct 4, 2018

Jorropo mentioned this issue Jun 13, 2019

Let uploaders decide of resolutions for their videos #1899

Closed

Chocobozzz mentioned this issue Aug 25, 2020

Prioritize transcode jobs #899

Closed

kontrollanten mentioned this issue Nov 6, 2020

Update FAQ.md wrt performance estimates and strategies #3246

Merged

Chocobozzz self-assigned this Jan 21, 2021

Chocobozzz added the Status: In Progress 🔜 label Jan 21, 2021

Chocobozzz mentioned this issue Jan 25, 2021

Fair transcoding jobs #3637

Merged

Chocobozzz closed this as completed in #3637 Jan 25, 2021

Chocobozzz removed the Status: In Progress 🔜 label Feb 2, 2021

Chocobozzz removed their assignment Oct 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A fair transcoding jobs list #1192

A fair transcoding jobs list #1192

Jorropo commented Oct 4, 2018 •

edited

Loading

ghost commented Oct 6, 2018

Chocobozzz commented Oct 8, 2018

ghost commented Oct 8, 2018

rigelk commented Oct 8, 2018

ghost commented Oct 8, 2018 via email

Chocobozzz commented Oct 9, 2018

Jorropo commented Oct 10, 2018

ghost commented Oct 10, 2018 via email

vincib commented Oct 30, 2018

rigelk commented Oct 30, 2018

vincib commented Oct 30, 2018

kyrahabattoir commented Jan 24, 2021

emansom commented Aug 5, 2022 •

edited

Loading

Chocobozzz commented Aug 5, 2022

vid-bin commented Sep 3, 2022

A fair transcoding jobs list #1192

A fair transcoding jobs list #1192

Comments

Jorropo commented Oct 4, 2018 • edited Loading

ghost commented Oct 6, 2018

Chocobozzz commented Oct 8, 2018

ghost commented Oct 8, 2018

rigelk commented Oct 8, 2018

ghost commented Oct 8, 2018 via email

Chocobozzz commented Oct 9, 2018

Jorropo commented Oct 10, 2018

ghost commented Oct 10, 2018 via email

vincib commented Oct 30, 2018

rigelk commented Oct 30, 2018

vincib commented Oct 30, 2018

kyrahabattoir commented Jan 24, 2021

emansom commented Aug 5, 2022 • edited Loading

Chocobozzz commented Aug 5, 2022

vid-bin commented Sep 3, 2022

Jorropo commented Oct 4, 2018 •

edited

Loading

emansom commented Aug 5, 2022 •

edited

Loading