Async glReadPixels() in HeadlessView? #477

mikemorris · 2014-10-03T20:58:57Z

Does this make sense or seem worthwhile @kkaefer?

https://www.opengl.org/discussion_boards/showthread.php/144238-asynchronous-readback-that-call-glReadPixels%28%29-with-PBO?highlight=multithread%2A

kkaefer · 2014-10-08T08:36:38Z

Yes, we should eventually switch to that since it's likely a lot faster. I've skipped implementing that since it was not immediately needed for tests back then.

adam-mapbox · 2015-12-10T00:52:58Z

https://github.com/mapbox/mapbox-gl-native/tree/adam_477_pbo_read_pixels

adam-mapbox · 2015-12-10T00:53:50Z

After talking with @mikemorris , it's not clear that doing readStillImage asynchronously is a win for us. The current context where this is used is already "asynchronous" in a sense; we don't have useful work to do while we wait for the image that we're getting on that thread.

tmpsantos · 2016-08-25T20:19:21Z

/sub

mikemorris · 2016-08-26T21:29:15Z

Tried reimplementing this over in https://github.com/mapbox/mapbox-gl-native/compare/pbo-read-pixels in hopes of alleviating the 1000ms+ GPU hang we've been seeing in glReadPixels calls in the Node.js bindings that permanently stalls all renders currently in progress, even on other processes.

This does successfully change glReadPixels to happen asynchronously, but the hang is still present - deferred to glClientWaitSync if a fence is inserted before glReadPixels, or at some other point inside HeadlessView::readStillImage if not (didn't have proper logging in place to detect precisely when).

Not sure if this is a hardware or GPU driver issue or if we're just stalling the pipeline somehow. We've been consistently reproducing this on a GRID K520 GPU running version 367.27 of the NVIDIA Linux drivers.

tiagovignatti · 2016-08-29T14:56:46Z

chiming in... are you also seeing the stalls on Intel hardware, @mikemorris? Do you have an easy way to trigger these stalls, so I can take a look myself?

tiagovignatti · 2016-08-30T15:16:16Z

I tried to track down what's going on but it will require a bit more work. If I switch on debug in Intel Mesa driver it captures the pipeline stall with the following output:
Flushing before mapping a referenced bo.
CPU mapping a busy miptree BO stalled and took 0.126 ms.

That's definitely a problem cause if we have to flush the batchbuffer early that has implications on performance. I'm not seeing other programs dumping anything, so I have to imply that it's a result of a bad stream that mbgl is sending to the hardware.

That said I don't think that changing to use PBO will help anything in this case. Somehow the GPU is stalled on a busy buffer object and another technique of downloading pixels won't help out.

mikemorris · 2016-08-30T17:53:21Z

@tiagovignatti Are you seeing the same ~1 second hang I'm describing above, or is this a possible cause that manifests differently on Intel hardware?

I'm actually not too familiar with the GPU pipeline - is a stall something that could break unrelated in-progress renders or drop previously queued GL calls?

The idea with switch to the async glReadPixels was to eliminate the implicit flush - it sounds like there's still a flush happening though, which I think could be from glXMakeContextCurrent at

mapbox-gl-native/platform/default/headless_view_glx.cpp

Lines 116 to 126 in 5588935

    
           void HeadlessView::activateContext() { 
        
               if (!glXMakeContextCurrent(xDisplay, glxPbuffer, glxPbuffer, glContext)) { 
        
                   throw std::runtime_error("Switching OpenGL context failed.\n"); 
        
               } 
        
           } 
        
           void HeadlessView::deactivateContext() { 
        
               if (!glXMakeContextCurrent(xDisplay, 0, 0, nullptr)) { 
        
                   throw std::runtime_error("Removing OpenGL context failed.\n"); 
        
               } 
        
           }

tiagovignatti · 2016-08-30T18:35:21Z

I don't think this would drop/break previously GL calls. What I am seeing on Intel is subtle stalls in many subtests of mbgl-test, but none of them takes more than 0.3 ms. Still, It might be that we're stepping in the same problem, cause I'd guess that when using discrete GPUs (your case, right?) it takes longer to synchronize the states.

This bug called my attention cause I am seeing really huge stalls (of several seconds!) when running mbgl-test with software rasterizer, but that seems orthogonal to all of this.

stale · 2018-11-20T13:49:46Z

This issue has been automatically detected as stale because it has not had recent activity and will be archived. Thank you for your contributions.

jfirebaugh mentioned this issue Oct 8, 2014

Add HeadlessView::readPixels #480

Closed

mikemorris added the Node.js node-mapbox-gl-native label Sep 18, 2015

mikemorris assigned adam-mapbox Dec 8, 2015

mikemorris mentioned this issue Aug 30, 2016

Render to a single context in HeadlessView #6212

Closed

This was referenced Oct 5, 2016

Use Pixel Buffer Object for async glReadPixels #6598

Closed

Refactor GL context creation and headless rendering #6596

Merged

stale bot added the archived Archived because of inactivity label Nov 18, 2018

stale bot closed this as completed Nov 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async glReadPixels() in HeadlessView? #477

Async glReadPixels() in HeadlessView? #477

mikemorris commented Oct 3, 2014

kkaefer commented Oct 8, 2014

adam-mapbox commented Dec 10, 2015

adam-mapbox commented Dec 10, 2015

tmpsantos commented Aug 25, 2016

mikemorris commented Aug 26, 2016 •

edited

Loading

tiagovignatti commented Aug 29, 2016 •

edited

Loading

tiagovignatti commented Aug 30, 2016

mikemorris commented Aug 30, 2016

tiagovignatti commented Aug 30, 2016

stale bot commented Nov 20, 2018

Async glReadPixels() in HeadlessView? #477

Async glReadPixels() in HeadlessView? #477

Comments

mikemorris commented Oct 3, 2014

kkaefer commented Oct 8, 2014

adam-mapbox commented Dec 10, 2015

adam-mapbox commented Dec 10, 2015

tmpsantos commented Aug 25, 2016

mikemorris commented Aug 26, 2016 • edited Loading

tiagovignatti commented Aug 29, 2016 • edited Loading

tiagovignatti commented Aug 30, 2016

mikemorris commented Aug 30, 2016

tiagovignatti commented Aug 30, 2016

stale bot commented Nov 20, 2018

mikemorris commented Aug 26, 2016 •

edited

Loading

tiagovignatti commented Aug 29, 2016 •

edited

Loading