Fix performance related to dynamic scheduler scaling #2751

dipinhora · 2018-06-06T14:51:12Z

As part of performance testing Wallaroo using multiple workers,
@JONBRWN discovered a regression in both throughput and latency.
He tracked the issue down the commit that re-enabled dynamic
scheduler scaling (fc80968).
NOTE: This performance issue did not exist for singler worker
runs of Wallaroo.

Some head scratching and testing led to the current commit to
resolve the multi-worker performance issue. My best guess is that
before this change the steal loop was dependent on a memory
access to determine if dynamic scheduler scaling needed to
suspend a thread or not as its initial check. This would lead to
somewhat erratic behavior where some times the steal loop would
take long while other times it wouldn't depending on how long the
memory load took. This had a follow-on impact on actor execution
because of ASIO messages because they wouldn't be picked up off
of the queue for work as quickly as they could be due to the
extra memory accesses.

This commit changes the ordering of some operations to ensure
that there is more consistent memory accesses for the loop
resulting in more consistent actor actor execution for ASIO
messages resolving the multi-worker performance issue that
@JONBRWN discovered.

@JONBRWN

As part of performance testing Wallaroo using multiple workers, @JONBRWN discovered a regression in both throughput and latency. He tracked the issue down the commit that re-enabled dynamic scheduler scaling (fc80968). NOTE: This performance issue did not exist for singler worker runs of Wallaroo. Some head scratching and testing led to the current commit to resolve the multi-worker performance issue. My best guess is that before this change the `steal` loop was dependent on a memory access to determine if dynamic scheduler scaling needed to suspend a thread or not as its initial check. This would lead to somewhat erratic behavior where some times the `steal` loop would take long while other times it wouldn't depending on how long the memory load took. This had a follow-on impact on actor execution because of ASIO messages because they wouldn't be picked up off of the queue for work as quickly as they could be due to the extra memory accesses. This commit changes the ordering of some operations to ensure that there is more consistent memory accesses for the loop resulting in more consistent actor actor execution for ASIO messages resolving the multi-worker performance issue that @JONBRWN discovered.

SeanTAllen changed the title ~~Fix multi-worker performance related to dynamic scheduler scaling~~ Fix performance related to dynamic scheduler scaling Jun 6, 2018

SeanTAllen added the changelog - fixed Automatically add "Fixed" CHANGELOG entry on merge label Jun 6, 2018

sylvanc approved these changes Jun 6, 2018

View reviewed changes

mfelsche approved these changes Jun 6, 2018

View reviewed changes

SeanTAllen merged commit 5e35604 into ponylang:master Jun 6, 2018

ponylang-main added a commit that referenced this pull request Jun 6, 2018

Update CHANGELOG for PR #2751 [skip ci]

b205caf

SeanTAllen mentioned this pull request Jun 6, 2018

Release 0.22.6 #2758

Closed

dipinhora pushed a commit to dipinhora/ponyc that referenced this pull request Jun 7, 2018

Update CHANGELOG for PR ponylang#2751 [skip ci]

03143a2

dipinhora pushed a commit to dipinhora/ponyc that referenced this pull request Jun 8, 2018

Update CHANGELOG for PR ponylang#2751 [skip ci]

783d762

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix performance related to dynamic scheduler scaling #2751

Fix performance related to dynamic scheduler scaling #2751

dipinhora commented Jun 6, 2018

Fix performance related to dynamic scheduler scaling #2751

Fix performance related to dynamic scheduler scaling #2751

Conversation

dipinhora commented Jun 6, 2018