Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[arm64] Tune memory copy performance #7559

Closed
jkotas opened this issue Mar 4, 2017 · 2 comments
Closed

[arm64] Tune memory copy performance #7559

jkotas opened this issue Mar 4, 2017 · 2 comments
Assignees
Labels
arch-arm64 tenet-performance Performance related issue
Milestone

Comments

@jkotas
Copy link
Member

jkotas commented Mar 4, 2017

Check what was done for x86/x64 in dotnet/coreclr#9786

@sdmaclea
Copy link
Contributor

@jkotas @jashook Please assign this to me.

If I look at the benchmark which was used to optimize dotnet/coreclr#9786, it looks flawed. All the measurements are made with a fairly small bucket. This will train the branch predictors and result in excessively branchy code. I think that is why the "optimized" Buffer.cs code is so complicated. I will add some additional tests which will hopefully detect the branch prediction penalties.

@jkotas
Copy link
Member Author

jkotas commented Aug 31, 2017

cc @vkvenkat

@msftgits msftgits transferred this issue from dotnet/coreclr Jan 31, 2020
@msftgits msftgits added this to the Future milestone Jan 31, 2020
@ghost ghost locked as resolved and limited conversation to collaborators Dec 25, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
arch-arm64 tenet-performance Performance related issue
Projects
None yet
Development

No branches or pull requests

3 participants