Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Teddy macros unrolling - initial PR to test in CI #294

Merged
merged 72 commits into from
Jun 26, 2024

Conversation

isildur-g
Copy link

@isildur-g isildur-g commented May 27, 2024

unrolling the complicated macros in fdr/teddy.c which are opaque to debugging.
There's reason to believe there is some legacy cruft hiding in parts of this but for now
the functionization has been kept pretty much a direct copy.
While the objective of this PR is maintainability and manageability of the codebase,
there have been a few positive side-effects on performance. specifically performance in SSE2 , AVX512 , and ppc
environments has seen a very slight boost (between 0.1% - 0.5% improvement). performance in other environments is essentially unchanged.

@markos markos added this to the 5.4.12 milestone Jun 3, 2024
@markos markos linked an issue Jun 3, 2024 that may be closed by this pull request
@markos markos added the enhancement New feature or request label Jun 3, 2024
@markos markos merged commit aa832db into VectorCamp:develop Jun 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Optimize/Refactor FDR & Teddy
2 participants