Option to use pre-shaped result rows; fixes #3042 #3043

koenfaro90 · 2023-08-11T15:57:51Z

Adds option usePrebuiltEmptyResultObjects on the Query class; this results in generating pre-shaped result row objects instead of dynamically generated ones - massively increasing performance.

I had some difficulties running all tests; also not quite sure about style/backward compatibility demands, so let me know what needs adjusting before LGTM!

… generates pre-shaped result rows

koenfaro90 · 2023-08-11T15:59:09Z

More info; #3042

brianc · 2023-08-11T20:19:05Z

ohhhh that's super interesting & doesn't seem to introduce any unwanted weird behavior or dynamic class generation or memory leaks or anything. Exciting! I'll run this in CI & see what we can see. 😄

koenfaro90 · 2023-08-11T20:23:44Z

I managed to run all tests at one point and everything passed, found one issue with multiple results which had an obvious point to fix.

…

On Fri, 11 Aug 2023 at 22:19, Brian C ***@***.***> wrote: ohhhh that's super interesting & doesn't seem to introduce any unwanted weird behavior or dynamic class generation or memory leaks or anything. Exciting! I'll run this in CI & see what we can see. 😄 — Reply to this email directly, view it on GitHub <#3043 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADR27XOROD3TAVEZWJ37JPDXU2HUJANCNFSM6AAAAAA3NDNEBU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

brianc · 2023-08-11T20:26:08Z

All tests pass on all versions of node! FUN! I'm going to pull down your branch and run my horribly crude benchmark against it and current release & i'll report back.

packages/pg/lib/result.js

charmander · 2023-08-11T20:39:13Z

I’d also be interested in how Object.fromEntries (+ Object.defineProperty polyfill for Node 10) benchmarks, since it avoids the (niche for pg, but uncomfortable) __proto__ hazard and is enabled by this change, but that’s for a future PR anyway!

koenfaro90 · 2023-08-11T20:58:10Z

I’d also be interested in how Object.fromEntries (+ Object.defineProperty polyfill for Node 10) benchmarks, since it avoids the (niche for pg, but uncomfortable) __proto__ hazard and is enabled by this change, but that’s for a future PR anyway!

You mean for constructing the initial cloneable-object? As that's called only once I would estimate that will be negligible in terms of performance regardless, but might indeed provide protection against a polluted prototype; however, that same risk was present in the current code, so indeed another PR for another day!

koenfaro90 · 2023-08-11T21:04:33Z

All tests pass on all versions of node! FUN! I'm going to pull down your branch and run my horribly crude benchmark against it and current release & I'll report back.

Does your benchmark include manipulating/using the result-rows? That's where the major gain would be, not so much in result construction I think, maybe a little. If you cannot quantify it, then I will write a small benchmark for that next week.

koenfaro90 · 2023-08-12T06:11:05Z

Seems the CI had a hickup, or is something actually failing now?

koenfaro90 · 2023-08-12T11:48:30Z

I wrote a little bench quickly; https://github.com/koenfaro90/node-postgres-bench/tree/master; there seems to be a tiny performance regression SOMEWHERE else, the actual operations seem faster, but somewhere else we seem to loose a little ms - I will investigate on Monday where the regression is caused.

8.11.2
< 8.11.2; 9952ms
pg-performance-pull
< pg-performance-pull; 10278ms
AccessCase { '8.11.2': 39.66, 'pg-performance-pull': 11.47 }
CloneUsingAssignCase { '8.11.2': 164.18, 'pg-performance-pull': 18.52 }
CloneUsingSpreadCase { '8.11.2': 179.6, 'pg-performance-pull': 3.2 }
KeysCase { '8.11.2': 17.12, 'pg-performance-pull': 0.83 }
ValuesCase { '8.11.2': 76.62, 'pg-performance-pull': 8.97 }

brianc · 2023-08-12T15:52:11Z

Nice - I'm assuming on that output above a lower number is better? Should I be paying attention to the AccessCase and CloneUsingAssignCase etc or only the diff in numbers between 8.11.2 and pg-performance-pull?

It's always kinda cat and mouse game trying to benchmark the driver since postgres may take slightly longer or shorter to do a few queries, node might do a GC pause here or there, network, disk, etc all come into play. I'd recommend trying to run the bench like 10 times & seeing if you steadily see more or less performance from your branch, because in my experience it's not extremely steady numbers....but if actually accessing the row values by name is actually faster, and that speed up is somewhat linearly related to the number of rows returned, this is gonna be yuge!

koenfaro90 · 2023-08-12T16:48:13Z

So, the times you see are all in ms, but the total numbers include setup (generating rows, inserting them), so the actual row usage is waaay faster indeed. Both cloning and simply accessing. But the total time including setup, inserting, querying and executing the specific is slower, perhaps cloning of the row accounts for this, or addFields is called more than once. I will research what accounts for this on Monday. Cases run in a seperate node instance to prevent any internal optimalization or GC impacting the results. So for now I would hold back on the merge until this is resolved.

…

On Sat, 12 Aug 2023 at 17:52, Brian C ***@***.***> wrote: Nice - I'm assuming on that output above a lower number is better? Should I be paying attention to the AccessCase and CloneUsingAssignCase etc or only the diff in numbers between 8.11.2 and pg-performance-pull? It's always kinda cat and mouse game trying to benchmark the driver since postgres may take slightly longer or shorter to do a few queries, node might do a GC pause here or there, network, disk, etc all come into play. I'd recommend trying to run the bench like 10 times & seeing if you steadily see more or less performance from your branch, because in my experience it's not extremely steady numbers....but if actually accessing the row values by name is actually faster, and that speed up is somewhat linearly related to the number of rows returned, this is gonna be yuge! — Reply to this email directly, view it on GitHub <#3043 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADR27XIGM5T43G54CIMNNPLXU6RDNANCNFSM6AAAAAA3NDNEBU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

koenfaro90 · 2023-08-13T09:42:15Z

Found the issue; needed to ensure that the clone-base record was properly shaped; now the new version is always considerably faster on all fronts (9600ms vs 10000ms for the suite).

abenhamdine · 2023-08-14T12:02:42Z

amazing work @koenfaro90 !
by the way, if you are interesting in perf optimizations of this library, perhaps you would want to work on #2706, it's a POC that I can't dedicate time, but it would probably improve perfs significantly.

koenfaro90 · 2023-08-14T12:17:49Z

I just messaged a related idea to one of my colleagues, focusing on being able to perform all JS side preparation/serialization while a previous query is being executed, most gains would be in bulk inserts with a certain chunk size. But will point him to your issue too.

…

On Mon, 14 Aug 2023 at 14:02, Arnaud Benhamdine ***@***.***> wrote: amazing work @koenfaro90 <https://github.com/koenfaro90> ! by the way, if you are interesting in perf optimizations of this library, perhaps you would want to work on #2706 <#2706>, it's a POC that I can't dedicate time, but it would probably improve perfs significantly. — Reply to this email directly, view it on GitHub <#3043 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADR27XJH332GVK2ABR5TSFDXVIHW5ANCNFSM6AAAAAA3NDNEBU> . You are receiving this because you were mentioned.Message ID: ***@***.***>

brianc · 2023-08-15T03:24:06Z

Weirdly tests failed on a few versions of node...rerunning them. Looking forward to merging this!!

Add property usePrebuiltEmptyResultObjects to Query constructor which…

11955a0

… generates pre-shaped result rows

koenfaro90 mentioned this pull request Aug 11, 2023

Result rows are dynamically shaped; causing massive performance degradations when used #3042

Closed

charmander added the reformat during merge label Aug 11, 2023

koenfaro90 changed the title ~~Option to use pre-shared result rows; fixes #3042~~ Option to use pre-shaped result rows; fixes #3042 Aug 11, 2023

HZ111 / Dev2 added 2 commits August 11, 2023 22:15

Remove option and test for prebuiltEmptyResultObject

c8abc18

Remove errorneously added newline

2172d4e

charmander requested changes Aug 11, 2023

View reviewed changes

packages/pg/lib/result.js Show resolved Hide resolved

HZ111 / Dev2 added 2 commits August 11, 2023 22:43

Move all logic for prebuilding objects to Result

cfff0b9

Move prebuilding to addFields

2de4a4e

charmander approved these changes Aug 11, 2023

View reviewed changes

Use a clone as clone-base

41533fd

brianc merged commit b5c5e52 into brianc:master Aug 15, 2023

wannabehero mentioned this pull request Sep 1, 2023

Parsing single result with large amount of columns takes more time in 8.11.3 #3055

Open

charmander mentioned this pull request Sep 1, 2023

Remove 1 loop on rowDescription event #3056

Merged

Tol1 mentioned this pull request Sep 7, 2023

With duplicated column names, result row gets value from rightmost non-null column (or default null) #3062

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to use pre-shaped result rows; fixes #3042 #3043

Option to use pre-shaped result rows; fixes #3042 #3043

koenfaro90 commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023

brianc commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023 via email

brianc commented Aug 11, 2023

charmander commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023

koenfaro90 commented Aug 12, 2023

koenfaro90 commented Aug 12, 2023

brianc commented Aug 12, 2023

koenfaro90 commented Aug 12, 2023 via email •

edited

Loading

koenfaro90 commented Aug 13, 2023

abenhamdine commented Aug 14, 2023

koenfaro90 commented Aug 14, 2023 via email

brianc commented Aug 15, 2023

Option to use pre-shaped result rows; fixes #3042 #3043

Option to use pre-shaped result rows; fixes #3042 #3043

Conversation

koenfaro90 commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023

brianc commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023 via email

brianc commented Aug 11, 2023

charmander commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023

koenfaro90 commented Aug 11, 2023

koenfaro90 commented Aug 12, 2023

koenfaro90 commented Aug 12, 2023

brianc commented Aug 12, 2023

koenfaro90 commented Aug 12, 2023 via email • edited Loading

koenfaro90 commented Aug 13, 2023

abenhamdine commented Aug 14, 2023

koenfaro90 commented Aug 14, 2023 via email

brianc commented Aug 15, 2023

koenfaro90 commented Aug 12, 2023 via email •

edited

Loading