Add `sort` in memory to Arrays library #4846

Amxx · 2024-01-18T23:18:01Z

Replaces #3520 that is old. This comment remains relevant.

Fixes #3490
Fixes LIB-1189

IMO the approach should be to provide a simple/straight forward implementation that works. We can refine it/replace it down the line if we find a more efficient approach. Backward compatibility should not cause issues.

PR Checklist

Tests
Documentation
Changeset entry (run npx changeset add)

changeset-bot · 2024-01-18T23:18:05Z

🦋 Changeset detected

Latest commit: 4336e2e

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
openzeppelin-solidity	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

contracts/utils/Arrays.sol

Co-authored-by: Ernesto García <ernestognw@gmail.com>

contracts/utils/Arrays.sol

Co-authored-by: Hadrien Croubois <hadrien.croubois@gmail.com>

ernestognw

LGTM

ernestognw · 2024-02-03T01:12:53Z

Before approving I want to clarify why we're picking the first element as the pivot.

I'm pending to dig more into this comment but I'd like to know if you got to a conclusion based on a previous discussion.

Amxx · 2024-02-05T09:16:51Z

Before approving I want to clarify why we're picking the first element as the pivot.

There are a few elements of answer, but the bottom line is: its simpler that way, and there is no reason to think its a bad choice.

Lets get into the issue:

A each step, the subroutine sort a subarray by choosing a pivot. Putting all the values that are smaller on one side, all the values that are bigger on the other, and run recursivelly on both "sides". The pivot is placed correctly for sure. The other values are "grouped" to be sorted. Mathematically, the best pivot is on that divides the array in two sub-arrays of equal size. Said otherwise, the best pivot is the median.
But if we wanted to get the median ... we basically need to sort the array, which is our goal.

We have tree options:

Try go be smart, and chose the pivot based on some algorithm
Chose the pivot randomnly
Take a specific element (first one/last one/...)

The first option requires reading the values in a loop in order to make some decision. It means reading the array, which comes at a cost. I'm not conviced that choosing a "good" pivot would save move gas (due to the nice "split" it would produce) than the cost involved in choosing the good pivot

If the array is shuffled, there is no reason to think that point 2 or 3 would be better. You have the same probability of getting a good or bad pivot. So it comes down to practicality.

Since you don't want to have to pivot "in the way", a common approach is to set it asside, sort the rest of the array (into the 2 sections discussed earlier), and then put the pivot in the right place. The easiest way to set the pivot asside, is to put it at the beginning (or at the end), and then inverse it with the last element smaller than the pivot (or the first element bigger than the pivot). If you take your pivot randomly, putting it asside costs you one swap operation. If you choose the first (or the last) its already placed somewhere nice.

So the bottom line is:

using the first element is easy in terms of code
its statistically not a better or worst choice than any other randomly choosen pivot
you could do better by choosing a good pivot, but making this choice has a cost.

contracts/utils/Arrays.sol

ernestognw · 2024-02-06T16:55:47Z

So the bottom line is:

using the first element is easy in terms of code

its statistically not a better or worst choice than any other randomly choosen pivot

you could do better by choosing a good pivot, but making this choice has a cost.

Great, I agree with this conclusions. I know there are other strategies to heuristically determine which element to use as the pivot.

I think it's worth keeping in mind that there may be use cases where specifying the pivot as part of the params of _quickSort. As you mentioned, it requires reading the values in the array, which is costly, but surely there may be a threshold at which it still makes sense (e.g. if guaranteed to have 20 elements, reading 20 may be better than applying the quicksort right away).

Right now it's fine since the function is private anyway.

RenanSouza2 and others added 11 commits November 12, 2023 17:16

Migrate 'arrays'

d78ae17

fix findUpperBound and add findLowerBound

52b3bd8

add memory variants

9a1411b

Merge branch 'master' into feature/array-bound-with-duplicates

c4726c8

fix merge

a6ec616

fix lint

23e8db9

minimize change

c72591f

add changeset

9162e42

Apply suggestions from code review

ed1de5b

add Arrays.sort

4c1c7f4

add sort test

abd07b6

Amxx added feature New contracts, functions, or helpers. Datastructures labels Jan 18, 2024

Amxx added this to the 5.1 milestone Jan 18, 2024

Amxx mentioned this pull request Jan 18, 2024

Add Array.sort #3520

Closed

3 tasks

Amxx added 3 commits January 19, 2024 00:19

fix lint

1e89815

codespell

b73989d

add fuzzing tests for Arrays.sort

79bf367

Amxx requested a review from ernestognw January 22, 2024 12:31

add unsafeMemoryAccess tests

f2d49ef

Amxx mentioned this pull request Jan 29, 2024

Procedurally generate Arrays.sol #4859

Merged

3 tasks

Amxx added 3 commits January 29, 2024 21:53

Merge branch 'master' into feature/quicksort

c043453

fix lint

c75de32

lint

f823bee

ernestognw reviewed Feb 1, 2024

View reviewed changes

Amxx and others added 2 commits February 2, 2024 19:01

Update contracts/utils/Arrays.sol

c90f12b

Co-authored-by: Ernesto García <ernestognw@gmail.com>

Update contracts/utils/Arrays.sol

180a969

Co-authored-by: Ernesto García <ernestognw@gmail.com>

Amxx commented Feb 2, 2024

View reviewed changes

contracts/utils/Arrays.sol Outdated Show resolved Hide resolved

Apply suggestions from code review

708972f

Amxx and others added 3 commits February 2, 2024 19:03

Merge branch 'master' into feature/quicksort

533e6cd

Update contracts/utils/Arrays.sol

3f1f0a5

Co-authored-by: Hadrien Croubois <hadrien.croubois@gmail.com>

Add comments to _quickSort

5a0ad7f

ernestognw previously approved these changes Feb 3, 2024

View reviewed changes

Lint

a8e6f54

ernestognw dismissed their stale review via a8e6f54 February 3, 2024 01:00

ernestognw changed the title ~~Arrays sorting (in memory)~~ Add sort in memory to Arrays library Feb 3, 2024

Amxx commented Feb 5, 2024

View reviewed changes

contracts/utils/Arrays.sol Outdated Show resolved Hide resolved

Amxx added 3 commits February 5, 2024 10:20

Update contracts/utils/Arrays.sol

7600291

Update contracts/utils/Arrays.sol

6f163d2

cache the pivot and improve doc

8983066

Amxx commented Feb 5, 2024

View reviewed changes

contracts/utils/Arrays.sol Outdated Show resolved Hide resolved

Apply suggestions from code review

8704763

Amxx requested a review from ernestognw February 6, 2024 15:23

ernestognw approved these changes Feb 6, 2024

View reviewed changes

ernestognw enabled auto-merge (squash) February 6, 2024 16:59

Merge branch 'master' into feature/quicksort

4336e2e

ernestognw merged commit 0a757ec into OpenZeppelin:master Feb 6, 2024
20 checks passed

Amxx deleted the feature/quicksort branch February 6, 2024 21:01

This was referenced Nov 8, 2024

[Snyk] Upgrade @openzeppelin/contracts from 5.0.0 to 5.1.0 RomulousApollo/v3-core#4

Open

[Snyk] Upgrade @openzeppelin/contracts-upgradeable from 5.0.0 to 5.1.0 RomulousApollo/v3-core#5

Open

This was referenced Nov 9, 2024

[Snyk] Upgrade @openzeppelin/contracts from 5.0.0 to 5.1.0 doperiddle/stl-contracts#5

Merged

[Snyk] Upgrade @openzeppelin/contracts-upgradeable from 5.0.0 to 5.1.0 doperiddle/stl-contracts#7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `sort` in memory to Arrays library #4846

Add `sort` in memory to Arrays library #4846

Amxx commented Jan 18, 2024 •

edited

Loading

changeset-bot bot commented Jan 18, 2024 •

edited

Loading

ernestognw left a comment

ernestognw commented Feb 3, 2024

Amxx commented Feb 5, 2024 •

edited

Loading

ernestognw commented Feb 6, 2024

Add sort in memory to Arrays library #4846

Add sort in memory to Arrays library #4846

Conversation

Amxx commented Jan 18, 2024 • edited Loading

PR Checklist

changeset-bot bot commented Jan 18, 2024 • edited Loading

🦋 Changeset detected

ernestognw left a comment

Choose a reason for hiding this comment

ernestognw commented Feb 3, 2024

Amxx commented Feb 5, 2024 • edited Loading

ernestognw commented Feb 6, 2024

Add `sort` in memory to Arrays library #4846

Add `sort` in memory to Arrays library #4846

Amxx commented Jan 18, 2024 •

edited

Loading

changeset-bot bot commented Jan 18, 2024 •

edited

Loading

Amxx commented Feb 5, 2024 •

edited

Loading