Add cardinality control for groupby benchs with flat types #15134

PointKernel · 2024-02-24T00:35:17Z

Description

Contributes to #15114

This PR adds cardinality control to group_max, group_nunique and group_rank benchmarks.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

GregoryKimball · 2024-02-28T17:22:06Z

Thanks for kicking this off! You might also consider adding nvbench axes for cardinality for the existing benchmarks. If we started with a default value of {0} then this would let us vary cardinality for engineering studies without changing the automated benchmark runtime.

…ch-cardinality

ttnghia · 2024-03-05T19:16:49Z

cpp/benchmarks/groupby/group_nunique.cpp

+    data_profile profile =
+      data_profile_builder()
+        .cardinality(cardinality)
+        .no_validity()
+        .distribution(cudf::type_to_id<int32_t>(), distribution_id::UNIFORM, 0, size);


Code to create keys and values here seem to be very similar (the same) as for creating those of groupby max. Can we further extract them into a common function? Like create_keys_values()?

The duplicate part is just data profile construction. Though we repeat those 5 lines of code each time, it's just one API call. It's not worth creating a helper function wrapping one single call IMO.

PointKernel · 2024-03-08T23:36:46Z

/merge

PointKernel added 2 commits February 23, 2024 16:10

Add groupby_max_cardinality bench

29d3e23

Minor cleanups

45f8073

PointKernel added feature request New feature or request 2 - In Progress Currently a work in progress libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change labels Feb 24, 2024

PointKernel self-assigned this Feb 24, 2024

PointKernel added 3 commits February 29, 2024 01:10

Merge remote-tracking branch 'upstream/branch-24.04' into groupby-ben…

5893f22

…ch-cardinality

Merge remote-tracking branch 'upstream/branch-24.04' into groupby-ben…

f6a13f4

…ch-cardinality

Add cardinality axis to the normal groupby max bench

77e53cb

PointKernel changed the title ~~Add groupby benchmarks varying cardinalities~~ Add cardinality control for groupby max bench Mar 4, 2024

PointKernel added 2 commits March 4, 2024 10:59

Add cardinality control for group_rank and group_nunique

30f53e8

Merge remote-tracking branch 'upstream/branch-24.04' into groupby-ben…

da8477a

…ch-cardinality

PointKernel changed the title ~~Add cardinality control for groupby max bench~~ Add cardinality control for groupby benchs with flat types Mar 4, 2024

PointKernel marked this pull request as ready for review March 4, 2024 19:42

PointKernel requested a review from a team as a code owner March 4, 2024 19:42

PointKernel requested review from vyasr and ttnghia March 4, 2024 19:42

PointKernel added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Mar 4, 2024

Merge branch 'branch-24.04' into groupby-bench-cardinality

99138c7

ttnghia reviewed Mar 5, 2024

View reviewed changes

ttnghia approved these changes Mar 5, 2024

View reviewed changes

GregoryKimball mentioned this pull request Mar 8, 2024

[FEA] Add shared memory hash map for low-cardinality aggregations #15262

Open

vyasr approved these changes Mar 8, 2024

View reviewed changes

rapids-bot bot merged commit b08dd9b into rapidsai:branch-24.04 Mar 8, 2024
73 checks passed

PointKernel deleted the groupby-bench-cardinality branch March 8, 2024 23:36

GregoryKimball mentioned this pull request Apr 18, 2024

[FEA] Add cardinality axis to nvbench for groupby aggregations #15114

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cardinality control for groupby benchs with flat types #15134

Add cardinality control for groupby benchs with flat types #15134

PointKernel commented Feb 24, 2024 •

edited

Loading

GregoryKimball commented Feb 28, 2024

ttnghia Mar 5, 2024

PointKernel Mar 5, 2024

PointKernel commented Mar 8, 2024

Add cardinality control for groupby benchs with flat types #15134

Add cardinality control for groupby benchs with flat types #15134

Conversation

PointKernel commented Feb 24, 2024 • edited Loading

Description

Checklist

GregoryKimball commented Feb 28, 2024

ttnghia Mar 5, 2024

Choose a reason for hiding this comment

PointKernel Mar 5, 2024

Choose a reason for hiding this comment

PointKernel commented Mar 8, 2024

PointKernel commented Feb 24, 2024 •

edited

Loading