Skip to content

Apache Release 5.0.0

Compare
Choose a tag to compare
@leerho leerho released this 09 Dec 01:23
· 346 commits to master since this release
  • A new Example Partitioner Tool is useable in its own right for partitioning medium sized data sets up to about 1E9 items. But the same algorithm could be used in a parallel environment for partitioning data sets many orders-of-magnitude larger.
  • Lots of internal cleanup and a few API improvements for consistency across the different quantile sketches, for example. These changes in the API, although relatively minor, were the reason to move to a major release.
  • Fixed an integer overflow bug caught by Karan Kumar (via Druid), where very large partitioning datasets using the classic quantiles DoublesSketch::getPartitionBoundaries() would fail.