Add `partitioned(by:)` #152

mdznr · 2021-07-15T19:58:08Z

Description

Adds a partitioned(by:) algorithm. This is very similar to filter(_:), but instead of just getting an array of the elements that do match a given predicate, also get a second array for the elements that did not match the same predicate.

This is more performant than calling filter(_:) twice on the same input with mutually-exclusive predicates since:

It only requires a single pass of the elements
If the input has a known number of elements, the cumulative space for both returned arrays is known and can avoid array buffer resizing.

let cast = ["Vivien", "Marlon", "Kim", "Karl"]
let (longNames , shortNames) = cast.partitioned(by: { $0.count < 5 })
print(longNames)
// Prints "["Vivien", "Marlon"]"
print(shortNames)
// Prints "["Kim", "Karl"]"

Detailed Design

extension Sequence {
  @inlinable
  public func partitioned(
    by predicate: (Element) throws -> Bool
  ) rethrows -> (falseElements: [Element], trueElements: [Element])
}

Naming

At a high-level, this acts similarly to the partition family of functions in that it separates all the elements in a given collection in two parts, those that do and do not match a given predicate. Thanks, @timvermeulen for help with naming!

Documentation Plan

Inline documentation for each new function
Comments in the implementation
Updated README.md
Added to Guides/Partitioned.md

Test Plan

Adds unit tests for example given in documentation
Adds unit tests for various inputs
Adds unit tests for empty input

Source Impact

This is purely additive

Checklist

I've added at least one test that validates that my change is working, if appropriate
I've followed the code style of the rest of the project
I've read the Contribution Guidelines
I've updated the documentation if necessary

mdznr · 2021-07-15T20:06:36Z

I made the closure belongsInSecondCollection, which is consistent with the other partitioned functions, but in terms of the return values, feels backwards ((second, first)).

mdznr · 2021-07-27T01:02:35Z

I ran some benchmarks using the awesome swift-collections-benchmark package, as suggested by @timvermeulen:

The output does confirm that using partitioned(_:) is faster than calling filter(_:) twice.

Using the Collection-based implementation with the fixed buffer size is faster than the Sequence-base implementation. However, the function’s overhead makes it slightly slower for collections fewer than 8 elements. For that reason, we could check count < 8 in the start of the Collection implementation and conditionally run the Sequence-based approach, which is slightly faster.

I was initially surprised partitioned(_:) wasn’t significantly faster than calling filter(_:) twice, though. With more experimentation, I learned how much the overall cost depends on the cost of the closure’s evaluation. The more costly the closure, the more valuable it was to avoid calling the closure twice (obviously). In very simple cases, the cost of evaluating the closure is extremely insignificant compared to the cost of adding each element to the output collection. However, in all cases, it is still faster to use partitioned(_:) than calling filter(_:) twice.

Using a slighty more expensive closure yielded these results:

Benchmarking code/details

Simple closure test:

benchmark.addSimple(
  title: "Filter × 2",
  input: [Int].self
) { input in
  blackHole(input.filter({ $0.isMultiple(of: 3) }))
  blackHole(input.filter({ !$0.isMultiple(of: 3) }))
}

benchmark.addSimple(
  title: "Partitioned (Sequence)",
  input: [Int].self
) { input in
  blackHole(input._partitioned({ $0.isMultiple(of: 3) }))
}

benchmark.addSimple(
  title: "Partitioned (Collection)",
  input: [Int].self
) { input in
  blackHole(input.partitioned({ $0.isMultiple(of: 3) }))
}

More expensive closure test:

let multiples: [Int] = [1, 3, 5, 7]

benchmark.addSimple(
  title: "Filter × 2",
  input: [Int].self
) { input in
  blackHole(input.__filter({ int in multiples.allSatisfy({ int.isMultiple(of: $0) }) }))
  blackHole(input.__filter({ int in !multiples.allSatisfy({ int.isMultiple(of: $0) }) }))
}

benchmark.addSimple(
  title: "Partitioned (Sequence)",
  input: [Int].self
) { input in
  blackHole(input._partitioned({ int in multiples.allSatisfy({ int.isMultiple(of: $0) }) }))
}

benchmark.addSimple(
  title: "Partitioned (Collection)",
  input: [Int].self
) { input in
  blackHole(input.partitioned({ int in multiples.allSatisfy({ int.isMultiple(of: $0) }) }))
}

All tests run on iMac Pro 3.2 GHz 8-Core Intel Xeon W; 32 GB 2666 MHz DDR4; macOS 11.3 (20E232); Apple Swift version 5.4.2 (swiftlang-1205.0.28.2 clang-1205.0.19.57)

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

Sources/Algorithms/Partition.swift

Guides/Partition.md

README.md

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

timvermeulen

Those graphs look good! It's nice to see that the added complexity seems to be worth it for most sizes. I think it'd be useful to benchmark another version that returns a (ArraySlice, ArraySlice) pair (or even (ArraySlice, ReversedCollection<ArraySlice>)) just so we can see how much performance we're missing out on by allocating two new arrays.

Sources/Algorithms/Partition.swift

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

mdznr · 2021-08-18T19:37:06Z

Those graphs look good! It's nice to see that the added complexity seems to be worth it for most sizes. I think it'd be useful to benchmark another version that returns a (ArraySlice, ArraySlice) pair (or even (ArraySlice, ReversedCollection<ArraySlice>)) just so we can see how much performance we're missing out on by allocating two new arrays.

Closeup of the two slice variants

Code

extension Collection {
  @inlinable
  public func partitionedA(
    _ belongsInSecondCollection: (Element) throws -> Bool
  ) rethrows -> ([Element], [Element]) {
    guard !self.isEmpty else {
      return ([], [])
    }
    
    // Since `RandomAccessCollection`s have known sizes (access to `count` is
    // constant time, O(1)), we can allocate one array of size `self.count`,
    // then insert items at the beginning or end of that contiguous block. This
    // way, we don’t have to do any dynamic array resizing. Since we insert the
    // right elements on the right side in reverse order, we need to reverse
    // them back to the original order at the end.
    
    let count = self.count
    
    // Inside of the `initializer` closure, we set what the actual mid-point is.
    // We will use this to partitioned the single array into two in constant time.
    var midPoint: Int = 0
    
    let elements = try [Element](
      unsafeUninitializedCapacity: count,
      initializingWith: { buffer, initializedCount in
        var lhs = buffer.baseAddress!
        var rhs = lhs + buffer.count
        do {
          for element in self {
            if try belongsInSecondCollection(element) {
              rhs -= 1
              rhs.initialize(to: element)
            } else {
              lhs.initialize(to: element)
              lhs += 1
            }
          }
          
          let rhsIndex = rhs - buffer.baseAddress!
          buffer[rhsIndex...].reverse()
          initializedCount = buffer.count
          
          midPoint = rhsIndex
        } catch {
          let lhsCount = lhs - buffer.baseAddress!
          let rhsCount = (buffer.baseAddress! + buffer.count) - rhs
          buffer.baseAddress!.deinitialize(count: lhsCount)
          rhs.deinitialize(count: rhsCount)
          throw error
        }
      })
    
    let lhs = elements[..<midPoint]
    let rhs = elements[midPoint...]
    return (
      Array(lhs),
      Array(rhs)
    )
  }
}

extension Collection {
  @inlinable
  public func partitionedB(
    _ belongsInSecondCollection: (Element) throws -> Bool
  ) rethrows -> (ArraySlice<Element>, ArraySlice<Element>) {
    guard !self.isEmpty else {
      return ([], [])
    }
    
    // Since `RandomAccessCollection`s have known sizes (access to `count` is
    // constant time, O(1)), we can allocate one array of size `self.count`,
    // then insert items at the beginning or end of that contiguous block. This
    // way, we don’t have to do any dynamic array resizing. Since we insert the
    // right elements on the right side in reverse order, we need to reverse
    // them back to the original order at the end.
    
    let count = self.count
    
    // Inside of the `initializer` closure, we set what the actual mid-point is.
    // We will use this to partitioned the single array into two in constant time.
    var midPoint: Int = 0
    
    let elements = try [Element](
      unsafeUninitializedCapacity: count,
      initializingWith: { buffer, initializedCount in
        var lhs = buffer.baseAddress!
        var rhs = lhs + buffer.count
        do {
          for element in self {
            if try belongsInSecondCollection(element) {
              rhs -= 1
              rhs.initialize(to: element)
            } else {
              lhs.initialize(to: element)
              lhs += 1
            }
          }
          
          let rhsIndex = rhs - buffer.baseAddress!
          buffer[rhsIndex...].reverse()
          initializedCount = buffer.count
          
          midPoint = rhsIndex
        } catch {
          let lhsCount = lhs - buffer.baseAddress!
          let rhsCount = (buffer.baseAddress! + buffer.count) - rhs
          buffer.baseAddress!.deinitialize(count: lhsCount)
          rhs.deinitialize(count: rhsCount)
          throw error
        }
      })
    
    let lhs = elements[..<midPoint]
    let rhs = elements[midPoint...]
    return (lhs, rhs)
  }
}

extension Collection {
  @inlinable
  public func partitionedC(
    _ belongsInSecondCollection: (Element) throws -> Bool
  ) rethrows -> (ArraySlice<Element>, ReversedCollection<ArraySlice<Element>>) {
    guard !self.isEmpty else {
      let emptyArraySlice = [Element]()[0...]
      return (
        emptyArraySlice,
        emptyArraySlice.reversed()
      )
    }
    
    // Since `RandomAccessCollection`s have known sizes (access to `count` is
    // constant time, O(1)), we can allocate one array of size `self.count`,
    // then insert items at the beginning or end of that contiguous block. This
    // way, we don’t have to do any dynamic array resizing. Since we insert the
    // right elements on the right side in reverse order, we need to reverse
    // them back to the original order at the end.
    
    let count = self.count
    
    // Inside of the `initializer` closure, we set what the actual mid-point is.
    // We will use this to partitioned the single array into two in constant time.
    var midPoint: Int = 0
    
    let elements = try [Element](
      unsafeUninitializedCapacity: count,
      initializingWith: { buffer, initializedCount in
        var lhs = buffer.baseAddress!
        var rhs = lhs + buffer.count
        do {
          for element in self {
            if try belongsInSecondCollection(element) {
              rhs -= 1
              rhs.initialize(to: element)
            } else {
              lhs.initialize(to: element)
              lhs += 1
            }
          }
          
          let rhsIndex = rhs - buffer.baseAddress!
          initializedCount = buffer.count
          
          midPoint = rhsIndex
        } catch {
          let lhsCount = lhs - buffer.baseAddress!
          let rhsCount = (buffer.baseAddress! + buffer.count) - rhs
          buffer.baseAddress!.deinitialize(count: lhsCount)
          rhs.deinitialize(count: rhsCount)
          throw error
        }
      })
    
    let lhs = elements[..<midPoint]
    let rhs = elements[midPoint...]
    return (lhs, rhs.reversed())
  }
}

benchmark.addSimple(
  title: "Array, Array",
  input: [Int].self
) { input in
  blackHole(input.partitionedA({
    $0.isMultiple(of: 2)
  }))
}

benchmark.addSimple(
  title: "ArraySlice, ArraySlice",
  input: [Int].self
) { input in
  blackHole(input.partitionedB({
    $0.isMultiple(of: 2)
  }))
}

benchmark.addSimple(
  title: "ArraySlice, ReversedCollection<ArraySlice>",
  input: [Int].self
) { input in
  blackHole(input.partitionedC({
    $0.isMultiple(of: 2)
  }))
}

I’m a bit surprised that the (Array, Array) implementation was actually faster in many cases (from a few hundred to a couple hundred thousand elements). Thinking it was a fluke, I’ve re-run this several times and continue to get similar results. I’m not sure yet why that could be.

I wish there were a clear best implementation from a performance point of view. However, since the performance for the non-Array return values weren’t better in all cases, I would say the tradeoff of having non-Array types as the return values aren’t worth it, as it does expose some implementation details and would make it difficult to change the implementation (and possibly the signature) later without it being a non-backwards-compatible breaking change.

timvermeulen · 2021-08-18T20:38:51Z

I’m a bit surprised that the (Array, Array) implementation was actually faster in many cases (from a few hundred to a couple hundred thousand elements). Thinking it was a fluke, I’ve re-run this several times and continue to get similar results. I’m not sure yet why that could be.

That's interesting and indeed surprising.

I wish there were a clear best implementation from a performance point of view. However, since the performance for the non-Array return values weren’t better in all cases, I would say the tradeoff of having non-Array types as the return values aren’t worth it, as it does expose some implementation details and would make it difficult to change the implementation (and possibly the signature) later without it being a non-backwards-compatible breaking change.

I completely agree with your conclusions here. I'd still be interested to see how returning (Array(lhs), Array(rhs.reversed())) rather than reversing the right side in-place could improve the (Array, Array) version even more, but judging by the tiny difference between the two slice versions it probably won't make a huge difference.

Sources/Algorithms/Partition.swift

mdznr · 2021-08-18T23:19:11Z

I wish there were a clear best implementation from a performance point of view. However, since the performance for the non-Array return values weren’t better in all cases, I would say the tradeoff of having non-Array types as the return values aren’t worth it, as it does expose some implementation details and would make it difficult to change the implementation (and possibly the signature) later without it being a non-backwards-compatible breaking change.

I completely agree with your conclusions here. I'd still be interested to see how returning (Array(lhs), Array(rhs.reversed())) rather than reversing the right side in-place could improve the (Array, Array) version even more, but judging by the tiny difference between the two slice versions it probably won't make a huge difference.

I think if I’m following you correctly, that should be the same as the test I ran earlier:

When would the reversal happen? If removing line 315 here and instead reversing the ArraySlice right before its conversion to an Array, it makes it slower.

timvermeulen · 2021-08-19T12:48:48Z

I think if I’m following you correctly, that should be the same as the test I ran earlier:

I missed that, my bad. Looks like ReversedCollection is just too slow to make this worthwhile.

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

Guides/Partition.md

Sources/Algorithms/Partition.swift

`partitioned(_:)` works like `filter(_:)`, but also returns the excluded elements by returning a tuple of two `Array`s

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

… the `Collection` implementation

Co-authored-by: Xiaodi Wu <13952+xwu@users.noreply.github.com>

The parameter name was potentially confusing. Unlike the other `partition` functions, this function can rely on its named tuple to clarify its behavior.

timvermeulen · 2021-10-08T17:19:05Z

@swift-ci Please test

natecook1000

Looks good! Mostly documentation nits, and then I think this is ready to merge 👍🏻

Sources/Algorithms/Partition.swift

Co-authored-by: Nate Cook <natecook@apple.com>

…r of actual elements found while iterating

natecook1000

natecook1000 · 2021-10-20T18:05:45Z

@swift-ci Please test

mdznr · 2021-10-21T18:30:33Z

Thank you @timvermeulen, @natecook1000, @xwu, @LucianoPAlmeida, @fedeci, and @CTMacUser for helping me get this function integrated into swift-algorithms!

mdznr mentioned this pull request Jul 15, 2021

Add bifurcate(_:) #151

Closed

4 tasks

mdznr marked this pull request as draft July 15, 2021 20:07

mdznr added a commit to mdznr/swift-algorithms that referenced this pull request Jul 27, 2021

For collections with fewer than 8 elements, use the Sequence-based …

13ff3c2

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

mdznr force-pushed the partitioned branch from 25cf83d to 13ff3c2 Compare July 27, 2021 01:08

mdznr commented Jul 27, 2021

View reviewed changes

Sources/Algorithms/Partition.swift Show resolved Hide resolved

mdznr commented Jul 27, 2021

View reviewed changes

Sources/Algorithms/Partition.swift Outdated Show resolved Hide resolved

mdznr commented Jul 27, 2021

View reviewed changes

Sources/Algorithms/Partition.swift Outdated Show resolved Hide resolved

mdznr marked this pull request as ready for review July 27, 2021 01:15

CTMacUser reviewed Jul 27, 2021

View reviewed changes

Guides/Partition.md Outdated Show resolved Hide resolved

fedeci reviewed Jul 27, 2021

View reviewed changes

Guides/Partition.md Outdated Show resolved Hide resolved

Guides/Partition.md Outdated Show resolved Hide resolved

Guides/Partition.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

mdznr added a commit to mdznr/swift-algorithms that referenced this pull request Jul 27, 2021

For collections with fewer than 8 elements, use the Sequence-based …

c43cd40

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

mdznr force-pushed the partitioned branch from 13ff3c2 to c43cd40 Compare July 27, 2021 16:13

timvermeulen reviewed Aug 6, 2021

View reviewed changes

mdznr added a commit to mdznr/swift-algorithms that referenced this pull request Aug 18, 2021

For collections with fewer than 8 elements, use the Sequence-based …

3cecd50

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

mdznr force-pushed the partitioned branch from c43cd40 to e0324f9 Compare August 18, 2021 19:00

LucianoPAlmeida reviewed Aug 18, 2021

View reviewed changes

Sources/Algorithms/Partition.swift Outdated Show resolved Hide resolved

mdznr added a commit to mdznr/swift-algorithms that referenced this pull request Sep 8, 2021

For collections with fewer than 8 elements, use the Sequence-based …

5ad7810

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

mdznr force-pushed the partitioned branch from 875f6a5 to 60054ff Compare September 8, 2021 22:48

mdznr added a commit to mdznr/swift-algorithms that referenced this pull request Sep 9, 2021

For collections with fewer than 8 elements, use the Sequence-based …

17f5bf2

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

mdznr force-pushed the partitioned branch from 60054ff to b738ced Compare September 9, 2021 16:36

xwu reviewed Sep 12, 2021

View reviewed changes

Guides/Partition.md Outdated Show resolved Hide resolved

xwu reviewed Sep 12, 2021

View reviewed changes

Sources/Algorithms/Partition.swift Outdated Show resolved Hide resolved

mdznr added 2 commits September 29, 2021 09:39

Add partitioned(_:)

629546a

`partitioned(_:)` works like `filter(_:)`, but also returns the excluded elements by returning a tuple of two `Array`s

For collections with fewer than 8 elements, use the Sequence-based …

9ca2969

…implementation This constant was determined using benchmarking. More information: apple#152 (comment)

mdznr and others added 10 commits September 29, 2021 09:39

Remove check for collections fewer than 8 elements

9d70c19

Make _partitioned internal

b45bc76

Prefer Array over ContiguousArray

1b24cac

Document partitioned(_:) on Collection

8014849

Remove partitioned(upTo:)

da8185e

Remove _tupleMap

86c1abf

Remove _partitioned and use it inline (since it’s no longer used by…

7fe99cb

… the `Collection` implementation

Remove unnecessary conversation of Array to Array

afc6a3a

Correct indentation

9ac7a20

Co-authored-by: Xiaodi Wu <13952+xwu@users.noreply.github.com>

Consistent syntax

a254f37

Co-authored-by: Xiaodi Wu <13952+xwu@users.noreply.github.com>

mdznr force-pushed the partitioned branch from d3c7d2a to a254f37 Compare September 29, 2021 16:39

mdznr added 2 commits September 30, 2021 10:01

Add an external by: label to partitioned

a66018e

Add labels to returned tuple falseElements, trueElements

ecc07f0

mdznr changed the title ~~Add partitioned(_:)~~ Add partitioned(by:) Sep 30, 2021

mdznr added 2 commits October 8, 2021 09:34

Correct function signature

439f0f2

Rename belongsInSecondCollection parameter name to simply predicate

1c12067

The parameter name was potentially confusing. Unlike the other `partition` functions, this function can rely on its named tuple to clarify its behavior.

natecook1000 reviewed Oct 12, 2021

View reviewed changes

mdznr and others added 4 commits October 12, 2021 09:21

Update copyright information

3d5d91c

Co-authored-by: Nate Cook <natecook@apple.com>

Update documentation

5189d07

Co-authored-by: Nate Cook <natecook@apple.com>

Update comment

2740675

Add a precondition to ensure that the count matches up with the numbe…

51a3c6b

…r of actual elements found while iterating

natecook1000 approved these changes Oct 20, 2021

View reviewed changes

timvermeulen merged commit e2fa131 into apple:main Oct 20, 2021

mdznr deleted the partitioned branch October 21, 2021 18:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `partitioned(by:)` #152

Add `partitioned(by:)` #152

mdznr commented Jul 15, 2021 •

edited

Loading

mdznr commented Jul 15, 2021

mdznr commented Jul 27, 2021

timvermeulen left a comment

mdznr commented Aug 18, 2021

timvermeulen commented Aug 18, 2021

mdznr commented Aug 18, 2021

timvermeulen commented Aug 19, 2021

timvermeulen commented Oct 8, 2021

natecook1000 left a comment

natecook1000 left a comment

natecook1000 commented Oct 20, 2021

mdznr commented Oct 21, 2021

Add partitioned(by:) #152

Add partitioned(by:) #152

Conversation

mdznr commented Jul 15, 2021 • edited Loading

Description

Detailed Design

Naming

Documentation Plan

Test Plan

Source Impact

Checklist

mdznr commented Jul 15, 2021

mdznr commented Jul 27, 2021

timvermeulen left a comment

Choose a reason for hiding this comment

mdznr commented Aug 18, 2021

timvermeulen commented Aug 18, 2021

mdznr commented Aug 18, 2021

timvermeulen commented Aug 19, 2021

timvermeulen commented Oct 8, 2021

natecook1000 left a comment

Choose a reason for hiding this comment

natecook1000 left a comment

Choose a reason for hiding this comment

natecook1000 commented Oct 20, 2021

mdznr commented Oct 21, 2021

Add `partitioned(by:)` #152

Add `partitioned(by:)` #152

mdznr commented Jul 15, 2021 •

edited

Loading