Add `Iterator::for_each` #42782

cuviper · 2017-06-20T22:39:04Z

This works like a for loop in functional style, applying a closure to
every item in the Iterator. It doesn't allow break/continue like
a for loop, nor any other control flow outside the closure, but it may
be a more legible style for tying up the end of a long iterator chain.

This was tried before in #14911, but nobody made the case for using it
with longer iterators. There was also Iterator::advance at that time
which was more capable than for_each, but that no longer exists.

The itertools crate has Itertools::foreach with the same behavior,
but thankfully the names won't collide. The rayon crate also has a
ParallelIterator::for_each where simple for loops aren't possible.

I really wish we had for_each on seq iterators. Having to use a
dummy operation is annoying. - @nikomatsakis

@nikomatsakis

This works like a `for` loop in functional style, applying a closure to every item in the `Iterator`. It doesn't allow `break`/`continue` like a `for` loop, nor any other control flow outside the closure, but it may be a more legible style for tying up the end of a long iterator chain. This was tried before in rust-lang#14911, but nobody made the case for using it with longer iterators. There was also `Iterator::advance` at that time which was more capable than `for_each`, but that no longer exists. The `itertools` crate has `Itertools::foreach` with the same behavior, but thankfully the names won't collide. The `rayon` crate also has a `ParallelIterator::for_each` where simple `for` loops aren't possible. > I really wish we had `for_each` on seq iterators. Having to use a > dummy operation is annoying. - [@nikomatsakis][1] [1]: https://github.com/nikomatsakis/rayon/pull/367#issuecomment-308455185

rust-highfive · 2017-06-20T22:39:14Z

r? @alexcrichton

(rust_highfive has picked a reviewer for you, use r? to override)

cuviper · 2017-06-20T22:51:16Z

@bluss I'm curious about your "interesting reasons" to use fold in Itertools::foreach. What sort of optimizations come out of that?

alexcrichton · 2017-06-20T22:51:56Z

👍 I'm game!

@rust-lang/libs, any others have thoughts?

aturon · 2017-06-21T02:33:36Z

Works for me!

sfackler · 2017-06-21T03:34:44Z

I seem to remember there being philosophical objections to this back in the day, but I don't feel particularly strongly.

scottmcm · 2017-06-21T07:04:35Z

@cuviper I think this is the context: https://medium.com/@veedrac/rust-is-slow-and-i-am-the-cure-32facc0fdcb (also https://internals.rust-lang.org/t/pre-rfc-fold-ok-is-composable-internal-iteration/4434)

cuviper · 2017-06-21T09:03:49Z

Interesting - I'll have to play around with that. And if we do want this implemented on fold, that needs to happen before stabilization.

cuviper · 2017-06-21T18:48:25Z

Woah, for_each based on fold is a clear win over a for loop when chain is involved. I'll update it and see about adding benchmarks to show this benefit, and we should figure out how to express this in the documentation.

The benefit of using internal iteration is shown in new benchmarks: test iter::bench_for_each_chain_fold ... bench: 635,110 ns/iter (+/- 5,135) test iter::bench_for_each_chain_loop ... bench: 2,249,983 ns/iter (+/- 42,001) test iter::bench_for_each_chain_ref_fold ... bench: 2,248,061 ns/iter (+/- 51,940)

frewsxcv · 2017-06-22T02:48:33Z

src/libcore/iter/iterator.rs

+    /// #![feature(iterator_for_each)]
+    ///
+    /// let mut v = vec![];
+    /// (0..5).for_each(|x| v.push(x * 100));


i'm not necessarily opposed to the current example you have written, but i find the current example slightly less idiomatic (subjectively) than something like:

let v: Vec<_> = (0..5).map(|x| x * 100).collect();

Sure, I was just aiming for something simple and testable. I would definitely use collect or extend for that in real code. Any ideas for something more meaningful?

Similarly, the added benchmarks are just sums.

Any ideas for something more meaningful?

New cookbook contributors usually have problem with consuming functional flow they have built just for the sake of side effects (if they do not wish to obtain any value like in fold or collect). Switching to imperative for just to obtain side effects feels not idiomatic.

Some artificial examples that might not be any better 😸

let (tx, rx) = channel(); (0..5).map(|x| x * 2 + 1).for_each(|x| { tx.send(x).unwrap(); } );

["1", "2", "lol", "baz", "55"] .iter() .filter_map(|s| s.parse::<u16>().ok()) .map(|v| v * v) .for_each(|v| println!("{}", v));

@budziq I'm glad to hear of more motivation for this!

Your additional examples are OK, but still not testing anything besides successfully compiling. Note that my first example has an assert_eq with the for-loop result, so we actually get some sanity check that it really works, as trivial as that is. Your channel example could read the rx side to check the result though.

@cuviper so something like that might be ok?

use std::sync::mpsc::channel; let (tx, rx) = channel(); (0..5).map(|x| x * 2 + 1).for_each(|x| { tx.send(x).unwrap(); } ); assert_eq!(vec![1, 3, 5, 7, 9], rx.iter().take(5).collect::<Vec<_>>());

That would work. Do folks like that better? Or perhaps as one more example?

steveklabnik · 2017-06-27T17:11:09Z

I personally really wanted a foreach but yeah, it was rejected, what's changed since then?

cuviper · 2017-06-27T17:26:23Z

@steveklabnik I think at that time the main point was that short iterators are probably better off with a for loop and more control-flow options. It wasn't properly considered for use with longer iterator chains. When your iterator takes up multiple lines, a for loop either makes awkward formatting or requires saving to a local first, both annoying.

Also, the performance benefit of internal iteration via fold is pretty compelling, and we can implement that in for_each without users having to understand it.

steveklabnik · 2017-06-27T20:38:24Z

It wasn't properly considered for use with longer iterator chains.

I must have done a bad job advocating back then, then. Oh well.

alexcrichton · 2017-06-27T22:25:23Z

Ok sounds like there's no reason to not experiment with this at this point, @cuviper if you want to update the example (which I think the comments indicate?) then I'll r+

steveklabnik · 2017-06-27T22:44:30Z

History time!

I would argue that rust-lang/rfcs#1064 (comment) by @nagisa is a decent summary:

So far none of the points arguing against the RFC have become false:

There’s a more general version of this function already available in the standard library as well as various chains that produce the desired behaviour;

THere’s the for-loop construct;

itertools package still provides a convenience wrapper for the for-loop.

The first bullet point has changed. The other ones are still true.

cuviper · 2017-06-27T23:12:48Z

Thanks for finding more context! My search-fu is apparently weak...

I don't think we have to falsify every point against -- only compare them to the points for:

re for loops: It looks much worse when you're working with a chain of iterators.
for_each is more succinct for using direct functions, e.g. values.for_each(my_log)
internal iteration with fold can be much faster, as shown in the benchmarks here.
- but it's easier for users to write .for_each(|item| f(item))
- rather than writing their own .fold((), |(), item| f(item))

cuviper · 2017-06-27T23:33:39Z

In general, I feel like there are a lot of people that do want this, and the people against are just "meh".

Anyway, I updated the first example like @budziq's suggestion.

nikomatsakis · 2017-06-29T01:59:31Z

Obviously I've already been quoted at the top, but I agree with @cuviper that I think the pros outweigh the cons. I think that there is indeed new information since the last time this was discussed:

The performance benefits of "internal iteration" were not widely discussed at the time (unless I remember incorrectly; I confess I didn't bother to click all of @steveklabnik's links, I'm just going based on my memory).
The fact that, for parallel iteration, for_each is necessary.

The two together mean that encouraging the use of for_each when it is convenient for sequential iteration will make code faster by default, and also facilitate the conversion to parallel execution. And it's more ergonomic to boot! Seems like a win-win to me.

The main argument against is basically "TMWTDI", which -- from my POV -- is a good enough reason to stop things without compelling advantages, but not an absolute blocker.

I also think one of the points made at the time is at least somewhat false:

There’s a more general version of this function already available in the standard library as well as various chains that produce the desired behaviour;

I presume that this is referring to fold or all? I don't consider that a real alternative. It's true that one can model for_each this way, but it's misleading and makes the code harder to read -- you have to realize that the function is being abused for something other than its intended purpose, which imo starts to defeat the point of using iterators. Moreover, as a consequence of that, code written in this style cannot be parallelized as efficiently or as well (e.g., the all combinator will waste time propagating booleans and checking for shortcircuits; fold isn't even available).

nikomatsakis · 2017-06-29T02:02:00Z

src/libcore/iter/iterator.rs

+    ///
+    /// let (tx, rx) = channel();
+    /// (0..5).map(|x| x * 2 + 1)
+    ///       .for_each(move |x| tx.send(x).unwrap());


Is the move necessary here? I would not expect so.

Maybe it's too sneaky, but that lets tx drop automatically, and then rx won't block.

alexcrichton · 2017-06-30T06:34:54Z

@bors: r+

bors · 2017-06-30T06:34:54Z

📌 Commit e72ee6e has been approved by alexcrichton

bors · 2017-06-30T09:15:27Z

⌛ Testing commit e72ee6e with merge 919c4a6...

@nikomatsakis

Add `Iterator::for_each` This works like a `for` loop in functional style, applying a closure to every item in the `Iterator`. It doesn't allow `break`/`continue` like a `for` loop, nor any other control flow outside the closure, but it may be a more legible style for tying up the end of a long iterator chain. This was tried before in #14911, but nobody made the case for using it with longer iterators. There was also `Iterator::advance` at that time which was more capable than `for_each`, but that no longer exists. The `itertools` crate has `Itertools::foreach` with the same behavior, but thankfully the names won't collide. The `rayon` crate also has a `ParallelIterator::for_each` where simple `for` loops aren't possible. > I really wish we had `for_each` on seq iterators. Having to use a > dummy operation is annoying. - [@nikomatsakis][1] [1]: https://github.com/nikomatsakis/rayon/pull/367#issuecomment-308455185

bors · 2017-06-30T11:42:03Z

☀️ Test successful - status-appveyor, status-travis
Approved by: alexcrichton
Pushing 919c4a6 to master...

cuviper · 2017-06-30T16:29:45Z

Yay!

Now, since I left it unstable with issue = "0", do we need to open an issue and update that with a PR?

Mark-Simulacrum · 2017-06-30T16:43:06Z

Oh, yes, please do. We probably shouldn't have merged yet actually, but no worries!

cuviper · 2017-06-30T17:23:51Z

See #42987 for that update.

phimuemue · 2018-04-02T17:46:48Z

Hi, I stumbled upon this, but I wondered why for_each does not support break. I hope I don't miss something essential, but I imagined it would be no problem for for_each to take a function returning a value that determines whether to break or not.

This value could be - as it is now - a () (never causing the loop to break) or a bool indicating whether we want to break or an Option<T> indicatin whether we want to "break with a certain item".

In code, this could be captured by a trait BreakIndicator as follows:

trait BreakIndicator {
    fn is_break(&self) -> bool;
    fn final_continue() -> Self;
}

impl BreakIndicator for () {
    // always continue
    fn is_break(&self) -> bool { false }
    fn final_continue() -> () {}
}

impl BreakIndicator for bool {
    // true means break; false means continue
    fn is_break(&self) -> bool { *self }
    fn final_continue() -> bool { false }
}

impl<T> BreakIndicator for Option<T> {
    // Some(v) means "break wich value v"; None means continue
    fn is_break(&self) -> bool { self.is_some() }
    fn final_continue() -> Option<T> { None }
}

I implemented a fn breaking_for_each on top of Iterator to see how that would work out:

trait IteratorWithBreakingForEach : Iterator {
    fn breaking_for_each<F, BI>(self, f: F) -> BI
        where F: FnMut(Self::Item) -> BI,
              BI: BreakIndicator,
    ;
}

impl<I> IteratorWithBreakingForEach for I where I: Iterator {
    fn breaking_for_each<F, BI>(self, mut f: F) -> BI
        where F: FnMut(Self::Item) -> BI,
              BI: BreakIndicator,
    {
        for item in self {
            let break_indicator = f(item);
            if break_indicator.is_break() {
                return break_indicator;
            }
        }
        BI::final_continue()
    }
}

Usage could be as follows:

fn main() {
    (1..10).breaking_for_each(|i| {
        println!("{}", i) // does not break at all
    });
    (1..10).breaking_for_each(|i| {
        println!("{}", i);
        i>=5 // break if i>=5
    });
    let x = (1..10).breaking_for_each(|i| {
        println!("{}", i);
        if i>=5 {
            Some(i) // break with value
        } else {
            None // continue
        }
    });
    println!("{:?}", x);
}

Has this ever been thought about? And if so, why was it apparently rejected?

cuviper · 2018-04-02T17:54:10Z

@phimuemue There's a form of that built around the Try trait with try_for_each (docs). There's also some discussion in #42327 (comment) whether Try should be re-framed more like Break/Continue.

scottmcm · 2018-04-02T18:12:35Z

@phimuemue As an example of "break with a certain item", check out how find is implemented:

rust/src/libcore/iter/iterator.rs

Lines 1738 to 1746 in ab8b961

    
           fn find<P>(&mut self, mut predicate: P) -> Option<Self::Item> where 
        
               Self: Sized, 
        
               P: FnMut(&Self::Item) -> bool, 
        
           { 
        
               self.try_for_each(move |x| { 
        
                   if predicate(&x) { LoopState::Break(x) } 
        
                   else { LoopState::Continue(()) } 
        
               }).break_value() 
        
           }

(That LoopState type is currently internal, but personally I'd like Try to use it, as @cuviper said.)

If you wanted to do the same in your code today†, it can be done with Result like this:

    self.try_for_each(move |x| { 
        if predicate(&x) { Err(x) } 
        else { Ok(()) } 
    }).err()

† Well, once someone makes a stabilization PR for try_for_each...

scottmcm · 2022-04-19T15:46:51Z

Hello 4 years later! In case anyone else ends up here, the LoopState type mentioned in my previous comment is now available on stable as https://doc.rust-lang.org/stable/std/ops/enum.ControlFlow.html

rust-highfive assigned alexcrichton Jun 20, 2017

alexcrichton added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. labels Jun 20, 2017

cuviper force-pushed the iterator_for_each branch from c5c238b to 4a8ddac Compare June 21, 2017 20:22

frewsxcv reviewed Jun 22, 2017

View reviewed changes

Use a little more compelling example of for_each

e72ee6e

nikomatsakis reviewed Jun 29, 2017

View reviewed changes

bors merged commit e72ee6e into rust-lang:master Jun 30, 2017

cuviper mentioned this pull request Jun 30, 2017

Tracking issue for the iterator_for_each library feature #42986

Closed

cuviper mentioned this pull request Jun 30, 2017

Consider adding a foreach-like iterator adapter rust-lang/rfcs#1070

Closed

arielb1 mentioned this pull request Aug 5, 2017

foreach-0.2.0 beta regression #43666

Closed

cuviper mentioned this pull request Apr 16, 2018

Add Iterator::exhaust #49990

Closed

Add Iterator::for_each #42782

Add Iterator::for_each #42782

Conversation

cuviper commented Jun 20, 2017

rust-highfive commented Jun 20, 2017

cuviper commented Jun 20, 2017

alexcrichton commented Jun 20, 2017

aturon commented Jun 21, 2017

sfackler commented Jun 21, 2017

scottmcm commented Jun 21, 2017

cuviper commented Jun 21, 2017 via email

cuviper commented Jun 21, 2017

frewsxcv Jun 22, 2017

Choose a reason for hiding this comment

cuviper Jun 22, 2017

Choose a reason for hiding this comment

budziq Jun 24, 2017

Choose a reason for hiding this comment

cuviper Jun 26, 2017

Choose a reason for hiding this comment

budziq Jun 27, 2017 • edited Loading

Choose a reason for hiding this comment

cuviper Jun 27, 2017

Choose a reason for hiding this comment

steveklabnik commented Jun 27, 2017

cuviper commented Jun 27, 2017

steveklabnik commented Jun 27, 2017

alexcrichton commented Jun 27, 2017

steveklabnik commented Jun 27, 2017

cuviper commented Jun 27, 2017

cuviper commented Jun 27, 2017

nikomatsakis commented Jun 29, 2017

nikomatsakis Jun 29, 2017

Choose a reason for hiding this comment

cuviper Jun 29, 2017

Choose a reason for hiding this comment

alexcrichton commented Jun 30, 2017

bors commented Jun 30, 2017

bors commented Jun 30, 2017

bors commented Jun 30, 2017

cuviper commented Jun 30, 2017

Mark-Simulacrum commented Jun 30, 2017

cuviper commented Jun 30, 2017

phimuemue commented Apr 2, 2018 • edited Loading

cuviper commented Apr 2, 2018

scottmcm commented Apr 2, 2018

scottmcm commented Apr 19, 2022

Add `Iterator::for_each` #42782

Add `Iterator::for_each` #42782

budziq Jun 27, 2017 •

edited

Loading

phimuemue commented Apr 2, 2018 •

edited

Loading