When journalists ask their audience for help, success creates a whole new problem: what do you do with thousands of tips?
Or what do you do with thousands of textual descriptions of … anything … potholes, disciplinary actions at prisons, aircraft safety incidents? There are too many to really read.
And any time you feel "there are too many to really read," that's when you should consider getting help from machine learning.
Here's how we did that. There's an iPython notebook in this repo; we also have a non-technical blogpost you can read.