NLPA

Latest

moncho-mendez released this 31 Jul 10:04

· 85 commits to master since this release

1.0.0

3e49d87

NLPA is a framework designed to operate in conjuction with BDP4J (https://github.com/sing-group/bdp4j) and able to extract texts from Twitter, Youtube Comments, text files, raw email files (.eml) or WARC (Web Archive) files. The extracted text can be preprocessed into a Dataset using task (org.bdp4j.pipe.Pipe) definitions. This framework incorporates more than 30 preprocessing tasks to transform the text.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NLPA