Skip to content

Commit

Permalink
Adds a BERT feature extractor (#116)
Browse files Browse the repository at this point in the history
* Initial commit of BERTFeatureExtractor.

* Updates for the integrated version of WordpieceTokenizer.

* Repackaging org.tribuo.interop.onnx.bert to org.tribuo.interop.onnx.extractors.

* BERTFeatureExtractor now implements TextPipeline, so you can use it in a RowProcessor via TextFieldProcessor.

* Adding hugginface transformers to THIRD_PARTY_LICENSES.txt

* Adding copyright statements.

* Adding another constructor to BERTFeatureExtractor.

* Adding a test for example extraction.

* Relaxing the fp comparison.

* Fixing review comments.
  • Loading branch information
Craigacp authored Apr 2, 2021
1 parent 892afa1 commit 95ec1c1
Show file tree
Hide file tree
Showing 9 changed files with 61,253 additions and 0 deletions.
10 changes: 10 additions & 0 deletions Interop/ONNX/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,16 @@
<artifactId>tribuo-regression-core</artifactId>
<version>${project.version}</version>
</dependency>
<dependency>
<groupId>${project.groupId}</groupId>
<artifactId>tribuo-json</artifactId>
<version>${project.version}</version>
</dependency>
<dependency>
<groupId>${project.groupId}</groupId>
<artifactId>tribuo-util-tokenization</artifactId>
<version>${project.version}</version>
</dependency>
<dependency>
<groupId>com.microsoft.onnxruntime</groupId>
<artifactId>onnxruntime</artifactId>
Expand Down
Loading

0 comments on commit 95ec1c1

Please sign in to comment.