Skip to content

Commit

Permalink
[ASR]cherrypick change optimizer and fix import error, test=asr (#3049)
Browse files Browse the repository at this point in the history
* optional tokenizer and fix some doc

* cherry pick with pr:3040
  • Loading branch information
zxcd authored Mar 15, 2023
1 parent 4d1787d commit d103cb8
Show file tree
Hide file tree
Showing 4 changed files with 4 additions and 3 deletions.
1 change: 1 addition & 0 deletions examples/aishell/asr3/RESULT.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@

* paddle version: develop (commit id: daea892c67e85da91906864de40ce9f6f1b893ae)
* paddlespeech version: develop (commit id: c14b4238b256693281e59605abff7c9435b3e2b2)
* paddlenlp version: 2.5.2

## Device
* python: 3.7
Expand Down
2 changes: 1 addition & 1 deletion examples/aishell/asr3/conf/train_with_wav2vec.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -83,7 +83,7 @@ dnn_neurons: 1024
freeze_wav2vec: False
dropout: 0.15

tokenizer: !apply:transformers.BertTokenizer.from_pretrained
tokenizer: !apply:paddlenlp.transformers.AutoTokenizer.from_pretrained
pretrained_model_name_or_path: bert-base-chinese
# bert-base-chinese tokens length
output_neurons: 21128
Expand Down
2 changes: 1 addition & 1 deletion examples/aishell/asr3/local/aishell_prepare.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@
import logging
import os

from paddlespeech.s2t.models.wav2vec2.io.dataio import read_audio
from paddlespeech.s2t.io.speechbrain.dataio import read_audio

logger = logging.getLogger(__name__)

Expand Down
2 changes: 1 addition & 1 deletion examples/aishell/asr3/local/data.sh
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
#!/bin/bash

stage=-1
stop_stage=-1
stop_stage=3
dict_dir=data/lang_char

. ${MAIN_ROOT}/utils/parse_options.sh || exit -1;
Expand Down

0 comments on commit d103cb8

Please sign in to comment.