[cli/paraformer] ali-paraformer inference #2067

Mddct · 2023-10-20T07:04:57Z

TODO: (in this pr)

TODO：（future pr）

自动化导出（下载阿里模型）包含：词典，cmvn [paraformer］paraformer auto exported #2096
fintune or train from ali paraformer [paraformer] support fintune #2139

NOTE： streaming paraformer not in current plan

Mddct · 2023-10-24T10:37:36Z

cli work

Mddct · 2023-10-30T09:27:05Z

recognize.py works

Mddct · 2023-10-30T10:43:31Z

decode info: batch_size=100， beam_size=10
aishell:

model	greedy_search	beam_search
wenet-ali-paraformer	1.96	1.96

decode info: batch_size=1， beam_size=10
aishell:

model	greedy_search	beam_search
wenet-ali-paraformer	1.95	1.95
funasr-paraformer	1.95	/

Mddct · 2023-10-30T14:34:37Z

confidence in search.py works

TODO(next):

resolve conflict to merge main

Mddct · 2023-10-30T15:01:15Z

it works!

robin1001 · 2023-10-30T15:07:32Z

assets 是必须的吗？是否可以放到模型中？

wenet/cli/transcribe.py

Mddct · 2023-10-30T15:21:15Z

assets 是必须的吗？是否可以放到模型中？

目前cmvn 这些已经在模型里边力，不过funasr的conf和wenet conf 格式不一样

导出模型的时候，需要用assets里边的文件，目前是我脚本转的

可以等后边自动从funasr里边的conf力转成wenet格式，然后再删掉

donstang · 2024-04-29T07:16:04Z

streaming paraformer 有计划支持吗？

Mddct · 2024-04-29T09:48:12Z

streaming paraformer 有计划支持吗？

暂时没计划， streaming的paraformer指标上差了一些也不是理想中的流模型，

感兴趣的话可以请关注wenetspeech2.0的进展

TeaPoly · 2024-06-02T04:28:37Z

wenet/cli/paraformer_model.py

+    def transcribe(self, audio_file: str, tokens_info: bool = False) -> dict:
+        waveform, sample_rate = torchaudio.load(audio_file, normalize=False)
+        waveform = waveform.to(torch.float)
+        feats = kaldi.fbank(waveform,


The default window in the FunASR frontend is hamming. You can find more details here. However, the default window in kaldi.fbank is povey, as specified here. This different window maybe a little mismatch. As mentioned in line 44 of this document:

"povey" is a window I made to be similar to Hamming but to go to zero at the edges, it's pow((0.5 - 0.5cos(n/N2*pi)), 0.85) I just don't think the Hamming window makes sense as a windowing function.

[cli/paraformer] ali-paraformer load and infer work

000c7af

Mddct changed the title ~~[cli/paraformer] ali-paraformer load and infer work~~ [cli/paraformer] ali-paraformer load and infer work [WIP] Oct 20, 2023

fix lint

6967ef5

Mddct force-pushed the Mddct-cli-paraformer branch from 579e9ff to 6967ef5 Compare October 20, 2023 07:13

Mddct marked this pull request as ready for review October 20, 2023 07:14

robin1001 mentioned this pull request Oct 20, 2023

迷你项目：python cli(command line interface) and api(application programming interface) #2069

Closed

12 tasks

Mddct force-pushed the Mddct-cli-paraformer branch from c3ca2e8 to 87906d5 Compare October 23, 2023 10:29

export jit and load work

c8cccdc

Mddct force-pushed the Mddct-cli-paraformer branch from 87906d5 to c8cccdc Compare October 23, 2023 11:05

Mddct added 2 commits October 23, 2023 19:45

reuse init_model.py

852165c

mv the intermediate files to the assets directory

e8fa013

Mddct force-pushed the Mddct-cli-paraformer branch from 3151899 to e8fa013 Compare October 23, 2023 11:56

Mddct added 2 commits October 23, 2023 20:21

merge main

0a98847

model.decodde work && recognize.py work

e987b00

Mddct force-pushed the Mddct-cli-paraformer branch from 0658057 to e987b00 Compare October 23, 2023 12:29

rm positionwise_feed_forward.py/lfr.py

3d25e2e

Mddct force-pushed the Mddct-cli-paraformer branch from 7ed212b to 3d25e2e Compare October 23, 2023 16:34

Mddct added 5 commits October 24, 2023 14:42

Merge branch 'main' into Mddct-cli-paraformer

aedd43b

refactor search

d264297

merge main

3f45af7

merge main

30b5677

cli work

20146f4

fix lint

e3ec8e7

Mddct changed the title ~~[cli/paraformer] ali-paraformer load and infer work [WIP]~~ [cli/paraformer] ali-paraformer inference Oct 24, 2023

fix att mask && batch infer

b1b44df

search confidence works

cd9c659

Mddct added 2 commits October 30, 2023 22:42

merge main

0b0eea7

merge main

c11aefe

Mddct requested review from robin1001 and xingchensong October 30, 2023 15:02

fix linux dtype

daff617

robin1001 reviewed Oct 30, 2023

View reviewed changes

wenet/cli/transcribe.py Outdated Show resolved Hide resolved

fix label type

9e810d1

revert init_model.py and add init_model in export_jit

b765c12

Mddct force-pushed the Mddct-cli-paraformer branch from bd2ea92 to b765c12 Compare October 30, 2023 15:38

robin1001 approved these changes Oct 30, 2023

View reviewed changes

robin1001 merged commit af1315c into main Oct 30, 2023
5 of 6 checks passed

robin1001 deleted the Mddct-cli-paraformer branch October 30, 2023 15:39

xingchensong mentioned this pull request Nov 1, 2023

中文开源语音大模型计划 #2097

Open

14 tasks

Mddct mentioned this pull request Nov 18, 2023

［feats/llm］语音大模型背景下的llm集成 #2142

Open

16 tasks

Mddct mentioned this pull request Jan 18, 2024

[feats] 权重迁移计划 #2305

Closed

3 tasks

TeaPoly reviewed Jun 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cli/paraformer] ali-paraformer inference #2067

[cli/paraformer] ali-paraformer inference #2067

Mddct commented Oct 20, 2023 •

edited

Loading

Mddct commented Oct 24, 2023

Mddct commented Oct 30, 2023 •

edited

Loading

Mddct commented Oct 30, 2023 •

edited

Loading

Mddct commented Oct 30, 2023 •

edited

Loading

Mddct commented Oct 30, 2023

robin1001 commented Oct 30, 2023

Mddct commented Oct 30, 2023 •

edited

Loading

donstang commented Apr 29, 2024

Mddct commented Apr 29, 2024

TeaPoly Jun 2, 2024

Mddct Jun 3, 2024

[cli/paraformer] ali-paraformer inference #2067

[cli/paraformer] ali-paraformer inference #2067

Conversation

Mddct commented Oct 20, 2023 • edited Loading

Mddct commented Oct 24, 2023

Mddct commented Oct 30, 2023 • edited Loading

Mddct commented Oct 30, 2023 • edited Loading

Mddct commented Oct 30, 2023 • edited Loading

Mddct commented Oct 30, 2023

robin1001 commented Oct 30, 2023

Mddct commented Oct 30, 2023 • edited Loading

donstang commented Apr 29, 2024

Mddct commented Apr 29, 2024

TeaPoly Jun 2, 2024

Choose a reason for hiding this comment

Mddct Jun 3, 2024

Choose a reason for hiding this comment

Mddct commented Oct 20, 2023 •

edited

Loading

Mddct commented Oct 30, 2023 •

edited

Loading

Mddct commented Oct 30, 2023 •

edited

Loading

Mddct commented Oct 30, 2023 •

edited

Loading

Mddct commented Oct 30, 2023 •

edited

Loading