Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CLAP] Add CLAP to the library #21370

Merged
merged 227 commits into from
Feb 16, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
227 commits
Select commit Hold shift + click to select a range
c6b06b5
add model like clip
ArthurZucker Jan 30, 2023
7547c82
update
ArthurZucker Jan 30, 2023
23c56ac
text model ok
younesbelkada Jan 30, 2023
405e653
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
younesbelkada Jan 30, 2023
1771782
clap text works
younesbelkada Jan 30, 2023
9b276aa
some refactor
younesbelkada Jan 31, 2023
45c36ba
more refactor
younesbelkada Jan 31, 2023
abee382
more refactor
younesbelkada Jan 31, 2023
a7219ec
more refactor
younesbelkada Jan 31, 2023
c553315
correct fusion
younesbelkada Jan 31, 2023
4360623
more refactor
younesbelkada Jan 31, 2023
e3aff6f
new modules
younesbelkada Jan 31, 2023
00eb73b
add basic processor
ArthurZucker Feb 2, 2023
45e7ce9
fixup
ArthurZucker Feb 2, 2023
b5c483f
remove whisper copioed from
ArthurZucker Feb 2, 2023
4084369
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 2, 2023
fc0d323
audio logits match
younesbelkada Feb 2, 2023
8a27723
add doc
ArthurZucker Feb 2, 2023
27f133f
correct filters mel and add maxlength
ArthurZucker Feb 2, 2023
5ddc2f3
style
ArthurZucker Feb 2, 2023
fbb6124
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 2, 2023
815c5ce
few fixes
younesbelkada Feb 2, 2023
95c400b
forward passes
ArthurZucker Feb 2, 2023
a41ff1a
fixup
ArthurZucker Feb 2, 2023
38f7fe8
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 2, 2023
275633c
fixup
ArthurZucker Feb 2, 2023
1b3a820
some clean up
younesbelkada Feb 2, 2023
8fed2d0
remove mels form the dictionnary
ArthurZucker Feb 2, 2023
2684c70
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 2, 2023
6b9051c
pad after the repeat
ArthurZucker Feb 2, 2023
a468f8d
update padding when dsmaller
ArthurZucker Feb 2, 2023
844540b
fix padding
ArthurZucker Feb 2, 2023
aeb340e
style
ArthurZucker Feb 2, 2023
5eaa517
use swin patch merging
younesbelkada Feb 2, 2023
1182287
use copied from swin
younesbelkada Feb 2, 2023
1a4caac
processor with any tokenizer
ArthurZucker Feb 2, 2023
5e604a8
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 2, 2023
e94c9dd
more copied from
younesbelkada Feb 2, 2023
e9ff994
some clean up
younesbelkada Feb 2, 2023
7dec8bc
more refactor
younesbelkada Feb 3, 2023
c352948
fix mel when rand_trunc
ArthurZucker Feb 3, 2023
f2d6ffe
style
ArthurZucker Feb 3, 2023
e71749c
remove unused imports
ArthurZucker Feb 3, 2023
d44efd1
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 3, 2023
f1627c6
update processing
ArthurZucker Feb 3, 2023
612e46f
remove image processing tests
ArthurZucker Feb 3, 2023
c3042e2
add testing fiel
ArthurZucker Feb 3, 2023
68cbadb
fixmodeling issues
ArthurZucker Feb 3, 2023
c20c66e
replace with `is_longer`
younesbelkada Feb 3, 2023
9c03ac7
clap in serialization
ArthurZucker Feb 4, 2023
6717d72
more refactor
younesbelkada Feb 6, 2023
a38a016
`make fixup`
younesbelkada Feb 6, 2023
f958fa1
make fixup
younesbelkada Feb 6, 2023
c218e3c
fix feature extractor
ArthurZucker Feb 6, 2023
65cfca4
update test feature extractor
ArthurZucker Feb 6, 2023
6f759c5
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 6, 2023
50613fe
`make fixup`
younesbelkada Feb 6, 2023
b1f46dc
clean up config
younesbelkada Feb 6, 2023
5115057
more clean up
younesbelkada Feb 6, 2023
833c061
more cleanup
younesbelkada Feb 6, 2023
2f99f82
Merge branch 'main' into add-clap-model
younesbelkada Feb 6, 2023
8ee3051
update tests
ArthurZucker Feb 6, 2023
e472482
refactor tests and inits
ArthurZucker Feb 6, 2023
b0f63c2
removeCLAP vision config
ArthurZucker Feb 6, 2023
21b2b94
remove CLAP from image procssing auto and dummy vision objects
ArthurZucker Feb 6, 2023
506772e
update inits
ArthurZucker Feb 6, 2023
6efadbf
style
ArthurZucker Feb 6, 2023
972f45e
re order classes in modeling clap
ArthurZucker Feb 6, 2023
c7b6cb9
Use roberta tokenizer as the other weights are not open sourced
ArthurZucker Feb 7, 2023
13ab798
Merge branch 'main' of https://github.com/huggingface/transformers in…
ArthurZucker Feb 7, 2023
48b346a
small cleaup
ArthurZucker Feb 7, 2023
d299422
remove tokenization CLAP
ArthurZucker Feb 7, 2023
f627ab9
processor tokenizr is roberta
ArthurZucker Feb 7, 2023
fdcfed1
update feature extraction doc
ArthurZucker Feb 7, 2023
3dc78e8
remove vclap from model zero shot
ArthurZucker Feb 7, 2023
0fef96f
update f_min and f_max to frequency_xx
ArthurZucker Feb 7, 2023
0515a18
some changes
younesbelkada Feb 7, 2023
0dcff26
make fixup
younesbelkada Feb 7, 2023
48d611b
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 7, 2023
fe43210
consistent behavior ebtween rand_crop and fusion
ArthurZucker Feb 7, 2023
e44c75e
add numpy resize and bilinear and documentation
ArthurZucker Feb 7, 2023
43df206
move resizing to image utils
ArthurZucker Feb 7, 2023
5379926
clean feature extraction
ArthurZucker Feb 7, 2023
107c544
import resize from correct file
ArthurZucker Feb 7, 2023
7eb278d
resize in image transforms
ArthurZucker Feb 7, 2023
a4c1940
update
ArthurZucker Feb 7, 2023
9594cc5
style
ArthurZucker Feb 7, 2023
51d2b64
style
ArthurZucker Feb 7, 2023
ce79e64
nit
ArthurZucker Feb 7, 2023
1bfd7dd
remove unused arguments form the feature extractor
ArthurZucker Feb 7, 2023
6794f16
style
ArthurZucker Feb 7, 2023
1b7b39d
few fixes + make fixup
younesbelkada Feb 8, 2023
552aee2
oops
younesbelkada Feb 8, 2023
05674c7
fix more tests
younesbelkada Feb 8, 2023
b8a4e44
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 8, 2023
74d3c4f
add zero shot audio classification pipeline
ArthurZucker Feb 8, 2023
cad3f42
update zeroshot classification pipeline
ArthurZucker Feb 8, 2023
f0d2194
fixup
ArthurZucker Feb 8, 2023
754bed1
fix copies
ArthurZucker Feb 8, 2023
5f0e467
all CI tests pass
younesbelkada Feb 8, 2023
47f714f
make fixup + fix docs
younesbelkada Feb 8, 2023
7d3ede5
fix docs
younesbelkada Feb 8, 2023
4e5db4b
fix docs
younesbelkada Feb 8, 2023
5d23429
update tests pip;eline
ArthurZucker Feb 8, 2023
aaa68ce
update zero shot pipeline
ArthurZucker Feb 8, 2023
c8e9c73
update feature extraction clap
ArthurZucker Feb 8, 2023
c54fce3
update tokenization auto
ArthurZucker Feb 8, 2023
95f1352
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 8, 2023
4debc3d
use nested simplify
ArthurZucker Feb 8, 2023
0be1e66
update pipeline tests
ArthurZucker Feb 8, 2023
c2e207b
Apply suggestions from code review
younesbelkada Feb 8, 2023
e47c696
split in two lines
younesbelkada Feb 8, 2023
dbab96e
fixes
younesbelkada Feb 8, 2023
9c83ff7
refactor
younesbelkada Feb 8, 2023
b5b929a
clean up
younesbelkada Feb 8, 2023
5e3aa5f
add integration tests
younesbelkada Feb 9, 2023
2ea06b8
update config docstring
ArthurZucker Feb 9, 2023
bc20ff4
style
ArthurZucker Feb 9, 2023
d5fa7b1
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 9, 2023
312dd92
update processor
ArthurZucker Feb 9, 2023
3cd99bf
fix processor test
younesbelkada Feb 9, 2023
b6b1dd0
fix feat extractor tests
younesbelkada Feb 9, 2023
1ce1a25
update docs
ArthurZucker Feb 9, 2023
6b451f7
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 9, 2023
d3b2d94
Apply suggestions from code review
younesbelkada Feb 9, 2023
7ed1747
fix readmes
younesbelkada Feb 9, 2023
6b2a6db
fix tips
younesbelkada Feb 9, 2023
b6fa932
Update src/transformers/models/auto/configuration_auto.py
ArthurZucker Feb 9, 2023
62ffdf7
update doc and remove todo -> properly explained
ArthurZucker Feb 9, 2023
1593f3c
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 9, 2023
8868e3c
fix idx and typo
ArthurZucker Feb 9, 2023
9b9bd4b
typoe
ArthurZucker Feb 9, 2023
9460eaa
cleanup config
ArthurZucker Feb 9, 2023
82f4ede
cleanup tests, styles and doc
ArthurZucker Feb 9, 2023
d90a9f2
ignore docstyle on image transform
ArthurZucker Feb 9, 2023
ff25a45
add conversion script
younesbelkada Feb 9, 2023
a6c3cc2
remove the `clap` indx in favor of `CLAP`
ArthurZucker Feb 9, 2023
79e5845
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tra…
ArthurZucker Feb 9, 2023
01bb870
update __init
ArthurZucker Feb 9, 2023
1b2c514
nits
ArthurZucker Feb 9, 2023
c805bae
Update src/transformers/pipelines/__init__.py
ArthurZucker Feb 9, 2023
640f6f9
fix bug
younesbelkada Feb 9, 2023
2b8f466
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
younesbelkada Feb 9, 2023
177a705
clarifiy config
younesbelkada Feb 9, 2023
57fd58c
fix copy
younesbelkada Feb 9, 2023
3a9f5ac
Merge branch 'main' into add-clap-model
ArthurZucker Feb 9, 2023
0874aba
fix init
younesbelkada Feb 9, 2023
31d3204
Apply suggestions from code review
younesbelkada Feb 10, 2023
948eb0a
fix model output
younesbelkada Feb 10, 2023
f75c02a
fix comment
younesbelkada Feb 10, 2023
7d7a000
make fixup
younesbelkada Feb 10, 2023
63d073a
make fixup
younesbelkada Feb 10, 2023
3bea427
rename to `Clap`
younesbelkada Feb 10, 2023
c20981d
replace to `Clap`
younesbelkada Feb 10, 2023
6c31509
replace to `Clap`
younesbelkada Feb 10, 2023
fb24c4e
repo consistency
younesbelkada Feb 10, 2023
bd29fec
again repo-consistency
younesbelkada Feb 10, 2023
5cf2fa8
make fixup
younesbelkada Feb 10, 2023
90e4c03
Apply suggestions from code review
ArthurZucker Feb 10, 2023
729fa51
add config
younesbelkada Feb 13, 2023
df70e2c
changes
younesbelkada Feb 13, 2023
7ebeacf
update conversion
younesbelkada Feb 13, 2023
3be208e
Apply suggestions from code review
younesbelkada Feb 13, 2023
505c623
remove unused function
younesbelkada Feb 13, 2023
efb526e
update based on code reviews
ArthurZucker Feb 13, 2023
7d1c2d3
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 13, 2023
2d57cfc
style
ArthurZucker Feb 13, 2023
441e247
more comments
younesbelkada Feb 13, 2023
1d82be5
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
younesbelkada Feb 13, 2023
51b3dde
cleanup
younesbelkada Feb 13, 2023
8d5e5ad
clean up
younesbelkada Feb 13, 2023
38ce551
style
ArthurZucker Feb 13, 2023
7b38c76
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 13, 2023
aa85941
apply suggestions
younesbelkada Feb 13, 2023
a45b221
Empty commit
ArthurZucker Feb 13, 2023
b9ee298
pipeline will be added in a different PR
ArthurZucker Feb 13, 2023
a47f063
update calls to audio utils functions
ArthurZucker Feb 13, 2023
9eb1ab4
update pipeline init
ArthurZucker Feb 13, 2023
fd1957b
style
ArthurZucker Feb 13, 2023
773b225
style
ArthurZucker Feb 13, 2023
c7a7cf6
styling again
ArthurZucker Feb 13, 2023
c970399
use pad
younesbelkada Feb 13, 2023
21f60ce
fix repo-consistency
ArthurZucker Feb 13, 2023
9ba34cd
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 13, 2023
9ca1780
update utils and add doc for audio utils
ArthurZucker Feb 13, 2023
0e0990a
clean up resize by using torch. update inits accordingly
ArthurZucker Feb 13, 2023
1c82889
style
ArthurZucker Feb 13, 2023
626a664
CLap's tokenizer is RobertA
ArthurZucker Feb 13, 2023
a1471b6
add audio utils to internal toctreee
ArthurZucker Feb 13, 2023
7ed4f0f
update totctree
ArthurZucker Feb 13, 2023
c07ab1e
style
ArthurZucker Feb 13, 2023
291b8f8
update documentation and normalize naming accross audio utils and fea…
ArthurZucker Feb 13, 2023
80a6212
style
ArthurZucker Feb 13, 2023
b29ee04
clean up
younesbelkada Feb 13, 2023
56d2b00
update doc and typos
ArthurZucker Feb 13, 2023
556c86f
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 13, 2023
a5d7cd1
fix doctest
younesbelkada Feb 13, 2023
d5376ea
update modelin code, got rid of a lot of reshaping
ArthurZucker Feb 13, 2023
f4b0441
style on added doc audio utils
ArthurZucker Feb 13, 2023
48b98d0
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 13, 2023
ad82198
update modeling clap
ArthurZucker Feb 13, 2023
6ed76fa
style
ArthurZucker Feb 13, 2023
af97ade
Apply suggestions from code review
ArthurZucker Feb 14, 2023
c5749da
docstringvariables with CLAP
ArthurZucker Feb 14, 2023
fd0bd47
rename key
younesbelkada Feb 14, 2023
13f41eb
update modeling CLAP
ArthurZucker Feb 14, 2023
7865442
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 14, 2023
2655e13
update audio utils docstring
ArthurZucker Feb 14, 2023
dbe50a7
update processing clap
ArthurZucker Feb 14, 2023
34255aa
fix readmes
younesbelkada Feb 14, 2023
c4cdd97
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
younesbelkada Feb 14, 2023
de162eb
fix toctree
younesbelkada Feb 14, 2023
83d0716
udpate configuration clap
ArthurZucker Feb 14, 2023
0a25e75
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 14, 2023
230b516
fix init
younesbelkada Feb 14, 2023
f379031
make fixup
younesbelkada Feb 14, 2023
fe1fbe3
fix
younesbelkada Feb 14, 2023
75171e3
fix
younesbelkada Feb 14, 2023
2fba86b
update naming
ArthurZucker Feb 14, 2023
d75a5f8
Merge branch 'add-clap-model' of https://github.com/ArthurZucker/tran…
ArthurZucker Feb 14, 2023
c221e1d
update
ArthurZucker Feb 16, 2023
25610ce
update checkpoint path
ArthurZucker Feb 14, 2023
6856ff0
Apply suggestions from code review
ArthurZucker Feb 16, 2023
1ce6363
Major refactoring
ArthurZucker Feb 16, 2023
a8dc9a4
Update src/transformers/models/clap/configuration_clap.py
ArthurZucker Feb 15, 2023
9b5b252
merge
ArthurZucker Feb 16, 2023
66b7dfe
Merge branch 'main' of https://github.com/huggingface/transformers in…
ArthurZucker Feb 16, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -295,6 +295,7 @@ Current number of checkpoints: ![](https://img.shields.io/endpoint?url=https://h
1. **[CamemBERT](https://huggingface.co/docs/transformers/model_doc/camembert)** (from Inria/Facebook/Sorbonne) released with the paper [CamemBERT: a Tasty French Language Model](https://arxiv.org/abs/1911.03894) by Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot.
1. **[CANINE](https://huggingface.co/docs/transformers/model_doc/canine)** (from Google Research) released with the paper [CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation](https://arxiv.org/abs/2103.06874) by Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting.
1. **[Chinese-CLIP](https://huggingface.co/docs/transformers/model_doc/chinese_clip)** (from OFA-Sys) released with the paper [Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese](https://arxiv.org/abs/2211.01335) by An Yang, Junshu Pan, Junyang Lin, Rui Men, Yichang Zhang, Jingren Zhou, Chang Zhou.
1. **[CLAP](https://huggingface.co/docs/transformers/main/model_doc/clap)** (from LAION-AI) released with the paper [Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation]https://arxiv.org/abs/2211.06687) by Yusong Wu, Ke Chen, Tianyu Zhang, Yuchen Hui, Taylor Berg-Kirkpatrick, Shlomo Dubnov.
1. **[CLIP](https://huggingface.co/docs/transformers/model_doc/clip)** (from OpenAI) released with the paper [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020) by Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever.
1. **[CLIPSeg](https://huggingface.co/docs/transformers/model_doc/clipseg)** (from University of Göttingen) released with the paper [Image Segmentation Using Text and Image Prompts](https://arxiv.org/abs/2112.10003) by Timo Lüddecke and Alexander Ecker.
1. **[CodeGen](https://huggingface.co/docs/transformers/model_doc/codegen)** (from Salesforce) released with the paper [A Conversational Paradigm for Program Synthesis](https://arxiv.org/abs/2203.13474) by Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong.
Expand Down
1 change: 1 addition & 0 deletions README_es.md
Original file line number Diff line number Diff line change
Expand Up @@ -288,6 +288,7 @@ Número actual de puntos de control: ![](https://img.shields.io/endpoint?url=htt
1. **[CamemBERT](https://huggingface.co/docs/transformers/model_doc/camembert)** (from Inria/Facebook/Sorbonne) released with the paper [CamemBERT: a Tasty French Language Model](https://arxiv.org/abs/1911.03894) by Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot.
1. **[CANINE](https://huggingface.co/docs/transformers/model_doc/canine)** (from Google Research) released with the paper [CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation](https://arxiv.org/abs/2103.06874) by Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting.
1. **[Chinese-CLIP](https://huggingface.co/docs/transformers/model_doc/chinese_clip)** (from OFA-Sys) released with the paper [Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese](https://arxiv.org/abs/2211.01335) by An Yang, Junshu Pan, Junyang Lin, Rui Men, Yichang Zhang, Jingren Zhou, Chang Zhou.
1. **[CLAP](https://huggingface.co/docs/transformers/main/model_doc/clap)** (from LAION-AI) released with the paper [Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation]https://arxiv.org/abs/2211.06687) by Yusong Wu, Ke Chen, Tianyu Zhang, Yuchen Hui, Taylor Berg-Kirkpatrick, Shlomo Dubnov.
1. **[CLIP](https://huggingface.co/docs/transformers/model_doc/clip)** (from OpenAI) released with the paper [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020) by Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever.
1. **[CLIPSeg](https://huggingface.co/docs/transformers/model_doc/clipseg)** (from University of Göttingen) released with the paper [Image Segmentation Using Text and Image Prompts](https://arxiv.org/abs/2112.10003) by Timo Lüddecke and Alexander Ecker.
1. **[CodeGen](https://huggingface.co/docs/transformers/model_doc/codegen)** (from Salesforce) released with the paper [A Conversational Paradigm for Program Synthesis](https://arxiv.org/abs/2203.13474) by Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong.
Expand Down
1 change: 1 addition & 0 deletions README_hd.md
Original file line number Diff line number Diff line change
Expand Up @@ -260,6 +260,7 @@ conda install -c huggingface transformers
1. **[CamemBERT](https://huggingface.co/docs/transformers/model_doc/camembert)** (इनरिया/फेसबुक/सोरबोन से) साथ में कागज [CamemBERT: एक टेस्टी फ्रेंच लैंग्वेज मॉडल](https:// arxiv.org/abs/1911.03894) लुई मार्टिन*, बेंजामिन मुलर*, पेड्रो जेवियर ऑर्टिज़ सुआरेज़*, योआन ड्यूपॉन्ट, लॉरेंट रोमरी, एरिक विलेमोन्टे डे ला क्लर्जरी, जैमे सेडाह और बेनोइट सगोट द्वारा।
1. **[CANINE](https://huggingface.co/docs/transformers/model_doc/canine)** (Google रिसर्च से) साथ में दिया गया पेपर [कैनाइन: प्री-ट्रेनिंग ए एफिशिएंट टोकनाइजेशन-फ्री एनकोडर फॉर लैंग्वेज रिप्रेजेंटेशन]( https://arxiv.org/abs/2103.06874) जोनाथन एच क्लार्क, डैन गैरेट, यूलिया टर्क, जॉन विएटिंग द्वारा।
1. **[Chinese-CLIP](https://huggingface.co/docs/transformers/model_doc/chinese_clip)** (from OFA-Sys) released with the paper [Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese](https://arxiv.org/abs/2211.01335) by An Yang, Junshu Pan, Junyang Lin, Rui Men, Yichang Zhang, Jingren Zhou, Chang Zhou.
1. **[CLAP](https://huggingface.co/docs/transformers/main/model_doc/clap)** (LAION-AI से) Yusong Wu, Ke Chen, Tianyu Zhang, Yuchen Hui, Taylor Berg-Kirkpatrick, Shlomo Dubnov. द्वाराअनुसंधान पत्र [Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation]https://arxiv.org/abs/2211.06687) के साथ जारी किया गया
1. **[CLIP](https://huggingface.co/docs/transformers/model_doc/clip)** (OpenAI से) साथ वाला पेपर [लर्निंग ट्रांसफरेबल विजुअल मॉडल फ्रॉम नेचुरल लैंग्वेज सुपरविजन](https://arxiv.org /abs/2103.00020) एलेक रैडफोर्ड, जोंग वूक किम, क्रिस हैलासी, आदित्य रमेश, गेब्रियल गोह, संध्या अग्रवाल, गिरीश शास्त्री, अमांडा एस्केल, पामेला मिश्किन, जैक क्लार्क, ग्रेचेन क्रुएगर, इल्या सुत्स्केवर द्वारा।
1. **[CLIPSeg](https://huggingface.co/docs/transformers/model_doc/clipseg)** (from University of Göttingen) released with the paper [Image Segmentation Using Text and Image Prompts](https://arxiv.org/abs/2112.10003) by Timo Lüddecke and Alexander Ecker.
1. **[CodeGen](https://huggingface.co/docs/transformers/model_doc/codegen)** (सेल्सफोर्स से) साथ में पेपर [प्रोग्राम सिंथेसिस के लिए एक संवादात्मक प्रतिमान](https://arxiv.org/abs/2203.13474) एरिक निजकैंप, बो पैंग, हिरोआकी हयाशी, लिफू तू, हुआन वांग, यिंगबो झोउ, सिल्वियो सावरेस, कैमिंग जिओंग रिलीज।
Expand Down
1 change: 1 addition & 0 deletions README_ja.md
Original file line number Diff line number Diff line change
Expand Up @@ -322,6 +322,7 @@ Flax、PyTorch、TensorFlowをcondaでインストールする方法は、それ
1. **[CamemBERT](https://huggingface.co/docs/transformers/model_doc/camembert)** (Inria/Facebook/Sorbonne から) Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot から公開された研究論文: [CamemBERT: a Tasty French Language Model](https://arxiv.org/abs/1911.03894)
1. **[CANINE](https://huggingface.co/docs/transformers/model_doc/canine)** (Google Research から) Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting から公開された研究論文: [CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation](https://arxiv.org/abs/2103.06874)
1. **[Chinese-CLIP](https://huggingface.co/docs/transformers/model_doc/chinese_clip)** (OFA-Sys から) An Yang, Junshu Pan, Junyang Lin, Rui Men, Yichang Zhang, Jingren Zhou, Chang Zhou から公開された研究論文: [Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese](https://arxiv.org/abs/2211.01335)
1. **[CLAP](https://huggingface.co/docs/transformers/main/model_doc/clap)** (LAION-AI から) Yusong Wu, Ke Chen, Tianyu Zhang, Yuchen Hui, Taylor Berg-Kirkpatrick, Shlomo Dubnov. から公開された研究論文 [Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation]https://arxiv.org/abs/2211.06687)
1. **[CLIP](https://huggingface.co/docs/transformers/model_doc/clip)** (OpenAI から) Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever から公開された研究論文: [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020)
1. **[CLIPSeg](https://huggingface.co/docs/transformers/model_doc/clipseg)** (University of Göttingen から) Timo Lüddecke and Alexander Ecker から公開された研究論文: [Image Segmentation Using Text and Image Prompts](https://arxiv.org/abs/2112.10003)
1. **[CodeGen](https://huggingface.co/docs/transformers/model_doc/codegen)** (Salesforce から) Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong から公開された研究論文: [A Conversational Paradigm for Program Synthesis](https://arxiv.org/abs/2203.13474)
Expand Down
1 change: 1 addition & 0 deletions README_ko.md
Original file line number Diff line number Diff line change
Expand Up @@ -237,6 +237,7 @@ Flax, PyTorch, TensorFlow 설치 페이지에서 이들을 conda로 설치하는
1. **[CamemBERT](https://huggingface.co/docs/transformers/model_doc/camembert)** (Inria/Facebook/Sorbonne 에서) Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot 의 [CamemBERT: a Tasty French Language Model](https://arxiv.org/abs/1911.03894) 논문과 함께 발표했습니다.
1. **[CANINE](https://huggingface.co/docs/transformers/model_doc/canine)** (Google Research 에서) Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting 의 [CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation](https://arxiv.org/abs/2103.06874) 논문과 함께 발표했습니다.
1. **[Chinese-CLIP](https://huggingface.co/docs/transformers/model_doc/chinese_clip)** (OFA-Sys 에서) An Yang, Junshu Pan, Junyang Lin, Rui Men, Yichang Zhang, Jingren Zhou, Chang Zhou 의 [Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese](https://arxiv.org/abs/2211.01335) 논문과 함께 발표했습니다.
1. **[CLAP](https://huggingface.co/docs/transformers/main/model_doc/clap)** (LAION-AI 에서 제공)은 Yusong Wu, Ke Chen, Tianyu Zhang, Yuchen Hui, Taylor Berg-Kirkpatrick, Shlomo Dubnov.의 [Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation]https://arxiv.org/abs/2211.06687)논문과 함께 발표했습니다.
1. **[CLIP](https://huggingface.co/docs/transformers/model_doc/clip)** (OpenAI 에서) Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever 의 [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020) 논문과 함께 발표했습니다.
1. **[CLIPSeg](https://huggingface.co/docs/transformers/model_doc/clipseg)** (University of Göttingen 에서) Timo Lüddecke and Alexander Ecker 의 [Image Segmentation Using Text and Image Prompts](https://arxiv.org/abs/2112.10003) 논문과 함께 발표했습니다.
1. **[CodeGen](https://huggingface.co/docs/transformers/model_doc/codegen)** (Salesforce 에서) Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong 의 [A Conversational Paradigm for Program Synthesis](https://arxiv.org/abs/2203.13474) 논문과 함께 발표했습니다.
Expand Down
1 change: 1 addition & 0 deletions README_zh-hans.md
Original file line number Diff line number Diff line change
Expand Up @@ -261,6 +261,7 @@ conda install -c huggingface transformers
1. **[CamemBERT](https://huggingface.co/docs/transformers/model_doc/camembert)** (来自 Inria/Facebook/Sorbonne) 伴随论文 [CamemBERT: a Tasty French Language Model](https://arxiv.org/abs/1911.03894) 由 Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot 发布。
1. **[CANINE](https://huggingface.co/docs/transformers/model_doc/canine)** (来自 Google Research) 伴随论文 [CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation](https://arxiv.org/abs/2103.06874) 由 Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting 发布。
1. **[Chinese-CLIP](https://huggingface.co/docs/transformers/model_doc/chinese_clip)** (来自 OFA-Sys) 伴随论文 [Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese](https://arxiv.org/abs/2211.01335) 由 An Yang, Junshu Pan, Junyang Lin, Rui Men, Yichang Zhang, Jingren Zhou, Chang Zhou 发布。
1. **[CLAP](https://huggingface.co/docs/transformers/main/model_doc/clap)** (来自 LAION-AI) 伴随论文 [Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation]https://arxiv.org/abs/2211.06687) 由 Yusong Wu, Ke Chen, Tianyu Zhang, Yuchen Hui, Taylor Berg-Kirkpatrick, Shlomo Dubnov 发布。
1. **[CLIP](https://huggingface.co/docs/transformers/model_doc/clip)** (来自 OpenAI) 伴随论文 [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020) 由 Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever 发布。
1. **[CLIPSeg](https://huggingface.co/docs/transformers/model_doc/clipseg)** (来自 University of Göttingen) 伴随论文 [Image Segmentation Using Text and Image Prompts](https://arxiv.org/abs/2112.10003) 由 Timo Lüddecke and Alexander Ecker 发布。
1. **[CodeGen](https://huggingface.co/docs/transformers/model_doc/codegen)** (来自 Salesforce) 伴随论文 [A Conversational Paradigm for Program Synthesis](https://arxiv.org/abs/2203.13474) 由 Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong 发布。
Expand Down
1 change: 1 addition & 0 deletions README_zh-hant.md
Original file line number Diff line number Diff line change
Expand Up @@ -273,6 +273,7 @@ conda install -c huggingface transformers
1. **[CamemBERT](https://huggingface.co/docs/transformers/model_doc/camembert)** (from Inria/Facebook/Sorbonne) released with the paper [CamemBERT: a Tasty French Language Model](https://arxiv.org/abs/1911.03894) by Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot.
1. **[CANINE](https://huggingface.co/docs/transformers/model_doc/canine)** (from Google Research) released with the paper [CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation](https://arxiv.org/abs/2103.06874) by Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting.
1. **[Chinese-CLIP](https://huggingface.co/docs/transformers/model_doc/chinese_clip)** (from OFA-Sys) released with the paper [Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese](https://arxiv.org/abs/2211.01335) by An Yang, Junshu Pan, Junyang Lin, Rui Men, Yichang Zhang, Jingren Zhou, Chang Zhou.
1. **[CLAP](https://huggingface.co/docs/transformers/main/model_doc/clap)** (from LAION-AI) released with the paper [Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation]https://arxiv.org/abs/2211.06687) by Yusong Wu, Ke Chen, Tianyu Zhang, Yuchen Hui, Taylor Berg-Kirkpatrick, Shlomo Dubnov.
1. **[CLIP](https://huggingface.co/docs/transformers/model_doc/clip)** (from OpenAI) released with the paper [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020) by Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever.
1. **[CLIPSeg](https://huggingface.co/docs/transformers/model_doc/clipseg)** (from University of Göttingen) released with the paper [Image Segmentation Using Text and Image Prompts](https://arxiv.org/abs/2112.10003) by Timo Lüddecke and Alexander Ecker.
1. **[CodeGen](https://huggingface.co/docs/transformers/model_doc/codegen)** (from Salesforce) released with the paper [A Conversational Paradigm for Program Synthesis](https://arxiv.org/abs/2203.13474) by Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong.
Expand Down
4 changes: 4 additions & 0 deletions docs/source/en/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -495,6 +495,8 @@
sections:
- local: model_doc/audio-spectrogram-transformer
title: Audio Spectrogram Transformer
- local: model_doc/clap
title: CLAP
- local: model_doc/hubert
title: Hubert
- local: model_doc/mctct
Expand Down Expand Up @@ -622,6 +624,8 @@
title: Utilities for Generation
- local: internal/image_processing_utils
title: Utilities for Image Processors
- local: internal/audio_utils
title: Utilities for Audio processing
- local: internal/file_utils
title: General Utilities
title: Internal Helpers
Expand Down
Loading