-
Notifications
You must be signed in to change notification settings - Fork 435
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[datasets] Extend the range of public datasets supported in docTR #587
Comments
@fg-mindee PS: contains only english handwritten ! |
TextOCR dataset |
@fg-mindee |
Sure, you can go ahead and I see that you already opened a PR :) |
Char74k dataset |
TotalText dataset |
@fg-mindee |
If one dataset cannot be downloaded directly, we'll add instructions where to get it and change the constructor. But apart from some exceptions, we won't reupload public datasets 👍 |
@fg-mindee |
If the dataset is public, either:
|
Wdyt can we add a function For ICDAR2003 which task would we provide ? |
ICDAR 2019 Robust Reading Challenge on Multi-lingual scene text detection and recognition |
@felixdittrich92 sorry for the late reply!
|
@fg-mindee |
Additional maybe ?
Replace: I think the IMGUR5K dataset and IC19 would bring the most improvement 🤗 wdyt and can we update the upper list ? 😄 EDIT: for IMGUR5K would it be possible to upload a zip file which contains the annotations/img_urls and hashes ? (9,6MB) |
@fg-mindee we should then also revise the boxes for the records so that they all have relative coordinates (but i would open a extra PR if this one is complete) 😄 |
Hello @felixdittrich92 👋 Yes, apart from IMGUR5k, where I'll have to take a closer look at, I think we're good for a while now 👌 |
About IC19 replacing IC13, most research papers evaluate their perf on IC03, IC13 and IC15. |
Checking the ref, Imgur5K word images looks like a nice final addition for this round :) |
@fg-mindee We can definitly do this but i would prefer to discuss in front of any implementation |
Mmmh it looks more like the script to simply download the dataset 😅 And it's a real-world handwritten image dataset, not a synthetic one to the best of my understanding! |
@fg-mindee The repo provides some "lists" with urls, hashes and labels .. of course we can provide something as "live creation" but: Lets discuss this tomorrow in Slack short :) |
@fg-mindee Thaanks :) |
Currently, we support
FUNSD
,CORD
andSROIE
but we should look at extending the range of supported datasets. Among others, we could include handwritten, and in-the-wild situations.Here is a list of datasets you can usually find in OCR-related benchmarks:
Of course, the list goes on
The text was updated successfully, but these errors were encountered: