the best models case insensitive? #7

xiaoyubing · 2019-05-06T12:52:46Z

the best models case insensitive?can you give case sensitive best models?

ku21fan · 2019-05-09T02:43:46Z

Yes. the model is case-insensitive.
We will give case-sensitive models soon (maybe in 1 week?).

ku21fan · 2019-05-14T07:32:15Z

I just uploaded the case-sensitive model. please check here
But it could have low accuracy because the training data and validation data contains some case insensitive labeling.
In detail:

Some fonts always generate the upper case even though the ground truth label have lower cases. Some example of MJSynth or SynthText might have been generated by these fonts.
The training data of IIIT5k and SVT are used as the validation data. Their label is always the upper case, even though the ground truth label has the lower case. please check here

If you want a more accurate case sensitive model, fine-tuning this model with your own data or more cleaned case sensitive data.

Best.

xiaoyubing · 2019-05-15T02:17:32Z

thanks

ku21fan · 2019-05-15T13:27:03Z

I forgot to notify one thing.
use case-sensitive model with --sensitive option.

solitaire2015 · 2019-07-16T07:47:38Z

@ku21fan Thanks for your great job. I have a question:
I test the pre-trained model of case-sensitive on IIIT5K 3000 by filtering special characters and change to upper case, I got a good accuracy: 90%. I wonder if the model is trained only on MJSynth + SYNText? The performance is incredible. Could you please explain why?

Best.

ku21fan · 2019-07-16T08:51:04Z

@solitaire2015
Yes, the model is trained only on MJSynth + SynthText.
But, in case --sensitive mode, we used all of SynthText.

After validation checks for each bounding box,
[a] SynthText has about 5.5 million images which do not contain special characters (only alphanumeric).
[b] SynthText has about 1.5 million images which contain special characters.

Our model was trained on [a] with case insensitive mode,
and trained on [a] and [b] with --sensitive mode
For now, only [a] is uploaded in our google / baidu drive, and I will upload [b] soon.

Best.

solitaire2015 · 2019-07-16T09:48:25Z

@ku21fan That's great, thank you very much.

ku21fan · 2019-07-16T13:14:27Z

@solitaire2015 I just uploaded [b] as 'ST_spe.zip' at here
Download, unzip, and move it into the data_lmdb_release/training/
And then try to train with ST_spe :)

solitaire2015 · 2019-07-19T01:46:14Z

@ku21fan I trained the model on --sensitive mode with the [a] + [b]. here are the training accuracy and loss:
[272000/300000] Loss: 0.06758 elapsed_time: 135756.64656
acer , gt: acer , True
Endless , gt: ENDLESS , False
BBQ , gt: BBQ , True
The , gt: The , True
THE , gt: THE , True
[272000/300000] valid loss: 2.01074 accuracy: 64.745, norm_ED: 1728.75
best_accuracy: 66.390, best_norm_ED: 1609.72

I trained this model on two 1080Ti, selected the best combination, TPS + ResNet + BiLSTM + Attn, for some reasons I add one convolutional layer at the end of ResNet feature extractor to keep the output width to be 25(26 before), I test the model on IIIT5k 3000, after filtering special characters and converting lower case to upper case, I got total accuracy: 88.5%, a little bit lower than yours 90%.

I investigate some failure cases of IIIT5K, some special characters are treated as alphanumeric characters, I think it's the reason why the sensitive model is more accurate than the insensitive model.

Best.

alwc · 2019-07-20T09:52:46Z

Hi @solitaire2015, did you use grayscale or rgb mode for your input images?

solitaire2015 · 2019-07-23T02:18:20Z

@alwc I use grayscale images.

ku21fan closed this as completed May 14, 2019

solitaire2015 mentioned this issue Jul 26, 2019

Train --sensitive mode with the MJSynth + ST_spe #40

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the best models case insensitive? #7

the best models case insensitive? #7

xiaoyubing commented May 6, 2019

ku21fan commented May 9, 2019

ku21fan commented May 14, 2019

xiaoyubing commented May 15, 2019

ku21fan commented May 15, 2019

solitaire2015 commented Jul 16, 2019

ku21fan commented Jul 16, 2019

solitaire2015 commented Jul 16, 2019

ku21fan commented Jul 16, 2019 •

edited

Loading

solitaire2015 commented Jul 19, 2019

alwc commented Jul 20, 2019

solitaire2015 commented Jul 23, 2019

the best models case insensitive? #7

the best models case insensitive? #7

Comments

xiaoyubing commented May 6, 2019

ku21fan commented May 9, 2019

ku21fan commented May 14, 2019

xiaoyubing commented May 15, 2019

ku21fan commented May 15, 2019

solitaire2015 commented Jul 16, 2019

ku21fan commented Jul 16, 2019

solitaire2015 commented Jul 16, 2019

ku21fan commented Jul 16, 2019 • edited Loading

solitaire2015 commented Jul 19, 2019

alwc commented Jul 20, 2019

solitaire2015 commented Jul 23, 2019

ku21fan commented Jul 16, 2019 •

edited

Loading