Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can't find documentation #153

Closed
PedroBarcha opened this issue Nov 11, 2016 · 4 comments
Closed

Can't find documentation #153

PedroBarcha opened this issue Nov 11, 2016 · 4 comments

Comments

@PedroBarcha
Copy link

Hi there,
I've been studying ocropus for a while now, but I couldn't find it's thresholding, denoising and character recognition (not CLSTM) documentation. Does anyone know where to find it?
Thanks a lot.

@zuphilip
Copy link
Collaborator

This highly overlaps with the issue #136.

@PedroBarcha
Copy link
Author

@zuphilip I had already seen the wiki, as suggested in #136 . But it didn't help me at all. I was hoping to find at least the references to the algorithms, as you did in #118, when you mentioned the paper on layout analysis, that was extremely helpful. Do you know the articles on the matters I'm interested? Or at least any reference at all to the thresh/denoise/calssifier algorithms?

@zuphilip
Copy link
Collaborator

You can try to look at the publications wiki page: https://github.com/tmbdev/ocropy/wiki/Publications

@amitdo
Copy link
Contributor

amitdo commented Nov 12, 2016

Thresholding - See Binarization
Denoising - I think it just removes small connected components.
Character recognition - Bidi-LSTM + CTC alignment. see the ICDAR 2013 LSTM Tutorial

@zuphilip zuphilip closed this as completed Jan 2, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants