Tesseract home made French spellcheck Dictionnary #9641
Eeel-12
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi @niksedk,
Thank you for your software. As a hobby i'm writting a small spellckeck dictionnary parser. I thought i should share / contribute my dictionnary here so it can be usefull for everyone, my project will never be made public.
This is the result of a year+ of spellcheck with Gaupol on french subtitles from various sources: sup2srt, pgsTosrt, are command line OCR tools that use tesseract 5.5. And also subtitles from Opensubtitles, probably created using SE.
If you decide to merge it to your "fra_OCRFixReplaceList.xml", just send a reply here or whatever format that suit you, and i will post futur updates on consequent changes.
note: duplicates may exist.
Best.
Beta Was this translation helpful? Give feedback.
All reactions