Situation: I got a scanned book that I’d like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.
What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn’t work for me, and I found no way to use it with some local model
Have any of you ever done a similar task? I’d appreciate any kind of suggestions and tips.
You can literally just feed the images into chat gpt at this point.
I’m giving preference to open source tools, but that’s a good thing to know, thanks
Every time I’ve done it, it’s pretty bad. Ocr is much better.
This doesn’t work after the pdf reaches a cert max size.
Could just break it up into chapters or something, pretty easy to split a pdf.