Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If the OCR is good then they're totally burying the lede, it's pricing is 100x cheaper than commercial OCR APIs.

It's potentially a game changer, plenty of industries have piles of scanned documents. Cheap OCR means this data suddenly becomes accessible even if the value per individual document is low (i.e. for input into machine learning).



Paper from a few years ago comparing Google's OCR system to commercially available benchmarks:

http://www.educatingsilicon.com/wp-content/uploads/2013/10/p...

A lot better for text in photographs. Comparison might be different on dense document text though.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: