OCR
Pull text out of images and scanned PDFs. 100+ languages, line and word boxes, layout-preserving export. Output text, hOCR, TSV, DOCX, or a fully searchable PDF. Powered by Tesseract.js, runs entirely in your browser.
100% in your browser
No upload
100+ languages
Searchable PDF
Input
Drop images, scans, or PDFs here
or click to browse. PNG, JPG, WEBP, BMP, TIFF, PDF
Settings
Idle
Output
Run OCR to detect tables.
Run OCR to view annotated image.