Document OCR & Data Extraction
Extract text from images and scanned documents using Tesseract OCR. Auto-detects emails, phone numbers, dates, and monetary amounts with structured output.
Try it Free
Upload an image and extract text with structured data in seconds.
Try Free
Result
Upload an image to extract text
API Documentation
Extract text from images and scanned documents using Tesseract OCR. Auto-detects emails, phone numbers, dates, and monetary amounts. Supports 20+ languages including Arabic.
Endpoint
Upload an image file or send base64-encoded image data. The API runs Tesseract OCR and returns extracted text with word-level confidence scores and structured data extraction.
Request Parameters
* Either image file or base64 is required.
Response Fields
text Full extracted textword_count Number of words extractedconfidence Average OCR confidence (0–100)structured.emails Auto-detected email addressesstructured.phones Auto-detected phone numbersstructured.dates Auto-detected datesstructured.amounts Auto-detected monetary amountsstructured.total Detected total amount (receipts/invoices)blocks Word-level bounding boxes with confidence scoresCode Examples
Error Codes
Rate Limits
Powered by Tesseract OCR v5. Max file size 10MB. Supports file upload or base64 input. No data is stored — images are processed and immediately discarded.