← Back to Demos

DocExtract

Client-side document extraction. Upload an image, OCR it with Tesseract.js, and extract structured fields using a predefined schema.

How it works: OCR Only mode runs real Tesseract.js in your browser on uploaded images. Vision mode uses simulated AI results. The "Use Sample Invoice" button demonstrates extraction against hardcoded sample text. This is a simplified version of the production IDP System's extraction pipeline.

OCR MODE

EXTRACTION SCHEMA

Vendor NametextInvoice NumbertextInvoice DatedateTotal AmountnumberCurrencytext

Drop an image here, or click to browse

Supports PNG, JPG, TIFF, BMP