Example of Data Extraction in ASP.NET Core Smart Data Extraction Library

This sample demonstrates how to extract the document structured data such as text, form fields, table and its bounding areas from PDFs and images.

DEMO

SOURCE

Input.pdf

Drop files (Image, PDF)

Data Extraction Options

Confidence Threshold Page Rangeto

Extracting, please wait a moment...

Note: Data extraction is performed based on the selected page range.

The Smart Data Extractor detects and extracts the structured data from uploaded documents according to the options you choose. Internally, it uses the Syncfusion OCR Processor to extract text from scanned images, with English set as the default language. Extracted data includes text, form field, tables are shown in the PDF Viewer for review and interaction.

Use the Browse button to select any file of interest.
Alternatively, drag and drop a chosen file into the designated file pick area.
After selecting a valid file and configuring the desired options, tap the Extract Data button to apply the process. The Smart Data Extractor then render the processed output in the PDF Viewer below for user interaction.
Support for various file formats, including:
- PDF - '.pdf'
- Image - '.jpeg','.jpg','.png'