Extract (Text + OCR) from PDF
Description
This activity extracts text or data from PDF files using OCR (Optical Character Recognition) and converts it into a structured format.
Input
PDF Files
Output
Extracted data or text
Configuration Fields
- Start page Specifies the starting page number from which the PDF extraction process will begin.
- End page Specifies the ending page number at which the PDF extraction process will stop.
Sample Input
Not Applicable
Sample Configuration
Sample Output
Extracted text or structured data from the specified pages of the PDF document.