Form
This function automatically extracts key values contained in documents.
The Intelligent Document Data extraction employs advanced Artificial Intelligence (AI) technology to streamline the extraction of data from documents, especially structured documents (forms), making it a powerful tool for enhancing efficiency and productivity.
It is important, however, to acknowledge the limitations of the AI in certain scenarios to manage expectations effectively. The success of the extraction process relies on the AI's ability to establish clear relationships between keys and values in the form. In instances where this relationship is unclear or ambiguous, the AI may struggle to provide accurate results.
Users should be mindful that the AI is a supportive tool rather than an infallible solution, and that not all forms may be processed with equal precision. The extraction can differ depending on the selected capture method, the scan settings, or other factors.
To ensure the best results, the extracted data should be reviewed for accuracy using the RSI LogicFlow validation feature, especially in cases where form layouts are unconventional or key/value relationships are ambiguous, for example when the value is not in the immediate vicinity of the key.
It is recommended to always enforce the RSI LogicFlow validation when the extracted data accuracy is critical.
To configure the Intelligent Document Data, the expected keyword should be entered. This is the information the AI will search for to extract relevant data, so it is important to match the keyword as it is written on the document. For each extracted data, it is possible to look for various keywords in order to support different document formats. The AI will select the best value found on the document.
It is possible to narrow down the data extraction to specific content within the document with the content filter and the page filter, ensuring you get the right data even if the key appears multiple times.
The data type should be selected and a unique Data ID should be assigned to the extracted information, serving as an identifier for subsequent workflow use and as a label in the validation interface.
Details of settings:
Category | Setting | Description | Options |
---|---|---|---|
Data to be Extracted | Search Label | Label to search in the document. | Any string. |
Content filter | Allows filtering key-value pairs. | Can filter using:
| |
Page filter | To specify a specific page to analyze | Must be a number. | |
Data Type | To specify the data type. | The type can be:
| |
Data ID | ID for using the extracted data in the input fields of other functions. You can leave it as the default or change it to an original ID that is easier to identify. | Any string except “%”. |