Page identification with text matching

Text matching uses the full page recognition results to identify pages.

You can identify pages by searching the recognition results for a string that is unique to each page type.

If all of the actions in a function return True, Datacap does not run any other functions in the current rule. For example, if you get a match on the word Car and can set the page type successfully. The PageID rule exits without running any of the other tests

Text matching uses the full page recognition results, so you must do full page OCR (or ICR) before you run any of the text matching actions. You can then use the WordFind action to determine whether a specific string is present and the SetPageType action to set the page type.

Library Action Description
Locate WordFind Locates the first (or next) occurrence of the specified word or phrase on the current page.
DCO SetPageType Assigns a page type to the current page in the runtime hierarchy.

The WordFind action is case-sensitive. Additionally, if different variants of a page type have different unique identifiers, you might need to use a more flexible matching technique. Such as regular expressions or keyword lists. For more information, see Regular expressions and Text matching with keyword lists.