Available from A2iA Corp
A2iA DocumentReader™ is a data extraction and document classification toolkit. It is the only software able to process all-types of paper documents and incoming mail, regardless of their structure or contents. By going further than existing OCR and ICR engines that are unable to recognize cursive handwriting, and other classification engines that use only layout to route a document, A2iA DocumentReader automates the entire incoming workflow.
A2iA DocumentReader first classifies the documents into “categories”, by analyzing both their geometry and content. The software examines the layout of the document’s elements. Then, using a general dictionary as well as a trade vocabulary, it carries out a literal transcription of the handwritten and/or typed areas to extract key words and expressions in order to determine the type of document. Next, the software synthesizes the information extracted from the different documents in order to identify the subject/document class. This unique feature minimizes, and can even eliminate, manual pre-sorting and classification steps.
All proprietary A2iA, A2iA DocumentReader contains no third-party technology. By operating the world’s largest research center dedicated to extracting information from paper, with a focus on handwritten and unstructured contents, A2iA’s R&D team is able to adapt the technology to meet the demands of the users.