Information Extraction from Business Documents
| Authors | |
|---|---|
| Year of publication | 2022 |
| Type | Article in Proceedings |
| Conference | Recent Advances in Slavonic Natural Language Processing (RASLAN 2022) |
| MU Faculty or unit | |
| Citation | |
| web | fulltext PDF |
| Keywords | OCR; Multi-modal learning; Information extraction; Transformers; Structured Documents |
| Description | Document AI is a relatively new research topic that refers to techniques for automatically reading, understanding, and analyzing business documents. Nowadays, many companies extract data from business documents through manual efforts that are time-consuming and expensive, requiring manual customization or configuration. This paper describes techniques to address these problems, apply them to real-world data, and implement them to an end-to-end solution for automatic information extraction from business documents. |
| Related projects: |