Information Extraction from Business Documents
Authors | |
---|---|
Year of publication | 2022 |
Type | Article in Proceedings |
Conference | Recent Advances in Slavonic Natural Language Processing (RASLAN 2022) |
MU Faculty or unit | |
Citation | |
Web | fulltext PDF |
Keywords | OCR; Multi-modal learning; Information extraction; Transformers; Structured Documents |
Description | Document AI is a relatively new research topic that refers to techniques for automatically reading, understanding, and analyzing business documents. Nowadays, many companies extract data from business documents through manual efforts that are time-consuming and expensive, requiring manual customization or configuration. This paper describes techniques to address these problems, apply them to real-world data, and implement them to an end-to-end solution for automatic information extraction from business documents. |
Related projects: |