Large Scale Keyword Extraction using a Finite State Backend
Authors | |
---|---|
Year of publication | 2016 |
Type | Article in Proceedings |
Conference | Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016 |
MU Faculty or unit | |
Citation | |
Web | https://nlp.fi.muni.cz/raslan/2016/paper17-Jakubicek_Smerk.pdf |
Field | Informatics |
Keywords | terminology extraction; keyword extraction; fsa; Sketch Engine |
Description | We present a novel method for performing fast keyword extraction from large text corpora using a finite state backend. The FSA3 package has been adopted for this purposes. We outline the basic approach and present a comparison with previous hash-based method as used in Sketch Engine. |
Related projects: |