Idiomatic Expressions in VerbaLex
Authors | |
---|---|
Year of publication | 2017 |
Type | Article in Proceedings |
Conference | Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017 |
MU Faculty or unit | |
Citation | |
Web | http://nlp.fi.muni.cz/raslan/raslan17.pdf |
Field | Informatics |
Keywords | idioms; verb phrases; verb frames; valency lexicon; corpus |
Description | Idiomatic expressions are part of everyday language, therefore NLP applications that can ``understand'' idioms are desirable. The nature of idioms is somewhat heterogenous - idioms form classes differing in many aspects (e.g. syntactic structure, lexical and syntactic fixedness). Although dictionaries of idioms exist, they usually do not contain information about fixedness or frequency since they are intended to be used by humans, not computer programs. In this work, we propose how to deal with idioms in the valency lexicon VerbaLex using automatically extracted information from the largest dictionary Czech idioms and a web corpus. |
Related projects: |