Competing Patterns for Language Engineering
Autoři | |
---|---|
Rok publikování | 2000 |
Druh | Článek ve sborníku |
Konference | Proceedings of Third International Workshop on Text, Speech and Dialogue, TSD 2000 |
Fakulta / Pracoviště MU | |
Citace | |
www | http://www.fi.muni.cz/tsd2000/tsdproc.html |
Obor | Využití počítačů, robotika a její aplikace |
Klíčová slova | patterns;finite automata;natural language processing;language engineering |
Popis | In this paper we describe a method of effective handling of linguistic data by means of \emph{covering and inhibiting patterns}---patterns that ``compete'' each other. A methodology of developing such patterns is outlined. Applications in the areas of morphology, hyphenation and part-of-speech tagging are shown. This pattern-driven approach to language engineering allows the combination of linguist expertise with the data learned from corpora---layering of knowledge. Searching for information in pattern database (dictionary problem) is blindingly fast---linear with respect to the length of searching word as with other finite-state approaches. |
Související projekty: |