Grammar Development for Czech Syntactic Parser with Corpus-based Techniques

Kovář,  Vojtěch; Kadlec,  Vladimír; Horák,  Aleš

Grammar Development for Czech Syntactic Parser with Corpus-based Techniques

Warning

This publication doesn't include Institute of Computer Science. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors	KOVÁŘ Vojtěch KADLEC Vladimír HORÁK Aleš
Year of publication	2006
Type	Article in Proceedings
Conference	Proceedings of Corpus Linguistic 2006
MU Faculty or unit	Faculty of Informatics
Citation
Field	Informatics
Keywords	parsing grammar czech corpus
Description	In the paper, we present the description of the Czech syntactic parser synt developed at FI MU NLP laboratory. The presented system is based on the meta-grammar formalism with a head-driven chart parser. The parsing technique provides fast analysis of the context free backbone with successive evaluation of the contextual constraints using so called ``forest of values.'' The meta-grammar formalism allows to capture complicated grammatic relations with a maintainable number of rules. Besides the description of the synt system, we display the process of the meta-grammar development. One of the first phases is formed by construction of corpus data for testing. In the paper, we demonstrate the exploitation of the corpus on testing a method for detection of the ``best analysis'' selection with the results of testing the synt analysis on Czech corpus.
Related projects:	Translation of Czech Sentences to Transparent Intensional Logic Constructions Intelligentmethods for incresing of reliability of electrical networks Centrum komputační lingvistiky