Project information
Pattern Recognition-based Statistically Enhanced MT
(PRESEMT)
- Project Identification
- 248307
- Project Period
- 1/2010 - 12/2012
- Investor / Pogramme / Project type
-
European Union
- 7th Specific RTD Programme
- Cooperation
- MU Faculty or unit
- Faculty of Informatics
- Cooperating Organization
-
Institute for Language and Speech Processing
- Responsible person George Tambouratzis
Norwegian University of Science and Technology
National Technical University of Athens
Lexical Computing Ltd.
This proposal describes PRESEMT, a flexible and adaptable MT system, based on a language-independent method, whose principles ensure easy portability to new language pairs. This method attempts to overcome well-known problems of other MT approaches, e.g. bilingual corpora compilation or creation of new rules per language pair. PRESEMT will address the issue of effectively managing multilingual content and is expected to suggest a language-independent machine-learning-based methodology. The key aspects of PRESEMT involve syntactic phrase-based modelling, pattern recognition approaches (such as extended clustering or neural networks) or game theory techniques towards the development of a language-independent analysis, evolutionary algorithms for system optimisation. It is intended to be of a hybrid nature, combining linguistic processing with the positive aspects of corpus-based approaches, such as SMT and EBMT.
Publications
Total number of publications: 14
2011
-
Syntactic Analysis Using Finite Patterns: A New Parsing System for Czech
Human Language Technology. Challenges for Computer Science and Linguistics, year: 2011
-
Time Dimension in the Dolphin Nick Knowledge Base Using Transparent Intensional Logic
Proceedings of 14th International Conference on Text, Speech, and Dialogue (TSD 2011), year: 2011
2010
-
Fast syntactic searching in very large corpora for many languages
PACLIC 24 Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, year: 2010
-
Through Low-Cost Annotation to Reliable Parsing Evaluation
PACLIC 24 Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, year: 2010