Building A Thesaurus Using LDA-Frames

Warning

This publication doesn't include Institute of Computer Science. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

MATERNA Jiří

Year of publication 2012
Type Article in Proceedings
Conference 6th Workshop on Recent Advances in Slavonic Natural Language Processing
MU Faculty or unit

Faculty of Informatics

Citation
Web https://nlp.fi.muni.cz/raslan/2012/paper02.pdf
Field Informatics
Keywords LDA-frames; thesaurus
Description In this paper we present a new method for measuring semantic relatedness of lexical units, which can be used to generate a thesaurus automatically. The method is based on a comparison of probability distributions of semantic frames generated using the LDA-frames algorithm. The idea is evaluated by measuring the overlap of WordNet synsets and generated semantic clusters. The results show that the method outperforms another automatic approach used in the Sketch Engine project.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info