Topic Modelling of the Czech Supreme Court Decisions

Varování

Publikace nespadá pod Ústav výpočetní techniky, ale pod Právnickou fakultu. Oficiální stránka publikace je na webu muni.cz.
Autoři

NOVOTNÁ Tereza HARAŠTA Jakub KÓL Jakub

Rok publikování 2020
Druh Článek ve sborníku
Konference Proceedings of the Fourth Workshop on Automated Semantic Analysis of Information in Legal Text held online in conjunction with the 33rd International Conference on Legal Knowledge and Information Systems (JURIX 2020)
Fakulta / Pracoviště MU

Právnická fakulta

Citace
www Open access sborníku
Klíčová slova topic modelling; Latent Dirichlet Allocation; Non-negative Matrix Factorization; court decisions; coherence score
Popis The Czech Supreme Court produces significant amount of decisions totalling more than 130 000 decisions since 1993. The amount makes it difficult for law practitioners to research this case law. This work focuses on topic models for enhanced information retrieval through identification of case law approaching the same or similar issues. We provide initial quantitative evaluation of Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF) models according to CV coherence score for different number of topics modelled n= {10, 20, ..., 90, 100}. Additionally, we provide qualitative evaluation for LDA and NMF models n= {20, 30} that will serve as a starting point for subsequent expert-user evaluation.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info