Annotated Corpus of Czech Case Law for Reference Recognition Tasks
Authors | |
---|---|
Year of publication | 2018 |
Type | Article in Proceedings |
Conference | Text, Speech, and Dialogue: 21st International Conference |
MU Faculty or unit | |
Citation | |
Web | |
Doi | http://dx.doi.org/10.1007/978-3-030-00794-2_26 |
Keywords | Reference recognition; dataset; legal texts; manual annotation |
Attached files | |
Description | We describe an annotated corpus of 350 decisions of Czech top-tier courts which was gathered for a project assessing the relevance of court decisions in Czech law. We describe two layers of processing of the corpus; every decision was annotated by two trained annotators and then manually adjudicated by one trained curator to solve possible disagreements between annotators. This corpus was developed as training and testing material for reference recognition tasks which will be further used for research on assessment of legal importance. However, the overall shortage of available research corpora of annotated legal texts, particularly in Czech language, leads us to believe that other research teams may find it useful. |
Related projects: |