Medical Knowledge Resources for Text-Mining of Health Records in Czech, Polish, and Slovak

Anetta,  Krištof

Medical Knowledge Resources for Text-Mining of Health Records in Czech, Polish, and Slovak

Varování

Publikace nespadá pod Ústav výpočetní techniky, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.

Autoři	ANETTA Krištof
Rok publikování	2022
Druh	Článek ve sborníku
Konference	Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2022
Fakulta / Pracoviště MU	Fakulta informatiky
Citace
www	Plný text Domovská stránka workshopu
Klíčová slova	EHR; electronic health records; healthcare text; UMLS; ICD10; SNOMED CT; MedDRA; MeSH; NLP; natural language processing; Slavic languages; Polish; Czech; Slovak
Popis	Knowledge extraction from medical text in small languages like Czech, Polish or Slovak is challenging due to the insufficiency of languagespecific medical resources (pretrained models, ontologies, dictionaries). This paper is a survey of noteworthy options for researchers targeting these languages, divided into two sections. First, since the UMLS Metathesaurus for English is by far the most extensive and detailed medical knowledge resource in Western medicine, appreciable results can be achieved by machine-translating the mined text to English – therefore, the relevant English components of UMLS are introduced. Second come the languagespecific resources for each language, detailing the publishing institutions, current website locations, contents, and file formats. The contribution of this paper is in collecting and pre-screening widely disparate sources needed for successful medical knowledge extraction in Central European Slavic languages.
Související projekty:	Interní grantová agentura Masarykovy univerzity AIcope - AI support for Clinical Oncology and Patient Empowerment New Horizons of Electronic Health Record Analysis using Deep Learning