Web Interface and Collection for Mathematical Retrieval : WebMIaS and MREC
Authors | |
---|---|
Year of publication | 2011 |
Type | Article in Proceedings |
Conference | DML 2011: Towards a Digital Mathematics Library |
MU Faculty or unit | |
Citation | |
Web | http://dml.cz/handle/10338.dmlcz/702604 |
Field | Informatics |
Keywords | math indexing and retrieval; mathematical digital libraries; information systems; information retrieval; mathematical content search; document ranking of mathematical papers; math text mining; WebMIaS; MIaS; Tralics; TeX; UMCL; Lucene |
Description | We demonstrate searching of mathematical expressions in technical digital libraries on a MREC collection of 439,423 real scientific documents with more than 158 million mathematical formulae. Our solution - the WebMIaS system - allows the retrieval of mathematical expressions written in TEX or MathML. TEX queries are converted on-the-fly into tree representations of Presentation MathML, which is used for indexing. WebMIaS allows complex queries composed of plain text and mathematical formulae, using MIaS (Math Indexer and Searcher), a math aware search engine based on the state-of-the-art system Lucene. MIaS implements proximity math indexing with a subformulae similarity search. |
Related projects: |