Association Analyzer Implementation: State of the Art: Deliverable 8.1 of project EuDML
Authors | |
---|---|
Year of publication | 2010 |
MU Faculty or unit | |
Citation | |
Description | This report focuses on two key technologies: Citation Indexing and Document Clustering. Citation Indexing concerns the automatic parsing and linking of citations to create a network of documents within the collection. This technology is well established in digital libraries and searchable archives such as CiteSeerX, Google Scholar, general projects as DRIVER, and mathematical specific digital libraries such as NUMDAM, DML-CZ or referative databases Zentralblatt MATH and Mathematical Reviews. Document Classification and Clustering are also established technologies within Information Retrieval but have not to date been widely used within digital libraries. In particular, there is very little previous work applying classification and clustering techniques to mathematical documents. However, initial research appears promising and we believe that the addition of these technologies will allow facilities beyond the current state of the art. |
Related projects: |