Volltext-Downloads (blau) und Frontdoor-Views (grau)

Classifying Medical Literature Using k-Nearest-Neighbours Algorithm

  • The amount of papers published yearly increases since decades. Libraries need to make these resources accessible and available with classification being an important aspect and part of this process. This paper analyzes prerequisites and possibilities of automatic classification of medical literature. We explain the selection, preprocessing and analysis of data consisting of catalogue datasets from the library of the Hanover Medical School, Lower Saxony, Germany. In the present study, 19,348 documents, represented by notations of library classification systems such as e.g. the Dewey Decimal Classification (DDC), were classified into 514 different classes from the National Library of Medicine (NLM) classification system. The algorithm used was k-nearest-neighbours (kNN). A correct classification rate of 55.7% could be achieved. To the best of our knowledge, this is not only the first research conducted towards the use of the NLM classification in automatic classification but also the first approach that exclusively considers already assigned notations from other classification systems for this purpose.

Download full text files

Export metadata

  • Export Bibtex
  • Export RIS

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Andreas Lüschow, Christian Wartena
URN:urn:nbn:de:bsz:960-opus4-11464
URL:http://ceur-ws.org/Vol-1937/paper3.pdf
ISSN:1613-0073
Parent Title (English):NKOS 2017: 17th European Networked Knowledge Organization Systems (NKOS) Workshop ; Proceedings of the 17th European Networked Knowledge Organization Systems Workshop co-located with the 21st International Conference on Theory and Practice of Digital Libraries 2017 (TPDL 2017), Thessaloniki, Greece, September 21st, 2017
Document Type:Conference Proceeding
Language:English
Year of Completion:2017
Release Date:2017/09/25
GND Keyword:Algorithmus; Klassifikation; Medizinische Bibliothek
First Page:26
Last Page:38
Institutes:Fakultät III - Medien, Information und Design
Dewey Decimal Classification:020 Bibliotheks- und Informationswissenschaft
Licence (German):License LogoHinweis zum Urheberrecht