Volltext-Downloads (blau) und Frontdoor-Views (grau)

Automatic Identification of Synonym Relations in the Dutch Parliament’s Thesaurus

  • For indexing archived documents the Dutch Parliament uses a specialized thesaurus. For good results for full text retrieval and automatic classification it turns out to be important to add more synonyms to the existing thesaurus terms. In the present work we investigate the possibilities to find synonyms for terms of the parliaments thesaurus automatically. We propose to use distributional similarity (DS). In an experiment with pairs of synonyms and non-synonyms we train and test a classifier using distributional similarity and string similarity. Using ten-fold cross validation we were able to classify 75% of the pairs of a set of 6000 word pairs correctly.

Download full text files

Export metadata

Additional Services

Search Google Scholar


Author:Rosa Tsegaye Aga, Christian WartenaORCiDGND, Otto Lange, Nelleke Aders
DOI original:https://doi.org/10.5445/KSP/1000058749/23
Parent Title (English):Archives of Data Science, Series A
Document Type:Article
Year of Completion:2017
Publishing Institution:Hochschule Hannover
Release Date:2017/09/20
GND Keyword:Synononym; Automatische Identifikation; Thesaurus
Link to catalogue:1751852393
Institutes:Fakultät III - Medien, Information und Design
DDC classes:020 Bibliotheks- und Informationswissenschaft
Licence (German):License LogoCreative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International