Detecting Paraphrases of Standard Clause Titles in Insurance Contracts
- For the analysis of contract texts, validated model texts, such as model clauses, can be used to identify used contract clauses. This paper investigates how the similarity between titles of model clauses and headings extracted from contracts can be computed, and which similarity measure is most suitable for this. For the calculation of the similarities between title pairs we tested various variants of string similarity and token based similarity. We also compare two additional semantic similarity measures based on word embeddings using pre-trained embeddings and word embeddings trained on contract texts. The identification of the model clause title can be used as a starting point for the mapping of clauses found in contracts to verified clauses.
Author: | Frieda JosiORCiD, Christian WartenaORCiDGND, Ulrich Heid |
---|---|
URN: | urn:nbn:de:bsz:960-opus4-13375 |
URL: | https://www.aclweb.org/anthology/W19-0803 |
DOI: | https://doi.org/10.25968/opus-1337 |
ISBN: | 978-1-950737-22-2 |
Parent Title (English): | RELATIONS - Workshop on meaning relations between phrases and sentences (May 23, 2019, Gothenburg, Sweden) |
Document Type: | Conference Proceeding |
Language: | English |
Year of Completion: | 2019 |
Publishing Institution: | Hochschule Hannover |
Contributing Corporation: | Association for Computational Linguistics |
Release Date: | 2019/05/29 |
Tag: | Contract Analysis; Paraphrase Similarity; Similarity Measures; Text Similarity; Title Matching |
GND Keyword: | Paraphrase; Vertragsklausel; Ähnlichkeit; Versicherungsvertrag |
First Page: | 23 |
Last Page: | 33 |
Link to catalogue: | 1690547723 |
Institutes: | Fakultät III - Medien, Information und Design |
DDC classes: | 020 Bibliotheks- und Informationswissenschaft |
Licence (German): | Creative Commons - CC BY - Namensnennung 4.0 International |