TeCoPhy: A Text Corpus of German Physics Texts
- To learn a subject, the acquisition of the associated technical language is important. Despite this widely accepted importance of learning the technical language, hardly any studies are published that describe the characteristics of most technical languages that students are supposed to learn. This might largely be due to the absence of specialized text corpora to study such languages at lexical, syntactical and textual level. In the present paper we describe a corpus of German physics text that can be used to study the language used in physics. A large and a small variant are compiled. The small version of the corpus consists of 5.3 Million words and is available on request.
Author: | Vitor Lécio Lecarda FontanellaORCiD, Tom Bleckmann, Lukas Dieckhoff, Gunnar FriegeORCiD, Christian WartenaORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:960-opus4-27964 |
URL: | https://cilc2023.wordpress.com/book-of-abstracts/ |
DOI: | https://doi.org/10.25968/opus-2796 |
Parent Title (English): | Corpus Linguistics in the Digital Era: Genres, Registers and Domains ; 14th International Conference on Corpus Linguistics - May 10 - 12, 2023 |
Document Type: | Conference Proceeding |
Language: | English |
Year of Completion: | 2023 |
Publishing Institution: | Hochschule Hannover |
Release Date: | 2023/05/31 |
Tag: | Corpus construction; German; Physics; Textbooks |
GND Keyword: | Korpus <Linguistik>; Physik; Deutsch |
First Page: | 122 |
Last Page: | 123 |
Link to catalogue: | 1853108790 |
Institutes: | Fakultät III - Medien, Information und Design |
DDC classes: | 410 Linguistik |
Licence (German): | Creative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International |