Volltext-Downloads (blau) und Frontdoor-Views (grau)

TeCoPhy: A Text Corpus of German Physics Texts

  • To learn a subject, the acquisition of the associated technical language is important. Despite this widely accepted importance of learning the technical language, hardly any studies are published that describe the characteristics of most technical languages that students are supposed to learn. This might largely be due to the absence of specialized text corpora to study such languages at lexical, syntactical and textual level. In the present paper we describe a corpus of German physics text that can be used to study the language used in physics. A large and a small variant are compiled. The small version of the corpus consists of 5.3 Million words and is available on request.

Download full text files

Export metadata


Author:Vitor Lécio Lecarda FontanellaORCiD, Tom Bleckmann, Lukas Dieckhoff, Gunnar FriegeORCiD, Christian WartenaORCiDGND
Parent Title (English):Corpus Linguistics in the Digital Era: Genres, Registers and Domains ; 14th International Conference on Corpus Linguistics - May 10 - 12, 2023
Document Type:Conference Proceeding
Year of Completion:2023
Publishing Institution:Hochschule Hannover
Release Date:2023/05/31
Tag:Corpus construction; German; Physics; Textbooks
GND Keyword:Korpus <Linguistik>; Physik; Deutsch
First Page:122
Last Page:123
Link to catalogue:1853108790
Institutes:Fakultät III - Medien, Information und Design
DDC classes:410 Linguistik
Licence (German):License LogoCreative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International