On the Harmonisation of Time Series Data for the Optimisation of Machine Learning Using the Example of Rejection Prediction After Kidney Transplantation
- A significant risk following a kidney transplantation is graft loss. The Screen Reject Project has developed a Clinical Data Warehouse (CDWH) as a foundation for a clinical decision support system designed to improve the diagnosis of graft rejections. The CDWH integrates patient data and event records of n = 141 kidney transplant patients. These data are not directly comparable within the cohort as they consist of irregular time series, particularly of laboratory values. Therefore, a pre-processing routine was developed which divides a relative time window before the last biopsy (the relevant end event of the reference period for subsequent machine learning procedures) into equal time intervals for each patient. For each of these intervals a representative value is calculated from the contained laboratory values. These representative values are used to train models for predicting kidney rejection. The comparison with an existing study from the project, in which a classification model was developed without considering the temporal dependencies, shows an improved sensitivity and specificity in predicting kidney rejection for the harmonised data using the same random forest model.
| Author: | Darian LiehrORCiD, Matthias KatzensteinerORCiD, Oliver J. BottORCiDGND |
|---|---|
| URN: | urn:nbn:de:bsz:960-opus4-36289 |
| DOI: | https://doi.org/10.25968/opus-3628 |
| DOI original: | https://doi.org/10.3233/SHTI250275 |
| ISBN: | 9781643685960 |
| ISSN: | 0926-9630 |
| Parent Title (English): | Intelligent Health Systems – From Technology to Data and Knowledge (Studies in Health Technology and Informatics ; 327) |
| Publisher: | IOS Press |
| Document Type: | Part of a Book |
| Language: | English |
| Year of Completion: | 2025 |
| Publishing Institution: | Hochschule Hannover |
| Release Date: | 2025/06/24 |
| Tag: | data harmonisation; data management; kidney transplant; machine learning; rejection diagnostics; secondary use; time series data |
| GND Keyword: | NierentransplantationGND; Maschinelles LernenGND; ZeitreihenanalyseGND |
| First Page: | 68 |
| Last Page: | 72 |
| Institutes: | Fakultät III - Medien, Information und Design |
| Data|H - Institute for Applied Data Science Hannover | |
| DDC classes: | 610 Medizin, Gesundheit |
| 004 Informatik | |
| Licence (German): | Creative Commons - CC BY-ND - Namensnennung - Keine Bearbeitungen 4.0 International |






