Threshold-learned CNN for multi-label text classification of electronic health records
Yang, Zhen; Emmert-Streib, Frank (2023-08-28)
Avaa tiedosto
Lataukset:
Yang, Zhen
Emmert-Streib, Frank
28.08.2023
IEEE Access
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202309298534
https://urn.fi/URN:NBN:fi:tuni-202309298534
Kuvaus
Peer reviewed
Tiivistelmä
<p> Text data in the form of natural language is a valuable resource that contains domain-specific information applicable to various applications. An example are electronic health records (eHR) offering comprehensive insights into patients’ health histories, enabling knowledge extraction for clinical diagnosis and treatment. In this paper, we study multi-label text classification (MLTC) of eHR data by introducing two novel MLTC methods based on a threshold-learned convolutional neural network (CNN). We conduct comprehensive comparisons with other multi-label models and binary relevance (BR). Importantly, we do not only optimize the architecture of multi-label classifiers but also of the baseline BR model. As a result, our findings indicate that the adaptive-threshold CNN (AT-CNN) and implicit-threshold CNN (IT-CNN) provide a favorable approximation of a binary CNN (B-CNN) with the added benefit of improved runtime efficiency. The latter is crucial when the number of classes grows larger because the runtime of classifiers based on one-vs-rest mappings becomes increasingly prohibitive for such configurations.</p>
Kokoelmat
- TUNICRIS-julkaisut [20153]