Hyppää sisältöön
    • Suomeksi
    • In English
Trepo
  • Suomeksi
  • In English
  • Kirjaudu
Näytä viite 
  •   Etusivu
  • Trepo
  • Väitöskirjat
  • Näytä viite
  •   Etusivu
  • Trepo
  • Väitöskirjat
  • Näytä viite
JavaScript is disabled for your browser. Some features of this site may not work without it.

Multi-label Text Classification with Deep Learning Models

Yang, Zhen (2026)

 
Avaa tiedosto
978-952-03-4452-8.pdf (10.91Mt)
Lataukset: 



Yang, Zhen
Tampere University
2026

Tieto- ja sähkötekniikan tohtoriohjelma - Doctoral Programme in Computing and Electrical Engineering
Informaatioteknologian ja viestinnän tiedekunta - Faculty of Information Technology and Communication Sciences
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
Väitöspäivä
2026-03-30
Näytä kaikki kuvailutiedot
Julkaisun pysyvä osoite on
https://urn.fi/URN:ISBN:978-952-03-4452-8
Tiivistelmä
Multi-label text classification (ML TC) assigns multiple, potentially dependent labels to each document and underpins applications such as medical diagnosis, legal tag­ging, and news categorization. In general, ML TC must handle correlated and often hierarchical labels, severe class imbalance, and long-tailed distributions. Two main approaches exist: binary relevance (BR) and multi-label (ML) methods. While BR methods are simple and scalable, they ignore label dependencies. Conversely, ML methods lack general guidelines and can sometimes underperform BR in practice.

This thesis evaluates BR for multi-label text classification in modern deep learning. Across diverse datasets, a convolutional BR model (BR-CNN) outperforms traditional BR methods. We further test whether CNNs that model label depen­dencies can close this gap. Two variants-Implicit-Threshold CNN (IT-CNN) and Adaptive-Threshold CNN (AT-CNN) -are compared with a standard multi-label CNN (M-CNN) and BR-CNN. Results show that dependency-focused models are not always guaranteed to exceed BR-CNN performance on multi-label datasets.

Another contribution of this thesis is the Partial-Dependency Binary Relevance (PDBR-CNN) method, a binary relevance architecture that selectively encodes par­tial label dependencies while retaining the robustness and scalability of BR. Across benchmark datasets, PDBR-CNN achieves substantial improvements over classical BR variants (e.g., classifier chains, stacking) and delivers competitive, often superior, performance compared to recent deep multi-label models.

Overall, this thesis systematically examines the performance of deep learning based BR methods and introduces novel CNN-based architectures. The results show that BR can be extended to incorporate deep neural modeling and label-dependency information, achieving superior performance with reasonable runtime.
Kokoelmat
  • Väitöskirjat [5272]
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste
 

 

Selaa kokoelmaa

TekijätNimekkeetTiedekunta (2019 -)Tiedekunta (- 2018)Tutkinto-ohjelmat ja opintosuunnatAvainsanatJulkaisuajatKokoelmat

Omat tiedot

Kirjaudu sisäänRekisteröidy
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste