Probabilistic Dynamic Non-negative Group Factor Model for Multi-source Text Mining
Lu, Chien; Peltonen, Jaakko; Nummenmaa, Jyrki; Järvelin, Kalervo (2020-10)
Lu, Chien
Peltonen, Jaakko
Nummenmaa, Jyrki
Järvelin, Kalervo
ACM
10 / 2020
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202012038459
https://urn.fi/URN:NBN:fi:tuni-202012038459
Kuvaus
Peer reviewed
Tiivistelmä
Nonnegative matrix factorization (NMF) is a popular approach to model data, however, most models are unable to flexibly take into account multiple matrices across sources and time or apply only to integer-valued data. We introduce a probabilistic, Gaussian Process-based, more inclusive NMF-based model which jointly analyzes nonnegative data such as text data word content from multiple sources in a temporal dynamic manner. The model collectively models observed matrix data, source-wise latent variables, and their dependencies and temporal evolution with a full-fledged hierarchical approach including flexible nonparametric temporal dynamics. Experiments on simulated data and real data show the model out-performs, comparable models. A case study on social media and news demonstrates the model discovers semantically meaningful topical factors and their evolution.
Kokoelmat
- TUNICRIS-julkaisut [19195]