AUEB Stats Seminars 24/3/2021: Improved estimation of partially-specified models by I. Kosmidis (Univ. of Warwick) από grstats
Tue 30 Mar 2021 - 12:52
AUEB Stats Seminar Series
Posterior summaries of topic models: an example from grocery retail baskets
Ioanna Manolopoulou, Department of Statistical, Science, UCL, UK
Joint work with: Mariflor Vega and Mirco Musolesi
Πέμπτη, Απρίλιος 1, 2021 - 12:30
Teams link: https://bit.ly/3opz2kK
ΠΕΡΙΛΗΨΗ
Understanding the shopping motivations behind market baskets is an important goal in the grocery retail industry. Analyzing shopping transactions demands techniques that can cope with the volume and complicated dependencies of grocery transactional data, while keeping interpretable outcomes. Latent Dirichlet Allocation (LDA) provides a natural framework to process grocery transactions and to discover a broad representation of customers' shopping motivations. However, summarising the posterior distribution of an LDA model is challenging, because LDA is inherently a mixture model and can exhibit substantial label-switching. Averaging across posterior draws (even after resolving label-switching) inevitably merges semantically different topics which may appear or disappear across draws, and whose average may be semantically meaningless. Moreover, a summary of corresponding uncertainty is not straightforwardly available. In this paper, we introduce clustering methodology that post-processes posterior LDA draws to summarise the entire posterior distribution and identify semantic modes represented as recurrent topics. We illustrate our methods on an example from a large UK supermarket chain.
Teams link: https://bit.ly/3opz2kK
Ημερομηνία Εκδήλωσης:
Πέμπτη, Απρίλιος 1, 2021 - 12:30
- AUEB Stats Seminars 24/3/2021: Improved estimation of partially-specified models by I. Kosmidis (Univ. of Warwick)
- AUEB Stats Seminars 24/6/2021: Nonparametric and high-dimensional functional graphical models by E. Solea
- AUEB Stats Seminars 10/6/2021: Monte-Carlo Statistical Methods for Parameter Estimation of the GreenLab Plant Growth Model by S. Trevezas
- AUEB Stats Seminars 26/11/2021: A new framework of semi-Markov processes for parameter estimation and Reliability Analysis by Andreas Makrides (University of the Aegean)
- AUEB Stats Seminars NEW DATE! 22/4/2021: Scalable inference for epidemic models with individual level data by P. Touloupou (U.of Birmingham)
Permissions in this forum:
You cannot reply to topics in this forum