Issue |
EAS Publications Series
Volume 77, 2016
Statistics for Astrophysics: Clustering and Classification
|
|
---|---|---|
Page(s) | 195 - 219 | |
DOI | https://doi.org/10.1051/eas/1677009 | |
Published online | 26 May 2016 |
D. Fraix-Burnet and S. Girard (eds)
EAS Publications Series, 77 (2016) 195-219
Modelling structured data with Probabilistic Graphical Models
INRIA Grenoble Rhône-Alpes, Mistis team, 38335 Montbonnot, France
Most clustering and classification methods are based on the assumption that the objects to be clustered are independent. However, in more and more modern applications, data are structured in a way that makes this assumption not realistic and potentially misleading. A typical example that can be viewed as a clustering task is image segmentation where the objects are the pixels on a regular grid and depend on neighbouring pixels on this grid. Also, when data are geographically located, it is of interest to cluster data with an underlying dependence structure accounting for some spatial localisation. These spatial interactions can be naturally encoded via a graph not necessarily regular as a grid. Data sets can then be modelled via Markov random fields and mixture models (e.g. the so-called MRF and Hidden MRF). More generally, probabilistic graphical models are tools that can be used to represent and manipulate data in a structured way while modeling uncertainty. This chapter introduces the basic concepts. The two main classes of probabilistic graphical models are considered: Bayesian networks and Markov networks. The key concept of conditional independence and its link to Markov properties is presented. The main problems that can be solved with such tools are described. Some illustrations are given associated with some practical work.
© EAS, EDP Sciences, 2016