jump to main area
:::
A- A A+

Seminars

Statistical Exploratory Data Analysis in the Modern Era

  • 2013-10-29 (Tue.), 10:30 AM
  • Recreation Hall, 2F, Institute of Statistical Science
  • Professor Susan Wilson
  • Australian National University and University of New South Wales

Abstract

Major challenges arising from today’s “data deluge” include how to handle the commonly occurring situation of different types of variables (say, continuous and categorical) being simultaneously measured, as well as how to assess the accompanying flood of questions. Based on information theory, a bias-corrected mutual information (BCMI) measure of association that is valid and estimable between all basic types of variables has been proposed. It has the advantage of being able to identify non-linear as well as linear relationships. Based on the BCMI measure, a novel exploratory approach to finding associations in data sets having a large number of variables of different types has been developed. These associations can be used as a basis for downstream analyses such as finding clusters and networks. The application of this exploratory approach is very general. Comparisons also will be made with other measures. Illustrative examples include exploring relationships (i) in clinical and genomic (say, gene expression and genotypic) data, and (ii) between social, economic, health and political indicators from the World Health Organisation.?

Update:
scroll to top