Inside the mixOmics package

The mixOmics R package is organised into three main parts: methods to analyse data, plotting functions and example data sets. Below is a non-exhaustive list of what you can find in the mixOmics package and their relevant publications.

  1. Statistical methods to analyse high throughput data

Note: The integrative and supervised methods in mixOmics are summarised and presented in our mixOmics article – Rohart et al. 2017

2. Plotting functions to display and interpret the results

  • 2D and 3D sample plots (with optional confidence ellipses)
  • Arrow plots (to visualise paired coordinates)
  • Relevance Network Graphs (to see associations between variables) – González et al. 2012
  • Clustered Image Maps (heatmaps for expression values or correlation) – González et al. 2012
  • Correlation circle plots (correlation of variables to latent components) – González et al. 2012
  • Circos plots for DIABLO analyses (see how datasets relate to each other) – Singh et al. 2019
  • Loading plots (to see variable importance) – Lê Cao et al. 2016

3. Example data sets

Each type of biological question can be answered with a specific method. This is why we provide in the package a whole range of case studies to illustrate each method. 

Single omics:

  • multidrug (ABC transporters and compounds data for 60 samples from different cell lines. Used in sPCA case study.) – Szakács et al., 2004
  • srbct (gene expression data for 63 samples grouped into tumour classes. Used in sPLS-DA case study) – Khan et al, 2001
  • breast.tumors (mRNA data for 47 samples, with missing data)
  • vac18 (gene expression data for 42 samples across different stimulation groups. Used in multilevel case study.) – Salmon-Céron et al. 2010
  • vac18.simulated (simulated time-course gene expression data for 48 samples across different stimulation groups)
  • linnerud (very small dataset with 3 physiological metrics and 3 exercise metrics measured across 20 participants. Used to illustrate key concepts)
  • yeast (metabolite data across 37 samples and different strains and conditions)

Multiple omics:

Multiple studies:

Microbiome data: