🚀 mixOmics v6.32.0 released on Bioconductor 3.21

mixOmics v6.32.0 is now available on Bioconductor 3.21, compatible with R 4.5.0. This update brings new features, bug fixes, and improvements based on your feedback.

What’s new since the last Bioconductor release:

🔬 New features and enhancements

plotLoadings() now supports ggplot2-style plots with fully customisable aesthetics
tune() has been enhanced to support tuning of components or variables
New function perf.assess() evaluates final model performance

⚙️ Improved performance and reproducibility

tune() and perf() now support parallel processing using the BPPARAM argument and accept a seed argument to improve reproducibility

🧹 Bug fixes and usability improvements

plotIndiv() now correctly handles pch ordering and ellipse colours
Better error message in perf() when a class has only one sample
Streamlined multiblock functions by removing unused arguments

📦 Install this version:

if (!require("BiocManager", quietly = TRUE))
  install.packages("BiocManager") 
BiocManager::install("mixOmics")

🔍 For a full list of changes, visit the README on our GitHub repo.

New Performance Assessment & Parameter Tuning – Beta test now!

We’ve streamlined performance assessment and parameter tuning functions, available for beta testing before the next Bioconductor release in April!

What’s New?

New perf.assess(): Assesses only the final model’s performance, returning key metrics (no plots). PR #344
Enhanced tune(): Now supports tuning components separately or alongside variables across multiple model types. PR #348
New documentation pages: Explore new webpages explaining key concepts and usage of these functions.

Get Involved

Install the latest development version using
devtools::install_github("mixOmicsTeam/mixOmics", ref = "6.31.4")
Test the new functions on your models
Share feedback on the User Forum or identified bugs on Github Issues

Try it out and help us refine these features before official release!

mixOmics website update

We’re pleased to share that the mixOmics website has undergone a redesign to enhance your browsing experience and make it easier to access our resources.

What’s New?

✨ Refreshed Design: A cleaner, more modern layout
📚 Expanded Getting Started Pages: Helpful pages to help you get up and running with mixOmics
🧭 Reorganized Navigation: A more intuitive menu to quickly find key resources
🔗 Updated Social Links: Stay connected with the mixOmics community
💬 Direct Links to the User Forum: If you haven’t already, join our mixOmics user forum to connect with over 500 other users and experts
🧑‍💻 Updated About Pages: Learn more about the project and our team
📅 Streamlined Workshops, Webinars, and News Sections: Easier access to events and updates
🖥️ Embedded R Markdown Pages: Improved code presentation with syntax highlighting in our Methods, Plots, and Case Studies pages

We are continuing to make small improvements, so if you encounter any issues or have feedback, please feel free to contact us.

Thank you for your continued support of mixOmics.

The mixOmics Team

All methodologies implemented in mixOmics can handle missing values.
In particular, (s)PLS, (s)PLS-DA,
(s)PCA utilise the NIPALS
(Non-linear Iterative
Partial Least
Squares) algorithm as part of their dimension reduction
procedures. This algorithm is built to handle NAs [1].

This is implemented through the nipals() function within
mixOmics. This function is called internally by the above methods but
can also be used manually, as can be seen below.

Usage in mixOmics

library(mixOmics)
data(liver.toxicity)
X <- liver.toxicity$gene[, 1:100] # a reduced size data set

## pretend there are 20 NA values in our data
na.row <- sample(1:nrow(X), 20, replace = TRUE)
na.col <- sample(1:ncol(X), 20, replace = TRUE)
X.na <- as.matrix(X)

## fill these NA values in X
X.na[cbind(na.row, na.col)] <- NA
sum(is.na(X.na)) # number of cells with NA

## [1] 20

# this might take some time depending on the size of the data set
nipals.tune = nipals(X.na, ncomp = 10)$eig
barplot(nipals.tune, xlab = 'Principal component', ylab = 'Explained variance')

FIGURE 1: Column graph of the explained variance of each Principal
Component.

If missing values need to be imputed, the package contains
impute.nipals() for this scenario. NIPALS
is used to decompose the dataset. The resulting components, singular
values and feature loadings can be used to reconstitute the original
dataset, now with estimated values where the missing values were
previously. To allow for the best estimation of missing values, there is
a large number of components being used (ncom = 10).

X.impute <- impute.nipals(X = X.na, ncomp = 10)
sum(is.na(X.impute)) # number of cells with NA

## [1] 0

The difference between the imputed and real values can be checked.
Here are the original values:

id.na = is.na(X.na) # determine position of NAs in dataframe

X[id.na] # show original values

##  [1]  0.09041 -0.04070  0.03497 -0.01712  0.01309  0.00233 -0.04142  0.11104
##  [9] -0.01519 -0.17034 -0.01641  0.15964  0.00557 -0.06217  0.04131  0.02157
## [17]  0.01226 -0.00753  0.03038 -0.00783

The values which were estimated via the NIPALS
algorithm:

X.impute[id.na] # show imputted values

##  [1]  0.0837747419 -0.0190061068  0.0004024897 -0.0180879247 -0.0094185656
##  [6] -0.0312362158 -0.0706920015  0.1400817774  0.0083359545 -0.1158255139
## [11]  0.0164817649  0.1007897385  0.0236184385  0.0191934144  0.0214240977
## [16]  0.0686280312 -0.0039198425  0.0085870558  0.0450234407  0.0013964758

References

Wold,
H. (1973). Nonlinear Iterative Partial Least Squares (NIPALS) Modelling:
Some Current Developments. Multivariate Analysis–III, 383-407.
https://doi.org/10.1016/b978-0-12-426653-7.50032-6

Test Post from R

This is a test post created via the REST API using R. It supports HTML formatting!

mixOmics 6.30.0 on Bioconductor

At the end of October 2024 Bioconductor updated to version 3.20, and with it updated to the latest version of mixOmics 6.30.0. You can install the latest version of mixOmics on Bioconductor here. This latest release version of the package runs on R version 4.4 and includes some minor bug fixes and updated code and unit tests. See our Github page for more details on these updates.

Webinar: PLS methods

This webinar was presented for a seminar to a group of quantitative researchers (mostly statisticians) at the University of Melbourne. Abstract is below.

Topics covered: context of data integration, PCA solved with NIPALS algorithm and SVD, sparse PCA, correlation circle plot interpretation, PLS algorithms and deflation modes, sparse PLS.

Technological improvements have allowed for the collection of data from different types of molecules (e.g. genes, proteins, metabolites, microorganisms) resulting in multiple ‘omics data (e.g. transcriptomics, proteomics, metabolomics, microbiome) measured from the same set N of biospecimens or individuals. In this talk I will introduce the statistical integration of these multi-omics data to shed more light into a biological system.

Integrating data include numerous challenges – data are complex and large, each with few samples (N < 50) and many molecules (P > 10,000), and generated using different technologies. I will present PLS (Partial Least Squares / Projection to Latent Structures developed by Wold in the 1980s) as an algorithm of choice for data integration of small N large P problems. These variants form the basis of our comprehensive mixOmics R package for feature selection, dimension reduction and integration of omics data sets. This talk is targeted at a general audience with background knowledge in statistics and interest in large data

The webinar was re-recorded for the PLS section.

Author: Eva Hamrud

🚀 mixOmics v6.32.0 released on Bioconductor 3.21

New Performance Assessment & Parameter Tuning – Beta test now!

What’s New?

Get Involved

mixOmics website update

What’s New?

Page from R Markdown

Usage in mixOmics

References

Test Post from R

mixOmics 6.30.0 on Bioconductor

Webinar: PLS methods