[open] Beginner workshop: self-paced online Feb 23 – April 11 2026

This course is designed for:

Beginners looking for an introduction to mixOmics methods for single – and multi-omics analyses.
Current mixOmics users who want to deepen their understanding of the mixOmics methods.
Users who would like more guidance on analyzing their own data (we also provide exemplar datasets).

The workshop is self-paced and spans across 7 weeks. There are 4 Q&A live sessions, and many opportunities to interact with the cohort and your instructor Prof Kim-Anh Lê Cao via Slack. BYO data is encouraged: we provide advice so that you can analyse your own data with mixOmics tools as part of your learning process. A good working knowledge in R programming (e.g. handling data frame, perform simple calculations and display simple graphical outputs) is essential to fully benefit from the course*.

According to our past participants, a time commitment of 5-8h/week was sufficient to feel that they were progressing. Here is some feedback from a previous course.

We provide a certificate of attendance or completion.

Register here, places are limited!

(make sure you put the right discount code, see tiered fees below)

Fees

Research Higher Degree students enrolled at a University: $550 AUD (incl. GST) [discount code: MIXO_RHD]

Staff and members from Universities & Not-for-profit organisations: $900 (incl. GST) [discount code: MIXO_NFP_STAFF]

Other industries: $1,450 AUD (incl. GST)

Discounts of 5% for a group of 3-9 learners and 10% for 10+ learners, however, this will require a single invoice per group.

These funds go towards the support of a software developer to maintain the package. If you need an invoice, contact Student Support at continuing-education[at]unimelb.edu.au

Teaching Period Dates

Teaching commences: Monday, 23 Feb 2026, 9:00 am AEST (note Australian time!)

Q&A live webinars are scheduled on Thursdays 6pm AEST / 8am CET during the first 4 weeks (26^th Feb, 5^th, 12^th and 19^th March).

An additional session might be added on Fridays 9am AEST ( = Thursdays 2pm PST / 5pm EST / 9pm CET)

Teaching concludes: Friday, 20 March 2025, 11:59 pm AEST (after 4 weeks)
(non marked) Assessment due: Friday 5 April 2025 (2 weeks prep)
Peer-review of assessment due: Friday 11 April 2025 (1 week prep)

The course is divided into theory (50%) and hands-on practice, with the opportunity to analyse your own data. The exercises and assignments are in R. Participants are encouraged to use RStudio and Rmarkdown (template and R code provided).

*Need an R refresher?

Learners who are not proficient in R do not get the full benenefit of the course (based on their own, honest, feedback!) For those looking for an R refresher well ahead of the course:

https://monashdatafluency.github.io/r-intro-2/index.html

The R cheatsheets for reference: https://iqss.github.io/dss-workshops/R/Rintro/base-r-cheat-sheet.pdf

[EOI open] Advanced workshop, March 2026

We are planning an advanced, hands-on asynchronous workshop for researchers who have completed our beginner course. The program will focus on complex study designs, including batch effects, longitudinal and time-course data and practical strategies for analysis with multivariate approaches.

If this is of interest, please submit a short expression of interest so we can tailor the workshop to your needs. You will also be first to hear when registrations open.

Workshop details

Dates: 2 – 27 March 2026 (4 weeks)
Format: Weekly focus topic with online resources, live Q&A sessions, and dedicated Slack support. Topics will include batch-effect management, multi-study integration, and longitudinal/time-course designs.
Capacity: Limited places to ensure personalised support from our team
Fees:
• AUD $795 (~ €438, US$515) – Students
• AUD $1,350 (~ €755, US$875) – Academic / Non-profit staff
• AUD $2,250 (~ €1,260, US$1458) – Industry participants
Payment available via invoice or credit card
Certificate: All participants will receive a certificate of attendance
Platform: You will be among the first to use our new mixOmics PRO platform !

We aim to open the registrations at the start of December, and can notify you if you register to our EOI.

Learning outcomes

By the end of the course, learners will be able to:

Frame study design and data preparation (Module 1).
Define experimental design, anticipate batch risks, and prepare datasets for analysis.
Model and cluster time trajectories to extract structure (Module 2).
Apply modelling to capture temporal patterns and dimension reduction approaches to summarise time-varying behaviour across features and omics datasets.
Detect and mitigate batch effects (Module 3).
Diagnose unwanted variation and implement appropriate corrections, showing improvement in core diagnostics.
Extend to advanced longitudinal methods when warranted (Module 4).
Trial network or tensor approaches and judge when they add value beyond baseline clustering.
Report robust, reproducible, and interpretable findings (all modules).
Validate results, communicate limitations, and deliver a clean, reproducible analysis with a concise biological narrative.

Feedback from workshops

From our 3-day workshop in Lund Sweden, Sep 2026

Twenty-one participants attended the workshop. Based on the responses of 16 of them, they self-reported increase in knowledge, with the average participant rating their “omics data analytics level” rising from 4.6/10 before the workshop to 6.8/10 after.

Our participants appreciated the combination of:

Clear lectures (breaking down complex concepts).
Hands-on exercises (applying the methods).
Direct interaction (debriefs and discussions with the teaching team).

Here is what they mentioned in their exit survey:

“It was really an eye opener workshop for my future PhD project and future endeavours to integrate omics data. Thank you for the nice workshop!”

“Kim-Anh was great at breaking all the concepts and analysis methods down and explain them in an easily understandable and visual way! I also really liked the debriefing periods, it helped a lot…”

“Overall, very beginner-friendly, considering that I had no background in ‘omics data!”

“This is an awesome tool. I am excited to use it more seriously with my work and have recommended it many times already in just the few weeks since the course.”

“I learnt there is a lot of extensions to traditional multivariate methods… that I wasn’t aware of. It was a perspective change…”

From the Feb – March 2025 workshop

Best aspects of the program

Course materials were exceptionally well done

The MixOmics Vignette The R markdown template Kim-Anh’s Teaching Style and approachability

the modules explained in details the main principles of statistical analyses and the webinars were clear and brought additional informations to better understand the online courses.

That we started from basics and went to more advanced analysis

The online content that supported the live webinars

Hands-on experience and being able to use own data. The quality of the material was excellent, and the teacher is very knowledgeable and helpful.

Practical approach, resources to learn more, and the videoconference Q&A.

The course was extremely well structured, the continuous building of knowledge in a layered approach was very effective. I see how the block integration was easy to understand only because of the pre-work in the previous weeks to fully understand PCA, PLS, PLS-DA. Also, the instructor was very flexible in incorporating all our wishes for the little extras we were interested in. The discussions on slack were also very helpful. All in all, the best course I have attended.

I found the assignment really good and challenging.

Assignment on own work, slack

The MixOmics course allows one to learn at their own pace given international schedules. The course material was stimulating and the teacher took time and care to address students’ questions.

The weekly webinar

The assignment and slack and working with own data.

Nice exposure of different methodologies of omics data analysis, including the mathematical reasoning behind them. The assignment was useful to apply and reinforce the knowledge of the theory. The Q&As and the Slack channel were also very good sources for topic discussion.

Very usefull course, many hands on options, it is great that you can work on your own data for the assignment, I liked the live tutorials a lot

The support was very helpful but the data for the assignments could work on my R version for which I cannot changed it because of my platform

Learning how to use all the tools of the mixomics package

The practical application of the different analyses which was taught.

Practical work mixed with theoretical part of the program.

Hands on activities

Additional comments

Genuinely a very well run and well taught course. I will be recommending it to people!

Professor Kim-Anh Lê Cao was amazing and did a very good job, teaching, organizing us and replying very fast to every question

Please make the access to online content available for 6 months after the course. Need to improve the login link for external users. I could only access by going to the link in the email sent by Melbourne uni with was time consuming. The link would not work if copied and pasted to online favourites.

The training proved to be exceptionally valuable. Despite its considerable challenges, especially given my initial lack of knowledge in the field, I acquired a wider knowledge in the area and very valuable new skills. This experience has significantly contributed to my scientific development. Committing to this training was a wise decision, as it provided an excellent introduction to mixomics and multi-omics analysis in general. I look forward to applying these new skills in my future research studies.

Kim-Anh was a great instructor, very smart, and always available to help and answer any questions.

Thank you to the teachers of the course for their availability. It was a much appreciated experience.

The connexion to the website as external was only accessible from the first e-mail sent.

Longer course, also exploring more options of mixOmix – or an additional course.

Prof. Kim-Anh was exceptionally kind and ready to help. Her teaching was highly appreciated.

Very well organized and structured workshop but I missed more theoretical part of the course so I would not be lost when started my hands-on.

[completed] 3-day mixOmics workshop in person, 22 – 24 Sept 2025, Lund University

We have a few spots left for an in-person mixOmics workshop, which we would like to open to our wider community!

Modern high-throughput technologies generate complex biological data that require powerful yet accessible tools for analysis. This beginner-level workshop introduces participants to data integration and multivariate analysis using the R package mixOmics.

Through a series of hands-on sessions, we will explore how multivariate methods can uncover biological patterns, identify key molecular features (or ‘markers’), and integrate multiple omics datasets. The approach is hypothesis-free, flexible, and does not rely on strict statistical assumptions.

By the end of the workshop, participants will be familiar with the core mixOmics workflows for exploratory and supervised analysis. There will also be an opportunity to apply the methods to your own dataset, with expert guidance throughout.

Pre-requisite: Basic proficiency in R is essential (e.g. working with data frames, basic calculations, simple plots). Participants without R experience have reported difficulty keeping up and gaining value from the course.

Instructor: Prof Kim-Anh Lê Cao, the University of Melbourne

WHERE: Mon 22 to Wed 24 Sept 2025: 9am – 5pm, Lund University (Room: Maskrosen (E121), Ekologihuset, Lunds Universitet,Kontaktvägen 10,Lund 22362, Sweden; google map pin)

REGISTER Request an invoice by emailing Maggie at MIG-EA [at] unimelb.edu.au and we will follow up with you with instructions pre-workshop and full schedule.

Fees

Research Higher Degree students enrolled at a University: 350 EUR
Staff and members from Universities & Not-for-profit organisations: 575 EUR
Other industries:: 1200 EUR

Workshop schedule

Monday 22 Sept and Tuesday 23^rd Sept: methods and hands-on.

The following broad topics will be covered.

A. Key methodologies in mixOmics and their variants

Basic processing of count data
Exploration of one data set and how to estimate missing values
Identification of molecular signature to discriminate different treatment groups
Integration of two data sets and identification of biomarkers
Introduction to repeated measurements or longitudinal studies analysis
Integration of more than two data sets to identify multi omics signatures
Integration of independent but related studies (optional)

B. Review on the graphical outputs implemented in mixOmics

Sample plot representation
Variable plot representation for data integration
Other useful graphical outputs

C. Case studies and applications

Several microbiome and omics studies will be analysed using the methods presented above.

Wednesday 24^th Sept: bring your own data. Participants will be given the opportunity to analyse their own data under the guidance and the advice of the instructor. Participants can also work in a team. Some data sets will also be provided for those unable to bring their own data.

Statistical concepts

The following statistical concepts will be introduced: covariance and correlation, multiple linear regression, classification and prediction, cross-validation, selection of markers, penalised regressions. Each methodology will be illustrated on a case study (theory and application will alternate).

Target group

The course is intended for computational biologists and biologists with some statistical knowledge and a good working knowledge in R. It will be particularly useful to those interested in:

Understanding and/or applying multivariate projection methodologies to large data sets.

Exploring data sets.

Selecting molecular / microbial features with methods implementing LASSO-based penalisations.

Using graphical techniques to better visualise data.

Anticipated outcomes

After completion of this workshop, participants will be able to

Apply those methods to high throughput microbiome studies, including their own studies.

Understand fundamental principles of multivariate projection-based dimension reduction technique.

Perform statistical integration and feature selection using recently developed multivariate methodologies.

Workshop registration cancellation policy

To confirm your place in this workshop, the registration fee is payable at the time of booking. This commitment helps us plan and deliver the workshop effectively for all participants.

Cancellations and Refunds: Refunds are only available if the workshop is cancelled or postponed by the organiser. In that case, a full refund (including any service fees) will be issued automatically.

No-Show Policy: If you do not attend the workshop, your registration fee will be non-refundable.

Illness or Exceptional Circumstances: We understand that unexpected situations can arise. If you are unable to attend due to illness or other exceptional circumstances, please contact us as soon as possible. While refunds cannot be issued, we will review your situation with care and may consider alternative options at the organiser’s discretion.

This policy is designed to ensure fairness to all participants and to support the smooth delivery of our workshops.

Webinar: Φ-Space ST: a platform-agnostic method to identify cell states in spatial transcriptomics studies

We have a sequel to Φ-Space, Φ-Space ST developed by Dr Jiadong Mao for spatial transcriptomics studies! We are very excited about these new developments and the potential of Φ-Space for single cell annotation!

Φ-Space ST is:

A novel and fast approach for cell type composition analysis.
Platform-Agnostic and Scalable as it works across multiple spatial transcriptomics (ST) platforms, including CosMx, Visium, and Stereo-seq.
Accurate and integrative as it identifies cell states by leveraging multiple scRNA-seq references.
Segmentation-Free & Niche-Driven as it annotates cell states at subcellular resolution, uncovering niche-specific cell types and tumor-distinguishing patterns.

Φ-Space ST: a platform-agnostic method to identify cell states in spatial transcriptomics studies. Jiadong Mao, Jarny Choi, Kim-Anh Lê Cao. bioRxiv 2025.

Check Jiadong’s latest seminar he presented at Melbourne Integrative Genomics on Friday 14th February 2025:

Abstract

We introduce Φ-Space ST, a platform-agnostic method to identify continuous cell states in spatial transcriptomics (ST) data using multiple scRNA-seq references. For ST with supercellular resolution, Φ-Space ST achieves interpretable cell type deconvolution with significantly faster computation. For subcellular resolution, Φ-Space ST annotates cell states without cell segmentation, leading to highly insightful spatial niche identification. Φ-Space ST harmonises annotations derived from multiple scRNA-seq references, and provides interpretable characterisations of disease cell states by leveraging healthy references. We validate Φ-Space ST in three case studies involving CosMx, Visium and Stereo-seq platforms for various cancer tissues. Our method revealed niche-specific enriched cell types and distinct cell type co-presence patterns that distinguish tumour from non-tumour tissue regions. These findings highlight the potential of Φ-Space ST as a robust and scalable tool for ST data analysis for understanding complex tissues and pathologies.

Webinar: Time-course multi-omics integration

I presented this talk for a group of statisticians at the Australian National University in Canberra. The abstract is below.

Topics covered: linear mixed model splines, multi-omics integration (PLS multiblock), correlation circle plot interpretation, timeOmics.

Longitudinal experiments are becoming increasingly popular in omics studies to monitor molecular changes following treatment or during disease progression. Integrating these data sets can give us some mechanistic insights into the different types of omics layers.

However, longitudinal omics data present numerous challenges including a small number of time points that may be unevenly spaced and unmatched between different data types, a small number of individuals, and a high individual variability. While current approaches have focused on differential expression across time or time profile clustering, the modelling of omics time profiles in a multivariate manner is critically lacking to understand longitudinal biological interactions.

I will present a statistical framework, timeOmics, to identify correlated profiles over time and between omics (transcriptomics, metabolomics, microbiome) to give insights into the molecular dynamics of biological systems and discuss future avenues of research in this expanding area.

Some key references

Straube J, Gorse AD, PROOF Centre of Excellence Team, Huang BE^& and Lê Cao K-A^& (2015). A linear mixed model spline framework for analysing time course ‘omics’ data. PLoS ONE 10(8): e0134540
A Bodein, O Chapleur, A Droit, K-A Lê Cao (2019). A Generic Multivariate Framework for the Integration of Microbiome Longitudinal Studies With Other Data Types, Frontiers in Genetics, 10,
A Bodein, M-P Scott-Boyer, O Perin, K-A Lê Cao, A Droit (2022). timeOmics: an R package for longitudinal multi-omics data integration, Bioinformatics, 38(2)

The timeOmics package

timeOmics is currently not directly available from the mixOmics package, instead it is a separate R package hosted on Bioconductor. See the Bioconductor page for installation instructions.

[closed] Self-paced online course Feb 24 – April 11, 2025

Single and multi-omics analysis and integration with mixOmics

Our registrations are now closed! Fill in this Expression Of Interest for if you missed out, so that we can notify you of new workshops.

This course is designed for:

Beginners looking for an introduction to mixOmics methods for single- and multi-omics analyses.
Current mixOmics users who want to deepen their understanding of the mixOmics methods.
Users who would like more guidance on analyzing their own data (we also provide exemplar datasets).

According to our past participants, a time commitment of 5-8h/week was sufficient to feel that they were progressing. Here is some feedback from a previous course.

We provide a certificate of attendance or completion.

Register here, places are limited!

Fees

Research Higher Degree students enrolled at a University: $495 AUD (incl. GST) [discount code: MIXO_RHD]

Staff and members from Universities & Not-for-profit organisations: $825 (incl. GST) [discount code: MIXO_NFP_STAFF]

Other industries: $1320 AUD (incl. GST)

discounts of 5% for a group of 3-9 learners and 10% for 10+ learners, however, this will require a single invoice per group.

These funds go towards the support of a software developer to maintain the package. If you need an invoice, contact Student Support at continuing-education[at]unimelb.edu.au

Teaching Period Dates

Teaching commences: Monday, 24 Feb 2025, 9:00 am AEST
- Q&A live webinars are scheduled on Thursdays 6pm AEST / 8am CET during the first 4 weeks (27^th Feb, 6^th, 13^th and 20^th March).
- An additional session might be added on Fridays 9am AEST ( = Thursdays 2pm PST / 5pm EST / 9pm CET)

Teaching concludes: Sunday, 23 March 2025, 11:59 pm AEST (after 4 weeks)

(non marked) Assessment due: Friday 4 April 2025 (2 weeks prep)

Peer-review of assessment due: Friday 11 April 2025 (1 week prep)

*Need an R refresher?

Learners who are not proficient in R do not get the full benenefit of the course (based on their own, honest, feedback!) For those looking for an R refresher well ahead of the course:

The R cheatsheets for reference: https://iqss.github.io/dss-workshops/R/Rintro/base-r-cheat-sheet.pdf

https://monashdatafluency.github.io/r-intro-2/index.html

Webinar: Φ-Space for continuous phenotyping of single-cell multi-omics data

We have developed a new PLS method for cell type continuous annotation of single cells, now published in Genome Biology!

Φ-Space addresses numerous challenges faced by state-of-the-art automated annotation methods:
- to identify continuous and out-of-reference cell states,
- to deal with batch effects in reference,
- to utilise bulk references and multi-omic references.
Φ-Space uses soft classification to phenotype cells on a continuum. The continuous annotation, or phenotype space embedding is then used to reduce the dimensionality of the data for various downstream analyses.

Mao, J., Deng, Y. & Lê Cao, KA. Phi-Space: continuous phenotyping of single-cell multi-omics data. Genome Biol26, 323 (2025). https://doi.org/10.1186/s13059-025-03755-8

View this 50-min video of Kim-Anh Lê Cao presenting Φ-Space at the WEHI Bioinformatics seminar:

Abstract.

Single-cell multi-omics technologies have empowered increasingly refined characterisa- tion of the heterogeneity of cell populations. Automated cell type annotation methods have been developed to transfer cell type labels from well-annotated reference datasets to emerging query datasets. However, these methods suffer from some common caveats, including the failure to characterise transitional and novel cell states, sensitivity to batch effects and under-utilisation of phenotypic information other than cell types (e.g. sample source and disease conditions).

We developed Φ-Space, a computational framework for the continuous phenotyping of single-cell multi-omics data. In Φ-Space we adopt a highly versatile modelling strategy to continuously characterise query cell identity in a low-dimensional phenotype space, defined by reference phenotypes. The phenotype space embedding enables various downstream analyses, including insightful visualisations, clustering and cell type labelling.

We demonstrate through three case studies that Φ-Space (i) characterises develop- ing and out-of-reference cell states; (ii) is robust against batch effects in both reference and query; (iii) adapts to annotation tasks involving multiple omics types; (iv) over- comes technical differences between reference and query.

The Φ-Space package

Φ-Space is currently not directly available from the mixOmics package, instead it is a separate R package that can be installed from Github.

Webinar: PCA and PLS-DA

These two recordings were part of a presentation to WEHI for their postgraduate lecture series for a diverse audience.

In the PCA presentation (18 min), we explain the concept of linear combination of variables (components) and useful graphical outputs such as correlation circle plots and biplots.

In the PLS-DA presentation (7 min), we talk about the concept of multivariate signature.

If you want to know more about the actual algorithm under the hood, you can watch this webinar on PLS.

[closed] Self-paced online course Oct 21 – Dec 6 2024

Unfortunately we had to cancel the workshop as we did not receive a sufficient number of participants to justify running the workshop at this time. These workshops involve peer review and a cohort feel to provide the best experience to our learners.

Register your EOI here and we will let you know when the registration page is up. Our next intake is scheduled for February 2025.

Feedback from a previous iteration can be found here.

Key summary

The new course is open and will run for 7 weeks. This course is online, but at your own pace, meaning that you need to dedicate enough time (5-8h per week) to fully benefit from the program.
There are 4 weeks of asynchronous learning (you work at our own pace to cover the material each week).
There are 4 live webinars organised on the first 4 Thursdays at 5pm AEST (convert your time here) to summarise some key concepts and ask your questions (the webinars will be recorded, as there are daylight savings during this period).
You will have the opportunity to chat on Slack and ask your questions during the whole course.
You can analyse your own data for the assessment (due in week 6) or use the data provided. You will reinforce your learning by marking the assignments of 2-3 other learners.

Teaching Period Dates, asynchronised:
- Teaching commences: Monday, 21 Oct 2024, 9:00 am AEST
- Teaching concludes: Sunday, 17 Oct 2024, 11:59 pm AEST (after 4 weeks)
- (non marked) Assessment due: Friday 29 Nov 2024 (2 weeks prep)
- Peer-review of assessment due: Friday 6 Dec 2024 (1 week prep)

Fees vary for
- Research Higher Degree students enrolled at a University: $495 AUD (incl. GST) [discount code: MIXO_RHD]
- Staff and members from Universities & Not-for-profit organisations: $825 (incl. GST) [discount code: MIXO_NFP_STAFF]
- Other industries: $1320 AUD (incl. GST)
- discounts of 5% for a group of 3-9 learners and 10% for 10+ learners, however, this will require a single invoice per group.

(these funds go towards the support of a software developer to maintain the package)

Information about the course and registration: https://study.unimelb.edu.au/find/short-courses/mixomics-r-essentials-for-biological-data-integration/

The number of places is limited, so first come first serve (this course runs once or twice a year)

What if I need an invoice? Contact Student Support at continuing-education[at]unimelb.edu.au

Prerequisites. A good working knowledge in R programming (e.g. handling data frame, perform simple calculations and display simple graphical outputs) is essential to fully benefit from the course*. The course is divided into theory (50%) and hands-on practice, with the opportunity to analyse your own data. The exercises and assignments are in R. Participants are encouraged to use RStudio and Rmarkdown (template and R code provided).

For those looking for an R refresher well ahead of the course:
- https://monashdatafluency.github.io/r-intro-2/index.html
- the R cheatsheets for reference: https://iqss.github.io/dss-workshops/R/Rintro/base-r-cheat-sheet.pdf

*Learners who are not proficient in R do not get the full benenefit of the course (based on their own, honest, feedback!)