Knowledge-Driven Subspace Fusion and Gradient Coordination for Multi-modal Learning

Yupei Zhang, Xiaofei Wang, Fangliangzi Meng, Jin Tang, Chao Li (Lead / Corresponding author)

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Multi-modal learning plays a crucial role in cancer diagnosis and prognosis. Current deep learning based multi-modal approaches are often limited by their abilities to model the complex correlations between genomics and histopathology data, addressing the intrinsic complexity of tumour ecosystem where both tumour and microenvironment contribute to malignancy. We propose a biologically interpretative and robust multi-modal learning framework to efficiently integrate histopathology images and genomics by decomposing the feature subspace of histopathology images and genomics, reflecting distinct tumour and microenvironment features. To enhance cross-modal interactions, we design a knowledge-driven subspace fusion scheme, consisting of a cross-modal deformable attention module and a gene-guided consistency strategy. Additionally, in pursuit of dynamically optimizing the subspace knowledge, we further propose a novel gradient coordination learning strategy. Extensive experiments demonstrate the effectiveness of the proposed method, outperforming state-of-the-art techniques in three downstream tasks of glioma diagnosis, tumour grading, and survival analysis. Our code is available at https://github.com/helenypzhang/Subspace-Multimodal-Learning.

Original languageEnglish
Title of host publicationMedical Image Computing and Computer Assisted Intervention
Subtitle of host publicationMICCAI 2024 - 27th International Conference, Proceedings
EditorsMarius George Linguraru, Qi Dou, Aasa Feragen, Stamatia Giannarou, Ben Glocker, Karim Lekadir, Julia A. Schnabel
PublisherSpringer Science and Business Media Deutschland GmbH
Pages263-273
Number of pages11
ISBN (Electronic)9783031720833
ISBN (Print)9783031720826
DOIs
Publication statusPublished - 14 Oct 2024
Event27th International Conference on Medical Image Computing and Computer-Assisted Intervention - Palmeraie Conference Centre, Marrakesh, Morocco
Duration: 6 Oct 202410 Oct 2024
Conference number: 27th
https://conferences.miccai.org/2024/en/ (Conference Website)

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15004 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th International Conference on Medical Image Computing and Computer-Assisted Intervention
Abbreviated titleMICCAI 2024
Country/TerritoryMorocco
CityMarrakesh
Period6/10/2410/10/24
Internet address

Keywords

  • Cancer diagnosis and prognosis
  • Molecular Pathology
  • Multi-modal learning

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Knowledge-Driven Subspace Fusion and Gradient Coordination for Multi-modal Learning'. Together they form a unique fingerprint.

Cite this