MR-RATE: A Vision-Language Foundation Model and Dataset for Magnetic Resonance Imaging

Welcome to the official repository for MR-RATE, a pioneering vision-language model and 3D medical imaging dataset for Magnetic Resonance Imaging.

Resources:

Paper: coming soon
Model Weights: coming soon
Dataset: MR-RATE on Hugging Face
Related Foundation Models: NV-Generate-MR-Brain by NVIDIA

Overview

MR-RATE is a unified framework for vision-language modeling in brain and spine MRI, comprising a large-scale 3D medical imaging dataset that uniquely pairs textual data with brain and spine MRI volumes and a contrastive pretraining pipeline that aligns multi-sequence MRI volumes with radiology reports using VL-CABS loss.

Repository Structure

MR-RATE/
│
├── data-preprocessing/       # Data preprocessing pipeline and dataset download scripts
│
├── contrastive-pretraining/  # Contrastive pretraining code for vision-language modeling
│
└── README.md

Each folder includes its own README.md detailing configuration, dependencies, and usage.

Components

Component Description

Data Preprocessing End-to-end pipeline converting raw DICOM exports into clean, anonymized, and spatially standardized NIfTI volumes. Covers DICOM-to-NIfTI conversion, PACS metadata filtering, modality classification, binary brain segmentation & defacing, co-registration to a shared T1w reference, atlas registration to MNI152 space, and multi-label anatomical segmentations. Also includes the radiology report preprocessing pipeline (anonymization, translation, structuring, QC) and LLM-based pathology classification producing binary labels for 37 SNOMED CT-grounded pathologies. Includes standalone scripts for downloading and merging all MR-RATE Hugging Face repositories.

Contrastive Pretraining Contrastive vision-language model that aligns multi-sequence MRI volumes and radiology reports using VL-CABS loss. Uses a VJEPA2 (ViT-G) image encoder with LoRA fine-tuning and a BiomedVLP-CXR-BERT text encoder. Supports four multi-volume fusion modes (early, mid_cnn, late, late_attn) and three image spaces (native_space, coreg_space, atlas_space). Enables zero-shot brain MRI pathology classification at inference time.

Workflow Summary

Download preprocessed data directly from MR-RATE on Hugging Face (see data-preprocessing/ for preprocessing details)
Train the contrastive model on (multi-sequence MRI, radiology report) pairs using contrastive-pretraining/
Run zero-shot inference for brain MRI pathology classification using trained model weights and contrastive-pretraining/

Related Foundation Models

The MR-RATE dataset was used by NVIDIA to train NV-Generate-MR-Brain, a 3D latent diffusion model capable of generating high-quality, high-resolution synthetic brain MRI volumes for T1, Fluid Attenuated Inversion Recovery (FLAIR), T2, and Susceptibility Weighted Imaging (SWI). This model is available under the permissive NVIDIA Open Model License, enabling researchers and developers to adapt it for their own downstream medical AI applications.

Citation

If you use this repository, the dataset, or any of its components, please cite:

Coming soon

License

We are committed to fostering innovation and collaboration in the research community. All elements of the MR-RATE repository are released under the Creative Commons Attribution–NonCommercial–ShareAlike (CC BY-NC-SA) license.

This allows all elements to be freely used, modified, and shared for non-commercial research purposes, provided that the original work is properly cited and any derivative works are distributed under the same license.

For commercial inquiries related to MR-RATE, please contact: [email protected]

Acknowledgements

This project is conducted by Forithmus and the University of Zurich, in collaboration with NVIDIA and Istanbul Medipol University.

We are grateful to NVIDIA for their support, which made this work possible. We also sincerely thank Istanbul Medipol University for their support and for providing the data used in this project. High-performance computing resources were provided by NVIDIA and the University of Zurich ScienceCluster. We would also like to thank the following individuals from NVIDIA for their contributions to the development of MR-RATE: Marc Edgar, Daguang Xu, Dong Yang, Yucheng Tang, Can Zhao, Andriy Myronenko, and Pengfei Guo.

This collaboration represents an important step toward the long-term mission to make high-quality medical intelligence accessible worldwide.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github/workflows		.github/workflows
contrastive-pretraining		contrastive-pretraining
data-preprocessing		data-preprocessing
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MR-RATE: A Vision-Language Foundation Model and Dataset for Magnetic Resonance Imaging

Overview

Repository Structure

Components

Workflow Summary

Related Foundation Models

Citation

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MR-RATE: A Vision-Language Foundation Model and Dataset for Magnetic Resonance Imaging

Overview

Repository Structure

Components

Workflow Summary

Related Foundation Models

Citation

License

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages