FAQs.
General topics
Last updated March 24, 2025
-
Yes, you can access our data at data.midrc.org. In addition to the de-identified images, there are annotations available on subsets of images. Fully de-identified reports and labels will be available soon. Data collection is ongoing, so check back often for updates.
-
MIDRC is a collaborative initiative led by the medical imaging community, funded by the National Institute of Biomedical Imaging and Bioengineering (NIBIB) and hosted at the University of Chicago. It is co-led by the American College of Radiology® (ACR®), the Radiological Society of North America (RSNA), and the American Association of Physicists in Medicine (AAPM). MIDRC aims to accelerate innovation in medical imaging by providing researchers with a large-scale, de-identified and curated dataset of medical images and associated clinical data. This resource supports the development of effective and useful AI algorithms to improve disease characterization and guide clinical interventions.
-
MIDRC was created to provide a large, high-quality dataset of medical images to support AI research. Developing AI for medical imaging is often limited by small datasets, inconsistent image quality, and a lack of reliable reference data. MIDRC helps overcome these challenges by offering a standardized collection of curated images and clinical data.
-
MIDRC was established in 2020 as a collaboration between the American Association of Physicists in Medicine (AAPM), the American College of Radiology (ACR), and the Radiological Society of North America (RSNA). Initially funded by the National Institute of Biomedical Imaging and Bioengineering (NIBIB), MIDRC is hosted at the University of Chicago and benefits from contributions by over 20 research institutions. The data commons is built on the Gen3 platform.
Today, MIDRC receives funding from the Advanced Research Projects Agency for Health (ARPA-H) serving as the exclusive medical imaging performer for Biomedical Data Fabric (BDF) program, advancing AI-driven medical imaging solutions. MIDRC was also selected to participate in the National Artificial Intelligence Research Resource (NAIRR) program, partnering with other federal and private organizations to build a shared research infrastructure that will strengthen access to critical resources necessary to power responsible AI discovery and innovation.
-
MIDRC brings together experts in medical imaging to create a comprehensive resource that integrates imaging data with clinical metadata. Unlike other datasets that primarily focus on general patient records, MIDRC prioritizes high-quality imaging data in the standardized DICOM format, ensuring researchers have access to rich, structured data that enhances AI model development.
What sets MIDRC apart is its ability to link imaging data with expansive clinical datasets (e.g., N3C, BioDataCatalyst) and aggregate disparate datasets from affiliated data enclaves (IDC, ACRDart, Stanford AIMI, TCIA) through the Biomedical Imaging Hub (BIH). This centralized approach allows researchers to efficiently index, display, and retrieve imaging and clinical data through a common portal, facilitating large-scale medical AI research.
Additionally, MIDRC uniquely supports a sequestered data commons designed for benchmarking and regulatory use, providing a controlled environment for validation testing and AI model evaluation. The platform also includes a robust set of AI research tools, making it a leading resource for advancing medical imaging innovations.
Data types
-
MIDRC uses the Digital Imaging and Communications in Medicine (DICOM) standard for medical images. DICOM is the international standard for transmitting and storing medical images and related information, providing rich information on acquisition and imaging protocol data as well as the images. The corresponding image annotations/labels are delivered in several formats including DICOM SR, DICOM SEG and JSON.
-
Medical images follow standardized formats like DICOM, but other health data—such as electronic health records (EHRs), lab results, and patient demographics—can vary widely in structure and terminology. MIDRC is now accepting de-identified radiology reports with labels assigned by the data donor as well as by information extraction software it has developed. MIDRC collaborates with federally funded initiatives to align imaging data with broader medical datasets, ensuring consistency and compatibility across different research projects. For example, MIDRC supports linkages to other non-imaging clinical enclaves to provide a comprehensive record of patient metadata including N3C, All of Us, and BioDataCatalyst.
-
MIDRC began by collecting chest X-ray and chest CT examinations. It has since branched out to include other modalities (e.g. MRI, Ultrasound, PET) and body parts including head, abdomen, pelvis and long bones. In many instances, there are longitudinal image records to track changes over time to support research on long-term effects.
-
MIDRC is working to link imaging data with treatment details and patient outcomes, including long-term follow-ups when possible. MIDRC is enriched by direct secure linkages to repositories that contain extensive clinical metadata that correspond to imaging data contained in the MIDRC enclave. This includes repositories such as N3C, All of Us, and BioDataCatalyst.
Accessing and using MIDRC data
-
No, data access is free. Access is subject to our data use agreements (one for non-commercial use and one for commercial use, see https://www.midrc.org/midrc-data-use-agreement). Available datasets can be found on data.midrc.org.
-
For technical support, contact MIDRC at midrc-support@gen3.org.
-
Users must register and agree to the data use policy. All research using MIDRC data must acknowledge MIDRC in publications (see https://www.midrc.org/midrc-acknowledgements).
-
Yes! MIDRC welcomes contributions from medical centers and hospitals. Visit the MIDRC data intake page for details (https://www.midrc.org/donate).
-
All submitted data must be de-identified before upload using approved tools. MIDRC reviews and processes all data to ensure no protected health information (PHI) remains. Secure transmission methods are available for data that may contain PHI, which will remain embargoed until fully de-identified.
-
Most data is openly accessible, but part is reserved for our sequestered dataset for validation testing.
-
Yes, MIDRC data are available worldwide.
-
All research using MIDRC data must acknowledge MIDRC in publications (see https://www.midrc.org/midrc-acknowledgements).
Other
-
Attend our monthly seminars: https://us06web.zoom.us/webinar/register/WN_FxteE7VTRqOTtMTzojzeGw#/registration
Sign up for the MIDRC newsletter:
https://www.midrc.org/register-to-receive-newsletter
Follow us on LinkedIn for updates:
https://www.linkedin.com/in/midrc/
Subscribe to our YouTube channel for seminar recordings, town halls, and more: https://www.youtube.com/@MIDRC_
-
MIDRC does not provide funding, but researchers are encouraged to seek independent grants. We can provide letters of support for funding applications.
-
Visit our contact page for general inquiries. For technical support, email midrc-support@gen3.org.
Can’t find the answer to your question? Please go to our contact page or, for technical questions about data download and use, contact midrc-support@gen3.org.
If you are interested in contributing images/data to MIDRC, please visit our data contribution page.