Computer graphic of DNA strands

Microsoft Genomics

Powering genomic data analysis on Azure

Discover insights from the genome using open-source and open-standard–based solutions that take advantage of the performance and scalability of the global Azure infrastructure.

Go to our Genomics Community site to get started.

Workflow management

Design and orchestrate scalable workflows, and efficiently manage genomics analysis pipelines and data manipulation tasks using the power of Azure.

Cromwell on Azure

Cromwell is a scientific workflow management system developed by the Broad Institute. With Cromwell on Azure, researchers can now scale their genomics and data science pipelines using the hyperscale compute capabilities on Azure.

Learn more about Cromwell on Azure

Nextflow on Azure

Nextflow is a bioinformatics workflow manager that enables the development of portable and reproducible workflows. Using Nextflow, you can deploy workflows on a variety of execution platforms, including local Kubernetes clusters and on high-performance computing (HPC).

Learn more about Nextflow on Azure

Snakemake on Azure

Snakemake is a workflow management system that creates reproducible and scalable data analyses. Workflows can be seamlessly scaled to server, cluster, grid, and cloud environments, without the need to modify the workflow definition.

Learn more about Snakemake on Azure

Microsoft Genomics service

This turnkey Azure service enables secondary analysis of genomics data using Burrows-Wheeler Aligner (BWA) and the Genome Analysis Toolkit (GATK). Microsoft Genomics service provides on-demand scalability and easy-to-use API integration. It’s ISO-certified and covered by Microsoft Health Insurance Portability and Accountability Act (HIPAA) and Business Associate Agreement (BAA).

Learn more about Microsoft Genomics service

Datasets

Power your genomics analysis and machine learning models using curated public datasets easily accessible from the Genomics Data Lake and the Azure Open Dataset platform.

Genomics Data Catalog

Access public datasets at no egress cost and integrate them into your genomic analysis workflows and applications. Available datasets include genome sequences, variant information, and subject/sample metadata in BAM, FASTA, VCF, and CSV file formats.

Explore Datasets

LOREM IPSUM

Multi-modal analysis

Make genomics data actionable by analyzing and interpreting data generated by modern genomics technologies using open-source software, big data analytics, and machine learning services on Azure.

Genomics Notebooks

Jupyter Notebook is a great tool for data scientists who are working on genomics data analysis. We demonstrate the use of Azure Jupyter Notebooks for this type of analysis via GATK, Picard, Bioconductor, and Python libraries.

  • VCF to Parquet conversion on Azure
  • FHIR and genomics data integration
  • Radiogenomics on Azure
  • Genomics data science with Azure confidential virtual machines (VMs)

Get started with multi-modal analysis

NVIDIA Clara Parabricks on Azure

Parabricks is a software suite for performing secondary analysis of next-generation sequencing (NGS) DNA and RNA data. A major benefit of Parabricks is that it is designed to deliver results at blazing fast speeds and low cost.

Get started with NVIDIA Clara Parabricks on Azure

Genomics Data Science Virtual Machine

The Genomics Data Science Virtual Machine (DSVM) is a customized DSVM image on the Azure platform, built specifically for data science. It has many popular data science tools preinstalled and preconfigured to jump-start the building of intelligent applications for advanced analytics.

Get Started with the Genomics Data Science VM

Bioconductor on Azure

Bioconductor provides hundreds of R-based bioinformatics tools for the analysis and comprehension of high-throughput genomic data.

Get started with Bioconductor on Azure

OpenCRAVAT on Azure

OpenCRAVAT is a new, open-source, scalable decision support system to support variant and gene prioritization with a dynamic GUI.

Get started with OpenCRAVAT on Azure

Accelerated precision health

Terra on Microsoft Azure powers global biomedical research collaborations and precision medicine. Terra is a scalable, secure research platform, jointly developed by the Broad Institute of MIT and Harvard, Microsoft, and Verily.

Get started with Terra

Learn more about Terra

Genomics workspace on Azure

Azure overview

With Azure, you have everything you need to build your genomics workspace in the cloud providing a centralized and secure environment to upload, analyze, and share data within and across organizations.

Customers accelerating research with Microsoft Genomics solutions

Exterior view of Children's Hospital

PacBio and Children’s Mercy Hospital

“We are excited to partner with Microsoft to improve the value of SMRT Sequencing to the Clinical/Translational Research Community. We believe combining our efforts will not only allow us to develop innovative solutions to improve end user experience but ultimately transform lives.”

Jonas Korlach, Ph.D.
Chief Scientific Officer, PacBio

Visit

Exterior view of Belfast Trust building.

Belfast Trust

“Leveraging Azure CycleCloud for analysis, we were able to save analysis time by 6 to 10 times and reduce the cost of analysis by roughly three times. This will enable us to expand our capacity to undertake more analysis and testing.”

Shirley Heggarty, Ph.D., FRCPath
Director, Regional Genetics Laboratory, Belfast City Hospital

Visit

Biotia

“At Biotia, by using Cromwell on Azure to back our compute-intensive genomics workflows, we have achieved substantial parallelization for our novel next-generation sequencing [NGS-based] COVID-19 detection and characterization assay.”

Joe Barrows
Director of Software Engineering, Biotia

Visit

A child reclines in a hospital bed while in the foreground medical monitoring equipment shows digital readouts

St. Jude Children’s Research Hospital

“Access to high-quality clinical genomic data, generated leveraging Microsoft Genomics service and streamed to St. Jude Cloud, will help further research in precision medicine for childhood cancer and other diseases.”

Dr. Jinghui Zhang
Chair, Department of Computational Biology

Visit

Stay up to date on our news

Terra logo

Biomedical research platform Terra now available on Microsoft Azure

Visit
Gene helix

Genomics workflows on secure lockdown environment using Cromwell on AKS

Visit
Our future health - NHS

UK’s largest health research programme supported by Microsoft Cloud

Visit

Partners

Zetta Genomics

Seqera Labs

3 Cloud

Sentieon

Veritas Genetics

Biotia

PacBio

Sophia Genetics

Databricks

Nvidia

DNA Stack

DNAnexus

BC Platforms