# Supported Data Types

cBioPortal is a multimodal cancer data visualization tool and supports a variety of data types. For some data types, we have explicit support; for others, you can leverage generic assay or clinical data to assign arbitrary data to either a sample or patient. Beyond including data directly into cBioPortal, one can also link in additional external viewers specialized for particular data types, or even entirely different dashboards or portals. This combination of structured and flexible data formats, as well as the option to link out to other viewers, allows cBioPortal to be used for many different combinations of clinical data and profiling methods.

# Molecular Data

Assay Processed Data cBioPortal Data Type Example cBioPortal Study
Bulk DNA Sequencing Mutations Mutations MSK-IMPACT (Nat Med, 2017)
- Copy Number Alterations per Gene (CNA) CNA MSK-IMPACT (Nat Med, 2017)
- Arm Level CNA Data Generic Assay TCGA Pancan (Cell 2018)
- Copy Number Segmentation Segmented Data MSK-IMPACT (Nat Med, 2017)
- Structural Variants Structural Variants PCAWG (Nature, 2020)
- Mutational Signatures Generic Assay PCAWG (Nature, 2020)
- Genetic Ancestry Generic Assay TCGA Pancan (Cell, 2018)
Bulk RNA Sequencing Expression Expression TCGA Pancan (Cell, 2018)
Bulk DNA Methylation Methylation per probe Generic Assay TCGA Pancan (Cell, 2018)
Proteomics Protein Expression Protein Expression Cancer Cell Line Encyclopedia (Broad, 2019)
Other Assays Any sample-level and/or patient-level attributes (e.g. LHA types, microbiome) Generic Assay

# External Viewer Integration

There are several other data types for which there is no native support. However cBioPortal offers a generic way of linking in additional viewers using the Resource Data Format. Technically this is known as an iframe link. These viewers can be linked at the cohort, patient or individual sample level. Beyond just linking viewers, we also have several suggestions on what kind of derived data can be added from these assays directly into cBioPortal for multimodal analysis. This mainly leverages the Generic Assay Format.

Assay External Viewer Derived data to load into cBioPortal Example cBioPortal Study
scRNAseq CellxGene, Vitesssce, Broad Single Cell Portal Cell fractions per sample, pseudobulk expression per gene per sample HTAN Vanderbilt (Cell, 2021)
GeoMx Minerva RNA expression per Region of Interest Ovarian Cancer (Gray Foundation, Cancer Discov 2024)
CyCIF/ORION Minerva Cell marker density in tumor regions Colorectal Adenocarcinoma (DFCI/Orion, 2024)
H&E Minerva, CDSA MSK SPECTRUM (Nature, 2022)
CT OHIF Tumor volume, density, etc. TCGA Pancan (Cell, 2021)
scDNAseq CellxGene, Vitesssce, Broad Single Cell Portal
scATACseq CellxGene, Vitesssce, Broad Single Cell Portal
Tapestri TBD Protein expression per sample TBD
3D CyCIF TBD TBD TBD
Your custom assay Your custom viewer Any sample-level or patient-level data

# Clinical Data

The clinical data are free-form de-identified patient and sample attributes. We don't follow a specific ontology, but rely on the authors of the published studies to determine the organization of their clinical data. The only exception is the cancer type categorization which uses the Oncotree ontology. We do allow encoding of the data type for each column (e.g. numerical, text, etc). More details can be found in the Clinical Data File Formats section. We also have a timeline format for incorporating treatment information (see e.g. this AACR GENIE lung cancer case).