|
1
|
submitter_id
|
clinical
|
character
|
NA
|
Patient identifier assigned by the submitting institution (MMRF)
|
MMRF_0001 to MMRF_2149
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
2
|
project_id
|
clinical
|
character
|
NA
|
GDC project identifier
|
MMRF-COMMPASS
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
3
|
age_at_diagnosis
|
clinical
|
integer
|
days
|
Age at primary diagnosis in DAYS (divide by 365.25 for years). GDC stores all ages in days.
|
10000-35000 (approx 27-96 years)
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
4
|
gender
|
clinical
|
character
|
NA
|
Patient sex/gender
|
female, male, not reported
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
5
|
race
|
clinical
|
character
|
NA
|
Patient race category per NIH guidelines
|
white, black or african american, asian, not reported, other
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
6
|
ethnicity
|
clinical
|
character
|
NA
|
Patient ethnicity per NIH guidelines
|
not hispanic or latino, hispanic or latino, not reported
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
7
|
vital_status
|
clinical
|
character
|
NA
|
Patient vital status at last follow-up
|
Alive, Dead, Not Reported
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
8
|
days_to_death
|
clinical
|
integer
|
days
|
Number of days from diagnosis to death. NA if patient is alive.
|
0-5000+
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
9
|
days_to_last_follow_up
|
clinical
|
integer
|
days
|
Number of days from diagnosis to last follow-up. NA if patient is deceased.
|
0-5000+
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
10
|
primary_diagnosis
|
clinical
|
character
|
NA
|
ICD-O-3 morphology code description for the primary diagnosis
|
Plasma cell myeloma, Myeloma NOS
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
11
|
disease_type
|
clinical
|
character
|
NA
|
Type of disease studied
|
Multiple Myeloma
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
12
|
site_of_resection_or_biopsy
|
clinical
|
character
|
NA
|
Anatomic site of tissue sample collection
|
Bone marrow, Blood
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
13
|
tissue_or_organ_of_origin
|
clinical
|
character
|
NA
|
Anatomic site of the disease origin
|
Bone marrow
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
14
|
year_of_diagnosis
|
clinical
|
integer
|
year
|
Calendar year of primary diagnosis
|
2005-2020
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
15
|
classification_of_tumor
|
clinical
|
character
|
NA
|
Tumor classification
|
primary, recurrence, metastasis, not reported
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
16
|
prior_malignancy
|
clinical
|
character
|
NA
|
Whether patient had a prior malignancy
|
yes, no, not reported
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
17
|
prior_treatment
|
clinical
|
character
|
NA
|
Whether patient received prior treatment
|
yes, no, not reported
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
18
|
ajcc_staging_system_edition
|
clinical
|
character
|
NA
|
AJCC staging edition used
|
various editions
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
19
|
days_to_last_known_disease_status
|
clinical
|
integer
|
days
|
Days from diagnosis to last disease status assessment
|
0-5000+
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=clinical” target=“_blank”>GDC</a>
|
|
20
|
sample_submitter_id
|
biospecimen
|
character
|
NA
|
Sample identifier assigned by submitting institution
|
MMRF_0001_1_BM, etc.
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample” target=“_blank”>GDC</a>
|
|
21
|
sample_id
|
biospecimen
|
character
|
NA
|
GDC-assigned UUID for the sample
|
UUID format
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample” target=“_blank”>GDC</a>
|
|
22
|
sample_type
|
biospecimen
|
character
|
NA
|
Type of sample collected
|
Primary Blood Derived Cancer - Bone Marrow, etc.
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample” target=“_blank”>GDC</a>
|
|
23
|
sample_type_id
|
biospecimen
|
character
|
NA
|
Numeric code for sample type
|
01, 09, 10, etc.
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample” target=“_blank”>GDC</a>
|
|
24
|
tissue_type
|
biospecimen
|
character
|
NA
|
Whether tissue is tumor or normal
|
Tumor, Normal
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample” target=“_blank”>GDC</a>
|
|
25
|
preservation_method
|
biospecimen
|
character
|
NA
|
Method used to preserve the sample
|
FFPE, Frozen, etc.
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample” target=“_blank”>GDC</a>
|
|
26
|
composition
|
biospecimen
|
character
|
NA
|
Sample composition category
|
Bone Marrow Components, Blood Derived
|
<a href=“https://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample” target=“_blank”>GDC</a>
|
|
27
|
gene_id
|
rnaseq_counts
|
character
|
NA
|
Ensembl gene identifier (ENSG with version)
|
ENSG00000000003.15
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
28
|
unstranded
|
rnaseq_counts
|
integer
|
raw counts
|
Unstranded read counts from STAR aligner. Use for DESeq2/edgeR analysis.
|
0-1000000+
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
29
|
stranded_first
|
rnaseq_counts
|
integer
|
raw counts
|
First-strand read counts (dUTP protocol)
|
0-1000000+
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
30
|
stranded_second
|
rnaseq_counts
|
integer
|
raw counts
|
Second-strand read counts
|
0-1000000+
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
31
|
tpm_unstranded
|
rnaseq_counts
|
numeric
|
TPM
|
Transcripts Per Million (unstranded). Normalized for gene length and sequencing depth.
|
0-100000+
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
32
|
fpkm_unstranded
|
rnaseq_counts
|
numeric
|
FPKM
|
Fragments Per Kilobase of transcript per Million mapped reads (unstranded).
|
0-100000+
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
33
|
fpkm_uq_unstranded
|
rnaseq_counts
|
numeric
|
FPKM-UQ
|
Upper quartile normalized FPKM (unstranded).
|
0-100000+
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
34
|
gene_name
|
rnaseq_metadata
|
character
|
NA
|
HGNC gene symbol
|
TP53, KRAS, MYC, etc.
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
35
|
gene_type
|
rnaseq_metadata
|
character
|
NA
|
Biotype from GENCODE annotation
|
protein_coding, lncRNA, miRNA, processed_pseudogene, etc.
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
36
|
barcode
|
rnaseq_metadata
|
character
|
NA
|
TCGA-style barcode for the sample
|
MMRF-COMMPASS-XXXX-TBM-…
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
37
|
patient
|
rnaseq_metadata
|
character
|
NA
|
Patient identifier extracted from barcode
|
MMRF-COMMPASS-XXXX
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|
|
38
|
sample_type
|
rnaseq_metadata
|
character
|
NA
|
Sample type from barcode decoding
|
Primary Blood Derived Cancer - Bone Marrow
|
<a href=“https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/” target=“_blank”>GDC</a>
|