Annotates variants in MAF with OncoKB™ annotation. Supports both python2 and python3.
MafAnnotator.py: A Mutation Annotation Format (MAF) file is a tab-delimited text file that lists mutations. MAFAnnotator annotated genes from MAF file by OncoKB™ Level of Evidences rules.
When you run MafAnnotator.py, FusionAnnotator.py and CnaAnnotator.py, you need a token before accessing the OncoKB™ data via its web API. Please visit OncoKB™ Data Access Page for more information about how to register an account and get an OncoKB™ API token.
With the token listed under OncoKB™ Account Settings Page, you could use it in the following format.
When you type python MafAnnotator.py -h in terminal, you can see all Python command parameters as below. -i <input MAF file>, -o <output MAF file> and-b oncokb_api_bear_token are required.
MafAnnotator.py -i <input MAF file> -o <output MAF file> [-p previous results] [-c <input clinical file>] [-s sample list filter] [-t <default tumor type>] [-u oncokb-base-url] [-b oncokb_api_bear_token] [-a]
EssentialMAFcolumns (case insensitive):HUGO_SYMBOL:HugogenesymbolVARIANT_CLASSIFICATION:TranslationaleffectofvariantalleleTUMOR_SAMPLE_BARCODE:sampleIDAMINO_ACID_CHANGE:aminoacidchangePROTEIN_START:proteinstartPROTEIN_END:proteinendPROTEIN_POSITION:canbeusedinsteadofPROTEIN_STARTandPROTEIN_END (in theoutputofvcf2map)Essentialclinicalcolumns:SAMPLE_ID:sampleIDONCOTREE_CODE:tumortypecodefromoncotree (oncotree.mskcc.org)Cancertypewillbeassignedbasedonthefollowingpriority:1) ONCOTREE_CODE in clinical data file2) ONCOTREE_CODE exist in MAF3) default tumor type (-t)DefaultOncoKB™baseurlishttp://oncokb.org.Use-atoannotatemutationalhotspots
Fusion Annotator
When you type python FusionAnnotator.py -h in terminal, you can see all Python command parameters as below. The required parameters is the same with MAF Annotator.
FusionAnnotator.py -i <input Fusion file> -o <output Fusion file> [-p previous results] [-c <input clinical file>] [-s sample list filter] [-t <default tumor type>] [-u oncokb-base-url] [-b oncokb_api_bear_token]
EssentialFusioncolumns (case insensitive):HUGO_SYMBOL:HugogenesymbolVARIANT_CLASSIFICATION:TranslationaleffectofvariantalleleTUMOR_SAMPLE_BARCODE:sampleIDFUSION:aminoacidchange,e.g."TMPRSS2-ERG fusion"Essentialclinicalcolumns:SAMPLE_ID:sampleIDONCOTREE_CODE:tumortypecodefromoncotree (oncotree.mskcc.org)Cancertypewillbeassignedbasedonthefollowingpriority:1) ONCOTREE_CODE in clinical data file2) ONCOTREE_CODE exist in Fusion3) default tumor type (-t)DefaultOncoKB™baseurlishttp://oncokb.org
CNA Annotator
When you type python CnaAnnotator.py -h in terminal, you can see all Python command parameters as below. The required parameters is the same with MAF Annotator.
CnaAnnotator.py -i <input CNA file> -o <output CNA file> [-p previous results] [-c <input clinical file>] [-s sample list filter] [-t <default tumor type>] [-u oncokb-base-url] [-b oncokb_api_bear_token]
Input CNA file should follow the GISTIC output (https://cbioportal.readthedocs.io/en/latest/File-Formats.html#discrete-copy-number-data)
Essentialclinicalcolumns:SAMPLE_ID:sampleIDCancertypewillbeassignedbasedonthefollowingpriority:1) ONCOTREE_CODE in clinical data file2) ONCOTREE_CODE exist in MAF3) default tumor type (-t)DefaultOncoKB™baseurlishttp://oncokb.org
Clinical Data Annotator
When you type python ClinicalDataAnnotator.py -h in terminal, you can see all Python command parameters as below. -i <input clinical file>, -o <output clinical file> and-a <annotated alteration files, separate by ,> are required.
ClinicalDataAnnotator.py -i <input clinical file> -o <output clinical file> -a <annotated alteration files, separate by ,> [-s sample list filter]
Essentialclinicalcolumns:SAMPLE_ID:sampleID
OncoKB Plots
When you type python OncoKBPlots.py -h in terminal, you can see all Python command parameters as below. -i <input clinical file> and -o <output clinical file> are required.
OncoKBPlots.py -i <annotated clinical file> -o <output PDF file> [-c <categorization column, e.g. CANCER_TYPE>] [-s sample list filter] [-n threshold of # samples in a category] [-l comma separated levels to include]
Essentialclinicalcolumns:SAMPLE_ID:sampleIDHIGHEST_LEVEL:HighestOncoKBlevelsSupportedlevels (-l): LEVEL_1,LEVEL_2,LEVEL_3A,LEVEL_3B,LEVEL_4,ONCOGENIC,VUS