Yesterday the paper describing TCGA Pan-Cancer Project was published in Nature Genetics. We’ve had the opportunity to participate in this exciting project and here I would like to explain our experience and contribution to it.
We have been interested for quite a while in the study of patterns of genomics alterations in cancer across tumor types. Thus a project like the TCGA Pan-Cancer provided a unique opportunity to apply our tools and expertise to a unique collection of data.
In the past few years we have developed computational methodologies to identify cancer drivers by analyzing the patterns of somatic mutations across tumors (i.e OncodriveFM and OncodriveCLUST) as well as tools to facilitate the visual exploration of multidimensional cancer genomics datasets (i.e. Gitools, IntOGen, see our review on this topic if you are interested in this), we now had the opportunity to apply those tools to TCGA Pan-Cancer data.
On the quest to identify cancer driver genes, it has been observed that driver alterations that affect a pathway tend to be altered in a mutually exclusive manner. As far as we know this was first observed by Thomas et al., Nat Genet 2007, however from our experience this type of pattern can be observed in data from almost all cancer genomic projects. The rationale behind that observation is that once a gene involved in a particular critical pathway is altered, a second alteration affecting the same pathway does not confer a further selective advantage to the cancer cell. The concept of mutually exclusive alteration patterns has recently been exploited to identify cancer drivers (Ciriello et al, Genome Research 2011 and Vandin et al., Genome Research 2012).
We have published a new section in IntOGen in collaboration with Hautaniemi Lab, where you can do a gene correlation with the gene expression results of TCGA (The Cancer Genome Atlas) for a given subset of patients that share some specific clinical annotations. Then you will be able to browse your correlations next to global gene expression, SNP survival and DNA methylation analyses. The data analyzed so far includes 4 tumor types from TCGA: glioblastoma multiforme (GBM; 507 samples), ovarian serous adenocarcinoma (OV; 546 samples), breast invasive carcinoma (BRCA; 525 samples) and colon adenocarcinoma (COAD; 161 samples).