Cancer Genetics and Epigenetics 2016, Vol.4, No.2, 1-9
6
Figure 4 Survival curves of high risk group and low risk group
Discussion
Breast cancer is a heterogeneous cancer which is the world's highest incidence of women; it is a serious threat
to women's health. The occurrence of breast cancer is a complex biological process which a number of genes
involved and regulated. The various levels of gene expression in tumor cells of different individuals
determined the difference in treatment and prognosis
. So investigate characteristic changes
of breast cancer from the gene level and detection of breast cancer prognostic biomarkers will play important
roles in guiding breast cancer therapy.
With the development of high-throughput sequencing technology, making it easier for researchers understand
the mechanism of occurrence and development of disease from the genome -wide level. RNA-Seq
transcriptome sequencing technology is to sequence mRNA, smallRNA noncoding RNA and the like, to
reflect their level of expression. TCGA is a free public database resource includes many types of cancers and
a variety of data types. From which we can obtain a large number of RNA-Seq samples of breast cancer. We
obtained 1099 breast cancer tumor samples and 110 normal control samples of RNA-SeqV2 Level 3 gene
expression data from TCGA database, and analyzed the genetic characteristics of breast cancer on a
genome-wide level to find differentially expressed genes in breast cancer as well as molecular markers for
cancer prognosis.
0
100
200
300
400
0.0
0.2
0.4
0.6
0.8
1.0
Survival
month
Survival Probability
high risk
low risk
p=2.95e−05