Intl. J. of Mol. Evol. and Biodivers. 2012, Vol. 2, No.1, 1-7
http://ijmeb.sophiapublisher.com
6
Figure 3 The color matrix of
Streptococcus pneumonia
TIGR4
Note: The horizontal axis and the longitudinal axis represent
the kinds of combination of trinucleotide respectively
Figure 4 The joint histogram of trinucleotide transition
probability matrix of organisms
Note: (a) Represents the joint histogram for both of
Streptococcus pneumonia
TIGR4 itself; (b) represent the joint
histogram for
Streptococcus pneumonia
TIGR4 and
Streptococcus pneumonia
D39
The formula used for joint histogram divergence is
(2)
Where
i
and
j
represent the row number and column
number of matrix HIST respectively;
HIST[i,j]
represents the value of the
i
-th row, the
j-
th column
element in matrix
HIST
.
3.4 Estimation of the novel evolution analysis method
The Euclidean distance is usually used to calculate the
diversity between sequences based on oligonucleotide
frequencies. And the joint histogram divergences of
trimucleotide transition probability matrixes were
compared with the Euclidean distances to evaluate the
discrimination of the novel evolution analysis method.
The Euclidean distance based on oligonucleotide
transition probability matrixes of two genomes was
calculated as
(3)
N
1ji,
2
ji,
ji,
y x
D
Here
N
represents the row number and column
number of oligonucleotide transition probability
matrix;
x
i ,j
and
y
i ,j
represent the element value of the
i
-th row and the
j
-th column in two transition
probability matrixes respectively.
Additionally, the traditional method for phylogenic
inference based on single gene or several genes has
limited discrimination, and it cannot distinguish the
intraspecies organisms efficiently (Bohlin et al., 2008).
So the joint histogram divergence was used to
reconstruct the phylogeny tree of organisms that is
hard to be identified by single gene. Here, for 11
organisms, their sequence alignment of 16S rRNA
gene was completed by CLUSTAL X, and the single
gene-based tree was constructed by neighbor-joint
method. Meanwhile, the joint histogram divergence
matrixes of these genomes were created and the
phylogeny tree was constructed by PHYLIP. Finally,
we compared the topologies of 16S rRNA gene-based
tree with those of genomes’ joint histogram distance
matrix-based tree to examine their ability to
reconstruct the phylogenetic relationship among these
11 microbial organisms. The result would be critical
for test the validity of our new method in differentiating
microbial species in closely related organisms.
) ,(
) ,(
2
,
,
j i
ji
j i
HIST
j i
HIST
i j
HD
Authors’ Contributions
CTY collected and analyzed the data, drafted and modified the
manuscript; QSH wrote and analyzed part of data. FZ also
analyzed part of data; YF directed the whole project, drafted
and modified the manuscript. All authors have read and
approved the final manuscript.
Acknowledgment
This work was supported by National Science Foundation of
China grants 10972081 and 11072080.
References
Amoutzias G.D., Robertson D.L., Oliver S.G. and Bornberg-Bauer E., 2004,
Convergent evolution of gene networks by single-gene duplications in