Intl. J. of Mol. Evol. and Biodivers. 2012, Vol. 2, No.1, 1-7
              
            
            
              http://ijmeb.sophiapublisher.com
            
            
              4
            
            
              Table 1 The Euclidian distance of trinucleotide transition probability matrixes of bacterial genomes
            
            
              
                L. lac
              
            
            
              _MG
            
            
              
                L. lac
              
            
            
              _SK
            
            
              
                L. lac
              
            
            
              _I1
            
            
              
                S. pyo
              
            
            
              _M1
            
            
              
                P. pen
              
            
            
              _AT
            
            
              
                S. aur
              
            
            
              _US
            
            
              
                A. met
              
            
            
              _QY
            
            
              
                L. lac
              
            
            
              _MG
            
            
              0
            
            
              
                L. lac
              
            
            
              _SK
            
            
              0. 044
            
            
              0
            
            
              
                L. lac
              
            
            
              _I1
            
            
              0. 062
            
            
              0. 073
            
            
              0
            
            
              
                S. pyo
              
            
            
              _M1
            
            
              0. 291
            
            
              0. 295
            
            
              0. 308
            
            
              0
            
            
              
                P. pen_AT
              
            
            
              0. 321
            
            
              0. 324
            
            
              0. 324
            
            
              0. 305
            
            
              0
            
            
              
                S. aur
              
            
            
              _US
            
            
              0. 392
            
            
              0. 396
            
            
              0. 390
            
            
              0. 392
            
            
              0. 334
            
            
              0
            
            
              
                A. met
              
            
            
              _QY
            
            
              0. 382
            
            
              0. 388
            
            
              0. 397
            
            
              0. 320
            
            
              0. 349
            
            
              0. 372
            
            
              0
            
            
              Table 2 The joint histogram divergence of trinucleotide transition probability matrixes of bacterial genomes
            
            
              
                L. lac
              
            
            
              _MG
            
            
              
                L. lac
              
            
            
              _SK
            
            
              
                L. lac
              
            
            
              _I1
            
            
              
                S. pyo
              
            
            
              _M1
            
            
              
                P. pen
              
            
            
              _AT
            
            
              
                S. aur
              
            
            
              _US
            
            
              
                A. met
              
            
            
              _QY
            
            
              
                L. lac
              
            
            
              _MG
            
            
              0
            
            
              
                L. lac
              
            
            
              _SK
            
            
              35. 674
            
            
              0
            
            
              
                L. lac
              
            
            
              _I1
            
            
              87. 366
            
            
              89. 496
            
            
              0
            
            
              
                S. pyo
              
            
            
              _M1
            
            
              1
            
            
              
            
            
              020. 933
            
            
              1
            
            
              
            
            
              043. 886
            
            
              1
            
            
              
            
            
              202. 584
            
            
              0
            
            
              
                P. pen
              
            
            
              _AT
            
            
              1
            
            
              
            
            
              090. 625
            
            
              1
            
            
              
            
            
              111. 164
            
            
              1
            
            
              
            
            
              193. 150
            
            
              1
            
            
              
            
            
              230. 305
            
            
              0
            
            
              
                S. aur
              
            
            
              _US
            
            
              1
            
            
              
            
            
              291. 842
            
            
              1
            
            
              
            
            
              311. 779
            
            
              1
            
            
              
            
            
              324. 448
            
            
              1
            
            
              
            
            
              475. 062
            
            
              1
            
            
              
            
            
              062. 187
            
            
              0
            
            
              
                A. met
              
            
            
              _QY
            
            
              2
            
            
              
            
            
              305. 814
            
            
              2
            
            
              
            
            
              344. 976
            
            
              2
            
            
              
            
            
              570. 574
            
            
              1
            
            
              
            
            
              586. 869
            
            
              1
            
            
              
            
            
              916. 826
            
            
              1
            
            
              
            
            
              768. 208
            
            
              0
            
            
              amplification effect on the diversity of genomes
            
            
              comparing with other methods previous mentioned.
            
            
              So, this new method we proposed performed higher
            
            
              accurately and efficiently in estimating the evolutionary
            
            
              distance of organisms for both closely related and
            
            
              distantly related as our calculated results exhibited.
            
            
              Moreover, another important conclusion that we
            
            
              confirmed is the positive correlation between JHD and
            
            
              their corresponding evolution distances exists for most
            
            
              organisms, the closer relationship of organisms has,
            
            
              and the smaller JHD is. However, there are exceptions
            
            
              that JHD of close related organisms is larger than the
            
            
              one of distant related organisms. Sometimes, which
            
            
              may partly due to the universal occurrence of
            
            
              horizontal gene transfer that caused high similarity of
            
            
              genomic fragment of distant related organisms
            
            
              (Gogarten and Townsend, 2005). Another reason is
            
            
              probably the appearance of convergent evolution,
            
            
              which could lead to the genome of distant related
            
            
              organisms change analogously (Amoutzias et al., 2004).
            
            
              Comparing with single-gene-based tree, this tree
            
            
              based on image registration of oligonucleotide
            
            
              transition probability matrices of microbial genomes
            
            
              had clearer branched structure for taxa both above and
            
            
              under the family level. Although this new method
            
            
              might not be succeed to analyze the phylogenetic
            
            
              relationship for some organisms and still be far away
            
            
              from perfect, there is an extensive development
            
            
              prospect to become a useful assistant tool or
            
            
              enlightening mean in identification of unknown
            
            
              organisms or phylogenetic inference (Tyagi et al.,
            
            
              2010). It could determine the lineage of unknown
            
            
              organisms by comparing with the definite organisms
            
            
              using the image registration technology. Also, it might
            
            
              be applicable on the genomes that have not been
            
            
              annotated because there will be no need to obtain
            
            
              homologous gene sequences and their multiple
            
            
              alignments. Of course, this novel method for evolution
            
            
              analysis we introduced needs further improvement and
            
            
              development to become a really effective method for
            
            
              species identification in the future.
            
            
              
                3 Materials and Methods
              
            
            
              
                3.1 Genomic sequences
              
            
            
              We selected 100 bacterial genomes for this analysis.
            
            
              Sequences data were obtained from NCBI website
            
            
              (http://www.ncbi.nlm.nih.gov/genomes/). A list of full
            
            
              binominal nomenclature with strain tags of these
            
            
              organisms together with their NCBI accession
            
            
              numbers and lineage codes is given in Supplement 1.
            
            
              The “lineage code” given in the fourth column is
            
            
              explained below.
            
            
              As the taxons in NCBI only listed their names, we
            
            
              have generated an explicit numbering for all taxon
            
            
              ranking from phyla down to strains. For example,