CGG_2024v15n2

Cotton Genomics and Genetics 2024, Vol.15, No.2, 66-80 http://cropscipublisher.com/index.php/cgg 68 the speed and efficiency of genome sequencing while significantly reducing costs, making whole-genome sequencing feasible. These technological advancements have greatly propelled cotton genome research. With the advent of high-throughput sequencing technologies, scientists have been able to deeply sequence and assemble the entire cotton genome. Using platforms such as PacBio and Illumina, researchers have successfully completed the whole-genome sequencing of multiple cotton varieties and related species. These datasets have provided high-quality genome sequences and revealed complex structures within the cotton genome, such as repetitive sequences, gene family expansions, and gene recombination events. These findings are crucial for understanding the evolution and function of the cotton genome. The integration of multi-omics data is another significant advancement in cotton genome research. By combining genome sequencing with transcriptomics, proteomics, and metabolomics data, scientists can gain a comprehensive understanding of gene expression regulation and metabolic pathway dynamics. These studies have revealed gene expression patterns and regulatory mechanisms in cotton under various developmental stages and environmental conditions. For example, integrating transcriptomics and metabolomics data has identified key regulatory genes and metabolic pathways involved in fiber development and stress resistance (Li et al., 2021). 3 Technological Advances in Genome Sequencing 3.1 Development of high-throughput sequencing technologies High-throughput sequencing (HTS) technologies have revolutionized the field of genomics, allowing for the rapid and comprehensive sequencing of complex genomes such as those of cotton (Gossypium). These technologies have evolved significantly since the advent of Sanger sequencing, providing higher speed, accuracy, and cost-effectiveness. The development of HTS platforms like Illumina, PacBio, and Oxford Nanopore has enabled the detailed sequencing of both diploid and tetraploid cotton species, revealing intricate details of their genome structures (Wang et al., 2018). One of the landmark achievements in this domain was the sequencing of Gossypium hirsutum and Gossypium barbadense, which provided high-quality reference genomes. This was achieved by integrating single-molecule real-time sequencing, BioNano optical mapping, and high-throughput chromosome conformation capture techniques. These efforts have significantly improved the contiguity and completeness of the genome assemblies, especially in regions with high repeat content like centromeres. The assembly of the Gossypium hirsutum TM-1 genome, which integrated whole-genome shotgun reads, BAC-end sequences, and genetic maps, highlighted the asymmetric evolution between A and D subgenomes. This comprehensive sequencing effort provided critical insights into the genomic signatures of selection and domestication associated with fiber improvement and stress tolerance(Zhang et al., 2015). 3.2 Innovations in data analysis and bioinformatics The rapid advancements in sequencing technologies have been paralleled by significant innovations in data analysis and bioinformatics. The complexity and volume of data generated by HTS necessitate sophisticated tools and methods for data processing, analysis, and interpretation. Innovations in bioinformatics have enabled the efficient handling of large datasets, facilitating the assembly, annotation, and comparative analysis of cotton genomes. Bioinformatics platforms such as COTTONOMICS integrate vast amounts of genomic, transcriptomic, and epigenetic data, providing a comprehensive database for cotton research. This platform allows for the retrieval and analysis of data concerning cotton genomes, genomic variations, gene expression, and epigenetic regulation, thereby enabling researchers to decipher complex genetic traits and their regulatory networks (Dai et al., 2022). The development of novel computational pipelines such as IGIA for reconstructing accurate gene structures from integrated data has been pivotal. These pipelines facilitate the exploration of transcriptional landscapes in cotton, revealing dynamic gene expression patterns, alternative splicing events, and regulatory mechanisms involved in fiber development and stress responses (Wang et al., 2019).

RkJQdWJsaXNoZXIy MjQ4ODYzNQ==