MGG_2024v15n5

Maize Genomics and Genetics 2024, Vol.15, No.5, 247-256 http://cropscipublisher.com/index.php/mgg 248 sequencing of millions of small DNA fragments, which are then computationally assembled to reconstruct the original sequence. This approach contrasts with traditional Sanger sequencing, which sequences DNA fragments one at a time. HTS technologies have significantly reduced the cost and time required for sequencing, making it feasible to sequence entire genomes, transcriptomes, and other complex genetic materials (Dalca and Brudno, 2010; Reon and Dutta, 2016; Lee, 2023). Several HTS platforms are commonly used in genomic research, each with its unique advantages and limitations. Illumina sequencing, one of the most widely used platforms, is known for its high accuracy and throughput, making it suitable for a wide range of applications, including whole-genome sequencing and RNA sequencing. Pacific Biosciences (PacBio) sequencing offers long-read capabilities, which are beneficial for resolving complex genomic regions and structural variants. Oxford Nanopore Technologies (ONT) provides real-time sequencing and ultra-long reads, which are advantageous for de novo genome assembly and the detection of epigenetic modifications. Each of these platforms has contributed to the advancement of genomic research by providing diverse tools to address different scientific questions (Caspar et al., 2018; Pradhan et al., 2019; Lee, 2023). 2.2 The current status of maize genome sequencing The sequencing of the maize genome has undergone significant advancements over the past decade. Initial efforts focused on generating a reference genome for maize, which provided a foundation for subsequent research. The first draft of the maize genome was published in 2009, marking a milestone in plant genomics. Since then, continuous improvements in sequencing technologies and bioinformatics tools have enabled more detailed and accurate assemblies of the maize genome. These advancements have facilitated the identification of genetic variations and the understanding of complex traits in maize, contributing to crop improvement and precision breeding efforts (Pérez-Losada et al., 2020; Farooqi et al., 2022). Several reference genomes for maize have been completed, each providing valuable insights into the genetic architecture of this important crop. Notable projects include the B73 reference genome, which has been extensively used in maize research, and the more recent NAM (Nested Association Mapping) population genomes, which offer a broader representation of maize genetic diversity. These reference genomes have been instrumental in identifying quantitative trait loci (QTL), genes, and alleles associated with important agronomic traits. Research projects leveraging these reference genomes have advanced researchers understanding of maize biology and facilitated the development of improved maize varieties with enhanced yield, stress tolerance, and nutritional quality (Reon and Dutta, 2016; Farooqi et al., 2022). 2.3 Data processing and analysis Processing HTS data involves several critical steps to ensure accurate and reliable results. The initial step is quality control, where raw sequencing reads are assessed and filtered to remove low-quality data. This is followed by read alignment, where the filtered reads are mapped to a reference genome or assembled de novo. Subsequent steps include variant calling, where genetic variants such as single nucleotide polymorphisms (SNPs) and insertions/deletions (indels) are identified, and annotation, where the functional implications of these variants are determined. Each of these steps requires specialized bioinformatics tools and algorithms to handle the large volume of data generated by HTS (Dalca and Brudno, 2010; Altmann et al., 2012; Guo et al., 2017). Bioinformatics tools play a crucial role in the analysis of maize HTS data, enabling researchers to extract meaningful insights from complex datasets. Tools for read alignment, such as BWA and Bowtie, are used to map sequencing reads to the maize reference genome. Variant calling tools, such as GATK and SAMtools, identify genetic variants, while annotation tools, such as ANNOVAR and SnpEff, predict the functional impact of these variants. Additionally, specialized software for transcriptome analysis, such as Cufflinks and DESeq, are used to study gene expression patterns. The integration of these tools allows researchers to uncover the genetic basis of important traits in maize and apply this knowledge to precision breeding programs (Altmann et al., 2012; Pradhan et al., 2019; Quijada et al., 2020).

RkJQdWJsaXNoZXIy MjQ4ODYzNQ==