TGG_2025v16n2

Triticeae Genomics and Genetics 2025, Vol.16 http://cropscipublisher.com/index.php/tgg © 2025 CropSci Publisher, registered at the publishing platform that is operated by Sophia Publishing Group, founded in British Columbia of Canada. All Rights Reserved. CropSci Publisher is an international Open Access publishing specializing in Triticeae genome, trait-controlling, Triticeae gene expression and regulation at the publishing platform that is operated by Sophia Publishing Group (SPG), founded in British Columbia of Canada Publisher CropSci Publisher Edited by Editorial Team of Triticeae Genomics and Genetics Email: edit@tgg.cropscipublisher.com Website: http://cropscipublisher.com/index.php/tgg Address: 11388 Stevenston Hwy, PO Box 96016, Richmond, V7A 5J5, British Columbia Canada Triticeae Genomics and Genetics (ISSN 1925-203X) is an open access, peer reviewed journal published online by CropSci Publisher. The journal publishes original papers involving in all aspects of Triticeae sciences. Subject areas covered comprise classical genetics analysis, structural and functional analysis of Triticeae genome, gene expression and regulation, efficient breeding of improved varieties, as well as transgenic varieties. It is positioned to meet the needs of breeders, geneticists, molecular biologists, and anyone, worldwide, engaged in the field of Triticeae research. All the articles published in Triticeae Genomics and Genetics are Open Access, and are distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. CropSci Publisher uses CrossCheck service to identify academic plagiarism through the world’s leading plagiarism prevention tool, iParadigms, and to protect the original authors’ copyrights.

Triticeae Genomics and Genetics (online), 2025, Vol. 16, No.2 ISSN 1925-203X http://cropscipublisher.com/index.php/tgg © 2025 CropSci Publisher, registered at the publishing platform that is operated by Sophia Publishing Group, founded in British Columbia of Canada. All Rights Reserved. Latest Content Pangenome Construction of Triticum aestivum and Its Implications for Genetic Diversity Wenyu Yang, Chunxiang Ma Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 Genome-Wide Association Mapping of Salt Tolerance in Barley Germplasm Jiamin Wang, Xian Zhang, Xuemei Liu Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 63-71 CRISPR/Cas9-Mediated Editing of TaGW2 to Enhance Grain Size in Wheat Xingzhu Feng Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 72-78 Functional Characterization of a Transgenic Barley Line Expressing Anti-Fungal Protein Ming Li, Congbiao You Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 79-91 Optimizing Sowing Density and Nitrogen Management for Yield Maximization in Bread Wheat Zhongying Liu, Wei Wang Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 92-100

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 54 Research Insight Open Access Pangenome Construction of Triticum aestivum and Its Implications for Genetic Diversity Wenyu Yang, Chunxiang Ma Modern Agricultural Research Center, Cuixi Academy of Biotechnology, Zhuji, 311800, Zhejiang, China Corresponding email: chunxiang.ma@cuixi.org Triticeae Genomics and Genetics, 2025, Vol.16, No.2 doi: 10.5376/tgg.2025.16.0006 Received: 08 Jan., 2025 Accepted: 20 Feb., 2025 Published: 05 Mar., 2025 Copyright © 2025 Yang and Ma, This is an open access article published under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Preferred citation for this article: Yang W.Y., and Ma C.X., 2025, Pangenome construction of Triticum aestivum and its implications for genetic diversity, Triticeae Genomics and Genetics, 16(2): 54-62 (doi: 10.5376/tgg.2025.16.0006) Abstract Common wheat (Triticum aestivum) is a globally important staple food crop. However, the use of a single reference genome limits our understanding of its extensive genomic diversity. This study explores the construction of a wheat pan-genome, utilizing advanced sequencing technology, map-based genomic characterization, and integrated bioinformatics processes to capture core genes and non-essential gene content. We analyzed structural variations, gene presence and deletion variations (PAV), and copy number variations (CNV), revealing significant genetic diversity in common wheat (T. aestivum). These findings have profound significance for wheat breeding, enhancing trait association research, genomic selection, and adaptability to climate change. We further discussed the evolutionary insights gained from pan-genome data, including domestication events, population structure and gene family expansion, and highlighted the key contributions of the "10+ Wheat Genome Project". A comprehensive understanding of the wheat genome highlights the necessity of continuously developing inclusive and scalable pan-genomes, integrating multi-omics data, and conducting international cooperation, ultimately aiming to support sustainable agriculture and crop improvement. Keywords Triticum aestivum; Pangenome; Genetic diversity; Wheat breeding; Structural variation 1 Introduction The name "wheat" (Triticum aestivum) might sound somewhat academic, but it is actually the bread wheat that people eat in their daily lives. Its status needs no exaggeration. A large part of the daily calories and proteins of billions of people are supplied by it (Cavalet-Giorsa et al., 2023). For many places, the stability of wheat production determines the stability of food security. The problem is that the population is still growing and the climate is becoming increasingly difficult to predict. The requirements for wheat have naturally increased-not only high yield, but also disease resistance, drought resistance, and preferably better nutrition (Bayer et al., 2022). These demands may sound self-evident, but the scientific research foundation behind them was actually not as solid as imagined before (Huang et al., 2024). Over the past few decades, most research on wheat genomes has revolved around a reference genome. In other words, we use a "standard sample" to represent all wheat (Zanini et al., 2021). It sounds convenient, but the problem is obvious-there are significant genetic differences among different varieties, and even among some of its wild relatives. Using one sample to infer the situation of an entire species is like using one person's height to represent the height distribution of all mankind. This approach misses a lot of things, such as whether certain genes exist or not, changes in gene structure, or new genes closely related to agronomic traits (Przewieslik-Allen et al., 2021). In this case, the idea of "pan-genome" was proposed. It does not focus on a single genome but rather assembles multiple gene groups from different sources to view the complete genetic picture within a species. Genes that are present in all varieties are core genes, while those that are only found in some varieties are auxiliary genes. Regulatory elements and structural variations can also be captured in the pan-genome. Nowadays, the pan-genome data and map databases of wheat can be presented in a visual way. Researchers can use them to compare the genetic differences of different varieties, and breeders can also select more suitable materials based on this (Barabaschi et al., 2025).

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 55 The starting point of this study is very straightforward-by constructing the wheat pan-genome, to clarify which gene differences are related to important traits. Not only modern wheat, but also ancient subspecies are included. After comparison, some new gene loci and structural variations that were not noticed in the past can be found, and useful alleles can also be unearthed, enriching the genetic chassis of wheat. Nowadays, sequencing and analysis technologies are much more mature than before, and the assembly of the pan-genome will become increasingly precise, which is precisely an important step in promoting sustainable wheat production. 2 Advances in Pangenome Construction Technologies 2.1 Sequencing platforms and assembly methods If it were ten years ago, to say that all the genetic diversity of wheat should be fully understood, more people would have thought it was a fairy tale. The genome of wheat is large and complex, with triploids and hexaploids mixed together. However, in recent years, the situation has changed. Sequencing technology has been updated rapidly and the cost is not as terrifying as before. The conditions for conducting such research have become much more lenient. The three commonly used assembly methods did not exist from the very beginning but gradually evolved through research and practice. For instance, de novo assembly-completely independent of existing reference genomes, directly piecing together a version that belongs to a certain variety itself; Reference-based iterative assembly-first use the existing version as the foundation, and then add new segments of other varieties. There is also the increasingly popular atlas pan-genome in recent years, which can directly "draw" the differences among different varieties (Hu et al., 2024). But don't think that having a method will solve the problem once and for all. The genome of wheat is full of repetitive sequences, interspersed with deletions and diversity changes, and it is particularly difficult to assemble these things. Often, it is still necessary to rely on high-throughput and high-precision platforms to support the entire process. 2.2 Data integration and graph-based pangenomes To assemble the genes of multiple varieties into a large map sounds like building with blocks, but in fact, it is not at all easy. However, the amount of information brought by this method is indeed astonishing, especially in terms of gene deletions (PAVs) and structural variations, where the differences can be seen at a glance (Zanini et al., 2021). Nowadays, some platforms have turned this idea into tools, such as Wheat Panache, and many researchers are already using it. Its advantage is that the operation is intuitive-it can directly compare the genetic regions of different varieties, and even complex variations can be detected (Figure 1) (Bayer et al., 2022). Moreover, this kind of map is not just for "watching the spectacle". It can mark which regions are encoded and which are regulated, and can also integrate transcriptome and epigenome data together. To put it bluntly, this is no longer a simple jigsaw puzzle; it's more like opening a door for in-depth research. 2.3 Challenges in constructing a wheat pangenome The problem, of course, has always existed. The genome of wheat is inherently large, and with its high repetition rate and polyploid structure, the difficulty of analysis can be imagined. Some people might say, "Isn't technology getting more and more advanced?" Yes, but the volume of data is also expanding at a rapid pace. Just for storage and management, very strong computing resources are required. The real challenge lies in how to precisely identify variations from piles of sequences, complete integration, and then produce clear and usable visual results. Moreover, not every researcher is proficient in operating those complex data tools, which leads to a lot of data being "useful but not applicable". It is precisely for this reason that many teams nowadays emphasize the need to build "user-friendly" analysis platforms. Only when more people can make it accessible and smooth for them to use can pan-genome research be truly implemented. 3 Genetic Diversity Uncovered by the Wheat Pangenome 3.1 Core and dispensable gene content Wheat's genes are more than just present or absent. Pan-genome studies reveal that they actually consist of two components: core genes common to all wheat varieties, and a subset of genes found only in certain varieties, often referred to as nonessential genes or, more bluntly, "variable genes." To give specific numbers, the pan-genome of hexaploid bread wheat contains approximately 140 500 genes, of which approximately 81 070 are in the core

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 56 portion. On average, each variety contains around 128 656 genes (Montenegro et al., 2017). While this may sound like a large number of genes, what's even more interesting is that these "nonessential" genes are actually a treasure trove of genes associated with traits. Abilities such as drought resistance, disease resistance, and adaptability to diverse environments are often found in these genes with greater variability. Figure 1 Wheat Panache screenshot showing an Aegilops ventricosa introgression at the beginning of chromosome 2 in cultivars Stanley, Jagger, Mace, and SY Mattis. Black boxes were added to show the region missing in cultivars where the introgression replaced parts of chromosome 2A. The graph assembly started with the IWGSC v1 assembly leading to linearized regions following the same naming scheme as the IWGSC v1.0 assembly (chr1A_part1, chr1A_part2, chr2A_part1, …). CS, ‘Chinese Spring’. Shown here is the beginning of the first part of chr2A. Black blocks are gene models. White regions correspond to regions that are present in the graph but contain no genes (Adopted from Bayer et al., 2022) 3.2 Structural variants (SVs) and their distribution The differences between genomes are sometimes not minor repairs but radical alterations. Pan-genome research shows that the number of structural variations among wheat varieties is astonishing. Chromosomal rearrangement is one type, and the other type is the mixture of genetic fragments from wild relatives (Przewieslik-Allen et al., 2021). In other words, some varieties may carry a portion of "wild" genes. The relationship between these structural variations and phenotypes is not loose. Traits such as stress resistance and yield can often find clues in SV. However, their distribution in the genome is not uniform. In some areas, there is almost no movement, while in others, the frequency of variation is so high that it seems like a "hotspot". Often, these highly variable regions are precisely linked to the adaptability and breeding history of wheat (Zanini et al., 2021). This makes one can't help but suspect that our previous research might have missed quite a few important positions. 3.3 Gene presence-absence variation (PAV) and copy number variation (CNV) Not every wheat variety has the same genes. Some genes are completely absent in certain varieties but actively present in others. This is known as PAV (Bayer et al., 2022). This difference is one of the important sources of genetic diversity in wheat. Especially among the genes related to disease resistance and environmental stress response, the proportion of PAV is even higher. In other words, it is precisely these differences that enable wheat to display its unique abilities in various environments. As for CNV-the variation in gene copy number-although it seems like a technical detail, its impact is not small at all. The same gene may have only one copy in one variety, but several copies may be replicated in another. This "more" or "less" will change the intensity of gene expression and thereby affect the manifestation of traits. Of course, these differences only count if they can be seen.

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 57 Pan-genome databases like Wheat Panache can now visually display and compare PAV with CNV. Researchers and breeders can directly query and quickly compare, which makes the study of genetic diversity and the targeted breeding of new varieties more confident (Bayer and Edwards, 2023). 4 Implications for Wheat Breeding and Crop Improvement 4.1 Enhanced trait association and marker development Some traits, such as drought resistance, salt tolerance, or the taste of the grains, are visible and tangible to everyone, but they are often the result of complex genes at work. The traditional reference genome is not omnipotent-some key variations cannot be found at all because they only occur in a few species (Tiwari et al., 2024). This is precisely where the pan-genome comes into play. It can make up for the missing part in the past, such as the presence of a deletion variation (PAV) or a newly emerged allele. Many new genetic markers have been unearthed from these differences. Tools like genome-wide association analysis (GWAS), with these markers, make it as easy to locate traits as "matching them" (Montenegro et al., 2017). The current k-mer analysis method can even directly identify the key genes that affect the protein content of grains. With these targets in place, quality improvement is no longer just a matter of luck but has a clear direction. 4.2 Improving genomic selection and prediction accuracy If one could know the destination of breeding earlier, a lot of time and resources could be saved. The introduction of the pan-genome is actually helping us look at the map in advance. The common genomic selection model has a drawback-it cannot cover all genes, especially those "non-mainstream" genes that are absent from the reference genome. The pan-genome can precisely fill this gap and make the prediction results closer to reality (Zhang et al., 2024). Machine learning is not omnipotent, but when combined with the pan-genome, it does have more advantages in dealing with complex traits (Bayer et al., 2021). This combination can help us avoid detours and screen out potential materials in advance. The breeding cycle has been shortened and the accuracy of seed selection has also improved. More importantly, these more abundant genetic information enable us to select materials that are both disease-resistant and of high quality with decent yields, no longer relying solely on experience for judgment, but rather backed by data. 4.3 Facilitating adaptation to climate change and emerging threats The climate is becoming increasingly unpredictable, and the changes in pests and diseases are also rapid. New problems in agriculture are increasing year by year. Wheat, as the staple food, is naturally the first to face the challenge. The pan-genome enables us to extract valuable genetic information from modern cultivated varieties and wild relatives. Many genes that are drought-tolerant, disease-resistant and adaptable to extreme climates have actually been hidden in those unnoticed varieties long ago (Mangal et al., 2024). In the past, it might have taken several years to find these genes. Now, with the "accelerator" of the pan-genome, the speed is much faster and the application is more precise. In the context of unstable climate and increasing threats, this ability is almost a necessity (Fernandez et al., 2021). Only by identifying and making good use of these "hidden" useful alleles can we possibly ensure the stability of future wheat yields and quality. 5 Evolutionary and Ecological Insights from the Wheat Pangenome 5.1 Understanding domestication and divergence The more refined the breeding, the more simple the inheritance-this is a common saying in the industry. The data from the pan-genome basically also confirm this statement. Those superior varieties that have undergone long-term domestication and strict screening have indeed "tightened" their genetic diversity significantly (Montenegro et al., 2017). However, this is not the whole truth. There are still many variations in the so-called "non-essential" gene regions. Although not every breed has these genes, they are often closely related to adaptability or specific traits. If we only look at modern varieties, this story actually cannot be told completely. If we compare wild relatives and local species together, we can see more details: some genes have been retained all the way, while others have quietly disappeared during the process of evolution. Behind every retention or loss, the shadow of human breeding selection can almost always be found.

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 58 5.2 Population structure and geographic differentiation The differences in the appearance and performance of wheat in different places are not only caused by the environment, but also the involvement of genes is obvious. The pan-genome can help us track these differences and see if certain genes are only present in varieties of specific regions and cannot be found anywhere else (Schreiber et al., 2024). PAV and SNP data record this "presence" and "absence", "similarity" and "difference". When put together, they can depict the relationships among wheat populations and how they gradually adapt to local conditions. However, this is not a simple issue of geographical grouping. Core genes and helper genes each have their own roles in regional adaptability. The absence of either type would be incomplete. More importantly, the pan-genome not only helps us review history, but also monitors the present, enabling us to identify in advance those variations that may be particularly crucial for future breeding. 5.3 Gene family expansion and functional innovation If genetic diversity is regarded as a warehouse, then the expansion of gene families is like adding new goods to the warehouse. Pan-genome research has found that many gene families in wheat have become more "fancy" in function, especially those related to stress coping. The changes are not only reflected in quantity but also in the situation of "changing jobs". Some genes that originally operated in organelles have entered the cell nucleus through polyploidy or gene transfer (Chen et al., 2023). These members, known as nuclear organelle genes (NOGs), have taken on new tasks, giving wheat more leeway in performing under adverse conditions. These changes may not seem obvious on the surface, but for breeding, they mean more functional reserves that can be utilized. The environment changes and genes adjust accordingly. The pan-genome can precisely bring out such details, making improvement more operational. 6 Case Study 6.1 Significance of the project for pangenome construction Everyone knew from the beginning that wheat was too complex and that a single reference genome was definitely not enough. But to really break through this limitation, it still had to be driven by specific projects. The "Ten+ Wheat Genome Project" was launched in this context. It tested more than ten varieties, both common and highly representative. To be honest, previous data were often "out of focus", especially when faced with global wheat variations. This project, through sequencing and assembly, has revealed the true appearance of each variety, from structural rearrangements to differences in the number of genes. It involves everything. It cannot be said that all problems have been solved, but at least the foundation of the pan-genome has been laid more firmly, allowing researchers to understand the overall genetic structure of wheat more systematically. 6.2 Key discoveries and interpretations As soon as the data came out, several details quickly caught the attention of the research team. For instance, some structural variations are not merely conventional genetic alterations. Some are chromosomal rearrangements, and others directly result from gene infiltration of wild relatives (Figure 2) (Walkowiak et al., 2020). This makes people realize that the evolutionary path of wheat is far more tortuous and complex than imagined. There were also some discoveries that, at first hearing, were quite unexpected. The reference genome of spring wheat in China is actually lacking quite a few genes. However, in some varieties, completely "exclusive" genes can still be found (White et al., 2024). These results not only helped clarify the boundary between core genes and variable genes, but also filled the gap in the previous map. The team also specifically analyzed genes related to disease and pest resistance. Genes like Sm1 have been detailedly characterized, and the results proved that its existence does indeed bring about real functional differences. As for the expression level, the activity levels of different subgenomes are not consistent, and the expression patterns among different tissues and varieties also vary significantly. The results of the transcriptome almost remind us that wheat is a naturally "restless" crop. 6.3 Broader implications for research and breeding These achievements are not simply a few more records in the database. They have left more room for maneuver in the search for trait markers, the development of new markers, and the exploration of superior alleles. Breeders can select the appropriate ones from a wider range of options and also improve varieties in a more targeted manner,

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 59 enabling them to have higher yields and better resistance in different environments (Long et al., 2024). However, this can also be regarded as a reminder-one should not always focus on pondering over a few main varieties. The true value of the pan-genome lies in the global diversity of wheat. The outcome of this project is actually telling us: We need to broaden our horizons. Ultimately, this method is not only feasible but also can indeed accelerate the pace of breeding. This step is indispensable if we want to discover more functional genes and promote molecular breeding. Figure 2 Introgressions and large-scale structural variation in wheat (Adopted from Walkowiak et al., 2020) Image caption: a-c, T. ponticum introgression on chromosome 3D in LongReach Lancer (a), T. timopheevi introgression on chromosome 2B in LongReach Lancer (b) and A. ventricosa introgression on chromosome 3D in Jagger (c). Track i, map of polymorphic RLC-Angela retrotransposon insertions (legend at bottom); track ii, density of projected gene annotations from Chinese Spring (blue bars, scaled to maximum value); track iii, per cent identity to Chinese Spring based on chromosome alignment (yellow; scale is 0%-100%); track iv, read depth of wheat wild relatives (blue-yellow heat map; legend at bottom). d, Dot plot alignment showing chromosome-level collinearity (black) with relative density of CENH3 ChIP-seq mapped to 100-kb bins for Chinese Spring (blue) and Julius (red); the arrow indicates a centromere shift. e, Robertsonian translocation between chromosomes 5B and 7B in ArinaLrFor. f, g, Cytology (f) and Hi-C (g) confirm the 5B/7B translocation in SY Mattis (left) compared with the non-carrier Norin 61 (right). In f, five independent cells were observed; the translocation was confirmed independently ten times. Scale bar, 10 μm (Adopted from Walkowiak et al., 2020)

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 60 7 Future Perspectives and Research Directions 7.1 Toward a more inclusive and scalable wheat pangenome A considerable number of wheat genomes have been identified so far, but compared with the overall picture, they are still just the tip of the iceberg. It is unrealistic to rely solely on a few major cultivated varieties to represent all genetic diversity. The future pan-genome must be "expanded"-cultivated species should be included, local varieties should be added, and wild relatives should not be left out. Anything that can supplement information should be included. However, this matter is not easy. The amount of data gradually accumulated and eventually turned into a huge and structurally complex problem. The management of a single genome is easy to solve, but when dozens of them are put together, it is easy to get out of control. To support subsequent analysis, it is necessary to have an extensible data architecture and a graph-based expression method (Petereit et al., 2022). There is another often overlooked aspect-some teams only focus on the current data and fail to incorporate the new assembly results in a timely manner. Over time, resources become "outdated". For the pan-genome to remain usable and worthwhile, the update mechanism must keep up with the pace. 7.2 Integrating multi-omics for functional insights Having a gene sequence does not mean understanding the story of genes. Whether it expresses itself, when it expresses it, and in which organization it expresses it, all these will be revealed by other "omics" information. Transcriptome, epigenome and proteome data are best viewed together. Some traits may look similar, but their regulatory mechanisms may be completely different. Multi-omics integration can bring out such differences and also identify the key regulatory loci that control traits (Badet et al., 2019). Especially when it comes to complex traits or the interplay of genes and the environment, it is much more reliable than a single omics. Moreover, its value lies not only in explaining existing traits but also in accelerating the discovery and verification of new traits. No matter how perfect the pan-genome is, if it is disconnected from functional information, its help to breeding will also be discounted. 7.3 Ethical, technical, and policy considerations However, no matter how much data there is, the question of who can use it and how can it be used cannot be ignored. With the expansion of wheat pan-genome resources, ethical and policy discussions have also been put on the agenda (Hossain et al., 2021). Not all countries have equal access to resources, and not all places that provide germplasm can truly benefit. Furthermore, technical issues cannot be relaxed. Confusion in formats, incompatible tools, and complex platform operations-once these issues accumulate, data can become a burden that is "visible but unusable." The biggest fear for researchers and breeders is having too much data and not knowing where to start. Therefore, relying solely on a database is not enough; clear regulations and collaborative mechanisms are also necessary. Whether for global sharing or local application, the rules must be clear. Otherwise, no matter how much data there is, it will be difficult to truly serve the goals of food security and sustainable agricultural development. 8 Concluding Remarks There are significant differences among wheat varieties, a point that has actually been mentioned long ago. But for many years, research has always relied on that one reference genome, and many details have thus been "covered up". It was not until the pan-genome was truly established that it was as if the veil covering the picture had been lifted, revealing its complete genetic appearance. These differences are not merely the variations of a few genes. Some people may find that certain genes have simply disappeared in some varieties, while others have undergone structural changes. Among such variations, those related to disease resistance, stress tolerance or quality are not uncommon. The value of the pan-genome lies in its ability to bring out such deeply hidden differences, rather than merely focusing on a "standard answer" as in the traditional approach. Of course, it's one thing for science to look good, but it's another for breeding to be practical. Those genes from local varieties or wild materials are often the key to breaking through the bottleneck. Using them for breeding can make wheat more resilient in the face of drought, high temperatures and pests and diseases. Especially when the

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 61 climate is becoming increasingly unstable and the pressure on agriculture is constantly increasing, this kind of genetic resource becomes particularly valuable. More realistically, the pan-genome has significantly shortened the time from gene discovery to field application. In the past, it might take several years to improve a trait, but now the breeding cycle is being continuously compressed. The goal is straightforward: high output, stable quality, and environmental friendliness. However, in the final analysis, this is not something that a certain research group can accomplish behind closed doors. The data needs to be continuously supplemented by someone, the omics information should be in line with the standards, and the analytical tools should not be made too "aloof". Researchers, breeding institutions and policymakers need to reach a consensus in terms of both concepts and actions. The data structure must also keep up with the demands, and the acquisition method should not become "whoever grabs it uses it". Only by achieving these can the role of the pan-genome not remain just on paper-it will truly drive wheat breeding and sustainable agriculture forward. Acknowledgments We are very grateful to Ms. Huang for reviewing the first draft of the paper and providing suggestions for improvement of logical coherence. Conflict of Interest Disclosure The authors affirm that this research was conducted without any commercial or financial relationships that could be construed as a potential conflict of interest. References Badet T., Oggenfuss U., Abraham L., McDonald B., and Croll D., 2019, A 19-isolate reference-quality global pangenome for the fungal wheat pathogen Zymoseptoria tritici, BMC Biology, 18: 12. https://doi.org/10.1186/s12915-020-0744-3 Barabaschi D., Volante A., Faccioli P., Povesi A., Tagliaferri I., Mazzucotelli E., and Cattivelli L., 2025, Ancient diversity of Triticum aestivum subspecies as source of novel loci for bread wheat improvement, Frontiers in Plant Science, 16: 1536991. https://doi.org/10.3389/fpls.2025.1536991 Bayer P., and Edwards D., 2023, Investigating pangenome graphs using wheat panache, Methods in Molecular Biology, 2703: 23-29. https://doi.org/10.1007/978-1-0716-3389-2_2 Bayer P., Petereit J., Danilevicz M., Anderson R., Batley J., and Edwards D., 2021, The application of pangenomics and machine learning in genomic selection in plants, The Plant Genome, 14(3): e20112. https://doi.org/10.1002/tpg2.20112 Bayer P., Petereit J., Durant É., Monat C., Rouard M., Hu H., Chapman B., Li C., Cheng S., Batley J., and Edwards D., 2022, Wheat panache: a pangenome graph database representing presence-absence variation across sixteen bread wheat genomes, The Plant Genome, 15(3): e20221. https://doi.org/10.1002/tpg2.20221 Cavalet-Giorsa E., González-Muñoz A., Athiyannan N., Holden S., Salhi A., Gardener C., Quiroz-Chávez J., Rustamova S., Elkot A., Patpour M., Rasheed A., Mao L., Lagudah E., Periyannan S., Sharon A., Himmelbach A., Reif J., Knauft M., Mascher M., Stein N., Chayut N., Ghosh S., Perović D., Putra A., Perera A., Hu C., Yu G., Ahmed H., Laquai K., Rivera L., Chen R., Wang Y., Gao X., Liu S., Raupp W., Olson E., Lee J., Chhuneja P., Kaur S., Zhang P., Park R., Ding Y., Liu D., Li W., Nasyrova F., Dvořák J., Abbasi M., Li M., Kumar N., Meyer W., Boshoff W., Steffenson B., Matny O., Sharma P., Tiwari V., Grewal S., Pozniak C., Chawla H., Ens J., Dunning L., Kolmer J., Lazo G., Xu S., Gu Y., Xu X., Uauy C., Abrouk M., Bougouffa S., Brar G., Wulff B., and Krattinger S., 2023, Origin and evolution of the bread wheat D genome, Nature, 633: 848-855. https://doi.org/10.1038/s41586-024-07808-z Chen Y., Guo Y., Xie X., Wang Z., Miao L., Yang Z., Jiao Y., Xie C., Liu J., Hu Z., Xin M., Yao Y., Ni Z., Sun Q., Peng H., and Guo W., 2023, Pangenome-based trajectories of intracellular gene transfers in Poaceae unveil high cumulation in Triticeae, Plant Physiology, 193(1): 578-594. https://doi.org/10.1093/plphys/kiad319 Fernandez C., Marsh J., Danilevicz M., Mercé C., and Edwards D., 2021, Application of pangenomics for wheat molecular breeding, CABI Digital Library, 13: 236-246. https://doi.org/10.1079/9781789245431.0013 Hossain A., Skalický M., Brestič M., Maitra S., Alam M., Syed M., Hossain J., Sarkar S., Saha S., Bhadra P., Shankar T., Bhatt R., Chaki A., Sabagh A., and Islam T., 2021, Consequences and mitigation strategies of abiotic stresses in wheat (Triticum aestivum L.) under the changing climate, Agronomy, 11(2): 241. https://doi.org/10.3390/AGRONOMY11020241 Hu H., Li R., Zhao J., Batley J., and Edwards D., 2024, Technological development and advances for constructing and analyzing plant pangenomes, Genome Biology and Evolution, 16(4): evae081. https://doi.org/10.1093/gbe/evae081

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 54-62 http://cropscipublisher.com/index.php/tgg 62 Huang F., Du X.Y., Zou S.K., Wang L., and Han Y.L., 2024, Advancements in wheat hybridization: overcoming biological barriers, Bioscience Evidence, 14(5): 195-205. https://doi.org/10.5376/be.2024.14.0021 Long C.Y., Hua W., Zhu J.H., and Fan M., 2024, Developing disease-resistant wheat varieties through genomic approaches, Molecular Plant Breeding, 15(6): 403-416. http://dx.doi.org/10.5376/mpb.2024.15.0038 Mangal V., Verma L., Singh S., Saxena K., Roy A., Karn A., Rohit R., Kashyap S., Bhatt A., and Sood S., 2024, Triumphs of genomic-assisted breeding in crop improvement, Heliyon, 10(15): e35513. https://doi.org/10.1016/j.heliyon.2024.e35513 Montenegro J., Golicz A., Bayer P., Hurgobin B., Lee H., Chan C., Visendi P., Lai K., Doležel J., Batley J., and Edwards D., 2017, The pangenome of hexaploid bread wheat, The Plant Journal, 90: 1007-1013. https://doi.org/10.1111/tpj.13515 Petereit J., Bayer P., Thomas W., Fernandez C., Amas J., Zhang Y., Batley J., and Edwards D., 2022, Pangenomics and crop genome adaptation in a changing climate, Plants, 11(15): 1949. https://doi.org/10.3390/plants11151949 Przewieslik-Allen A., Wilkinson P., Burridge A., Winfield M., Dai X., Beaumont M., King J., Yang C., Griffiths S., Wingen L., Horsnell R., Bentley A., Shewry P., Barker G., and Edwards K., 2021, The role of gene flow and chromosomal instability in shaping the bread wheat genome, Nature Plants, 7: 172-183. https://doi.org/10.1038/s41477-020-00845-2 Schreiber M., Jayakodi M., Stein N., and Mascher M., 2024, Plant pangenomes for crop improvement, biodiversity and evolution, Nature Reviews Genetics, 25: 563-577. https://doi.org/10.1038/s41576-024-00691-4 Tiwari V., Saripalli G., Sharma P., and Poland J., 2024, Wheat genomics: genomes, pangenomes, and beyond, Trends in Genetics, 40(11): 982-992. https://doi.org/10.1016/j.tig.2024.07.004 Walkowiak S., Gao L., Monat C., Haberer G., Kassa M., Brinton J., Ramirez-Gonzalez R., Kolodziej M., Delorean E., Thambugala D., Klymiuk V., Byrns B., Gundlach H., Bandi V., Siri J., Nilsen K., Aquino C., Himmelbach A., Copetti D., Ban T., Venturini L., Bevan M., Clavijo B., Koo D., Ens J., Wiebe K., N’Diaye A., Fritz A., Gutwin C., Fiebig A., Fosker C., Fu B., Accinelli G., Gardner K., Fradgley N., Gutierrez-Gonzalez J., Halstead-Nussloch G., Hatakeyama M., Koh C., Deek J., Costamagna A., Fobert P., Heavens D., Kanamori H., Kawaura K., Kobayashi F., Krasileva K., Kuo T., McKenzie N., Murata K., Nabeka Y., Paape T., Padmarasu S., Percival-Alwyn L., Kagale S., Scholz U., Sese J., Juliana P., Singh R., Shimizu‐Inatsugi R., Swarbreck D., Cockram J., Budak H., Tameshige T., Tanaka T., Tsuji H., Wright J., Wu J., Steuernagel B., Small I., Cloutier S., Keeble-Gagnère G., Muehlbauer G., Tibbets J., Nasuda S., Melonek J., Hucl P., Sharpe A., Clark M., Legg E., Bharti A., Langridge P., Hall A., Uauy C., Mascher M., Krattinger S., Handa H., Shimizu K., Distelfeld A., Chalmers K., Keller B., Mayer K., Poland J., Stein N., McCartney C., Spannagl M., Wicker T., and Pozniak C., 2020, Multiple wheat genomes reveal global variation in modern breeding, Nature, 588: 277-283. https://doi.org/10.1038/s41586-020-2961-x White B., Lux T., Rusholme-Pilcher R., Juhász A., Kaithakottil G., Duncan S., Simmonds J., Rees H., Wright J., Colmer J., Ward S., Joynson R., Coombes B., Irish N., Henderson S., Barker T., Chapman H., Catchpole L., Gharbi K., Bose U., Okada M., Handa H., Nasuda S., Shimizu K., Gundlach H., Lang D., Naamati G., Legg E., Bharti A., Colgrave M., Haerty W., Uauy C., Swarbreck D., Borrill P., Poland J., Krattinger S., Stein N., Mayer K., Pozniak C., 10+ Wheat Genome Project, Spannagl M., and Hall A., 2024, De novo annotation of the wheat pan-genome reveals complexity and diversity within the hexaploid wheat pan-transcriptome, bioRxiv, 574802: 1-29. https://doi.org/10.1101/2024.01.09.574802 Zanini S., Bayer P., Wells R., Snowdon R., Batley J., Varshney R., Nguyen H., Edwards D., and Golicz A., 2021, Pangenomics in crop improvement-from coding structural variations to finding regulatory variants with pangenome graphs, The Plant Genome, 15(1): e20177. https://doi.org/10.1002/tpg2.20177 Zhang Z., Liu D., Li B., Wang W., Zhang J., Xin M., Hu Z., Liu J., Du J., Peng H., Hao C., Zhang X., Ni Z., Sun Q., Guo W., and Yao Y., 2024, A k-mer-based pangenome approach for cataloging seed-storage-protein genes in wheat to facilitate genotype-to-phenotype prediction and improvement of end-use quality, Molecular Plant, 17(7): 1038-1053. https://doi.org/10.1016/j.molp.2024.05.006

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 63-71 http://cropscipublisher.com/index.php/tgg 63 Case Study Open Access Genome-Wide Association Mapping of Salt Tolerance in Barley Germplasm Jiamin Wang, Xian Zhang, Xuemei Liu Hainan Provincial Key Laboratory of Crop Molecular Breeding, Sanya, 572025, Hainan, China Corresponding email: xuemei.liu@hitar.org Triticeae Genomics and Genetics, 2025, Vol.16, No.2 doi: 10.5376/tgg.2025.16.0007 Received: 16 Jan., 2025 Accepted: 28 Feb., 2025 Published: 20 Mar., 2025 Copyright © 2025 Wang et al., This is an open access article published under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Preferred citation for this article: Wang J.M., Zhang X., and Liu X.M., 2025, Genome-wide association mapping of salt tolerance in barley germplasm, Triticeae Genomics and Genetics, 16(2): 63-71 (doi: 10.5376/tgg.2025.16.0007) Abstract Salt stress is the main abiotic factor that limits the productivity of barley (Hordeum vulgare L.) in global saline-alkali regions. This study employed genome-wide association analysis (GWAS) methods to reveal the genetic basis of salt tolerance in different barley germplasm resources. We evaluated a group of core germplasm resources under controlled salinity conditions and conducted high-resolution GWAS analysis using single nucleotide polymorphism (SNP) markers. Our analysis identified several loci and candidate genes significantly associated with salt stress traits, including those involved in ion homeostasis, osmotic regulation, and stress signaling pathways. A case study focusing on North African germplasm resources highlighted key salt-tolerant genes, such as HvHKT1;5 and HvNHX1, which further emphasizes their significance in breeding projects. Despite the challenges related to population structure and environmental variations, our research results demonstrate the practicality of GWAS in analyzing complex traits and guiding marker-assisted selection (MAS). This study laid the foundation for breeding salt-tolerant barley varieties and emphasized the value of integrating genomic tools into climate-adaptive agricultural breeding strategies. Keywords Barley germplasm; Salt tolerance; GWAS; Candidate genes; Marker-assisted selection 1 Introduction In some areas with high soil salinity, barley is often regarded as a "safe grain" for food security. It's not because its yield is necessarily the highest, but because it can tolerate salt more than many crops. Salinization is now spreading worldwide, and the area of affected cultivated land is still expanding. However, a salt environment does not mean that crops will be safe and sound - salt stress can disrupt the ionic balance of plants, reduce biomass, and also impact germination, chlorophyll levels, and antioxidant defense processes. As a result, the growth of the plants is often weakened, and the yield declines accordingly. Sometimes, even quite serious losses occur (Sonia et al., 2023; Thabet and Alqudah, 2023). It is not an easy task to enhance salt tolerance through traditional breeding. Not only is the genetic structure complex, but it also involves multiple physiological and molecular pathways. In contrast, genome-wide association studies (GWAS) have been proven useful in understanding the genetic basis of salt tolerance. It can identify quantitative trait loci (QTLS), single nucleotide polymorphisms (SNPS), and candidate genes related to key traits, including ion transport, antioxidant capacity, and stress response gene expression, etc. (Việt et al., 2013; Mwando et al., 2020). Previous studies have mapped important QTLs on chromosomes 2H, 4H, 6H, and 7H, and have also identified genes such as HKT1;5 that regulate sodium ion transport, as well as other genes involved in ion homeostasis and antioxidant defense. (Huang et al., 2008; Mwando et al., 2021). This study intends to use GWAS to screen out salt-tolerant gene loci and candidate genes in different barley germplasms, and combine genomic, physiological and transcriptomic information to provide genetic markers and resources that can be used for breeding. In this way, the efficiency of mark-assisted selection can be enhanced, and the breeding of salt-tolerant varieties will also be accelerated. Not only that, these achievements are helpful for the sustainable development of saline-alkali land agriculture and can also provide a reference framework for similar research on other crops.

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 63-71 http://cropscipublisher.com/index.php/tgg 64 2 Overview of Salt Stress in Barley 2.1 Physiological and biochemical effects of salinity on barley growth Under salt stress, the response of barley plants is quite intuitive - the biomass of both above-ground and underground parts decreases, the water content of leaves drops, and the level of photosynthetic pigments also decreases (Figure 1) (Eldakkak and El-Shourbagy, 2023). Meanwhile, sodium ions (Na+) often accumulate excessively, while the absorption of potassium ions (K+) is inhibited. The Na+/K⁺ ratio is disrupted, making it more difficult to maintain cellular homeostasis and metabolism (Sadiq et al., 2024). However, not all varieties are the same. Some salt-tolerant barley can still maintain a relatively high K+/Na+ ratio and relatively good moisture conditions. From a biochemical perspective, salt stress can promote the increase of osmotic regulatory substances such as proline and soluble sugar, and also enhance the activity of antioxidant enzymes such as catalase, ascorbic acid peroxidase, and peroxidase. These reactions can to some extent alleviate oxidative damage. In the salt-sensitive type, the contents of malondialdehyde (MDA) and hydrogen peroxide (H2O2) tend to increase, indicating aggravated lipid peroxidation and oxidative stress (Yildiz and Acar, 2022). Figure 1 Illustrated the variation among barley genotypes under control condition and with 100 mM and 200 mM NaCl treatments (Adopted from Sadiq et al., 2024) 2.2 Genetic complexity and heritability of salt tolerance traits Salt tolerance is not a trait that can be explained by a single gene. It involves multiple physiological and molecular mechanisms (Ouertani et al., 2021). Among barley of different genotypes, the differences in these traits are quite obvious, such as ion homeostasis, antioxidant capacity, and accumulation of osmotic protectants, which usually have high heritability and are regulated by specific genotypes. Recent studies have identified some genomic loci and candidate genes highly correlated with salt tolerance, especially those key genes directly involved in ion transport and antioxidant defense. However, the situation behind this is not simple - the interaction between genotypes and the environment, the superposition of additive and non-additive effects, all make the genetic basis of salt tolerance more difficult to fully clarify (Abdelrady et al., 2024). 2.3 Conventional breeding limitations in improving salt stress resilience It is not easy for traditional breeding to make breakthroughs in salt tolerance. Polygenic control, significant environmental influence, and complex phenotypic traits all make the screening work slow and cumbersome (Boussora et al., 2024). Coupled with insufficient molecular markers and long breeding cycles, the launch speed of salt-tolerant varieties is naturally limited (Alqudah et al., 2024). These realities also explain why more advanced genomic approaches, such as genome-wide association studies (GWAS), are particularly necessary in analyzing the genetic structure of salt tolerance and promoting the breeding process.

Triticeae Genomics and Genetics, 2025, Vol.16, No.2, 63-71 http://cropscipublisher.com/index.php/tgg 65 3 GWAS as a Tool for Dissecting Salt Tolerance 3.1 Advantages of GWAS over traditional QTL mapping Traditional QTL mapping is usually based on specific parent groups, and the range that can be located is often large, but the accuracy is limited (Li, 2020). The approach of GWAS is somewhat different. It directly utilizes the natural genetic diversity from vast germplasm resource banks, making it more detailed in the detection of the association between markers and traits (Huang and Lin, 2024). In this way, not only can single nucleotide polymorphisms (SNPS) significantly associated with salt tolerance traits be identified across the entire genome, but also candidate genes or alleles that are easily overlooked by traditional methods may be discovered. 3.2 Marker-trait association and identification of significant SNPs In recent barley studies, GWAS has identified many SNPS related to key traits of salt tolerance, such as Na+ and K+ contents, Na+/K+ ratio, and root and stem biomass under salt stress (Xu et al., 2023). These loci are distributed on chromosomes 2, 4, 5, 6 and 7, and some of these "hotspot" regions are concentrated with candidate genes related to ion transport, protein kinases and stress signal transduction. HKT1;5 is a typical example. It plays an important role in the transport and distribution of sodium and has a direct impact on the salt tolerance mechanism of barley (Hazzouri et al., 2018). The discovery of these marker-traits provides quite valuable genetic basis for subsequent marker-assisted selection in breeding. 3.3 Integration of GWAS with other omics approaches Although GWAS alone can identify the associated loci, the effect of screening and verifying candidate genes will be better when combined with transcriptomics and metabolomics (Gharaghanipor et al., 2022). For instance, through integrative analysis, some studies have found that in salt-tolerant and salt-sensitive barley genotypes, some differentially expressed genes (DEGs) are concentrated in pathways such as ion homeostasis, antioxidant defense, and metabolic regulation (Tu et al., 2021). Genes such as PGK2, BASS3, SINAT2, AQP and SYT3 were thus identified and further verified by qRT-PCR. The combination of such multi-omics not only enables people to have a more comprehensive understanding of the molecular mechanism of salt tolerance, but also accelerates the speed of breeding stress-resistant varieties. 4 Germplasm Resources for GWAS in Barley 4.1 Diversity of global barley collections and gene banks In national or international gene banks around the world, a large number of barley germplasms are preserved - including local varieties, wild relatives, and excellent cultivated varieties. Together, they constitute an extremely rich genetic diversity, which precisely serves as an important basis for identifying new alleles and loci related to salt tolerance during GWAS. In previous studies, some teams directly utilized hundreds or even thousands of germplasms from around the world, covering different geographical and genetic backgrounds, in order to capture as many association signals between markers and traits as possible. 4.2 Importance of landraces, wild relatives, and elite lines in salt tolerance research In terms of salt tolerance research, local varieties and wild barley (Hordeum spontaneum) have their unique value. They often adapt to regions with harsher environmental conditions and carry some alleles that are not present in modern cultivated varieties (Wu et al., 2013). Especially wild barley often outperforms cultivated varieties in terms of osmotic regulation ability and the content of compatible solutes. As for superior strains and modern cultivated varieties, although their diversity is relatively limited, they have a good agronomic background and can serve as reliable controls or references when evaluating new salt-tolerant sources. 4.3 Strategies for core set development and phenotypic screening under salinity Faced with a vast amount of germplasm resources, researchers often first establish a micro-core set or core set to cover the largest genetic and phenotypic differences with fewer samples. Then, these core sets were placed in the field or controlled environment for salt stress treatment to determine ion content, biomass, yield and some key physiological indicators (Xu et al., 2025). At the same time, by combining advanced phenotypic techniques such as ion flux measurement and transcriptome analysis, salt-tolerant genotypes and their underlying mechanisms can be identified more effectively. This approach makes GWAS more efficient and also provides a shortcut for discovering salt-tolerant candidate genes and breeding related varieties.

Made with FlippingBook

RkJQdWJsaXNoZXIy MjQ4ODYzNA==