Journal Search Engine
Search Advanced Search Adode Reader(link)
Download PDF Export Citaion korean bibliography PMC previewer
ISSN : 2287-5824(Print)
ISSN : 2287-5832(Online)
Journal of The Korean Society of Grassland and Forage Science Vol.45 No.3 pp.206-216
DOI : https://doi.org/10.5333/KGFS.2025.45.3.206

Next-Generation Sequencing–Based Development of Single Nucleotide Polymorphism Barcodes for Identification of the Cultivar ‘Kowinearly’ in Italian Ryegrass (Lolium multiflorum Lam.)

Chang-Woo Min1, Jae Hoon Woo1, Bo Ram Choi1, Yowook Song1, Jun Gyeong Choi2, Yun Ju Kang2, Ki-Won Lee1*
1Forages Production Systems Division, National Institute of Animal Science, RDA, Cheonan 31000, Republic of Korea
2BIOTO, 187, Techno 2-ro, Yuseong-gu, Daejeon 34015, Republic of Korea
* Corresponding author: Ki-Won Lee, Forages Production Systems Division, National Institute of Animal Science, RDA, Cheonan 31000, Republic of Korea. Tel: +82-41-580-6757, Fax: +82-41-580-6779, E-mail: kiwon@korea.kr
September 18, 2025 September 24, 2025 September 25, 2025

Abstract


Molecular markers have been widely applied in population genetics, diagnostic taxonomy, and genetic mapping, and they can also be used for classifying varieties of Italian ryegrass during field selection. In this study, genome-wide sequence information was generated for 10 Italian ryegrass cultivars (40 samples), including ‘Kowinearly’ (KW), using next-generation sequencing (NGS). Single nucleotide polymorphism (SNP) analysis revealed that only three SNP loci were sufficient to distinguish KW from the other cultivars. Furthermore, 21 alternative barcode sets, each consisting of three SNPs, were identified. These SNP barcode sets provide a reliable criterion for cultivar discrimination in Italian ryegrass and can contribute to the protection of domestic varieties and the advancement of the forage industry in Korea. More broadly, the development of distinguishing markers across Italian ryegrass cultivars will enhance genetic resource identification and support the breeding of high-quality new varieties.



초록


    Ⅰ. INTRODUCTION

    Italian ryegrass (Lolium multiflorum Lam.) is one of the most important winter forages worldwide due to its high productivity, quality, and palatability for livestock (Humphreys et al., 2010). In Korea, its rapid establishment, short growing period, vigorous regrowth, and tolerance to waterlogging have made it particularly suitable for paddy fields (Cojocariu et al., 2010), establishing it as a predominant winter forage in Korea (Oh et al., 2021). However, its limited cold tolerance restricts cultivation mainly to the southern regions (Kim and Sung, 2021). This limitation has driven breeding efforts for cold tolerance and early heading, resulting in the widespread use of the cultivar ‘Kowinearly’ (Choi et al., 2006;Ji et al., 2018).

    Recently, with tariff elimination and the increasing practice of domestic seed production abroad and reimporting them, the risk of admixture has grown, underscoring the need for accurate cultivar identification. Moreover, in forage crops, key morphological descriptors used for cultivar determination are strongly influenced by environmental conditions and observer effects, and the differences among cultivars are often small. As a result, morphology-based DUS testing often lacks sufficient discriminating power, making molecular marker–based evaluation essential (Pasquali et al., 2022). Indeed, approximately 700 Italian ryegrass cultivars were listed in the OECD catalogue as of 2025 (OECD, 2025). Furthermore, extensive hybridization and limited germplasm have reduced genetic diversity, further limiting the reliability of morphology alone for discrimination.

    Advances in next-generation sequencing (NGS) now enable rapid access to genomic information, and the development of molecular markers for cultivar discrimination has been reported in many crops (Chakravarthi and Naravaneni, 2006;Park et al., 2017). Molecular markers, which detect DNA polymorphisms, allow the assessment of genetic diversity and cultivar identity, providing a faster and more accurate alternative to morphology (Collard and Mackill, 2008).

    In Italian ryegrass, most studies have focused on the development of simple sequence repeat (SSR) markers. For example, Nie et al. (2019) applied 29 SSR markers to six tetraploid cultivars, whereas Kubik et al. (2001) demonstrated that at least 15 SSR markers could identify perennial ryegrass (Lolium perenne L.) cultivars with over 99% accuracy. Although SSR markers have been effectively used for cultivar discrimination in the genus Lolium, research on single nucleotide polymorphisms (SNPs) remains limited.

    SNPs, which are based on single base substitutions, offer higher marker density and more precise detection than SSRs. Combined with automated high-throughput platforms, they enable efficient large-scale analyses (Mammadov et al., 2012;Zhang et al., 2023). These advantages make SNPs particularly valuable for cultivar identification in Italian ryegrass, where genetic diversity is relatively narrow.

    Therefore, this study aimed to investigate SNP polymorphisms in major Korean cultivars using NGS and develop a molecular marker set capable of rapidly and accurately identifying the representative cultivar ‘Kowinearly’. The results are expected to improve the efficiency of cultivar identification and provide a solid foundation for breeding research and cultivar management, thereby strengthening the competitiveness of the Korean forage industry.

    Ⅱ. MATERIALS AND METHODS

    1. Plant materials

    This study analyzed 10 Italian ryegrass (Lolium multiflorum Lam.) cultivars, comprising 40 samples. These included nine domestic cultivars released in Korea—‘Kowinearly’ (KW), ‘Greenfarm’ (GF), ‘Greencall’ (GC), ‘Kospeed’ (KS), ‘Kogreen’ (KG), ‘Kowinmaster’ (KM), ‘IR605’ (605), ‘Kowinner’ (KWR), and ‘Hwasan104’ (HS104)—as well as one foreign reference cultivar, ‘Florida80’ (FLO). All plant materials were obtained from the National Institute of Animal Science (NIAS), Department of Animal Genetic Resources, Rural Development Administration (RDA), Republic of Korea. Young and healthy leaf tissues were collected at the vegetative growth stage from multiple individuals representing each cultivar. To preserve DNA integrity, harvested leaf samples were immediately frozen in liquid nitrogen and stored at −70°C in an ultra-low temperature freezer until genomic DNA extraction.

    2. DNA extraction and library preparation

    Genomic DNA was extracted from 100 mg of leaf tissue using the CTAB method (Kidwell and Osborn, 1992). DNA quality was assessed via electrophoresis of 3 μL DNA on a 1% agarose gel at 200 V for 25 min, with a 1 kb+ ladder used as a reference. For sequencing library construction, qualified DNA samples were digested with the restriction enzyme ApeKI at 75°C for 2 h, followed by adapter and barcode ligation and pooling. Purification was performed using the QIAquick PCR Purification Kit, and amplification was carried out for 15 cycles. Library quality was verified on a 1.5% agarose gel at 200 V for 30 min, with a 100 bp ladder used to confirm the expected fragment distribution. Sequencing was conducted on an Illumina HiSeq X platform (Illumina Inc., San Diego, CA, USA) using an S4 flow cell in paired-end mode (2 × 150 bp).

    3. Preprocessing of sequencing data and SNP detection

    After demultiplexing with barcodes, adapter sequences were removed using Cutadapt (v1.8.3) (Martin, 2011). Low-quality reads with a Phred score below 20 were trimmed using Trimmomatic (v0.39) (Bolger et al., 2014), yielding highquality cleaned reads. These cleaned reads were mapped to the Lolium multiflorum reference genome, consisting of 2,545 scaffolds (175,758,422 bp) reported by Knorst et al. (2019), using the BWA-MEM algorithm (v0.7.17) (Li, 2013). SNP calling was performed with SAMtools (v0.1.16) (Li et al., 2009), and the resulting SNP loci were arranged in a matrix format (SNVs), with loci as rows and cultivars as columns. SNP filtering was applied using the following thresholds: minor allele frequency (MAF) > 25%, missing data < 20%, and reference/alternative allele frequency > 25%.

    4. Selection of cultivar-specific SNP markers

    SNP loci were annotated based on the scaffold track information reported by Knorst et al. (2019). Each SNP was coded as “a” for the reference allele, “b” for the alternative allele, “h” for heterozygous, and “–” for missing data. To identify markers specific to KW, additional filtering was applied with a missing rate < 15%. Homozygous SNPs consistently observed in all KW individuals but polymorphic in other cultivars were retained as candidate loci. From these candidates, the minimal number of SNPs required to uniquely discriminate KW was determined, and the selected SNPs were designated as the cultivar-specific barcode marker set.

    Ⅲ. RESULTS AND DISCUSSION

    1. Sequencing data generation and preprocessing

    Sequencing of the 40 Italian ryegrass samples on the Illumina HiSeq X platform generated 558,852,406 paired-end reads (151 bp), corresponding to 84,386,713,306 bp in total. After quality trimming with Trimmomatic, 515,716,846 high-quality reads were retained, yielding 61,628,597,634 bp of cleaned data (Table 1). Mapping against the 2,545 scaffolds of Lolium multiflorum resulted in 88.45% alignment (457,253,563 reads), covering a total of 13,985,967 bp with an average depth of 15.26 (Table 2).

    2. SNP discovery through bioinformatic analysis

    Using the aligned reads as input, SAMTools identified 7,002,372 SNPs across the L. multiflorum scaffolds, which were organized into a single-nucleotide variant (SNV) dataset. Successive filtering reduced the dataset substantially: with a minor allele frequency (MAF) threshold of >25%, 1,239,685 loci were retained. A missing data threshold of <20% was applied to reduce the set to 7,908 loci and an additional reference/alternative allele frequency filter (>25%) yielded 1,671 high-confidence SNP loci (Table 3).

    The removal of most SNPs at the MAF >25% stage suggests that the Korean Italian ryegrass cultivars share a high proportion of common alleles, with most variation concentrated in rare alleles. Similar patterns have been observed in other low-diversity breeding panels. For example, in a U.S. maize inbred panel, 68% of SNPs had MAF <0.1 (Romay et al., 2013). In such low-diversity populations, increasing the MAF threshold excludes rare variants and emphasizes medium- to high-frequency loci, improving marker utility. Geibel et al. (2021) also recommended selection of high-MAF SNPs for reliable marker design in restricted breeding populations.

    This reduction in genetic diversity underscores the limitations of morphology- and physiology-based discrimination, which are strongly influenced by environment and genotype–environment interactions. As Pasquali et al. (2022) highlighted, reliance on phenotypic traits alone in DUS testing risks misclassification. The SNP-based molecular markers developed in this study thus provide a more reliable framework for cultivar identification.

    3. Phylogenetic tree and PCA analysis

    The final set of 1,671 SNP loci was used to investigate the genetic relationships among the 10 Italian ryegrass cultivars (40 samples) through phylogenetic tree construction and principal component analysis (PCA) (Figs. 1 and 2). Most cultivars clustered closely, reflecting their genetic relatedness. However, in the PCA, KW showed partial separation from other cultivars along PC2 (12.0%) and PC3 (9.7%). Although the variance explained by these components was modest, the separation suggests that KW may have a distinct genetic background compared with other major domestic cultivars.

    4. Development of SNP barcode sets for cultivar discrimination

    Among the 1,671 loci, 52 SNPs were consistently detected in all four KW individuals. After excluding 28 loci with high missing rates, 24 SNPs were retained with missing data <15%, ensuring higher reliability. These SNPs exhibited an average MAF of 46.3% (range: 38.6–50%) and an average polymorphism information content (PIC) of 0.372 (range: 0.362–0.375) (Table 5). Given that SNPs are bi-allelic markers with a maximum PIC value of 0.5, markers with PIC values above 0.3 are generally considered informative (Yang et al., 2016). Accordingly, the selected SNPs were validated as effective candidates for cultivar identification.

    From these 24 loci, a barcode system was developed using combinations of three SNPs. The minimum set of three SNPs was sufficient to unambiguously distinguish KW, and an additional 21 alternative SNP combinations were also identified. For instance, in barcode set 1, three loci (LM_4600405, LM_39418557, and LM_39440022) with genotypes T, C, and A were unique to KW (Table 6). Genotypes were coded as “a” (reference allele), “b” (alternative allele), “h” (heterozygous), or “–” (missing) and converted into barcode profiles (Table 7). In this system, the “bbb” genotype pattern uniquely identified KW, clearly distinguishing it from GF, GC, KS, KG, KM, 605, KWR, HS104, and FLO.

    In a previous study, RAD-seq–based SNP markers were shown to discriminate Italian ryegrass cultivars, reinforcing the validity of our SNP barcode system (Yu et al., 2022). Although simple sequence repeat (SSR) markers remain valuable for studying population structure and complementing classical DUS testing (Nam et al., 2025), the small number of SNPs required in this study offers practical advantages for rapid and cost-effective identification. However, as Yu et al. (2022) noted, the robustness of SNP-based discrimination depends on both sample size and genetic diversity. Therefore, further validation across a broader set of cultivars and individuals is necessary to establish robust applicability.

    Ⅳ. CONCLUSIONS

    In this study, NGS of 10 Italian ryegrass cultivars (40 samples) generated genome-wide sequence information, from which SNP analysis revealed that as few as three SNP loci were sufficient to discriminate the cultivar KW. Moreover, 21 alternative three-SNP barcode sets were developed. These SNP barcodes provide an effective and reliable tool for precise cultivar discrimination, supporting the protection of domestic varieties and the advancement of the forage industry. Beyond KW, the development of SNP-based marker sets for additional Italian ryegrass cultivars will further improve the resolution of genetic resource identification and contribute to the breeding of highquality new varieties.

    Although only a single foreign reference cultivar, FLO, was included, which constrains generalizability beyond cultivars distributed in Korea, immediate applicability is largely preserved because FLO represents the majority of foreign cultivars currently distributed in Korea. Future work will expand within-cultivar sampling and increase the number of cultivars analyzed to develop SNP markers capable of distinguishing all Italian ryegrass cultivars distributed in Korea. These markers will be implemented in field-deployable assays such as KASP and HRM to ensure robust and reproducible cultivar identification.

    Ⅴ. ACKNOWLEDGMENTS

    This study was supported by the Cooperative Research Program for Agriculture Science & Technology Development (Project No. PJ01669904) and the 2025 Postdoctoral Fellowship Program of the National Institute of Animal Science funded by RDA, Republic of Korea.

    Figure

    KGFS-45-3-206_F1.jpg

    Phylogenetic tree analysis of Italian ryegrass cultivars.

    KGFS-45-3-206_F2.jpg

    PCA analysis of Italian ryegrass cultivars.

    Table

    Statistic values after pre-processing

    1 Clean reads/Raw reads (%): (Total length of clean reads / Total length of raw reads) 100.

    Statistic values of read alignment

    SNP matrix record and filter of statistic values

    SNP matrix record and filter of statistic values

    1 Number of SNPs in the cultivar discrimination barcode set: constructed using three SNP markers in combination.
    2 Number of candidate barcode sets for cultivar discrimination: total number of alternative barcode set combinations.

    Nucleotide, MAF and PIC information for 24 SNPs

    SNP nucleotide sequences including barcode set

    Barcode set for identification of Kowinearly (KW) from other Italian ryegrass cultivars

    Reference

    1. Bolger, A.M., Lohse, M. and Usadel, B. 2014. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114- 2120.
    2. Chakravarthi, B.K. and Naravaneni, R. 2006. SSR marker based DNA fingerprinting and diversity study in rice (Oryza sativa L). African Journal of Biotechnology. 7(5):684-688.
    3. Choi, G.J., Lim, Y.C., Rim, Y.W., Sung, B.C., Kim, M.J., Kim, K.Y. and Seo, S. 2006. A cold-tolerant and early-maturing Italian ryegrass new variety, 'Kogreen'. Journal of the Korean Society of Grassland and Forage Science. 26(1):9-14.
    4. Cojocariu, L., Moisuc, A., Samfira, I., Horablaga, M., Radu, F. and Lalescu, D.V. 2010. The study of the main production characters depending on dose of cattle manure applied to Italian ryegrass. Research Journal of Agricultural Science. 42(1):390-397.
    5. Collard, B.C.Y. and Mackill, D.J. 2008. Marker-assisted selection: An approach for precision plant breeding in the twenty-first century. Philosophical Transactions of the Royal Society B: Biological Sciences. 363(1491):557-572.
    6. Geibel, J., Reimer, C., Weigend, S., Weigend, A., Pook, T. and Simianer, H. 2021. How array design creates SNP ascertainment bias. PLOS ONE. 16(3):e0245178.
    7. Humphreys, M., Feuerstein, U., Vandewalle, M. and Baert, J. 2010. Ryegrasses. In: B. Boller, U. Posselt and F. Veronesi (Eds.), Fodder crops and amenity grasses. Handbook of plant breeding. Springer. New York. USA. pp. 211-260.
    8. Ji, H.C., Whang, T.Y., Lee, K.W., Kim, W.H., Woo, J.H., Hong, K.H. and Choe, K.W. 2018. Growth characteristics and productivity of Italian ryegrass (Lolium multiflorum Lam.) new variety, 'Green Call'. Journal of the Korea Society of Grassland and Forage Science. 38(4):247-252.
    9. Kidwell, K.K. and Osborn, T.C. 1992. Simple plant DNA isolation procedures. In: J.S. Beckmann and T.C. Osborn (Eds.), Plant genomes: Methods for genetic and physical mapping. Springer. Dordrecht. Netherlands. pp. 1-13.
    10. Kim, M. and Sung, K. 2021. Impact of abnormal climate events on the production of Italian ryegrass as a season in the Republic of Korea. Journal of Animal Science and Technology. 63(1):77-90.
    11. Knorst, V., Yates, S., Byrne, S., Asp, T., Widmer, F., Studer, B. and Kölliker, B. 2019. First assembly of the gene-space of Lolium multiflorum and comparison to other Poaceae genomes. Grassland Science. 65(2):125-134.
    12. Kubik, C., Sawkins, M., Meyer, W.A. and Gaut, B.S. 2001. Genetic diversity in seven Perennial ryegrass (Lolium perenne L.) cultivars based on SSR markers. Crop Science. 41(5):1565-1572.
    13. Li, H. 2013. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv preprint arXiv:1303.3997.
    14. Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G. and Durbin, R. 2009. The sequence alignment /map format and SAMtools. Bioinformatics. 25(16):2078-2079.
    15. Mammadov, J., Aggarwal, R., Buyyarapu, R. and Kumpatla, S. 2012. SNP markers and their impact on plant breeding. International Journal of Plant Genomics. 2012:728398.
    16. Martin, M. 2011. Cutadapt removes adapter sequences from highthroughput sequencing reads. EMBnet Journal. 17(1):10-12.
    17. Nam, D.G., Baek, E.S., Hwang, E.B., Gwak, S.C., Lee, Y.H., Cho, S.W., Yu, J.K. and Hwang, T.Y. 2005. Genetic diversity patterns within and among varieties of Korean Italian ryegrass (Lolium multiflorum) and Perennial ryegrass (Lolium perenne) based on simple sequence repetition. Agriculture. 15(3):244.
    18. Nie, G., Huang, T., Ma, X., Huang, L., Peng, Y., Yan, Y., Li, Z. Wang, X. and Zhang, X. 2019. Genetic variability evaluation and cultivar identification of tetraploid annual ryegrass using SSR markers. PeerJ. 7:e7742.
    19. OECD. 2025. List of varieties eligible for seed certification. Available from https://www.oecd.org/content/dam/oecd/en/topics/policy-subissues/seeds/full-variety-ist.pdf.
    20. Oh, M., Choi, B.R., Lee, S.Y., Jung, J.S., Park, H.S., Lee, B.H. and Kim, K.Y. 2021. Study on the forage cropping system of Italian ryegrass and summer forage crops at paddy field in middle region of Korea. Journal of the Korean Society of Grassland and Forage Science. 41(2):141-146.
    21. Park, K.C., Kim, Y.G., Hwangbo, K., Gil, J., Chung, H., Park, S.G., Hong, C.P. and Lee, Y. 2017. Development of simple sequence repeat markers from Adenophora triphylla var. japonica (Regel) H. Hara using next generation sequencing. Korean Journal of Medicinal Crop Science. 25(6):411-417.
    22. Pasquali, E., Palumbo, F. and Barcaccia, G. 2022. Assessment of the genetic distinctiveness and uniformity of pre-basic seed stocks of Italian ryegrass varieties. Genes. 13(11):2097.
    23. Romay, M.C., Millard, M.J., Glaubitz, J.C., Peiffer, J.A., Swarts, K.L., Casstevens, T.M., Elshire, R.J., Acharya, C.B., Mitchell, S.E., Flint-Garcia, S.A., McMullen, M.D., Holland, J.B., Buckler, E.S. and Gardner, C.A. 2013. Comprehensive genotyping of the USA national maize inbred seed bank. Genome Biology. 14:R55.
    24. Yang, X., Ren, R., Ray, R., Xu, J., Li, P., Zhang, M. and Kilian, A. 2016. Genetic diversity and population structure of core watermelon (Citrullus lanatus) genotypes using DArTseq-based SNPs. Plant Genetic Resources. 14(3):226-233.
    25. Yu, Q., Ling, Y., Xiong, Y., Zhao, W., Xiong, Y., Dong, Z., Yang, J., Zhao, J., Zhang, X. and Ma, X. 2022. RAD-seq as an effective strategy for heterogenous variety identification in plants-A case study in Italian ryegrass (Lolium multiflorum). BMC Plant Biology. 22:231.
    26. Zhang, J., Yang, J., Lv, Y., Zhang, X., Xia, C., Zhao, H. and Wen, C. 2023. Genetic diversity analysis and variety identification using SSR and SNP markers in melon. BMC Plant Biology. 23:39.