Abstract
In this paper, we have performed an in-depth study of the complete set of the satellite DNA (satDNA) families (i.e. the satellitomes) in the genome of two barley species of agronomic value in a breeding framework, H. chilense (H1 and H7 accessions) and H. vulgare (H106 accession), which can be useful tools for studying chromosome associations during meiosis. The study has led to the analysis of a total of 18 satDNA families in H. vulgare, 25 satDNA families in H. chilense (accession H1) and 27 satDNA families in H. chilense (accession H7) that constitute 46 different satDNA families forming 36 homology groups. Our study highlights different important contributions of evolutionary and applied interests. Thus, both barley species show very divergent satDNA profiles, which could be partly explained by the differential effects of domestication versus wildlife. Divergence derives from the differential amplification of different common ancestral satellites and the emergence of new satellites in H. chilense, usually from pre-existing ones but also random sequences. There are also differences between the two H. chilense accessions, which support genetically distinct groups. The fluorescence in situ hybridization (FISH) patterns of some satDNAs yield distinctive genetic markers for the identification of specific H. chilense or H. vulgare chromosomes. Some of the satellites have peculiar structures or are related to transposable elements which provide information about their origin and expansion. Among these, we discuss the existence of different (peri)centromeric satellites that supply this region with some plasticity important for centromere evolution. These peri(centromeric) satDNAs and the set of subtelomeric satDNAs (a total of 38 different families) are analyzed in the framework of breeding as the high diversity found in the subtelomeric regions might support their putative implication in chromosome recognition and pairing during meiosis, a key point in the production of addition/substitution lines and hybrids.
Key message
Several satDNAs have been identified in the subtelomeres of wild and cultivated barley species that might contribute to the specificity of these distal chromosome regions during homologous recognition and pairing in meiosis.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Cultivated barley, Hordeum vulgare ssp. vulgare L. (Baden and von Bothmer 1994; Bothmer et al. 1995) (2n = 2x = 14) is a member of the Poaceae family and one of the most important cereal crops in the world. Barley is a source of feed for livestock and of malt for brewing, with a production of more than 150 million tons (MT) in 2022, being the European Union the major grower with 65% of the global production (Faostat, https://www.fao.org/faostat/es/#data/QCL/visualize; consulted 04/03/2024). Cultivated barley was domesticated ~ 10,000 years ago from its progenitor H. vulgare subsp. spontaneum (K. Koch) Thell. (2n = 2x = 14) in the Fertile Crescent, a historically significant region for its early agricultural and human civilizations corresponding to part of the territories of the Mediterranean Levant and Mesopotamia (Brown et al. 2009), which is considered the primary centre of barley origin and domestication, although additional areas of domestication have been also proposed (Thormann et al. 2016; Blattner 2018). Understanding barley genome organization is essential to enable and facilitate the demand for better-adapted crops with higher yields, which must be implemented through breeding programs to ensure more efficient and sustainable production. Likewise, it is important to have information on the genomic organization of crop wild relatives which may provide new genes for crop improvement and adaptation. Such is the case, for example, of the wild barley species like H. chilense Roem. & Schult. (2n = 2x = 14), mainly distributed in Chile and Argentina (Blattner 2018), that contains desirable genes for wheat breeding (Forster et al. 1990; Martin et al. 1998; Rubiales et al. 2000; Martín et al. 2010; Calderón et al. 2012) or for the development of the wheat-barley amphiploids named tritordeum (Martín et al. 1999).
In the framework of breeding, it is important to elucidate how chromosomes associate and recombine during meiosis (the cellular process to generate the gametes in organisms with sexual reproduction), particularly in inter-specific genetic crosses, in which the success of recombination between chromosomes from different species is not high. In this context, subtelomeric sequences can be of particular importance during meiosis since they might be crucial for homologous chromosome recognition and association required for proper gamete segregation (Calderón et al. 2014; Naranjo 2015). We have previously found that the subtelomeric regions in barley and wheat are highly polymorphic and those polymorphisms could contribute to the specificity of the correct homologous pairing in both species (Aguilar and Prieto 2020, 2021; Serrano-León et al. 2023). Because tandemly repetitive DNA or satellite DNA (satDNA) is one of the main components of subtelomeric as well as (peri)centromeric regions (Garrido-Ramos 2015, 2017, 2021) we have recently proposed that satDNAs also contribute to the polymorphism that exists in the terminal region of the chromosomes (Gálvez-Galván et al. 2024).
The newly updated barley genome sequence assembly (H. vulgare subsp. vulgare, MorexV3; https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_904849725.1/) has accelerated comparative genomics analyses of barley and other species (Mascher et al. 2021). Lamentably, it has been proven that known repeat arrays in telomeres, subtelomeres, centromeres, and 5S and 45S rDNA loci are not entirely represented in the current barley reference genome sequence assembly (Navrátilová et al. 2022; Serrano-León et al. 2023). Using a computational pipeline designed to identify and characterize repetitive DNA sequences in next-generation sequencing data (Novák et al. 2013, 2017; Ruiz-Ruano et al. 2016) we have screened Illumina sequencing data from H. vulgare subsp. vulgare and H. chilense H1 and H7 accessions to characterize the complete set of satDNA families (i.e., the satellitome) of both species and compared them with special emphasis on their genomic location and organization.
Materials and Methods
Plant material and growing conditions
Two different accessions (H1 and H7) from the wild barley H. chilense (Roem. et Schult.) (2n = 2x = 14; genome size = ~ 5.3 Gb; (Bennett and Smith 1976), and the domestic barley H. vulgare L. cv. Betzes (accession H106) (2n = 2x = 14; genome size = ~ 5 Gb; Doležel et al. 1998, 2018) were used in this work to perform both genomic and cytogenetic analyses. Seeds from the barley species were germinated and incubated in the dark at 4 °C on wet filter paper in Petri dishes for 4–5 days, then transferred to 25 °C for 1–2 days until germination. Roots were dissected and treated for accumulation of mitotic cells and then seeds were transferred to pots and grown in the greenhouse under semi-controlled conditions of temperature (25 °C day/15 °C night) and relative humidity (40%). After two weeks, the plant material was ready for genomic DNA isolation.
Satellitome analysis
Genomic DNA (gDNA) was isolated from young frozen barley leaves using the standard CTAB procedure (Murray and Thompson 1980) with some modifications (Hernández et al. 2001). The DNA quality and concentration were determined using a NanoDrop1000 spectrophotometer (NanoDrop Technologies, USA).
Next Generation Sequencing was carried out at Macrogen Inc. (Macrogen Inc., Seoul, Korea) based on Illumina NovaSeq 6000 150PE (2 × 151 bp), yielding about 20.4 Gb (~ 4 × coverage), 20.7 Gb (~ 4 × coverage) and 20.3 Gb (~ 4 × coverage) data for H. chilense (accession H1), H. chilense (accession H7) and Hordeum vulgare (accession H106), respectively. Illumina sequencing raw data can be accessed at SRA-Genbank database in the BioProjects PRJNA1039805 and PRJNA1040438.
We applied the protocol satMiner (Ruiz-Ruano et al. 2016), based on consecutive rounds of clustering of Illumina reads by RepeatExplorer 2 (Novák et al. 2013, 2020), using a subset of reads (2,000,000 per library), and subsequent filtering of the already assembled reads using DeconSeq (Schmieder and Edwards 2011). RepeatExplorer 2 (Novák et al. 2013, 2020) executes an integrated version of the TAREAN tool (Novák et al. 2017), which performs automated identification of satellite DNA repeats based on the topology of their cluster graphs.
We first performed a quality trimming with Trimomatic (Bolger et al. 2014), and randomly selected 2 × 2,000,000 Illumina reads with SeqTK (https://github.com/lh3/seqtk), to run RepeatExplorer2 with default options. Cluster graphs with circular shapes were selected using TAREAN which generates a consensus monomer sequence for each satDNA cluster.
We filtered out the reads showing homology with the already clustered contigs and the already identified satDNA using DeconSeq, and selected a new set of 2 × 2,000,000 reads from the filtered libraries, that were clustered with RepeatExplorer2 in a second round. This allows detecting satDNAs being poorly represented in the raw reads. We repeated the filtering using the clusters in the second round and selected 2 × 2,000,000 reads for three additional rounds. Performing additional rounds of clustering and filtering have shown to be highly successful as it allows the detection of satDNAs which, due to their low abundance, had gone unnoticed because those of highly abundant elements masked their signals (Ruiz-Ruano et al. 2016).
To estimate abundance and divergence for each identified satDNA, we aligned 2 × 10 millions of randomly selected read pairs to the consensus sequences in the resulting satDNA database, using RepeatMasker with a publicly available script (https://github.com/fjruizruano/satminer/blob/master/repeat_masker_run_big.py). We used the calcDivergenceFromAlign.pl built-in tool of RepeatMasker to obtain a histogram of the Kimura two-parameter divergence for each element. Next, we transformed the abundance values to express them as genome proportions by dividing the number of aligned nucleotides by the total number of nucleotides in the selection of 20 million reads. The resulting histograms are referred to as Repeat Landscapes (RLs). We also used this procedure to search for each isolated satellite in each accession in the rest of the accessions analyzed by aligning 2 × 10 millions randomly selected read pairs from each barley accession to the consensus sequences in each accession-specific satDNA database.
We searched for homology between barley satellitomes with the rm_homolgy script (Ruiz-Ruano et al. 2016) that makes all-to-all alignments with Repeat-Masker v4.0.5 (Smit et al. 2015). In addition, we searched for homologies with transposable elements with RepeatMasker with “no_low” and “no_is” options.
The Adenine/Thymine content (AT%) was calculated using the bioinformatic tool “GC Content Calculator” (https://www.biologicscorp.com/tools/GCContent) developed by (Guerra et al. 2016).
A search of these satellite sequences in the barley genome was carried out using Basic Local Alignment Search Tool (BLAST®) trailing the genome assembly of Hordeum vulgare subsp. vulgare (https://www.ncbi.nlm.nih.gov/assembly/ GCF_904849725.1), through NCBI's Genome Data Viewer (GDW) to identify the locations of each satDNA family within the barley genome.
Probes designed for cytogenetic validation of satDNA sequences by in situ hybridization
The different satDNAs families were amplified by PCR (“Polymerase Chain Reaction”) using specific primers designed with the Primer-BLAST software tool from NCBI (https://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC=BlastHome) (Table S1). For monomers shorter than 80 bp, primers were designed manually. OligoAnalyzer™ tool (https://eu.idtdna.com/calc/analyzer) was used to confirm the absence of putative secondary structures in the sequences of the primers (hairpins, self-dimers and hetero-dimers). 20 ng of each H. chilense accession or H. vulgare (H106) genomic DNA were used to perform the PCR reaction with different polymerases according to the size of the sequence (MyTaqTM Plant-PCR Kit or MyFiTM DNA Polymerase, both from Bioline) (see Table S1). For families with sequences consisting of monomers between 80–1000 bp, we performed PCR amplification with the following conditions: a starting denaturation step at 94 °C for 5 min (minutes), 35 cycles at 94 °C for 30 s (seconds), followed by an annealing step of 42–60 °C (primer-dependent, see Table S1) for 30 s, and an extension at 72 °C during 1 min. A final extension step at 72ºC for 6 min was added. In the cases of amplifying satDNAs shorter than 80 bp, we reduced the time of annealing to 10 s to get longer amplicons according to (Ruiz-Ruano et al. 2016). PCR products from short monomers were displayed as a smear in agarose gels and were re-amplified using 1 μL of the previous PCR product in a new PCR mix. For satDNAs with a monomer sequence larger than 1 Kb, the extension time was modified, to find the optimal condition, around 45 s/Kb. FavorPrepTM Gel/PCR Purification Mini Kit (FAVORGEN) was used to extract the desired size band from the agarose gel. DNA amplification samples were sequenced to confirm the reliability of the PCR products.
PCR products were loaded in 1% agarose gel electrophoresis in 1 × TAE (40 mM TrisBase, 20 mM Acetate and 1 mM EDTA, dH2O until the volume of 1 L) running buffer and visualized using Quantity One 1-D Analysis Software Bio-Rad. According to the size of the sequence, a 100 bp or 1 Kb DNA Ladder ready to Load (Solis BioDyne) were used as a reference for molecular weight DNA.
The satDNAs families sequences from H. chilense and H. vulgare were labelled by nick translation with digoxigenin-11-dUTP (Roche Applied Science, Indianapolis, IN, USA) and with biotin-11-dUTP (Boehringer Mannheim Biochemicals, Germany), respectively, according to the manufacturer’s instructions. Nick translation was performed in a thermocycler (ThermoBrite® Leica) at 15 °C for 90 min.
Chromosome preparations of root tip cells in somatic metaphase
Barley seeds were germinated on wet filter paper in the dark for 5 days at 4 °C, followed by 24 h at 25 °C. Emerging seedling roots 1–2 cm (centimetres) long were cut, incubated for 4 h in 0.05% w/v colchicine at 25 °C, fixed in 100% ethanol- acetic acid, 3:1 (v:v) and stored at 4 °C until their use.
Preparation of chromosome spreads was done as described in (Prieto et al. 2001; Prieto et al. 2004a) with some modifications. That is, before squashing, roots were washed in 1 × enzyme buffer (4 mM citric acid and 6 mM sodium citrate) 3 times for 5 min each. Then, meristems were cut and incubated for one hour in the enzyme mixture (0.5% pectolyase Y23 (Kyowa Chemical Products Co., LTD), 1% cellulose “Onozuka” RS (Yakult Pharmaceutical Ind. Co., LTD) and 20% peptinase (Sigma) in dH2O).
Fluorescence in situ hybridization (FISH)
For in situ hybridization experiments, biotin and digoxigenin labelled probes were mixed to a final concentration of 5 ng/μl in the hybridization mixture (50% formamide, 2 × SCC, 5 ng of each digoxigenin and biotin-labelled probes, 10% dextran sulphate, 0.14 μg of yeast tRNA, 0.1 μg of sonicated salmon sperm DNA, and 5 ng of glycogen). The in situ hybridization protocol was performed according to (Cabrera et al. 2002). At least two slides per satDNA per H. chilense or H. vulgare lines were hybridized in FISH experiments.
Post-hybridization washes were conducted twice at 2 × SSC (5 min each) at 37 °C followed by another wash in 1 × SSC at room temperature (RT). Biotin- and digoxigenin-labelled probes were detected with streptavidin-Cy3 conjugates (Sigma, St. Louis, MO, USA) and antidigoxigenin FITC antibodies (Roche Diagnostics, Meylan, France), respectively. Total DNA was counterstained with 4′,6-diamidino-2- phenylindole (DAPI) and mounted in Vectashield (Vector Laboratories, Burlingame, CA, USA). Hybridization results were visualized using a Nikon Eclipse 80i epifluorescence microscope and images were captured with a Nikon CCD camera using the Nikon 3.0 software (Nikon Instruments Europe BV, Amstelveen, The Netherlands) and processed with Photoshop 11.0.2 software for adjustment of brightness and contrast (Adobe Systems Inc., San Jose, CA, USA).
Chromosomes from the wild and cultivated barley accessions displaying positive signals for the satDNAs sequences were identified using the repeat sequence GAA as described previously (Pedersen et al. 1996; Pedersen and Langridge 1997; Prieto et al., 2004a; Kruppa et al. 2013).
Results
Barley satellitome survey
SatDNA mining revealed 25 satDNA families in H. chilense (accession H1), representing ~ 4.3% of the genome (Table 1), 27 satDNA families in H. chilense (accession H7), representing ~ 3.6% of the genome (Table 2) and 18 satDNA families in H. vulgare representing ~ 2.1% of the genome (Table 3). Figures S1–S3 show the reconstruction of representative monomer sequences for each satDNA family. Intra-specific homologies between several of these satDNAs which form different superfamilies were found. For example, the superfamily SF3 in H. chilense-H1 is composed of three satDNA families (HchH1Sat03-118, HchH1Sat04-118 and HchH1Sat12-118). There are homologous satellites to these satDNAs in H. chilense-H7 (three families) and H. vulgare-H106 (three families) and the nine satDNA families comprise the Group number 3 (GR3) of satellite sequences (Tables 1, 2, 3).
It is striking at first sight that very few satellites of H. chilense would have homologues in H. vulgare and vice versa (Tables 1–3). Thus, in principle, the two species would share only 5 satDNA families. In addition, certain differences between accessions H1 and H7 of H. chilense have been found, that is 2 satellites in accession H1 not found in H7 and 8 satellites in H7 not found in accession H1. It is also striking that in the wild varieties of barley, the proportion of satellite DNA in the genome is about twice that in cultivated barley (about 2%) and, in addition, there are also considerable differences in this proportion between H. chilense accessions (about 4.3% in H1 and 3.6% in H7). To search for the presence of the presumed accession-specific satDNAs in each barley accession, we aligned 2 × 10 millions of randomly selected read pairs from each barley accession to the consensus sequences in the accession-specific satDNA database, using RepeatMasker as indicated in Materials and Methods. As a result, all thirteen alleged H. vulgare-specific satellites were evident in both H. chilense H1 and H7 accessions (Table 4). However, in most cases, the proportion of these satellites in H1 and H7 was very low (below 0.01% in 10 of 13 satellites and between 0.02% and 0.07% in the remaining three). Similarly, among the twenty-one alleged H. chilense-specific satDNAs, all but two were found in H. vulgare (Table 4). In this case, one satDNA was found to represent 0.036% of the H. vulgare genome but the remaining 18 were represented in a percentage below 0.01% (in fact, 9 satellites were under 0.001%). Table 4 shows the relationships of all these satDNA families. Collectively, they are grouped in 36 homology groups and 8 superfamilies, summing a total of 46 different satDNA families. According to Table 4, all indications are that 44 of these satellites were already present in the common ancestor of these barley species. Most of the groups are constituted by satDNA families that have been somewhat more amplified either in H. chilense (GR2, GR4, GR5, GR6, GR7, GR8, GR10, GR11, GR13, GR14, GR15, GR17, GR18, GR19, GR20, GR21 and GR24) or in H. vulgare (GR28, GR29, GR30, GR31, GR32, GR34 and GR35), with traces of each of them remaining in the other species (Table 4; Fig. 1). Conversely, amplifications were significant in the three accessions analyzed in seven groups of satDNAs, although differently among them (GR1, GR3, GR9, GR12, GR25, GR26 and GR27; a total of 10 different families). In addition, within H. chilense, we can also distinguish between accessions H1 and H7 since most shared satDNAs are more abundant in the H1 accession. Furthermore, 4 satDNA families are exclusive of this accession (Tables 2, 3 and 4): (a) HchH1Sat02-92 (GR2 and SF2) is composed of two 46-bp subunits and it is almost replacing the 46-bp satellite (HchH1Sat18-46) in H1 (0.30% vs 0.01%), which is present in the H7 accession (HchH7Sat06-46) in a smaller proportion (0,11%); (b) HchH1Sat21-332 is a new satDNA of the GR4 group and SF4 superfamily that is not present in H7; (c) In the GR10, HchH1Sat14-88 (0.02%) is composed of two 44-bp subunits and it has almost replaced in H1 the 44-bp satellite present in the H7 accession (HchH7Sat15-44) in a similar proportion (0.03%) while in the H1 accession (HchH1Sat25-44) it has almost disappeared (0.0001%); (d) HchH1Sat24-1932 is a newly emergent and still poorly represented satDNA included in GR11 group. Among the 10 satellite families with significant amplifications in the two species, it is also worth noting that some are more abundant in H. chilense and others in H. vulgare. Of note are the satellites included in GR1 group, which represent the higher proportion of the satellitome in H1 (71% of the satellitome) and H7 (66% of the satellitome) accessions of H. chilense. HvuSat01-338 (GR1) is also the most abundant satDNA in H. vulgare but represents 0.6% of the genome and it is four times less abundant than the representative satellites (HchH1Sat01-337 and HchH7Sat01-337) in H. chilense accessions (3.05% and 2.4% respectively). On the contrary, HvuSat03-118 (GR3 and SF3), is 0.37% of the H. vulgare genome while the respective counterparts in H. chilense H1 and H7 (HchH1Sat03-118 and HchH7Sat02-118) represent 0.17% and 0.18%, respectively, of their genomes. Also notable is the case of HvuSat02-444 (GR25) which is 0.55% of the H. vulgare genome but only 0.02%-0.03% in H. chilense (we couldn’t isolate these counterparts). Interestingly, this satellite consists of a cassette-like structure in which a 174 bp sequence is flanked by imperfect inverted repeats (130/139 bp) of the sequence (GAA)n [(TTC)n-174 bp-(GAA)n], noting that several of the TTC and GAA subrepeats of the (TTC)n and (GAA)n regions are degenerate (Figure S4). Repeat Landscape (i.e. the histogram representing abundance (y-axis) and divergence (x-axis) concerning a satDNA consensus sequence; see Materials and Methods) plots of each of the two parts of this satellite can be seen in Figure S5.
In the comparison between H. chilense and H. vulgare, some specificity based on the RE results and the proportion that they represent in each genome can be highlighted, although trace copies of most satellites in each other species have been also detected. Thus, we can consider that there are 19 groups of sequences including 25 different satDNA families that are more characteristic of H. chilense (GR2, GR4, GR5, GR6, GR7, GR8, GR10, GR11, GR13, GR14, GR15, GR16, GR17, GR18, GR19, GR20, GR21, GR23 and GR24) while there are 6 groups (7 different satDNA families) more characteristic of H. vulgare (GR25, GR28, GR29, GR31, GR32 and GR34) (Table 4 and Fig. 1).
satDNA homologies
BLAST search found significant similarity between several barley satDNA families (especially among those belonging to the groups GR1 and GR3) and other barley satDNAs previously reported (Table S2). In addition, these satellites, as well as other barley satDNAs, were homologous to other satDNAs previously described in other grass species (Table S2). Specifically, the GR1 satDNAs showed homology to the known Afa family found in Hordeum and several species of Poaceae while GR3 satDNAs are 118-bp subtelomeric satellites (see below) homologous to other satDNAs described under different designations in different grasses (Table S2). In addition, H. chilense satellites included in groups GR4, GR5, GR7, GR9, GR10 and GR12, were homologous to satDNAs sequences found in other Poaceae but not in Hordeum (Table S2).
An analysis using RepeatMasker revealed that 14 satDNA groups identified in this article as satDNAs showed homology with transposable elements, particularly DNA/CMC-EnSpm/CACTA elements although some of them are related to LTR retrotransposons (Table S3). Part of the sequence (between 20 and 100%, depending on the family) of six different satDNA families of H. chilense H1 showed homology to DNA/CMC-EnSpm/CACTA transposons. All these sequences were subtelomeric/interstitial except HchH1Sat09-1236 which showed a dispersed pattern (Table S3; see below). Part of the sequence (between 8 and 100%, depending on the family) of seven different satDNA families of H. chilense accession H7 also showed homology to DNA/CMC-EnSpm/CACTA transposons. All these sequences were subtelomeric/interstitial except HchH7Sat12-728 and HchH7Sat23-692, which were dispersed (Table S3; see below). Additionally, 25% of the (peri)centromeric HchH7Sat14-2790 satDNA sequence (see below), was homologous to LTR/Gypsy retrotransposons and 4.5% of the subtelomeric HchH7Sat26-satDNA sequence was homologous to DNA/hAT transposons. Two satDNA sequences of H. vulgare, HvuSat01-338 and HvuSat02-444, were related to DNA/CMC-EnSpm/CACTA transposons (95%-100% of their sequences; Table S3). However, another five satDNAs were homologous to LTR retrotransposons (between 12 and 89% of the sequence), including the (peri)centromeric HvuSat15-1590 satDNA (53% of its sequence is homologous to LTR/Copia elements) (Table S3).
Satellite DNA location
Most of the satellites analyzed, 38 in total, have a subtelomeric location (satellites included in groups GR1, GR2, GR3, GR4, GR5, GR7, GR9, GR11, GR12, GR15, GR20, GR23, GR28, GR29, GR30, GR31 and GR36) ranging from situations in which their presence is in a single chromosome pair to those in which their presence is in all seven chromosome pairs, passing through intermediate situations (Fig. 2; Table 5). It should be noted that some of these groups are the biggest ones (such as GR3 with 9 satellites). Some of these subtelomeric satellites (included in groups GR1, GR2, GR4, GR5, GR9 and GR31) displayed additional interstitial loci on some chromosomes (Fig. 2). In the cases of the GR5 and GR12 groups, composed mostly of subtelomeric satellites four satDNAs are interstitial, located at a single locus (Table 5).
A few satellites (satellites of groups GR2, GR10, GR13, GR14, GR19, GR32 and GR35), 16 in total, are located interstitially on one or a few pairs of chromosomes (Fig. 3 and Table 5). Among the satellites of GR2, HchH1Sat02-92 (Fig. 3a) and HchH7Sat18-46 are interstitial (Fig. 3e) in chromosomes 2–7, while satellite HchH7Sat06-46 occupies subtelomeric (2) loci in chromosomes 5 and 6, interstitial (3) loci in chromosomes 3, 6 and 7, and centromeric (1) loci in chromosome 2 (Table 5). This group of subtelomeric satellites would also include the four aforementioned satellites from the GR5 and GR12 groups.
Four Satellite DNAs are identified as (peri)centromeric (Fig. 4 and Table 5). The homologs HchH1Sat07-493 and HchH7Sat07-484 (GR6), are present on one chromosomal pair (pair 7) of both H. chilense accessions H1 and H7 (Fig. 4a, b). In situ hybridization of these satellites in H. vulgare did not reveal FISH signals. In situ hybridization experiments also revealed that the (peri)centromeric satellite HchH7Sat14-2790 (GR18), present in all chromosomes of H. chilense H7 (Fig. 4c) was also detected in H. chilense H1 (Fig. 4e), while in H. vulgare signals were not detected in any chromosome (data not shown). In H. vulgare, the satellite HvuSat15-1590 (GR33) is (peri)centromeric in all chromosomes (Fig. 4d). Although we were unable to isolate a homologous satellite in H. chilense, in situ hybridization with the HvuSat15-1590 satellite in H. chilense accessions revealed its presence in the peri(centromeric) region of all chromosomes except in chromosome 5 in both H. chilense accessions, confirming that this satellite is conserved in both H. vulgare and H. chilense species (Fig. 4f, g).
In addition to these satDNAs, HvuSat02-444 (GR25) is a satellite that has multiple locations along the chromosomes and resembles the hybridization pattern of the GAA sequence (Dennis et al. 1980; Pedersen et al. 1996) although the pattern is more diffuse and a multitude of additional loci were observed scattered throughout the chromosomes of H. vulgare (Fig. 4h; Table 5).
Few satellites showed a dispersed pattern (HchH1Sat09-1236, HchH1Sat23-4077, HchH7Sat11-505, HchH7Sat12-728, HchH7Sat23-692, HvuSat05-5500 and HvuSat06-4925) or were not visualized by in situ hybridization (HchH1Sat24-1932, HchH7Sat25-82, HchH7Sat27-410 and HvuSat16-483) (data not shown).
Table 5 summarizes the FISH data of different homologous satDNAs among the accessions studied. This summary highlights how different satellites have different distribution patterns in different accessions. For example, homologous satellites of GR1 have a different hybridization pattern on chromosomes 1, 4, 5 and 6 when comparing H. chilense with H. vulgare (Table 5). In the case of homologous satellites of GR2, the hybridization signals differentiate chromosomes 2, 5 and 6 of the two accessions of H. chilense (Table 5). Chromosomes 2, 3, 6 and 7 have different hybridization patterns between H. vulgare and H. chilense (and between the two accessions of this species) for the homologous satellites of GR3 (Table 5). The hybridization patterns of GR4 satellites are very different on the 7 chromosomes of the two accessions (H1 and H7) of H. chilense (Table 5). And so, we could continue with most of the satellites studied (Figs. 2–4 and Table 5). This can be seen in Fig. 5 as an additional comparative example between the three accessions. Figure 5 shows four out of five homologous satellites of GR 12 (H1Sat16, H1Sat17, H7Sat20 and HvuSat13) that have FISH patterns specific for each accession analyzed and could be used as chromosome markers.
BLAST search of satDNAs to the genome of H. vulgare
Taking advantage of the reference genome of H. vulgare (https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_904849725.1/), we have traced the genome assembly with the satDNAs isolated from this species. The number of hits found for each satellite was proportional to the estimated abundance for each satellite, although not in all cases (Table S4). However, an important number of repetitive units from each of these satellites have been omitted from the total assembly analysis if we compare the number of hits revealed with the expected number of repeats (based on the proportion of each satellite and a genome size of ~ 5 Gb) (Table S5). Moreover, many hits appeared at the unplaced contigs, particularly for the most abundant satDNAs (Table S4). On the other hand, a coincidence between the positions of BLAST hits in the assembled chromosomes and FISH signals was found for most satDNAs. However, in addition to the regions detected by FISH, some satellites in the genome assembly using BLAST alignments were additionally scattered in other regions and different chromosomes. In some cases, especially when the satellites are long, the alignments revealed that an important part of the matches are incomplete sequences in the assembly. This is particularly the case for HvuSat05-5500 and HvuSat06-4925, which showed a dispersed pattern and homology to LTR retrotransposons and a scattered BLAST pattern in the assembly of H. vulgare genome. However, the case of HvuSat16-483, which was not located by in situ hybridization, is different: BLAST pattern in the assembly of H. vulgare genome is also scattered but most of the matches aligned completely or almost completely repetitive units and, in addition, some contiguous repeat units were detected, usually two repetitive units in a row, but in some other cases three or even six in a row (Table S4).
We also checked for the presence of repeats of the different satDNAs identified in H. chilense in the H. vulgare genome. Table S6 shows the number of hits of each of these H1 and H7 satDNAs in the H. vulgare genome assembly. The results are compatible with the results presented in Table 4. The numbers of hits of the satellites of the GR1, GR3 and GR12 groups were the highest in H. vulgare, which were also the ones detected by RE2 in the two species. The satellites of groups GR4 and GR11, which were not detected by RE2 in H. vulgare but tracked in the raw reads, also present an important number of hits, although much more discrete (Table 4). Of the remaining groups, most satellites were very poorly represented in the H. vulgare genome (Tables 4 and S6). Interestingly, (peri)centromeric satellites from H. chilense yielded very few (HchH1Sat07-493/HchH7Sat07-484) or no hits (HchH7Sat14-2790) in H. vulgare (Table S6) according to the search performed among the raw reads, that only revealed traces of these satellites in the H. vulgare genome (Table 4). Hordeum chilense satellites that showed a dispersed FISH pattern (HchH1Sat09-1236, HchH1Sat23-4077, HchH7Sat11-505, HchH7Sat12-728 and HchH7Sat23-692) or that did not give a signal in the hybridization experiments (HchH1Sat24-1932, HchH7Sat25-82, HchH7Sat27-410) displayed compatible BLAST results in the H. vulgare genome, which allows us to ensure that these sequences show a dispersed pattern in both species, in some cases sufficiently accumulated in H. chilense (but not in H. vulgare) to give FISH signals (dispersed) or not, explaining the absence of in situ hybridization signals. In addition, HchH7Sat27-410 sequences were not present in the genome H. vulgare as the result of the screening of the raw reads (Table 4).
Discussion
The contrasting satellitomes of the different barley accessions
Both species analyzed in this article have an undoubted interest in agriculture. One, H. chilense, is a wild species with the potential to add value to crops of other grass species, and the other, H. vulgare, is one of the most important cereal crops in the world together with wheat, maize, rice, or millet. In addition, sets of both cultivated (H. vulgare) and wild (H. chilense) barley addition lines in a hexaploid wheat background were developed several decades ago (Islam et al. 1978, 1981; Miller et al. 1982), which have an enormous potential to transfer desirable traits into wheat but also in plant meiosis studies (Calderón et al. 2012, 2014, 2018). Although both congeneric species belong to lineages that were separated no more than 9–12 million years ago (mya) (Blattner 2009, 2018), the profile of their satellitomes is very different. For example, they share a satDNA family composed of repeats of 337–338 bp (HchH1Sat01-337/HchH7Sat01-337 and HvuSat01-338 satellites of the GR1), which is the most abundant in both species, between four and five times more abundant in H. chilense than in H. vulgare (Table 4), and widely conserved in other grasses being known as the Afa family (Nagaki et al. 1995), closely related to the pAs1 sequence (Rayburn and Gill 1986). Furthermore, both wild and cultivated barley species share three different but related satDNA families composed of repeat units of 118 bp (included in GR3), which are among the most abundant in both species and some of them are also conserved in other Poaceae (Table S2). As an example, the abundance of HvuSat03-118 (H. vulgare) is twice the abundance of their H. chilense counterparts (Table 4). However, in addition to the important quantitative differences found in both species for these satellites, large quantitative differences for the rest of satDNAs analyzed in this paper have been shown. Only traces of most of the satDNAs of H. chilense remain in H. vulgare and vice versa in such a way that species-specific dominant satDNAs in one species are characterized by the presence of low-copy counterparts in the other species. This entails that, in most cases, our satDNA mining failed to isolate these counterparts. However, we revealed these traces by tracking the species-specific satDNA databases with the raw sequencing reads of every species (Table 4). Therefore, after all, both species share all satDNAs of their satellitomes except two low-represented satellite families of H. chilense that were not found in H. vulgare. As predicted by the "library hypothesis" (Fry and Salser 1977), differential amplifications of each satDNA family have occurred in the two species analyzed from the same common ancestral satellite library. In this case, moreover, it appears that the divergent paths of wildlife and domestication have led to an accelerated divergence in satDNA profiles of barley species. Important changes in satDNA content between wild and cultivated species have been associated with domestication in maize (Bilinski et al. 2015). Interestingly, the differences between the satellitomes of the two accessions studied of H. chilense were found remarkable (Tables 4 and 5; Figs. 1–6). It has been proposed that H. chilense consists of different morphologically and genetically distinct groups and our results contribute to support this view given the differences in abundance and location of most satDNAs between H1 and H7 accessions (Tables 4 and 5; Figs. 1–6). In addition, our results support the fact that H1 and H7 H. chilense accessions were selected to generate a H. chilense F2 population due to their morphological and ecophysiological differences (Hernández et al. 2001).
Although it is not easy to explain what has happened for each satDNA in each lineage for whether the reduction in the number of copies in one lineage and increase in the other lineage has occurred or vice versa, our results suggest that most of the differences are due to amplifications of some satellites versus others in each lineage. Although the library hypothesis does not foresee the emergence of new satDNAs, our data support the emergence of some new satDNAs in one or both accessions of H. chilense either from new random sequences (HchH7Sat22-364) or from pre-existing satDNAs (HchH1Sat02-92, HchH1Sat14-88, HchH1Sat21-332 and HchH1Sat24-1932) as occurred in other plant and animal species (Navajas-Pérez et al. 2005; Ruiz-Ruano et al. 2019; Sales-Oliveira et al. 2024).
The second most abundant satellite in H. chilense H1 is indeed one of those. HchH1Sat02-92 (GR2) is a 92 bp satellite. The homologous to this satellite in H7 accession (HchH7Sat06-46, GR2) is just half as long (46 bp), as a satellite of the same group found in H1 (HchH1Sat18-46). Interestingly, HchH1Sat02-92 is represented 30 times more in the H1 genome than HchH1Sat18-46 and up to 3 times more than HchH7Sat06-46 in H7. In addition, hardly any of its relics remain in H. vulgare (Table 4). These results suggest an enormous amplification in H1 of a new satellite that has emerged from the duplication and divergence of a shorter ancestral satellite. This is in agreement with longer-length satellites originated from different duplication and divergence cycles of shorter satellites of diverse random origin (Navajas-Pérez et al. 2005; Ruiz-Ruano et al. 2019; Sales-Oliveira et al. 2024), probably by unequal crossing-over (Smith 1976). However, it contrasts with the origin of satellites from other types of tandemly repetitive sequences such as ribosomal DNA (Stupar et al. 2002; Neumann et al. 2003; Jo et al. 2009; Almeida et al. 2012) or dispersed repetitive sequences such as the different types of transposable elements of eukaryotic genomes (Meštrović et al. 2015; Šatović-Vukšić and Plohl 2023), as we have found for several barley (Table S3) and wheat (Gálvez-Galván et al. 2024) satDNAs. As can be seen in Table 4, the longer satellite of GR2 group is practically replacing the shorter satellite in H1, the same as occurs between the 44 and 88-bp satellites of the GR10 group, although there the amplification of the longer satellite has not been so dramatic.
On the other side, the second most abundant satellite in H. vulgare (HvuSat02-444, GR25) is poorly represented in H. chilense and, in fact, was not detected in this species using the satDNA mining protocol. This satellite is very curious because its repetitive sequence has a structure that resembles the structure of a MITE (Miniature Inverted-repeat Transposable Element) since a sequence of 174 bp (in this case a non-coding sequence of unknown origin) is flanked by two imperfect inverted repeats (Figure S4). However, in this case, the flanking sequences, which in DNA transposons and MITEs are called TIR (Terminal Inverted Repeats), are composed of the repetition of perfect and degenerated GAA (TTC) trinucleotides, which distinguishes it from transposable elements. Notwithstanding, this structure suggests an amplification mechanism of this satellite that would mimic the amplification mechanism of other satellites from MITEs and DNA transposons (Meštrović et al. 2015). Indeed, our search using RepeatMasker revealed its homology with EnSpm elements (Table S3). A set of clones composed of repetitive sequences similar to HvuSat02-44 was identified by (Kato 2011) (Table S2): the sequence of clone pHv-1966, for example, consisting of two partial units of the HvSat02-444 sequence (not shown). The (GAA)n region of HvuSat02-444 is similar to the GAA satellite sequence, a 334-bp composed of perfect and degenerated GAA trinucleotides (Dennis et al. 1980; Pedersen et al. 1996), that accumulates in different loci at a certain distance in both sides of the centromere on all H. vulgare chromosomes (Szakács et al. 2013). It is possible that FISH probes of this satellite cross-hybridize with (GAA)n-rich sites and, vice versa. What is remarkable is that Repeat Landscapes plots of each of the two parts of this satellite support that the 174 bp non-repetitive part is an ancient satellite that in H. vulgare, but not in H. chilense, has been recently amplified after its assemblage between the (TTC)n/(GAA)n inverted repeats (Figure S5).
Barley satellitomes in the framework of breeding
The correct identification of each chromosomal pair in species of agronomic interest has been an important objective in recent years. Thus, in the case of barley, several markers, either derived from its genome or other grasses, have been used for this purpose. Several satDNAs such as pSc119.2 from rye (Bedbrook et al. 1980), the Afa family from wheat (Nagaki et al. 1995) or HvT01 from barley (Belostotsky and Ananiev 1990) are routinely used for such identification (Prieto et al. 2004b; Rey et al. 2018; Jouve et al. 2018) among other repetitive markers (Kato 2011). But, above all, it is the GAA sequence (Dennis et al. 1980; Pedersen et al. 1996) that has been most used for such identification and is the one we have used in this study (Figure S6). In barley, different types of microsatellites tend to accumulate in different regions, especially heterochromatin, forming loci visible by FISH, which tend to have different patterns between chromosomes and between different barley accessions, making them useful markers for chromosomal differentiation within and between Hordeum species (Cuadrado and Jouve 2007; Carmona et al. 2013). In this article, we provide the genomic information of the satellitome of H. vulgare and H. chilense with the analysis of 70 satellites from three different accessions that have revealed the existence of a total of 46 satDNA families, most of which had not been previously described and that provide a large number of new markers that differentiate chromosomes both within and between species as well as between the two accessions of H. chilense.
In this context, satDNAs are located in regions important for the correct homologous chromosome pairing during meiosis, most in the subtelomeric and peri(centromeric) regions. Centromeres have been extensively studied in barley (Houben et al. 2007). The functional centromere region of barley chromosomes is composed of an LTR/Gypsy-like retrotransposon conserved in several cereals, the centromeric retrotransposon (CR) cereba (Presting et al. 1998; Hudakova et al. 2001) and a short 6-bp species-specific GC-rich satellite (Hudakova et al. 2001; Nasuda et al. 2005) that we have not identified in this analysis. Only a fraction of the centromeric DNA is utilized in the kinetochore assembly (Houben et al. 2007). Thus, the pericentromeric region would also be composed of cereba elements and of the 6 bp GC-rich satellite as well as probably other sequences that we have found in this study. We have isolated a satellite, HvuSat15-1590, present in the (peri)centromeric region of all the chromosomes of H. vulgare (also present in six of the seven chromosomes of the two accessions of H. chilense in in situ hybridization experiments) which is not homologous to cereba but to retrotransposons of the LTR/Copia type (Table S3). The satellite HchH7Sat14-2790, related to LTR/Gypsy retrotransposons (Table S3), was identified in accession H7 of H. chilense and it also hybridizes (peri)centromeric to all chromosomes of both accessions (H1 and H7) of H. chilense. However, this satellite was not detected by FISH signal in H. vulgare chromosomes. In potato, several very long chromosome-specific satellites have been amplified from retrotransposon-related sequences (Gong et al. 2012; Zhang et al. 2014). Indeed, an association has been found between transposable elements and newly emerged (peri)centromeric satDNAs in several plant species (Gong et al. 2012; Sharma et al. 2013; Zhang et al. 2014; Meštrović et al. 2015; Vondrak et al. 2020). It has been proposed that centromere-specific satellites have originated during evolution by nested transposition (Sanmiguel and Bennetzen 1998; Hudakova et al. 2001) and that there would be a tendency to replace CRs by satellites (Langdon et al. 2000; Hudakova et al. 2001). In this context, HvuSat15-1590 and HchH7Sat14-2790 might provide new repetitive elements that could emerge as new dominant centromeric satellites in the future. The different organization of the HchH7Sat14-2790 satellite in H. vulgare and H. chilense (probably scattered in a few locations vs. accumulated (peri)centromerically) is very relevant in this regard. So is the emergence of HchH1Sat07-493/HchH7Sat07-484 homologs in the (peri)centromeric region of a chromosome pair of the two accessions of H. chilense but not in H. vulgare. These reflect the enormous plasticity and dynamic capacity of the (peri)centromeric region of barley and the set of satDNAs found in this study would provide the substrate for future DNA sequence replacements at centromeres.
Although some subtelomeric satellites such as those of GR1 are located in all chromosomes, in the two barley species there is great variability at the subtelomeric level. Such diversity is reflected in specific patterns of different satellites for each chromosome of both species which, as we have suggested for wheat (Calderón et al. 2014; Aguilar and Prieto 2020; Gálvez-Galván et al. 2024) and for barley itself (Serrano-León et al. 2023), may be very relevant for the correct pairing of chromosomes during meiosis which is also of interest in these species since in breeding the use of addition and substitution lines is very frequent. Cytogenetic studies of homologous pairing between a pair of H. chilense chromosomes lacking the subtelomeric region on one chromosome arm have demonstrated that the subtelomeric region is important for the process of homologous chromosome recognition and pairing (Calderón et al. 2014). In this work, we have identified several satDNAs in different barley subtelomeric chromosome regions that might suggest conserved functions of these DNA sequences in early meiosis. These subtelomeric satDNAs can contribute to shedding light on the putative role of these subtelomeric regions during chromosome recognition and pairing stages in barley meiosis.
Data availability
All raw data included in this work are available in the article.
References
Aguilar M, Prieto P (2020) Sequence analysis of wheat subtelomeres reveals a high polymorphism among homoeologous chromosomes. Plant Genome. https://doi.org/10.1002/tpg2.20065
Aguilar M, Prieto P (2021) Telomeres and subtelomeres dynamics in the context of early chromosome interactions during meiosis and their implications in plant breeding. Front Plant Sci. https://doi.org/10.3389/fpls.2021.672489
Almeida C, Fonsêca A, dos Santos KGB, Mosiolek M, Pedrosa-Harand A (2012) Contrasting evolution of a satellite DNA and its ancestral IGS rDNA in Phaseolus (Fabaceae). Genome 55:683–689. https://doi.org/10.1139/g2012-059
Baden C, von Bothmer R (1994) A taxonomìc revision of Hordeum sect. Critesion. Nord J Bot 14:117–136. https://doi.org/10.1111/j.1756-1051.1994.tb00579.x
Bedbrook JR, Jones J, O’Dell M, Thompson RD, Flavell RB (1980) A molecular description of telomeric heterochromatin in secale species. Cell 19:545–560. https://doi.org/10.1016/0092-8674(80)90529-2
Belostotsky DA, Ananiev EV (1990) Characterization of relic DNA from barley genome. Theor Appl Genet 80:374–380. https://doi.org/10.1007/BF00210075
Bennett MD, Smith JB (1976) Nuclear DNA amounts in angiosperms. Philos Trans R Soc Lond Ser B 274:227–274. https://doi.org/10.1098/rstb.1976.0044
Bilinski P, Distor K, Gutierrez-Lopez J, Mendoza GM, Shi J, Dawe RK, Ross-Ibarra J (2015) Diversity and evolution of centromere repeats in the maize genome. Chromosoma 124:57–65. https://doi.org/10.1007/s00412-014-0483-8
Blattner FR (2009) Progress in phylogenetic analysis and a new infrageneric classification of the barley genus Hordeum (Poaceae: Triticeae). Breed Sci 59:471–480. https://doi.org/10.1270/jsbbs.59.471
Blattner FR (2018) Taxonomy of the genus Hordeum and Barley (Hordeum vulgare) BT—the Barley genome. In: Muehlbauer GJ (ed) Stein N. Springer International Publishing, Cham, pp 11–23
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Bothmer R, Jacobsen N, Baden C, Jørgensen R, Linde-Laursen I (1995) An ecogeographical study of the genus Hordeum. 2nd edition. In: Systematic and ecogeographical studies on crop genepool. International Plant Genetic Resources Institute, Rome
Brown TA, Jones MK, Powell W, Allaby RG (2009) The complex origins of domesticated crops in the Fertile Crescent. Trends Ecol Evol 24:103–109. https://doi.org/10.1016/j.tree.2008.09.008
Cabrera A, Martín A, Barro F (2002) In-situ comparative mapping (ISCM) of Glu-1 loci in Triticum and Hordeum. Chromosom Res 10:49–54. https://doi.org/10.1023/A:1014270227360
Calderón MC, Ramírez MC, Martín A, Prieto P (2012) Development of Hordeum chilense 4Hch introgression lines in durum wheat: a tool for breeders and complex trait analysis. Plant Breed 131:733–738. https://doi.org/10.1111/j.1439-0523.2012.02010.x
Calderón MDC, Rey MD, Cabrera A, Prieto P (2014) The subtelomeric region is important for chromosome recognition and pairing during meiosis. Sci Rep 4:1–6. https://doi.org/10.1038/srep06488
Calderón MC, Rey MD, Martín A, Prieto P (2018) Homoeologous chromosomes from two hordeum species can recognize and associate during meiosis in wheat in the presence of the Ph1 locus. Front Plant Sci. https://doi.org/10.3389/fpls.2018.00585
Carmona A, Friero E, de Bustos A, Jouve N, Cuadrado A (2013) Cytogenetic diversity of SSR motifs within and between Hordeum species carrying the H genome: H. vulgare L. and H. bulbosum L. Theor Appl Genet 126:949–961. https://doi.org/10.1007/s00122-012-2028-y
Cuadrado A, Jouve N (2007) Similarities in the chromosomal distribution of AG and AC repeats within and between Drosophila, human and barley chromosomes. Cytogenet Genome Res 119:91–99. https://doi.org/10.1159/000109624
Dennis ES, Gerlach WL, Peacock WJ (1980) Identical polypyrimidine-polypurine satellite DNAs in wheat and barley. Heredity (Edinb) 44:349–366. https://doi.org/10.1038/hdy.1980.33
Doležel J, Čížková J, Šimková H, Bartoš J (2018) One major challenge of sequencing large plant genomes is to know how big they really are. Int J Mol Sci 19:3554
Forster BP, Phillips MS, Miller TE, Baird E, Powell W (1990) Chromosome location of genes controlling tolerance to salt (NaCl) and vigour in Hordeum vulgare and H. chilense. Heredity (Edinb) 65:99–107. https://doi.org/10.1038/hdy.1990.75
Fry K, Salser W (1977) Nucleotide sequences of HS-α satellite DNA from kangaroo rat dipodomys ordii and characterization of similar sequences in other rodents. Cell 12:1069–1084. https://doi.org/10.1016/0092-8674(77)90170-2
Gálvez-Galván A, Garrido-Ramos MA, Prieto P (2024) Bread wheat satellitome: a complex scenario in a huge genome. Plant Mol Biol 114:8. https://doi.org/10.1007/s11103-023-01404-x
Garrido-Ramos MA (2015) Satellite DNA in plants: more than just rubbish. Cytogenet Genome Res 146:153–170. https://doi.org/10.1159/000437008
Garrido-Ramos MA (2017) Satellite DNA: an evolving topic. Genes (Basel). https://doi.org/10.3390/genes8090230
Garrido-Ramos MA (2021) The genomics of plant satellite DNA. Prog Mol Subcell Biol 60:103–143
Gong Z, Wu Y, Koblížková A, Torres GA, Wang K, Iovene M, Neumann P, Zhang W, Novák P, Robin Buell C, Macas J, Jianga J (2012) Repeatless and repeat-based centromeres in potato: implications for centromere evolution. Plant Cell 24:3559–3574. https://doi.org/10.1105/tpc.112.100511
Guerra AP, Calvo E, Wasserman M, Chaparro-Olaya J (2016) Producción de proteínas recombinantes de Plasmodium falciparum en Escherichia coli. Biomedica 36:97–108
Hernández P, Dorado G, Prieto P, Giménez MJ, Ramírez MC, Laurie DA, Snape JW, Martín A (2001) A core genetic map of Hordeum chilense and comparisons with maps of barley (Hordeum vulgare) and wheat (Triticum aestivum). Theor Appl Genet 102:1259–1264. https://doi.org/10.1007/s001220000514
Houben A, Schroeder-Reiter E, Nagaki K, Nasuda S, Wanner G, Murata M, Endo TR (2007) CENH3 interacts with the centromeric retrotransposon cereba and GC-rich satellites and locates to centromeric substructures in barley. Chromosoma 116:275–283. https://doi.org/10.1007/s00412-007-0102-z
Hudakova S, Michalek W, Presting GG, ten Hoopen R, dos Santos K, Jasencakova Z, Schubert I (2001) Sequence organization of barley centromeres. Nucleic Acids Res 29:5029–5035. https://doi.org/10.1093/nar/29.24.5029
Islam AKMR, Shepherd KW, Sparrow DHB (1978) Production and characterization of wheat-barley addition lines. In: Proceedings of 5th international wheat genetics symposium (New Delhi). pp 365–371
Islam AKMR, Shepherd KW, Sparrow DHB (1981) Isolation and characterization of euplasmic wheat-barley chromosome addition lines. Heredity (Edinb) 46:161–174. https://doi.org/10.1038/hdy.1981.24
Jo S-H, Koo D-H, Kim JF, Hur C-G, Lee S, Yang T, Kwon S-Y, Choi D (2009) Evolution of ribosomal DNA-derived satellite repeat in tomato genome. BMC Plant Biol 9:42. https://doi.org/10.1186/1471-2229-9-42
Jouve N, Carmona A, De Bustos A, Cuadrado A (2018) The phylogenetic relationships of species and cytotypes in the genus Hordeum based on molecular karyotyping. Curr Res Phylogenet Evol Biol. https://doi.org/10.29011/CRPEB
Kato A (2011) High-density fluorescence in situ hybridization signal detection on barley (Hordeum vulgare L.) chromosomes with improved probe screening and reprobing procedures. Genome 54:151–159. https://doi.org/10.1139/G10-098
Kruppa K, Türkösi E, Szakács É, Cseh A, Molnár-Láng M (2013) Development and identification of a 4HL.5DL wheat/barley centric fusion using GISH, FISH and SSR markers. Cereal Res Commun 41:221–229. https://doi.org/10.1556/CRC.2012.0038
Langdon T, Seago C, Mende M, Leggett M, Thomas H, Forster JW, Thomas H, Jones RN, Jenkins G (2000) Retrotransposon evolution in diverse plant genomes. Genetics 156:313–325. https://doi.org/10.1093/genetics/156.1.313
Martin A, Martin L, Ballesteros J (1998) The potential of Hordeum chilense in breeding Triticeae species. Science Publishers Inc., Enfield, pp 377–386
Martín A, Alvarez JB, Martin LM, Barro F, Ballesteros J (1999) The development of Tritordeum: a novel cereal for food processing. J Cereal Sci 30:85–95. https://doi.org/10.1006/jcrs.1998.0235
Martín AC, Atienza SG, Ramírez MC, Barro F, Martín A (2010) Molecular and cytological characterization of an extra acrocentric chromosome that restores male fertility of wheat in the msH1 CMS system. Theor Appl Genet 121:1093–1101. https://doi.org/10.1007/s00122-010-1374-x
Mascher M, Wicker T, Jenkins J, Plott C, Lux T, Koh CS, Ens J, Gundlach H, Boston LB, Tulpová Z, Holden S, Hernández-Pinzón I, Scholz U, Mayer KFX, Spannagl M, Pozniak CJ, Sharpe AG, Šimková H, Moscou MJ, Grimwood J, Schmutz J, Stein N (2021) Long-read sequence assembly: a technical evaluation in barley. Plant Cell 33:1888–1906. https://doi.org/10.1093/plcell/koab077
Meštrović N, Mravinac B, Pavlek M, Vojvoda-Zeljko T, Šatović E, Plohl M (2015) Structural and functional liaisons between transposable elements and satellite DNAs. Chromosom Res 23:583–596. https://doi.org/10.1007/s10577-015-9483-7
Miller TE, Reader SM, Chapman V (1982) The addition of Hordeum chilense chromosomes to wheat [Phenotypes]. Induced variability in plant breeding : international symposium/of the section Mutation and Polyploidy of the European Association for Research on Plant Breeding, Wageningen, August 31–September 4, 1981/Chairman, C. Broertjes
Murray MG, Thompson WF (1980) Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res 8:4321–4326. https://doi.org/10.1093/NAR/8.19.4321
Nagaki K, Tsujimoto H, Isono K, Sasakuma T (1995) Molecular characterization of a tandem repeat, Afa family, and its distribution among Triticeae. Genome 38:479–486. https://doi.org/10.1139/g95-063
Naranjo T (2015) Contribution of structural chromosome mutants to the study of meiosis in plants. Cytogenet Genome Res 147:55–69. https://doi.org/10.1159/000442219
Nasuda S, Hudakova S, Schubert I, Houben A, Endo TR (2005) Stable barley chromosomes without centromeric repeats. Proc Natl Acad Sci USA 102:9842–9847. https://doi.org/10.1073/pnas.0504235102
Navajas-Pérez R, la de Herrán R, Jamilena M, Lozano R, Rejón CR, Rejón MR, Garrido-Ramos MA (2005) Reduced rates of sequence evolution of Y-linked satellite DNA in Rumex (polygonaceae). J Mol Evol 60:391–399. https://doi.org/10.1007/s00239-004-0199-0
Navrátilová P, Toegelová H, Tulpová Z, Kuo Y-T, Stein N, Doležel J, Houben A, Šimková H, Mascher M (2022) Prospects of telomere-to-telomere assembly in barley: analysis of sequence gaps in the MorexV3 reference genome. Plant Biotechnol J 20:1373–1386. https://doi.org/10.1111/pbi.13816
Neumann P, Požárková D, Macas J (2003) Highly abundant pea LTR retrotransposon Ogre is constitutively transcribed and partially spliced. Plant Mol Biol 53:399–410. https://doi.org/10.1023/B:PLAN.0000006945.77043.ce
Novák P, Neumann P, Pech J, Steinhaisl J, Macas J (2013) RepeatExplorer: a galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. Bioinformatics 29:792–793. https://doi.org/10.1093/bioinformatics/btt054
Novák P, Ávila Robledillo L, Koblížková A, Vrbová I, Neumann P, Macas J (2017) TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads. Nucleic Acids Res 45:e111–e111. https://doi.org/10.1093/nar/gkx257
Novák P, Neumann P, Macas J (2020) Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2. Nat Protoc 15:3745–3776. https://doi.org/10.1038/s41596-020-0400-y
Pedersen C, Langridge P (1997) Identification of the entire chromosome complement of bread wheat by two-colour FISH. Genome 40:589–593. https://doi.org/10.1139/G97-077
Pedersen C, Rasmussen SK, LindeLaursen I (1996) Genome and chromosome identification in cultivated barley and related species of the Triticeae (Poaceae) by in situ hybridization with the GAA-satellite sequence. Genome 39:93–104. https://doi.org/10.1139/g96-013
Presting GG, Malysheva L, Fuchs J, Schubert I (1998) A Ty3/gypsy retrotransposon-like sequence localizes to the centromeric regions of cereal chromosomes. Plant J 16:721–728. https://doi.org/10.1046/j.1365-313x.1998.00341.x
Prieto P, Ramíarez MC, Ballesteros J, Cabrera A (2001) Identification of intergenomic translocations involving wheat, Hordeum vulgare and Hordeum chilense chromosomes by FISH. Hereditas 135:171–174. https://doi.org/10.1111/j.1601-5223.2001.t01-1-00171.x
Prieto P, Martín A, Martín M, Cabrera A, Martín A, Cabrera A (2004a) Chromosomal distribution of telomeric and telomeric-associated sequences in Hordeum chilense by in situ hybridization. Hereditas. https://doi.org/10.1111/j.1601-5223.2004.01825.x
Prieto P, Shaw P, Moore G (2004b) Homologue recognition during meiosis is associated with a change in chromatin conformation. Nat Cell Biol 6:906–908. https://doi.org/10.1038/ncb1168
Rayburn AL, Gill BS (1986) Isolation of a D-genome specific repeated DNA sequence fromAegilops squarrosa. Plant Mol Biol Rep 4:102–109. https://doi.org/10.1007/BF02732107
Rey MD, Moore G, Martín AC, Scoles G (2018) Identification and comparison of individual chromosomes of three accessions of Hordeum chilense, Hordeum vulgare, and Triticum aestivum by FISH. Genome 61:387–396. https://doi.org/10.1139/gen-2018-0016
Rubiales D, Reader SM, Martín A (2000) Chromosomal location of resistance to Septoria tritici in Hordeum chilense determined by the study of chromosomal addition and substitution lines in “Chinese Spring” wheat. Euphytica 115:221–224. https://doi.org/10.1023/A:1004097830103
Ruiz-Ruano FJ, López-León MD, Cabrero J, Camacho JPM (2016) High-throughput analysis of the satellitome illuminates satellite DNA evolution. Sci Rep 6:28333. https://doi.org/10.1038/srep28333
Ruiz-Ruano FJ, Navarro-Domínguez B, Camacho JPM, Garrido-Ramos MA (2019) Characterization of the satellitome in lower vascular plants: the case of the endangered fern Vandenboschia speciosa. Ann Bot 123:587–599. https://doi.org/10.1093/aob/mcy192
Sales-Oliveira VC, dos Santos RZ, Goes CAG, Calegari RM, Garrido-Ramos MA, Altmanová M, Ezaz T, Liehr T, Porto-Foresti F, Utsunomia R, Cioffi MB (2024) Evolution of ancient satellite DNAs in extant alligators and caimans (Crocodylia, Reptilia). BMC Biol 22:47. https://doi.org/10.1186/s12915-024-01847-8
Sanmiguel P, Bennetzen JL (1998) Evidence that a recent increase in maize genome size was caused by the massive amplification of intergene retrotransposons. Ann Bot 82:37–44. https://doi.org/10.1006/anbo.1998.0746
Šatović-Vukšić E, Plohl M (2023) Satellite DNAs—from localized to highly dispersed genome components. Genes (Basel) 14
Schmieder R, Edwards R (2011) Fast identification and removal of sequence contamination from genomic and metagenomic datasets. PLoS ONE 6:e17288
Serrano-León IM, Prieto P, Aguilar M (2023) Telomere and subtelomere high polymorphism might contribute to the specificity of homologous recognition and pairing during meiosis in barley in the context of breeding. BMC Genomics 24:642. https://doi.org/10.1186/s12864-023-09738-y
Sharma A, Wolfgruber TK, Presting GG (2013) Tandem repeats derived from centromeric retrotransposons. BMC Genomics 14:142. https://doi.org/10.1186/1471-2164-14-142
Smit A, Hubley R, Green P (2015) RepeatMasker Open-4.0. http://www.repeatmasker.org
Smith GP (1976) Evolution of repeated DNA sequences by unequal crossover. Science (80-) 191:528–535. https://doi.org/10.1126/science.1251186
Stupar RM, Song J, Tek AL, Cheng Z, Dong F, Jiang J (2002) Highly condensed potato pericentromeric heterochromatin contains rDNA-related tandem repeats. Genetics 162:1435–1444. https://doi.org/10.1093/genetics/162.3.1435
Szakács É, Kruppa K, Molnár-Láng M (2013) Analysis of chromosomal polymorphism in barley (Hordeum vulgare L. ssp. vulgare) and between H. vulgare and H. chilense using three-color fluorescence in situ hybridization (FISH). J Appl Genet 54:427–433. https://doi.org/10.1007/s13353-013-0167-8
Thormann I, Reeves P, Reilley A, Engels JMM, Lohwasser U, Börner A, Pillen K, Richards CM (2016) Geography of Genetic Structure in Barley Wild Relative Hordeum vulgare subsp. spontaneum in Jordan. PLoS ONE 11:e0160745
Vondrak T, Robledillo LÁ, Novák P, Koblížková A, Neumann P, Macas J (2020) Characterization of repeat arrays in ultra-long nanopore reads reveals frequent origin of satellite DNA from retrotransposon-derived tandem repeats. Plant J 101:484–500. https://doi.org/10.1111/tpj.14546
Zhang J, Zhang B, Su H, Birchler JA, Han F (2014) Molecular mechanisms of homologous chromosome pairing and segregation in plants. J Genet Genomics 41(3):117–123
Acknowledgements
This research was supported by PID2019-103996RB-I00 grant from the MCIN/AEI/https://doi.org/10.13039/501100011033 from Ministerio de Ciencia e Innovación and Qualifica Project QUAL21_023 IAS from Consejería de Transformación Económica, Industria, Conocimiento y Universidades/Cofinanciación and Programa Operativo FEDER de Andalucía 2014-2020. The first author also thanks Spanish MCIN for her PhD PRE2020-094798 grant. We are highly indebted to Dr. Beatriz Navarro-Domínguez and to Dr. Francisco J. Ruiz-Ruano for support using the satMiner pipeline, and to Ángel Martín Alganza, who helped us managing the computer server. We also thank Borja Rojas Panadero for his support with the management of BLAST search results.
Funding
Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. Funding was provided by Ministerio de Ciencia e Innovación (PID2019-103996RB-I00) and Consejería de Transformación Económica, Industria, Conocimiento y Universidades (QUAL21_023 IAS).
Author information
Authors and Affiliations
Contributions
M.A.G.R. and P.P. conceived the work. A.G.G., L.B. and M.A.G.R. performed the experiments. M.A.G.R. and P.P. wrote the manuscript. The final version of the manuscript before submission was revised and approved by all authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gálvez-Galván, A., Barea, L., Garrido-Ramos, M.A. et al. Highly divergent satellitomes of two barley species of agronomic importance, Hordeum chilense and H. vulgare. Plant Mol Biol 114, 108 (2024). https://doi.org/10.1007/s11103-024-01501-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11103-024-01501-5