Hostname: page-component-6766d58669-88psn Total loading time: 0 Render date: 2026-05-21T03:34:05.988Z Has data issue: false hasContentIssue false

Structured, inbred, and plastic: the genome and population genetics of the weed false cleavers (Galium spurium)

Published online by Cambridge University Press:  30 October 2024

Sara L. Martin*
Affiliation:
Research Scientist, Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, Canada
Elizabeth Sears
Affiliation:
Research Technician, Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, Canada
Tracey James
Affiliation:
Research Technician, Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, Canada
Shaun Sharpe
Affiliation:
Research Scientist, Saskatoon Research and Development Centre, Agriculture and Agri-Food Canada, Saskatoon, Canada
Breanne D. Tidemann
Affiliation:
Research Scientist, Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, Canada
*
Corresponding author: Sara L. Martin; Email: sara.martin@agr.gc.ca
Rights & Permissions [Opens in a new window]

Abstract

False cleavers (Galium spurium L.) is an aggressive weed from the Rubiaceae. Here we assemble a chromosome-scale draft of its genome, laying the foundations for determining the genetic basis of auxinic herbicide resistance and for systematic research into its polyphyletic genus. We use the genome to examine the population genetics of material from the Canadian Prairies and, in concert with a common greenhouse experiment, to examine whether the phenotypic variation observed in the field results primarily from genetic or environmental factors. The genome assembly covers approximately 85% of G. spurium’s expected 360-Mbp genome size, with 94% of BUSCO (Benchmarking Universal Single-Copy Orthologs) genes complete and most single copy (89%). Approximately 37% of the genome is repetitive elements and 35,540 genes were annotated using RNA-seq data, including 100 homologues for genes involved in, or potentially involved in, herbicide resistance. The genome shows strong synteny with other members of the Rubiaceae, including smooth bedstraw (Cruciata laevipes Opiz) and robusta coffee [Coffea canephora (Pierre ex Froehner]. Double-digested RAD-seq data for the 19 populations from the Canadian Prairies indicated that G. spurium has high levels of population structure (FST = 0.54) and inbreeding (FIS = 0.86) with low levels of hetrozygosity (HO = 0.02) and nucleotide diversity (π = 0.0003). Variation in flowering time and seed weight largely overlapped among populations grown in the greenhouse. A redundancy analysis investigating genotype–phenotype associations showed few associations between single-nucleotide polymorphism (SNP) variation and these characteristics. In contrast, the majority of SNPs under selection were associated with mericarp hook density. This suggests that for most traits, environmental variation rather than genetic variation likely underlies phenotypic differences observed in the field. Several genes of interest, including several homologues involved in the assembly of the Skp1-Cullin-F-Box IR1/AFB E3 ubiquitin ligase complex (e.g., CAND1, ECR1), are located in areas of the genome with evidence of selection and are targets for further investigation.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© Crown Copyright - His Majesty the King in Right of Canada, as represented by the Minister of Agriculture and Agri-Food Canada, 2024. Published by Cambridge University Press on behalf of Weed Science Society of America
Figure 0

Figure 1. Map of the Canadian Prairie provinces Albert and Saskatchewan indicating locations for sample populations collected for use in this research, with latitude indicated on the left and longitude indicated along the bottom.

Figure 1

Figure 2. The three visual categories scored for hook density on the mericarps of Galium spurium: (A) naked, (B) sparsely hooked, and (C) densely hooked.

Figure 2

Table 1. Genome assembly statistics for Galium spurium.

Figure 3

Figure 3. Circle plot of the Galium spurium genome with (A) gene density, (B) single-nucleotide polymorphism (SNP) density, (C) heterozygosity, (D) alternative allelic frequency, and (E) long terminal repeat frequency. An indication of chromosome length is plotted along the outside edge.

Figure 4

Figure 4. A phylogenetic tree indicating the evolutionary relationships among species studied here produced by OrthoFinder. The values at the nodes indicate support calculated by OrthoFinder using the Species Tree Inference from All Genes (STAG) method and indicates the proportion of orthogroup trees that had the bipartition. Galium, Cruciata, and Sherardia are all within the tribe Rubieae (Rubioideae), while Leptodermis is in the tribe Paederieae (Rubioideae), and Coffea canephora is in a different subfamily, Ixoroideae. Asclepias syriaca is an outgroup from the Apocynaceae.

Figure 5

Figure 5. A riparian plot of syntenic relationships among species from the Rubiaceae generated by GENESPACE showing genomic structure across the family, with Coffea canephora at the bottom and working upward: Leptodermis oblonga, Sherardia arvensis, Cruciata laevipes, and finally Galium spurium at the top of the plot. Asterisks indicate that the chromosome was inverted to clarify the plot. Note that C. canephora’s cc09 has been shifted to better indicate its position within the chromosomes of the Rubiaceae species.

Figure 6

Figure 6. Plots of synteny between proteins of Galium spurium and (A) Cruciata laevipes and (B) Coffea canephora. Red dots indicate the best syntenic matches and blue dots indicate secondary matches, with gray dots indicating lower-order matches. For G. spurium and C. laevipes, chromosomes are colored based on syntenic relationships with C. canephora. We see from these plots that several chromosomes are almost entirely syntenic (e.g., gs06, gs07, and gs10), but there is no clear pattern suggesting that end-to-end joining or nested chromosome fusion led to the reduction in chromosome number in G. spurium from n = 11 to n = 10.

Figure 7

Figure 7. For the 19 populations studied here (A) overall proportion of individuals showing homozygosity for the major allele (blue), heterozygosity (purple), and homozygosity (magenta) for the minor allele; (B) levels of heterozygosity by population; and (C) a heat map of pairwise FST values for with cooler colors indicating lower FST values and warmer indicating higher FST values. Heterozygosity was low across all populations with the majority of individuals (83%) homozygous for the reference allele with evidence of limited outcrossing. On average, the FST values were 0.54, but AB-08 showed lower pairwise FST values across comparisons and SK-05 showed higher values.

Figure 8

Figure 8. Provesti’s genetic distance (y axis) by geographic (landscape) distance (m) for pairings of the 19 populations. This graph (graph41g::scatter_dist) shows a smooth (loess) line with 95% confidence interval. A Mantel test with 1,000 Monte Carlo repetitions indicates a significant (P < 0.001) positive relationship providing evidence of isolation by distance.

Figure 9

Figure 9. Principal component analysis (PCA) of the genetic variation of single-nucleotide polymorphism (SNP) data. Circles in warm colors represent individuals from Alberta and triangles in cool colors represent those from Saskatchewan. Clumping of individuals from the same population is evident, but variation among populations provides spread. Axis1 explains 76% of the variation, while axis 2 explains 44%.

Figure 10

Figure 10. Phenological data from the 19 populations grown in the greenhouse, including (A) proportion of seed germinated after 7 d, (B) proportion flowering at 6 wk, and (C) proportion of plants with seed at 8 wk. In all cases, statistical differences are observed, but the majority of populations are highly variable and show strong overlap with values from the other populations.

Figure 11

Figure 11. Morphological data from the 19 populations grown in the greenhouse for key characteristics, including (A) plant height at 6 wk, (B) total estimated seed count, and (C) seed (mericarp) diameter.

Figure 12

Figure 12. Redundancy analysis (RDA) examining how variation in the single-nucleotide polymorphism (SNP) data is associated with variation in key traits. A total of 5.5% of the variation in the SNP data was explained by the morphological characteristics included. The majority of highly loading SNPs (70%), with a cutoff of 3 SDs from the mean, were associated with hook density and are highlighted in red. Most of the remaining SNPs are associated with total seed weight (26%) and highlighted in purple, while the one SNP associated with flowering time is in yellow.

Figure 13

Figure 13. Manhattan plots of almost 4,000 single-nucleotide polymorphisms (SNPs) across the genome, with 192 SNPs showing evidence of selection after the Benjamini-Hochberg procedure was used to adjust significance value to control false discovery, the rate was set to 1%, and SNPs were thinned to reduce the effects of linkage disequilibrium, highlighted in red. The green bar at the top of each chromosome plot indicates gene density, with darker areas annotated with more genes. The locations of genes with proteins that are homologues to known or potential contributors to herbicide resistance are shown on the x-axes.

Figure 14

Table 2. Top 20 homologues of genes with known or potential roles in herbicide resistance ranked by proximity to single-nucleotide polymorphisms (SNPs) under selection.a

Supplementary material: File

Martin et al. supplementary material

Martin et al. supplementary material
Download Martin et al. supplementary material(File)
File 162 KB