Research Article |
Corresponding author: Russell S. Pfau ( rspfau@proton.me ) Academic editor: Jack Neff
© 2024 Jessica L. Beckham, Jeff A. Johnson, Russell S. Pfau.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Beckham JL, Johnson JA, Pfau RS (2024) Molecular data support Bombus sonorus and Bombus pensylvanicus (Hymenoptera, Apidae) as distinct species. Journal of Hymenoptera Research 97: 895-914. https://doi.org/10.3897/jhr.97.132937
|
Despite their distinctive banding patterns, setal coloration, and geographic ranges, the Sonoran bumble bee (Bombus sonorus Say, 1837) and the American bumble bee (Bombus pensylvanicus De Geer, 1773) are often treated as conspecific, with some authorities ranking B. sonorus as a subspecies of B. pensylvanicus. This lack of taxonomic clarity creates challenges, particularly for population monitoring and conservation initiatives. In this study, we used genetic analyses to assess the potential for clinal variation, ongoing hybridization, and historical introgression between B. pensylvanicus and B. sonorus within a broad area of sympatry across the state of Texas. Double digest restriction associated DNA sequencing (ddRADseq) was performed on 166 specimens (58 B. sonorus and 108 B. pensylvanicus), and a portion of the mitochondrial COI gene was sequenced for a subset of the specimens from Texas (46 B. sonorus and 32 B. pensylvanicus) and eight specimens of B. sonorus from California. An additional five sequences of B. pensylvanicus from Georgia and Florida were obtained from Genbank and BOLD along with one each of B. transversalis (Olivier, 1789), B. mexicanus Cresson, 1879, B. medius Cresson, 1863 and B. fervidus (Fabricius, 1798), and B. mesomelas Gerstäcker, 1869. For both genetic datasets (nuclear ddRADseq and mtDNA COI), individuals formed two distinct groups concordant with species identification based on setal coloration. We found no evidence supporting a clinal pattern of variation, ongoing hybridization, or historical introgression within the study area and conclude that B. sonorus should be recognized as a species under the Biological Species Concept.
Bombus pensylvanicus, Bombus sonorus, bumble bee, COI, RADseq, sympatry
Bombus sonorus Say, 1837, commonly referred to as the Sonoran bumble bee, has a contentious taxonomic history. First described from Mexico (Say 1837), B. sonorus continued to be recognized as a species by
Some populations of B. pensylvanicus and B. sonorus are in decline (
Genetic data can provide evidence to help resolve taxonomic uncertainty when morphological differences are ambiguous. Mitochondrial DNA sequences have been used to identify and delineate species among many taxonomic groups. However, the value of mtDNA sequence analysis is limited because it represents a single locus; moreover, mtDNA sequences may have a different evolutionary history relative to nuclear loci because of their maternal inheritance pattern (
To clarify whether B. sonorus should be considered a distinct species, we generated ddRADseq and mtDNA cytochrome c oxidase subunit I (COI) gene sequence datasets for B. sonorus and B. pensylvanicus individuals collected across a broad area of sympatry in Texas. We sought to determine if the genetic data were concordant with species identification based on setal coloration and assessed the evidence for clinal variation, hybridization, and historical introgression.
For purposes of this study, we visually identified B. sonorus and B. pensylvanicus based on setal coloration (banding pattern and shade of yellow) following the descriptions of
To determine the extent of geographic overlap of B. pensylvanicus and B. sonorus within the study area (Texas), we examined photographic documentation of occurrence data on the iNaturalist platform (www.inaturalist.org). We filtered iNaturalist observations by species and location (i.e., filtered for B. sonorus in Texas and B. pensylvanicus in Texas) and then verified species identifications for all observations using the identification criteria described above. Observations were only included in our dataset if the photograph quality allowed for species identification. We then downloaded these verified observations from the Global Biodiversity Information Facility (https://www.gbif.org) and mapped each locality using QGIS (
To gather specimens for genetic analyses, we surveyed for bumble bees during July – September, 2016 to coincide with times of peak bumble bee activity. We identified target areas for surveys within the projected area of sympatry in Texas; these target areas were prioritized for collection trips, but we also acquired bees opportunistically from other areas of the state. Most collection events took place along public roadsides, and survey locations were chosen based on accessibility and the presence of flowering plants. Bees were collected using an aerial net and placed in a mobile cooler for transport to the lab where they were maintained at -80 °C.
We collected 166 bumble bees from 113 sites in 66 Texas counties (Fig.
Occurrence of B. pensylvanicus (larger black circles) and B. sonorus (smaller yellow circles) within Texas based on iNaturalist observations (A) locations sampled for this study (B) showing locations with B. pensylvanicus (black circles), B. sonorus (yellow circles), and both species (half black and half yellow circles) as identified by ddRADseq. White dots over colored circles are locations from which mtDNA sequences were also obtained. Orange line indicates the apparent eastern- and southern-most extent of B. sonorus (based on iNaturalist data) for ease of comparison between the two figures.
DNA was extracted from either the head and thorax or all legs of each specimen in sterile conditions with QIAGEN® DNeasy Blood and Tissue kits using the manufacturer’s protocol for DNA extraction from insect tissue, with minor adjustments. Specifically, body parts were removed from each individual bee using a sterile razor blade, and then chopped into smaller pieces. Tissue was placed in a 1.7 μl microcentrifuge tube and homogenized with a disposable micropestle prior to adding the tissue lysis buffer. Two final elutions, the first of which (Q1) was 200 μl and the second (Q2) 100 μl, were performed for each sample. DNA concentrations were determined using a Qubit™ Fluorometer. In the case of samples whose Q1 DNA concentrations were lower than 10 ng/ul, Q1 and Q2 were combined and concentrated using a Thermo Savant SpeedVac Concentrator. Samples (Q1 or concentrated Q1 + Q2) were then prepared for sequencing in volumes that yielded approximately 2 ng of DNA.
Genomic sequence data were generated using a reduced-representation genomic sequencing approach (ddRADseq) following methods described in
For ipyrad, default parameters were specified unless otherwise noted for each of the seven steps used in the pipeline. Individual reads were filtered based on a minimum Phred quality score of 20 and a maximum number of Ns in a read set to 5. Reads were then de-replicated, followed by within sample de novo clustering using VSEARCH (
In order to assign individuals to clusters and identify potential admixture between B. pensylvanicus and B. sonorus, we performed a principal components analysis (PCA), conducted STRUCTURE analyses (a Bayesian clustering method), and created a NeighborNet network. PCA was performed using R (
For a subset of the Texas specimens (and eight B. sonorus from California), we amplified a portion of the mitochondrial cytochrome c oxidase subunit I (COI) gene using polymerase chain reaction (PCR) and sequenced (forward and reverse) the resulting PCR products using automated Sanger dideoxy chain termination sequencing. This dataset included 77 of the 166 Texas specimens (45 B. sonorus and 32 B. pensylvanicus) because COI failed to amplify or sequence in all 166 specimens, and the eight B. sonorus from California (Genbank accession numbers PP840056–PP840059). We carried out PCRs and sequencing reactions using the primers LCO1490 and HCO2198 (
In addition to sequences that we generated, we obtained five sequences of B. pensylvanicus from BOLD (http://www.barcodinglife.org; Georgia: BBBEE941-11 and BBBEE939-11; Florida: BBHYA070-12, BBHYA087-12, and BBHYA069-12) and one each of B. transversalis (Olivier, 1789), B. mexicanus Cresson, 1879, B. medius Cresson, 1863 and B. fervidus (Fabricius, 1798) from Genbank (www.ncbi.nlm.nih.gov/Genbank; KJ848950, MG279409, KC853364, and MG448456).
We aligned sequences using ClustalW within the program MEGA-X (
The iNaturalist occurrence data included 2,008 B. sonorus (GBIF.org 2024a; retrieved on 20 April 2024 and included observations spanning the years 2005–2024) and 11,984 B. pensylvanicus (GBIF.org 2024b; retrieved on 20 April 2024 and included observations spanning the years 1972–2024 [only two observations predated 2004]). iNaturalist records documented the occurrence of B. pensylvanicus state-wide and were largely consistent with
We obtained an average of 4.2 million paired-end reads per sample after q-score and adapter filtering. Sequencing reads are available on the NCBI Short Reads Archive (BioProject ID PRJNA1115193). After de novo assembly and subsequent filtering using ipyrad, 17,352 loci were identified. The final dataset consisted of 3,371 loci after further filtering using VCFtools. We conducted all subsequent analyses using this final dataset.
The PCA showed two well-separated clusters of individuals (Fig.
Of the 90 COI sequences analyzed, we identified five COI haplotypes: one from B. sonorus and 4 from B. pensylvanicus (Fig.
Bayesian phylogenetic tree of individuals (A) and median-joining haplotype network (B) of Bombus based on COI mtDNA sequences. The tree was rooted with B. mesomelas. Numbers along branches in the phylogenetic tree indicate support (posterior probability; provided only for major clades). Within the network, sizes of circles are proportional to haplotype frequency and lines across branches represent mutational steps between haplotypes. The color of circles in the network indicates species in which the haplotype was found.
We evaluated species boundaries of the taxonomically controversial bumble bees B. sonorus and B. pensylvanicus within a broad area of sympatry using two genetic datasets (nuclear ddRADseq and mitochondrial COI) combined with examination of setal coloration (banding pattern and shade of yellow). To aid in interpreting our genetic data, we documented the extent of geographic overlap of B. sonorus and B. pensylvanicus using occurrence data from the iNaturalist platform. While the iNaturalist dataset is limited to presence data, we relied on these data because there are numerous records documented within the past two decades, each of which can be assessed visually to verify correct species identification. We found the range of B. sonorus documented in iNaturalist to be broader than that reported by either
All of our analyses were consistent in providing evidence that B. sonorus and B. pensylvanicus are genetically distinct across the documented area of sympatry in Texas. We found no evidence of a pattern of clinal variation, hybridization, or historical introgression within the broad area of sympatry sampled for this project (Texas). If these color forms represent a single species, or if extensive interbreeding occurred, we would have expected one of three patterns: (1) a single genetic group, (2) individuals not grouping concordant with coloration, or (3) individuals having mixed membership to more than one cluster (for example, an F1 hybrid between two divergent lineages would have approximately 50% membership to each of two clusters in STRUCTURE). Instead, all analyses showed two distinct genetic groups. The mitochondrial COI dataset showed all individuals forming reciprocally monophyletic clades in concordance with their identification as B. sonorus or B. pensylvanicus according to coloration. Additionally, we found no instances of individuals identified as one species by ddRADseq possessing a mtDNA haplotype of the other species. If the two forms hybridize extensively and produce fertile offspring within the area of sympatry, historical introgression of mitochondria (due to backcrossing) would be expected, resulting in a lack of concordance between datasets. The lowest STRUCTURE membership coefficient was 0.842, which could represent a hybrid backcross individual; however, incomplete lineage sorting (the presence of shared variation predating speciation) results in patterns such as this which can be attributed falsely to hybridization (Hey 2006). Furthermore, perfect assignment of all individuals is never seen in this type of analysis. Given the low COI sequence divergence between B. sonorus and B. pensylvanicus, combined with no evidence of mtDNA introgression, incomplete sorting of nuclear alleles is to be expected.
Our results showed the two taxa as reciprocally monophyletic. This contrasts with
All specimens that we collected for this study were clearly identifiable to species based on coloration, and we observed no phenotypically intermediate specimens among our samples.
Upon examining photographic records on iNaturalist, we did find several instances of female B. pensylvanicus in the U.S. with numerous yellow hairs on the scutellum (GBIF.org 2004c), as mentioned by
The observation by
Our data demonstrate that hybridization events between B. pensylvanicus and B. sonorus are absent or very rare, despite ample opportunity for mating in sympatry, justifying the status of B. sonorus as a distinct species under the Biological Species Concept (Mayr 1963). Nuclear ddRADseq, mitochondrial COI, and coloration were concordant in distinguishing B. sonorus from B. pensylvanicus. Given the lack of evidence supporting clinal variation, hybridization, or historical introgression between these species across the broad area of sympatry in Texas, it seems unlikely that these phenomena would be occurring in Mexico where the two species are also reportedly sympatric. However, future research should focus on the assessment of populations of B. sonorus and B. pensylvanicus in Mexico, to include documentation of areas in which the two species are sympatric and confirmation of the anecdotal reports of individuals displaying intermediate coloration. Until evidence of clinal variation or extensive hybridization between B. sonorus and B. pensylvanicus is documented using multilocus genetic datasets, we recommend that B. sonorus be recognized as a distinct species and that conservation efforts designate them as such.
Raw sequencing reads (ddRADseq) are available on the NCBI Short Reads Archive (BioProject ID PRJNA1115193). COI mtDNA sequences are available at NCBI (Genbank accession numbers: PP840056–PP840059). The ddRADseq VCF, COI sequence alignment, and photographic vouchers of specimens are available from the Dryad Digital Repository https://doi.org/10.5061/dryad.08kprr5b2.
This research was supported by the Texas Parks and Wildlife Department (State Wildlife Grant, 2016), the Advanced Environmental Research Institute at the University of North Texas, and Tarleton State University. We thank the following individuals who assisted with collection efforts: Tom Barnes, David Beckham, Deborah Douglas, Tom Fisher, Danny Kern, Michael Eckenfels, and Scott Longing. We thank John Ascher, Jack Neff, Leif Richardson, Robbin Thorp, Michael Warriner, and Paul Williams for providing insight via personal communications. We thank Joel Neylon for providing a list of iNaturalist observations of attempted interspecific mating and examples in the literature of interspecific behaviors. We thank the >5000 observers and >1000 identifiers on the iNaturalist platform for their contributions to documenting and identifying, respectively, Bombus within the state of Texas and beyond. Special thanks to Joel Neylon and John Ascher who confirmed and contributed identifications for over >13,000 and >8000 specimens, respectively, of Texas Bombus on iNaturalist. We also thank the four reviewers for helpful comments.
Specimens of Bombus examined for this study including locality data, species, sex, whether RADseq data was collected, and mtDNA haplotype
Data type: xlsx
Clumpak output from STRUCTURE run files of RADseq data
Data type: pdf