Dr. Rob Dillon, Coordinator





Friday, February 4, 2022

Character phase disequilibrium in the Gyraulus of Europe

Editor’s Note – This essay was subsequently published as: Dillon, R.T., Jr. (2023c)  Character phase disequilibrium in the Gyraulus of Europe.  Pp 215 – 226 in The Freshwater Gastropods of North America Volume 7, Collected in Turn One, and Other EssaysFWGNA Project, Charleston, SC.

I get a lot of requests for freshwater gastropod samples for sequencing studies.  And I confess, I can sometimes be a jerk about it.  In reply, I always ask, “What hypothesis do you plan to test?”  Note that answers of the form, “I want to work out the phylogeny of gastropod family X” or “I want to figure out how many species there are in the gastropod genus Y” do not qualify as hypotheses [1].

So back in October of 2020 I was pleased to receive an email from Jeff Nekola of Masaryk University in Brno, the hometown of Gregor Mendel, posing a very well-considered hypothesis to test with sequence data from specimens that I might provide him.  Jeff introduced me to his colleague Michal Horsak, and the two of them together pitched me a project to test whether their European Gyraulus laevis (Adler 1838) might be conspecific with our North American Gyraulus parvus (Say 1817).  Let’s back up for a bit of context.

G. laevis [4]
All I know about the planorbid genus Gyraulus in the Old World, I learned from Prof. Claus Meier-Brook, late of the University of Tubingen [3].  The symphony of malacology he composed in 1983, entitled “Taxonomic Studies on Gyraulus [4]” was a masterwork of rigor, scholarship, technical brilliance and timeless beauty, both scientific and otherwise [5].  Meier-Brook examined 492 Gyraulus sampled from 94 Eurasian populations, thoroughly cataloguing variation in scores of anatomical and morphological characters both common and obscure, carefully documenting ecophenotypic variance as well as variance due to method of preservation.  Ultimately, he recognized eight species in Europe (excluding Macedonia [6]) and seven in Asia (one shared) for a total of 14 species of Gyraulus in the Palearctic.

So G. laevis and G. parvus were both among the eight European species recognized as valid by Prof. Meier-Brook, the former ranging widely across Europe but “rather stenotopic and rare.”  The latter he considered native in Iceland, but introduced into Germany by the early 1970s, and rapidly spreading through Europe as he wrote his monograph.

The two species are quite difficult to distinguish in the field, and even as early as 1952 European workers were suggesting that laevis might be a junior synonym of parvus [7].  Meier-Brook disagreed.  And here is the couplet by which he suggested that we distinguish them, transcribed verbatim:

6A. Penultimate whorl distinctly elevated, distal portion of spermoviduct slender, not wider than widest portion of sperm duct; distal half of vas deferens much wider (2:1 on an average) than proximal half (introduced from N America)… G. parvus.

6B. Penultimate whorl not or not distinctly elevated, distal portion of spermoviduct wider than widest portion of sperm duct and vas deferens, distal half of vas deferens not conspicuously widened (1:1) … G. laevis.

The six samples of G. parvus upon which Meier-Brook based his diagnosis, however, were collected from Ann Arbor, two places in Canada, two places in Iceland, and Germany – none from Thomas Say’s type locality in the Delaware River near Philadelphia.  So the hypothesis advocated by my colleagues in Brno back in October of 2020 was an interesting one, and it did, indeed, seem likely to me that sequence data might cast some light upon it.

And my loyal readership may remember an expedition I undertook up the Delaware River in 2012, hunting the type locality of Thomas Say’s Lymnaea catascopium [8].  I had picked up incidental samples of G. parvus at two sites on that field trip, which I agreed to post to Brno at my next opportunity.

G. parvus [4]

Jeff and Michal also asked if I might be able to spare a couple samples of Gyraulus circumstriatus.  My antennae raised slightly at this request – it was hard to see how sequence data from G. circumstriatus might bear on the hypothesis before us.  And all I had on my shelves was a couple odd samples from Montana and NW Pennsylvania, nowhere near the Connecticut type locality.  But OK – in for a penny, in for a pound [9].

So in late January of 2021 I was pleased to receive an email from Michal reporting success in sequencing the samples that I had sent, and that “the preliminary phylogenetical analysis clearly says that these are the same species and there is also no difference from all European populations of G. parvus/laevis.”  And in mid-April a first draft manuscript arrived, courtesy of Ms. Erika Lorencova, the lead author on the project.

I found some things to like about the research as the manuscript unfolded before my eyes last spring.  Lorencova, Nekola, Horsak, and five additional coauthors [10] had sequenced four genes (COI, CYTB, ITS1, ITS2) from 65 individual snails representing seven nominal species of Gyraulus, plus a Planorbis outgroup – just one single individual per population, in almost all cases, alas.  They did not sequence all four genes for all 65 individuals, alas again.  Their data set included 34 individuals identified as G. parvus (27 Czech, 2 Slovak, 1 Croat, 1 Aust, 1 Serb, 2 USA) and 14 individuals identified as G. laevis (12 Czech, 2 Croat), none of the latter collected from anywhere near Alder’s type locality in England, which pissed me off royally.  Their data set did, however, include one individual collected from Scotland, one from Northern Ireland, and two from Wales, all identified as “parvus/laevis,” none of which helped one little bit.

Erika and our mutual colleagues opted to analyze their (mitochondrial) COI + CytB dataset separately from their (nuclear) ITS1 + ITS2 dataset.  They generated gene trees using four different methods for each data set: neighbor-joining, maximum parsimony, maximum likelihood, and Baysian inference, for a total of eight separate analyses.  All of these “yielded very similar tree topologies.”  If not, I suppose our friends in Brno could have picked the one they liked best.

In any case, the Baysian tree based on CO1 + CytB is shown on the left in their Figure 2 modified below, and the Baysian tree based on ITS1 + ITS2 is shown on the right.  Both trees vividly demonstrate character phase disequilibrium of a high and aggravated nature.  If at any point during the next seven paragraphs you fail to understand anything I am saying to you, please go back to my essay of [4Jan22] and read forward.
Fig 2 of Lorencova et al [16] modified

Let us first focus on the COI+CytB mtDNA tree at left.  Erika, Jeff, Michal and colleagues divided the 42 individuals depicted on the giant branch at the bottom of their tree into three “races,” which they distinguished using Roman numerals I, II, and III.  The mean uncorrected p-difference between Race I and Race II was 4.4%, between Race I and Race III it was 5.1%, and between Race II and Race III it was 5.5% [11].

Right-click on the figure above, open it in a new window, and count with me.  In Race I we see 6 parvus (red) and 12 laevis (blue), setting aside the 1 parvus/laevis (no count).  In Race II we see the two circumstriatus [12].  And in Race III we see 15 parvus, 2 laevis, and 4 parvus/laevis (no count).

In Table 1 below I have combined all 35 of the (classified) Gyraulus and categorized them simultaneously by nominal species and mtDNA race. The contingency chi-square is a very significant 10.98 (1 df).  This is character phase disequilibrium.  These 35 snails are not drawn from a single randomly-breeding population.

Tab 1. Gyraulus classified by species and mtDNA

Shifting our attention to the nDNA tree at right, we find the percent sequence divergence much less, but the character phase disequilibrium even more significant.  Lorencova and colleagues did not divide the gigantic branch at the bottom of their nDNA tree into “races” as they did for their mtDNA tree, but they easily could have done so.  There seem to be just two ITS1+ITS2 races, rather than the three we saw for mtDNA, and they differ by only a couple nucleotides.  But the match between the top half of the big branch of the nDNA tree is very close to the top half (“Race I”) of the big branch of the mtDNA tree.  And ditto for the two bottom halves.  Note that the same two individuals labelled “laevis” in the lower (Race III) clade on the mtDNA tree at left are exactly the same two individuals labelled “laevis” in lower clade by their nDNA at right, for example.

And indeed, laevis is again very much over-represented in the upper clade in the nDNA tree, which we might as well go ahead and call Race I, and parvus very much over-represented in the lower clade, which we might as well call Race III.  For the nDNA, Table 2 below shows that the contingency chi-square is 12.89 (1 df), even more significant than the mtDNA results.  What gives?

The 42 individual Gyraulus hanging on the big, low branches of the two gene trees shown above were not drawn from a single randomly-breeding population.  In large and complexly-structured environments, such as Erika’s six-country sample area most certainly is, the first thought of an evolutionary biologist would be that any such non-random mating might arise from barriers to dispersal or isolation by distance.  So let’s re-run our analysis focusing just on the 29 individual Gyraulus collected from the Czech Republic.  The mtDNA breakdown is Race I parvus = 5, Race I laevis = 11, Race III parvus = 11, Race III laevis = 2, chi-square (1 df) = 8.26, still quite significant.

Tab 2. Gyraulus classified by species and nDNA

I suppose it is possible that there might be some sort of geographic structure in the 29 Czech samples below the level of the country in which they were sampled, but I do not have the patience to plot 29 sets of lat/long coordinates and look for it.  I will simply note that there is another very plausible explanation for the character phase disequilibrium apparent in the branches of the gene trees developed by Erika, Jeff, Michal and our mutual colleagues, beyond some sort of fine geographic structure that I cannot see.  Perhaps the two sets of populations they have identified as G. parvus and G. laevis are in fact good, biological species, genetically and morphologically similar but reproductively isolated, with a smattering of misidentifications [13] or a hybridization event here and there.  In any case, the results of Lorencova et al. (2021) do not support their hypothesis that G. parvus and G. laevis are conspecific.

All through the second half of the month of April I carried on a detailed and earnest email correspondence with my colleagues in Brno, desperately trying to get them to see the reasoning I have outlined in the seven paragraphs above.  I explained character phase disequilibrium every way I know how to explain it, backing all the way up to Mendel and coming forward, drawing diagrams with circles and arrows and a paragraph on the back of each one explaining what each one was [14].  I calculated probabilities to the fourth decimal place and highlighted them in bold.  And I simply could not make Jeff, Erika, or Michal understand character phase disequilibrium, or its origins in non-random mating, or the likelihood that the disequilibrium evident in the branches of their trees might reflect reproductive isolation, or indeed (in retrospect) the connection between reproductive isolation and speciation.

Alas, we seemed to be speaking different languages.  Rather than the term, “species,” my colleagues preferred the term,  “species-level clade.”  And they concluded that G. parvus and G. laevis were “part of the same species-level clade” because “although reasonably well-supported races exist in the mtDNA tree their base pair variability is three times smaller than that seen between the other analyzed Gyraulus species” [15].

Nor apparently could the editor of the international journal to which my colleagues submitted their paper see the character phase disequilibrium in their Figure 2 as modified above, nor understand its significance, nor apparently could the reviewers he chose.  Apparently, there is widespread blindness in evolutionary biology today to simple principles of transmission genetics that I have considered fundamental for my entire professional career.  The paper by Lorencova and colleagues was published in July [16].  Going beyond their proposed synonymization of G. laevis under G. parvus, our colleagues in Brno went so far as to suggest that “North American G. circumstriatus might simply represent another race within G. parvus.”  Good grief.

Well, science is a self-correcting process.  Someday some bright young student will realize that the answer to the parvus/laevis question does not lie in sampling a single individual from 30 populations, but in sampling 30 individuals from a single population.  He will find a lake or a pond inhabited by Gyraulus bearing both the Race 1 and Race 3 haplotypes [17].  He will collect and sort 30 adults by the elevation of their penultimate shell whorl, sequence those genes, and look for disequilibrium between mtDNA race, nDNA race, and shell morphology.  And here is his hypothesis: big excesses of Race 1/flat whorl and Race 3/elevated whorl.  Will our bright young student find any Race 1/elevated or Race 3/ flat recombinants?  Will he find any ITS heterozygotes?  Good questions.

I will conclude with a caveat.  I am not aware of any research directly bearing on mode of reproduction in Gyraulus, but my biological intuition suggests to me that populations of trashy, weedy planorbids such as G. parvus most certainly is might demonstrate significant frequencies of self-fertilization.  Asexual reproduction voids the biological species concept and necessitates a retreat to the solid rock of morphology, not a blundering-forward into the slough of DNA [18]

So if our bright young student of the future confirms preferential selfing in Gyraulus, and googles up this essay and reads it, I would ask him to go back to my essay of [4Jan22], read to note [3] and stop.  The first nine paragraphs of my January essay will have turned out to be important for Gyraulus, but the remainder not.  And then I would suggest that he come forward to the present blog post, re-read from the top of my essay to Claus Meier-Brook’s couplet distinguishing G. parvus and G. laevis morphologically, and then stop again.  Because none of the rest of this month’s post will have turned out to be relevant, either.  To the extent that Gyraulus populations self-fertilize, Professor Meier-Brook’s brilliant 1983 monograph stands as the final word thus far.


Notes

[1] If, however, the opening lines of the present essay happen to fall upon the eyes of a bright young student with access to a molecular lab and an interest in real science [2], I stand ready to help in any way I can.  I even have a list of genuine hypotheses about the evolution of freshwater gastropods ripe for testing with sequence data, for students in the market for thesis topics.  A long list.

[2] Science is the construction of testable hypotheses about the natural world.

[3] Claus Meier-Brook (1934 - 2018) has occupied a niche in my malacological pantheon since the dawn of my professional career.  I never met him, although we enjoyed a cordial correspondence in the 1980s.  He was best known for his work on pisiidid clams, but also made important contributions to our understanding of the medically-important planorbids.  He was unable to resist the Willi Hennig fad, alas, but the cladist gobbledygook is easy to subtract from his otherwise inspiring oeuvre [5].  For a biography and bibliography, see:

  • Jungbluth, J. H. (2006) Claus Meier-Brook 70-Jahre (28  April  1934).   Mitteilungen  der Deutschen Malakozoologische  Gesellschaft 75:  39-47.

[4] Meier-Brook, C. (1983)  Taxonomic studies on Gyraulus (Gastropoda: Planorbidae).  Malacologia 24: 1 – 113.

[5] And I feel like Salieri.  Rifling the pages of his papers, hearing the beauty of the science in my mind’s ear – a rusty squeezebox, high above it a single oboe – my heart breaking.  But jeeze, crap!  Hit the fast-forward button through Movement VI, Cladiste.

[6] Meier-Brook recognized six Gyraulus species endemic to Macedonian lakes.

[7] Jaeckel, S. (1952) Zur Oekologie der Mollusken-fauna in der westlichen Ostsee. Sctiriften des Naturwissenscfiaftlictien Vereins fur Sctileswig-Holstein. 26: 18-50.

[8] It is traditional to assume that Thomas Say’s type localities for both G. parvus and L. catascopium were in the Delaware River at his home town of Philadelphia.  But 200 years of commercial development at the Philadelphia waterfront have rendered that environment unrecognizable today.  For an account of my explorations upstream, see:

  • The Type Locality of Lymnaea catascopium [14July15]

[9] Or more accurately, “In for 200 mg of snail flesh, might as well go all in for a half gram.”

[10] Michal offered me a coauthorship, which was very considerate of him, and I did appreciate the honor, but ultimately declined.

[11] MtDNA sequence divergences in the 4 – 5% range are quite consistent with the values typically reported among valid, biological species of freshwater pulmonate snails.  But that is not the direction I want to go in the present essay.

[12] The two G. circumstriatus sequences, sampled as they were from Montana and British Columbia, geographically remote from all 40 of the samples of parvus and laevis available for comparison, have no bearing on the specific status of circumstriatus whatsoever.

[13] Meier-Brook considered G. laevis and G. parvus sibling species, which means that they cannot be distinguished morphologically, and then spent many pages developing evidence to the contrary, ironically.

[14] By now I imagine that the origin of last month’s blog post will have become obvious to my perceptive readership.  My essay of 4Jan22 was an adaptation of the extensive email correspondence I carried on with Jeff Nekola and Michal Horsak in April of 2021.  And the reason for all those examples involving Mendel’s peas was that I was trying to communicate with geneticists in Brno.  And vainly searching for a language that we might have in common.  See:

  • What is character phase disequilibrium? [4Jan22]

[15] Darn it, I know I said in footnote [11] that this is not a direction I want to go with the present essay.  But it’s eating at me.  Although the “base pair variability” observed among the laevis, parvus, and circumstriatus clusters may indeed be “three times smaller than the values seen between the other analyzed Gyraulus species,” the values of 4 – 5% mtDNA sequence divergence among those groups reported by Erika, Jeff, Michal and colleagues are NOT small when compared to interspecific values typical for biological species of freshwater pulmonates worldwide.  See, for example:

  • The Lymnaeidae 2012: Stagnalis yardstick [4June12]

[16] Lorencova, E., L. Beran, M. Novakova, V. Horsakova, B. Rowson, J. Hlavac, J. Nekola, and M. Horsak (2021).  Invasion at the population level: a story of the freshwater snails Gyraulus parvus and G. laevis.  Hydrobiologia 848: 4661 – 4671.

[17] A clean answer to the question that has sent nine scientists, including yours truly, scampering around the world to not answer may be waiting 60 km south of Brno.  For according to the supplementary materials available with their paper, Erika, Michal, Jeff & Company did collect snails they identified as both parvus and laevis together in a fishpond on the northern edge of the Czech town of Hlohovec.

[18] I vividly illustrated the folly of attempting to classify asexually-reproducing populations by DNA sequence data in two recent series of blog posts: one on the (parthenogenic) prosobranch Campeloma and a second on the (self-fertilizing) pulmonate Galba.  See:

  • Fun with Campeloma! [7May21]
  • What Lymnaea (Galba) schirazensis is not, might be, and most certainly is [3Aug21]

Tuesday, January 4, 2022

What is character phase disequilibrium?

Editor’s Notes - About halfway through this essay I refer to original research conducted with two forms of Lithasia on the Duck River, GEN and DUT.  That work was published as a note in Ellipsaria 22(3) [pdf] if you are looking for something citable [1]. The present essay was subsequently published as: Dillon, R.T., Jr. (2023b)  What is character phase disequilibrium?  Pp 153 – 162 in The Freshwater Gastropods of North America Volume 6, Yankees at The Gap, and Other EssaysFWGNA Project, Charleston, SC.

It has come to our attention that some non-negligible fraction of our professional colleagues do not understand transmission genetics.  I suppose I might have seen it coming.  Over the 33 years I spent as a genetics professor at a mid-sized college of regional reputation, textbook chapters devoted to Mendel and Morgan became skinnier and skinnier, squashed by bulging chapters of molecular ephemera.  Some of my younger colleagues seemed to consider that transmission and population genetics were epiphenomena, to be mentioned, perhaps, on the way to the subject matter that might actually be useful or interesting to the students.

So before launching into this month’s lecture, I am springing a pop quiz.  Everybody please return with me to my November essay on Leptoxis.  You can either scroll down below the lengthy January essay that follows and the (somewhat shorter) December essay below that or open this link in a fresh window [2Nov21].  Now read the four paragraphs about “the third important lesson” halfway through that essay, including footnotes #13 and #14, paying special attention to the material about “snail 5S.”  When you get to “0.05^2 = 0.0025” in footnote #14, stop.  If you understand the importance of that calculation, feel free to skip this month’s essay, and I’ll see you next month.  But if all that stuff about snail 5S and its parentage is foggy to you, read on.

Gregor Mendel proposed his “second law” on the strength of an experiment he conducted in his pea garden using two genes: seed color (yellow/green) and seed shape (smooth/wrinkled).  He interpreted the 9:3:3:1 phenotypic ratio he observed in the F2 from his dihybrid cross as the result of independent assortment of those two genes.

From Lobo & Shaw [2]
Very shortly after the rediscovery of Mendel’s work, however, results began to crop up that did not jive with Mendel's second law.  In 1905 the team of Bateson, Saunders and Punnett obtained an F2 ratio from a dihybrid cross involving flower color (purple/red) and pollen shape (round/long) that did not match 9:3:3:1 expectation.  They got too many of the purple/long and red/round “parental” phenotypes, and not enough of the purple/round and red/long “recombinants.”

And in 1910-11, the American Thomas Hunt Morgan confirmed the “chromosomal theory of inheritance” in fruit flies, coining the term “linkage” to describe the situation where a pair of genes might not assort independently because they were located on the same chromosome [2].  Although the term was not invented at the time, the situation in William Bateson’s field of peas would today be called linkage disequilibrium.

Now let me ask you to do a thought experiment.  Plant your own 50:50 mixed field of peas, using Mendel’s genes, not Bateson’s – half smooth/yellows and half wrinkled/greens.  Plant them in alternating rows, smooth/yellow mixed evenly with wrinkled/green.  Go away for two generations and come back to check the F2.  Remember that seed color and seed shape are not linked.  Would you expect to count the same 9:3:3:1 that Mendel observed? 

Surprise!  No, you do not.  You would find way too many of the “parental” phenotypes – the smooth/yellows and the wrinkled/greens, just like William Bateson found too many purple/long and red/round in 1905.

I have asked you what my students used to call a “trick question.”  They called it that because that’s what it is.  Peas are normally self-pollinating.  Mendel physically dusted all the round/yellow stigmas of his parental generation with wrinkled/green anthers, allowed the parents to set seeds, planted them, reared the F1, and then allowed the F1 to (naturally) self-pollinate to get his F2 results.  But in the field of peas we just imagined in the paragraph above, I told you simply to “go away for two generations.”  Absent intervention by you or an Augustinian monk, in two generations your pea field will remain exactly as you planted it, half smooth/yellows and half wrinkled/greens.  Way, way too many parentals.

Independent assortment assumes random mating.  If (for example) there is any tendency for round/yellows to pollinate other round/yellows or wrinkled/greens to pollinate other wrinkled/greens, the seed shape gene and the seed color gene will remain associated in the pea field, as though there were linked on the same chromosome, even though they are not.  Some people also call this phenomenon “linkage disequilibrium,” even though the genes involved are not physically located on the same chromosome.  I prefer a newer term, “gametic phase disequilibrium.”  And I like to describe the situation in our monkless field of peas as stable gametic phase disequilibrium [3].

So, let’s take a concrete example.  Last month we featured a dataset documenting variance at three allozyme-encoding loci (Odh, Mpi, and Hexdh) in seven subpopulations of Lithasia geniculata sampled down the length of the Duck River in Middle Tennessee [4].  Although I did not mention it, there are two different-looking shell forms of Lithasia at the three most downstream sites: one bearing typical-looking, oblong, bumpy shells (GEN), and a form with a more acute apex and angled (sometimes even tuberculate) shell (DUT).  Paul Johnson, Steve Ahlstedt and their colleagues sampled both the oblong- bumpy GEN form and the angulate-tuberculate DUT form at these three sites in 2002, bagging them separately [1].  The gene frequencies I reported last month were for the GEN form only.

In Table 1 below I have combined the GEN and DUT samples collected at Wright Bend (site E) for a total of 2(78) = 156 haploid genomes [5].  Each haploid genome is classified jointly by the allele resolved at the Odh and Hexdh loci, with observed counts above the slash, and expected below.

Table 1 shows evidence of disequilibrium between the Odh and the Hexdh locus, with excesses of Odh112/Hexdh93 and Odh109/Hexdh99 association.  The 3x2 contingency chi-square (setting aside rare allele Odh114) was 7.69, significant at the 0.05 level.  This sample does not seem to have been collected from a randomly-breeding population.

And indeed, a similar analysis combining the 32 GEN individuals collected at Site D with 34 DUT individuals would yield similar results, and ditto again were we to combine the 33 GEN and 35 DUT individuals collected at Site F.  The evidence suggests that this case of gametic phase disequilibrium is stable.

Now let’s generalize the situation one step further.  The gametic phase disequilibrium documented in Table 1 is correlated with shell phenotype.  It disappears when either GEN or DUT is analyzed separately, at all three sample sites.  Lithasia bearing shells of the oblong-bumpy GEN phenotype demonstrate significantly higher frequencies of Odh112 and Hexdh93 than Lithasia bearing acute, angulate DUT shells at all three sites where they were collected together. 

Tab 1. Site E, GEN and DUT combined
I have tabulated allele frequencies by shell morphology (observed/expected) for Site E in Table 2 below.  The contingency chi-square at the Hexdh locus is a whopping 41.2 (1 df) and that at the Odh locus (again setting aside rare allele Odh114) is 27.0 (2 df), both highly significant.  And again, the complete data set, for all three sites and all three loci, is available in the report I published in Ellipsaria 22(3) last year [1].

Shell morphology is a function of both genetics and environment, of course.  Heaven knows that enough pixels have sacrificed their flickering lives to both those fractions of the variance on this blog [6].  But Table 2 shows that the component of shell morphological variation that Johnson, Ahlstedt and their colleagues used to distinguish GEN from DUT in 2002 is correlated with variance in two demonstrably genetic markers, Hexdh and Odh.  And there is no apparent correlation between shell morphology and environment in this situation, whatever environmental variance there might have been within sites rendered negligible by the variance between. 

Let us broaden the concept of gametic phase disequilibrium to include measures of the phenotype, shall we?  Here I suggest a new term, “character phase disequilibrium” to describe situations such as documented in Table 2, where variation in the phenotype correlates with variation in the genotype.  I define CPD as “any violation of independent assortment between one or more morphological characters and one or more characters of demonstrably genetic origin” [7].

The potential underlying causes of character phase disequilibrium are identical to the causes of the better-known gametic phase disequilibrium.  In the case of the Lithasia of the Duck River, the cause of the disequilibrium between genotype and phenotype is almost certainly a violation of the assumption of random mating.  But of what origin?

In natural populations inhabiting large and complex environments, the first thought of an evolutionary biologist would be that any observed non-random mating would arise from barriers to dispersal or simple isolation by distance, as we obsessed over on this blog from September to December [8].  And in fact, we have very thoroughly documented both phenomena in the Lithasia populations of the Duck River [1, 4].

Tab 2. Site E, GEN and DUT separate
But here we see the same association between shell morphology and allozyme frequency in a sample (D) collected at Duck River mile 145.6, and in a second sample (E) sampled at Duck River mile 42.6, and in a third sample (F) collected from Duck River mile 25.7.  The CPD phenomenon cannot be attributed to physical barriers to dispersal or to isolation by distance in this case.

Rather, the data suggest that the GEN subpopulation bearing bumpy, oblong, typical shells and the DUT subpopulation bearing acute, angulate shells are reproductively isolated biological species, at all three of the sample sites tested down a 120-mile length of the Duck River.  The situation is apparently stable.

OK, coming off the backstretch now, heading for home.  Let me ask you to do one more thought experiment with Mendel’s peas.  For this experiment I will ask you to assume that peas are outcrossers, not selfers.  And this time, plant a field almost entirely of green/wrinkled peas, with one single yellow/smooth pea planted in the center.  Remember that seed color and seed shape are not physically linked, and this time your singleton yellow/smooth pea plant can freely cross-pollinate with its neighbors, as in a normal field of flowers.  Would the two loci appear to assort independently?

The answer is no.  I’ve asked you another trick question.  I didn’t mention anything about the passage of generations.

Initially, standing on the edge of your newly-planted pea garden with your hands on your hips, seed color and seed shape appear to be tightly linked, because the only plant with yellow seeds has smooth seeds.  And if you let your plants cross-pollinate, set seed, die, and bring forth a fresh generation, seed color and seed shape will still appear tightly linked, because every pollen grain made by your yellow plant was also marked with the smooth allele.  All the yellow F1* will still have smooth seeds, remembering that both yellow and smooth are dominant.  It might take quite a few generations [9] for the two loci, seed color and seed shape, to diffuse independently into the pea population before linkage disappears.  But it will, eventually, disappear.  This is unstable gametic phase disequilibrium.

And I am sure that my readership will have no difficulty extending the situation with the shell morphology in the Lithasia populations of the Duck into the pea garden of thought-experiment #2.  If our mixed field of randomly outcrossing peas were to demonstrate variance in some morphological trait of unknown heritability, as well as variance in the two for which the genetic basis is well-documented, we would not be surprised to observe unstable character phase disequilibrium.

So finally.  Character phase disequilibrium describes the situation with the singleton 5S snail that Nathan Whelan collected in his sample from the “Sixmile” Leptoxis subpopulation [10].  That snail simultaneously bore both the genetic markers and the shell morphology of the “Bulldog” population upstream.  But is that disequilibrium stable or unstable?   All those obscure calculations in footnotes #13 and #14 of my November essay were my efforts to answer that question.

Detail from Whelan et al [10] Fig 2 
My first thought was that snail 5S might be the offspring of a pure Sixmile parent and a pure Bulldog parent, analogous to the F1 pea generation marked with an asterisk(*) three paragraphs above.  Nathan seemed to imply that the 5S shell morphology was indistinguishable from the Bulldog parent, just as the phenotype of the F1 peas in thought experiment #2 were indistinguishable from their yellow, smooth parents, because of dominance.  The disequilibrium looks unstable at first blush, Bulldog and Sixmile freely interbreeding.

But if my first thought were true, the genotype of snail 5S would be 50/50.  The apparent excess of Bulldog genome made me consider the possibility that snail 5S might be an F2 or higher, with a pure Bulldog parent backcrossed to a second parent containing some Bulldog genome, which would be highly unlikely unless there were some reproductive isolation between Bulldog and Sixmile.  In other words, the character phase disequilibrium at Sixmile might be stable.  And the Bulldog and Sixmile populations have speciated?  Really?

In retrospect, all that wheel-spinning in footnotes #13 and #14 of my November post was entirely unnecessary.  One need only look downstream to Nathan’s Centreville sample, which was a melting-pot of Bulldog, Sixmile, Centreville, and other populations as well.  The excess Bulldog genome in snail 5S is just slop, there is no evidence of reproductive isolation, the character-phase disequilibrium at Sixmile is unstable.

Thus endeth the lecture.  I apologize for going full-professor on you all this month, especially since in 33 years of effort I was never able to advance beyond a second-rank professorship at a third-rank college in a fourth-rank state.  But a firm grasp of the principles I have outlined above is essential for a successful career in evolutionary biology.  Or at least, it used to be.  Tune in next time.


Notes

[1] Dillon, R. T. (2020) Reproductive isolation between Lithasia populations of the geniculata and duttoniana forms in the Duck River, Tennessee.  Ellipsaria 22(3): 6 - 8.  [pdf]

[2] Here’s a link [html] to a very nice educational resource published by Nature, reviewing the early work on genetic linkage of Bateson et al (1905) and Morgan (1910, 1911):
  • Lobo, I. & Shaw, K. (2008) Discovery and types of genetic linkage. Nature Education 1(1):139.

[3] This is a place-keeper footnote, which I will call up next month.

[4] Dillon, R. T., Jr. (2020) Population genetic survey of Lithasia geniculata in the Duck River, Tennessee.  Ellipsaria 22(2): 19 - 21 [pdf].  For more, see:

  • Intrapopulation gene flow: Lithasia geniculata in the Duck River [7Dec21]

[5] This is somewhat less than you might expect summing the locus totals published in [1], because for a few individuals I was able to obtain results at one locus and not the other.  Data from both loci are required for the joint analysis shown in Table 1.

[6] If you hit the “Phenotypic Plasticity” label above right you will find (as of January 2022) no fewer than 28 essays touching on the heritability of shell morphology in freshwater gastropods.   If I had to pick two, my favorite from the environmental side would probably be the first listed below, and my favorite from the genetic side would most certainly be the second, with the references cited in both:

  • Pleurocera acuta is Pleurocera canaliculata [3June13]
  • The heritability of shell morphology h^2 = 0.819! [15Apr15]

[7] We have also touched on the concept of character phase disequilibrium in several previous essays on this blog.  An understanding of CPD was critical to my series on the cryptic Pleurocera of Maryville:

  • The cryptic Pleurocera of Maryville [13Sept16]
  • The fat simplex of Maryville matches type [14Oct16]
  • One Goodrich missed: The skinny simplex of Maryville is Pleurocera gabbiana [14Nov16]

I didn’t call the phenomenon anything at the time because I thought that the concept of CPD was obvious and didn’t need a name any more than it needed an explanation.  Looking back, my Maryville research demonstrated the power that an understanding of character phase disequilibrium can bring to evolutionary science.  But maybe I should have posted the present essay back in 2016.

[8] Here is my series on gene flow within populations of freshwater gastropods:

  • Intrapopulation gene flow: King Arthur’s lesson [7Sept21]
  • Intrapopulation gene flow: The polymorphic Pleurocera of Naked Creek [11Oct21]
  • Intrapopulation gene flow, the Leptoxis of the Cahaba, and the striking of matches [2Nov21]
  • Intrapopulation gene flow: Lithasia geniculata in the Duck River [7Dec21]

[9] The number of generations required will be a function of both the rate of gene flow and the effective population size.  In fact, the level of gametic phase disequilibrium observed is one of two methods by which the effective size of a population can be estimated.  For more, see:

  • The best estimate of the effective size of a gastropod population, of any sort, anywhere, ever [14Jan19]

[10] Whelan, N. V, M. P. Galaska, B. N. Sipley, J. M. Weber, P. D. Johnson, K. M. Halanych and B. S. Helms (2019)  Riverscape genetic variation, migration patterns, and morphological variation of the threatened Round Rocksnail, Leptoxis ampla. Molecular Ecology 28:1593-1610.  For a review, see:

  • Intrapopulation gene flow, the Leptoxis of the Cahaba, and the striking of matches [2Nov21]

Tuesday, December 7, 2021

Intrapopulation gene flow: Lithasia geniculata in the Duck River

Editor’s Notes - This essay is the fourth and final installment of a series on isolation by distance and barriers to dispersal within populations of pleurocerid snails.  The original research upon which it was based was published in Ellipsaria v22(2) [pdf], if you’re looking for something citable. This essay was subsequently published as: Dillon, R.T., Jr. (2023b)  Intrapopulation gene flow: Lithasia geniculta in the Duck River.  Pp 147 – 152 in The Freshwater Gastropods of North America Volume 6, Yankees at The Gap, and Other EssaysFWGNA Project, Charleston, SC.

The modern era of American malacology dawned in 1934, on page 8 of the University of Michigan Museum of Zoology Miscellaneous Publication #286 [1].  For it was there that Calvin Goodrich first advanced his model of shell variation in the Lithasia geniculata population of the Duck River in Middle Tennessee.  Goodrich argued that that pleurocerid populations previously identified as Anculosa pinguis, Lithasia fuliginosa, and Lithasia geniculata were all variants in a 270 mile cline, lowering the first two nomina to subspecific status under the third.

This was not taxonomic bit-fiddling.  It was a throwing off of typological thinking, and an anticipation of the biological species concept eight years before Mayr hove it like Simba before the African host [2].  We first featured Goodrich’s model for the Duck River Lithasia on this blog way back in February of 2007, and have returned to it twice since then, in July of 2014 and September of 2019 [3].

So when I opened an email from Paul Johnson in the summer of 2002, offering to ship me an extensive series of Lithasia samples collected down the length of the Duck, frozen and ready for allozyme electrophoresis, I jumped at the chance.  Paul was working with our good friend Steve Ahlstedt and a host of collaborators on a survey of the unionid mussel fauna of the Duck [4] and had sampled Lithasia as an ancillary.  This seemed like a marvelous opportunity to generalize some of the observations I had made on intrapopulation genetic variation in Pleurocera proxima during my dissertation years [5] and add a dramatic shell morphological component as well.

The samples arrived in Charleston on dry ice shortly thereafter.  Paul, Steve, and their colleagues had collected 50+ individuals from six sites down the length of the Duck, as shown in the figure above, plus a seventh site on the Buffalo River, a major downstream tributary.  The two upstream subpopulations (pinA and pinB) Paul identified as L. geniculata pinguis, the next one (fulC) was L. geniculata fuliginosa, and the three most downstream sites (genD, genE, genF) he identified as typical L. geniculata geniculata.  Paul’s sample from the Buffalo River he identified as L. geniculata fuliginosa (fulG).

And I went straight to work, initially screening for allozyme variance at 17 loci, finding three polymorphic: Octopine dehydrogenase (Odh, 2 alleles), Mannose-phosphate isomerase (Mpi, 3 alleles) and Hexanol dehydrogenase (Hexdh, 2 alleles).  For those three loci, I cranked up the sample size to around 30 – 40 individuals for each of the 7 subpopulations.

The graph below shows the frequency of the most common allele [9] at the three polymorphic loci as a function of Duck River mile.  The similarity between this graph and the graph of Odh frequency in the P. proxima population of Naked Creek [12Oct21] is striking, both data sets showing a dramatic swing in allele frequencies between their most upstream sample sites. 

Although separated by just 3.5 river miles, subpopulations pinA and pinB demonstrated a fixed difference at the Mpi locus and a nearly fixed difference at the Odh locus.  All N=41 pinA snails were homozygous for Mpi97 and Odh112, while all N=28 pinB snails were homozygous for Mpi94, with 25 of the 28 homozygous for Odh109.  This result is also quite reminiscent of the results reported by Whelan and colleagues for the Leptoxis of the Cahaba, reviewed here last month [10].

In Pleurocera proxima, this phenomenon was attributable to a barrier to gene flow at a corrugated metal pipe.  In Lithasia geniculata, the exactly analogous phenomenon is attributable to the falls of the Duck River – a bit more dramatic, but no Niagara.  The main Duck River and the Little Duck River drop over several named waterfalls in Old Fort State Park west of Manchester.  Judging from Paul’s collection data, the barrier between pinA and pinB may be Step Falls, a pretty little 30 ft drop, figured below.  That’s one small step for Man, one giant leap for Snail-kind.

Gene frequency variance was also significant down the remainder of the Duck River at the sample sizes tested, although less dramatic.  Subpopulations pinB, fulC and genD differed from each other at the Odh locus, while genD, genE and genF differed from each other at the Mpi locus.  Buffalo River subpopulation fulG differed significantly from genF at both Odh and Mpi [11].  Differences of these magnitudes seem attributable to isolation by distance, rather than any discrete barrier to dispersal.

Step Falls of the Little Duck R.

But it was shell morphological variation that attracted us to the Lithasia geniculata population of the Duck River, and it is with shell morphological variation that we will conclude.  No keen-eyed 19th-century taxonomist has ever assigned different Latin nomina to the Lithasia above and below the falls of the Duck River.  The shells born by subpopulations pinA and pinB are indistinguishable, despite the fact that gene flow from the latter to the former is apparently negligible.

Taxonomists have, however, fallen over themselves racing to describe new species of Lithasia down the remainder of the Duck River, even though gene flow among the subpopulations inhabiting its lower 270 miles is apparently greater.  Such observations are consistent with the hypothesis that much of the shell morphological variation observed in the Lithasia geniculata population of the Duck River is ecophenotypic in origin.

It was the work of Calvin Goodrich that inspired me back in 2007 to propose the phenomenon ultimately re-christened CPP, cryptic phenotypic plasticity [12]. I never insisted that CPP must have its origins in the non-genetic components of shell morphological variance but results such those as I published in 2011, 2013 and 2014 have certainly pointed in that direction.  The Lithasia geniculata population of the Duck River does, too.


Notes

[1] Goodrich, C. (1934) Studies of the gastropod family Pleuroceridae - I. Occas. Pprs. Mus. Zool. U. Mich. 286: 1 - 17.

[2] Mayr, E. (1942) Systematics and the Origin of Species.  Columbia University Press, NY.

[3] I have explicitly featured Goodrich’s model for the Duck River Lithasia geniculata in three previous essays. The influence of that seminal work has been papable in many more:

  • Goodrichian taxon shift [20Feb07]
  • Elimia livescens and Lithasia obovata are Pleurocera semicarinata [11July14]
  • CPP Diary: The spurious Lithasia of Caney Fork [4Sept19]

See my 2007 essay for the L. geniculata detail of Goodrich’s Plate I, and my 2014 essay for a scan of the entire plate.

[4] Ahlstedt, S. A., J. R. Powell, R. S. Butler, M. T. Fagg, D. W. Hubbs, S. F. Novak, S. R. Palmer and P. D. Johnson (2017) Historical and current examination of freshwater mussels (Bivalvia: Margaritiferidae: Unionidae) in the Duck River basin Tennessee, USA. Malacological Review 45:1-163.

[5] My research project on gene flow in the Naked Creek population of Pleurocera proxima was ultimately published in three different ways.  The 1979 results, in their entirety, were published as Chapter 2 of my 1982 dissertation [6].  The barrier-to-dispersal portion of that 1979 study was combined with data from 1980 and 1985 and published in 1988 [7].  The isolation-by-distance portion of my 1979 study was published just last year in Ellipsaria [8].  For an overview of the entire research program, see:

  • Intrapopulation gene flow: The polymorphic Pleurocera of Naked Creek [12Oct21]

[6] Dillon, R.T. Jr (1982)  The correlates of divergence in isolated populations of the freshwater snail, Goniobasis proxima.  Ph.D. Dissertation, University of Pennsylvania. 182 pp.  Dissertation Abstracts 43: 615B

[7] Dillon, R.T., Jr. (1988) The influence of minor human disturbance on biochemical variation in a population of freshwater snails. Biological Conservation 43: 137-144.  [pdf]

[8] Dillon, R. T. (2020) Fine scale genetic variation in a population of freshwater snails. Ellipsaria 22(1): 24 - 25.  [pdf]

[9] Weighted average over all seven sites.

[10] Whelan, N. V, M. P. Galaska, B. N. Sipley, J. M. Weber, P. D. Johnson, K. M. Halanych and B. S. Helms (2019)  Riverscape genetic variation, migration patterns, and morphological variation of the threatened Round Rocksnail, Leptoxis ampla. Molecular Ecology 28:1593-1610.  For a review, see:

  • Intrapopulation gene flow, the Leptoxis of the Cahaba, and the striking of matches [2Nov21]

[11] The Buffalo River Lithasia population has been described as a unique species.  Allozyme data do not support that hypothesis, however.  See:

  • Minton, R. L. (2013) A new species of Lithasia (Gastropoda: Pleuroceridae) from the Buffalo River, Tennessee, USA. The Nautilus 127:119-124.

[12] “Cryptic phenotypic plasticity” is defined as “interpopulation morphological variance so extreme as to prompt an (erroneous) hypothesis of speciation.”  For more, see:

  • Dillon, R. T., Jr.  (2011)  Robust shell phenotype is a local response to stream size in the genus Pleurocera. Malacologia 53:265-277. [pdf]
  • Dillon, R. T., S. J. Jacquemin & M. Pyron (2013)  Cryptic phenotypic plasticity in populations of the freshwater prosobranch snail, Pleurocera canaliculata.  Hydrobiologia 709: 117-127.  [pdf]
  • Dillon, R. T., Jr. (2014)  Cryptic phenotypic plasticity in populations of the North American freshwater gastropod, Pleurocera semicarinata.  Zoological Studies 53:31. [pdf]

Tuesday, November 2, 2021

Intrapopulation gene flow, the Leptoxis of the Cahaba, and the striking of matches

Editor’s Note – This essay was subsequently published as: Dillon, R.T., Jr. (2023b)  Intrapopulation gene flow, the Leptoxis of the Cahaba, and the striking of matches.  Pp 133 – 146 in The Freshwater Gastropods of North America Volume 6, Yankees at The Gap, and Other EssaysFWGNA Project, Charleston, SC.

Last month we reviewed a research project I conducted in the late 1970s on isolation-by-distance and barriers to dispersal in a population of Pleurocera proxima inhabiting a small stream in NW North Carolina [1].  The markers I used were allozyme variants – bands of protein migrating at different speeds in starch gels.  Even back in the 1970s, there was already a lot of hand-wringing about the genetic variation being missed by such a gross and clumsy technique.

I was missing, of course, all of the silent variation – variation due to the redundancy of the genetic code.  And of the non-redundant variation, mutations that yield variation in the amino-acid sequence, I was only catching a tiny fraction – that subset changing the charge on the protein.  And the technique I was using worked only on loci that encode enzymes, the evolution of which must certainly be constrained by selection, right?  What about junk DNA?  What about everything else? 

A lot of my colleagues worried about these problems in the 1970s.  And I can see why, if one invested in all that cumbersome equipment for protein electrophoresis, and all those expensive reagents to demonstrate allozyme bands, and conducted one’s field work, and ran one’s gels, and found no variation, and scraped one’s gels into one’s trash can, and washed one’s giant sink-full of dishes, one might grumble.

Allozyme variation in Naked Creek

But the grossness and clumsiness of allozyme electrophoresis never bothered me, through my entire career, even into the 21st century [5] because I did find allozyme variation.  With hard work, patience, and good technique, I found a lot of very useful genetic markers.  I realized that I was “binning” a bazillion silent variants together when I scored a snail homozygous for Odh106 back in 1979.  But I could distinguish those bazillion variants together from the bazillion silent variants I had binned together in snails I scored as homozygous Odh109F.  And those two big bins were inherited in Mendelian fashion.  Fine.

Well, technology marches on.  The first fully automated sequencing DNA machine came onto the market in 1987, even as Kerry Mullis’ patent for PCR amplification was approved, and by the 1990s everybody was sequencing DNA.  Massively-parallel (“Next-Generation”) DNA sequencing machines were introduced for commercial use around 2005, enabling rapid and cheap sequencing of gigabases of DNA.  And in 2008 a method was first proposed to identify single-nucleotide polymorphisms in random lengths of DNA amplified from population samples, “Restriction Associated DNA sequencing,” or RADseq for short.

The DNA from each individual (let’s say, for example, each snail) is isolated and cut into a bazillion little pieces with a carefully-chosen restriction enzyme or set of enzymes.  The millions of pieces in this mess that are (usually) a couple hundred base pairs long [6] are electrophoretically separated from the zillions of littler pieces and bigger hunks and ligated to adapters that facilitate their amplification and uniquely identify each individual snail.  These millions of pieces are called “reads.”  Feed all those reads through the front door of your local Next-Generation sequencing factory.

As mind-boggling as the previous paragraph most certainly is, I will now ask you to imagine repeating that process for (let’s say) 20 snails from (let’s say) 8 populations.  So 160 times.  And as mind-boggling as the preparation and the sequencing of all those 160 million reads most certainly is, I will now ask you to imagine that all of those reads can be “quality-filtered” to remove the crappy ones, screened by their utility as markers across the entire 160-individual population, and analytically matched to each other using some gargantuan gear-grinding smoke-belching computer program.  If you’re interested in the technical details, see the references at footnote [7] below.

What comes back out of the sequencing factory is, in our example, a comparison of 8 sets of 20 snails amplified at some huge number of random, anonymous (“restriction-site associated”) reads of DNA.  If there is any polymorphism, even for one single synonymous nucleotide, the researcher will know it.  The data are typically reported in tightly-packed bar graph form, standardized by the number of matching reads, coded by some color, let’s say orange to start.  If snail #1 and snail #2 are genetically identical, as far as can be determined by this mind-boggling technique, the bar graphs depicting their genomes will match in orange along 100% of their height.  If snail #1 and snail #2 differ by (let’s say) 40%, a second color is selected, let’s say blue, and the bar graph depicting snail #2 is 60/40 orange/blue.

So in 2019 our colleague Nathan Whelan, together with six coworkers, published the first RADseq study of a population of pleurocerid snails [8].  His results nicely augment and compliment the results I myself obtained using allozyme variants in the 1970s, and as such make an important contribution to our understanding freshwater gastropod evolution generally.

Whelan et al. Figure 2 [8]

Nathan collected samples of 20 Leptoxis from each of 8 sites in the Cahaba River drainage of central Alabama, stretching approximately 45 km (30 miles) from the Helena suburbs at County Road 52 (“CR52”) downstream to Centreville.  Four of his eight samples came from the main river, and four from the tributaries, as shown in his Figure 2 reproduced above.  This is a marvelously data-rich figure, from which those of us interested in pleurocerid evolution can learn at least four important lessons.  Let’s unpack each lesson, one at a time.

First, all eight of Nathan’s samples (let’s call them “subpopulations”) were genetically unique – some entirely unique, others just mostly unique.  I am not a fan of Nathan’s palate, but what he is trying to convey with his four shades of blue and four shades of orange is eight unique genomes associated with eight distinct Leptoxis subpopulations.  And the shade of orange 100% covering the Shades Creek box is completely different from the shade of orange 100% covering the Schultz Creek box.  And both of those oranges are completely different from the shade of blue 100% covering the CR52 box and the shade of blue 100% covering the Bulldog Bend box.  Those four subpopulations seem to be entirely unique, as far as Nathan’s data extend.  The other four subpopulations – Schultz Creek, Marvel Slab, Sixmile, and Centreville, are mostly unique, with other shades of color more or less impinging around the edges of their mostly-uniquely-colored box interiors.

Apparently, gene flow among most of Nathan’s subpopulations of Leptoxis is zero, at least on the time scale of single-nucleotide mutations.  That is the most surprising result of the 2019 study conducted by Whelan and colleagues.  All eight of their subpopulations are connected by water, with stepping-stone distances generally in the 10 – 30 km range.  The research we reviewed in September would have led us to expect slow but measurable active migration upstream, with rapid and episodic transport downstream [9].  My previous research on Pleurocera [1] suggested an average gene flow of 6.5 migrants among sites sampled 1 kilometer apart which, one would assume, ought to extend 10 km to some value greater than 0.0.  No?

The second important lesson conveyed by Nathan’s Figure 2 is that what little dribble of gene flow does occur among a few of his subpopulations seems to occur 100% downstream.  The four completely-unique subpopulations (Shades Creek, Schultz Creek, CR52, and Bulldog Bend) are from the four most upstream sites.  All four of Nathan’s downstream boxes show at least a little bit of upstream color.  One’s eye is especially drawn to the box at Nathan’s downstream-most subpopulation, Centreville, which is primarily painted dark blue, but demonstrates significant bars of NWR yellow, Schultz Creek dark orange, Shades Creek orange, Marvel Slab blue, and Bulldog blue.

More than any other pleurocerid, Leptoxis populations are strongly associated with rock substrate and rapid water flow.  Individuals of the genus Pleurocera, by contrast, are at least occasionally observed grazing across softer substrates.  This includes P. proxima, which although apparently adapted for small, trout-stream-sized creeks tumbling through the southern Appalachians, is not uncommonly spotted crawling on sand and firmer mud.  See the last photo I published in last month’s post [12Oct21].

But I can never, in my 60 years of field experience, ever remember collecting an individual Leptoxis on anything but rock.  So, Nathan’s eight Leptoxis subpopulations were collected from eight riffle areas that must (inevitably) have been separated by pool areas, with extensive bottoms of mud substrate.  Leptoxis can wash downstream through such pools but (apparently) cannot effectively crawl upstream through them.

Then how did Leptoxis get upstream in the first place?  The two answers to that question are great age + dirty birds.  While the lower regions of the Mobile Basin were yet covered by the Cretaceous embayment, the upper Mobile Basin had long been flowing free from the mountains of what is now North Georgia.  In 2009 I offered several lines of evidence suggesting that the pleurocerid populations of this region are “The Snails The Dinosaurs Saw,” living fossils of great antiquity [10].  I subsequently penned a series of essays showing that aerial dispersal among such populations is not as unlikely as one might think [11].

Whelan et al. Fig 1, modified [8]

And in 2016, Nathan together with our colleague Ellen Strong published a paper documenting extensive mitochondrial superheterogeneity among these same Cahaba River populations of Leptoxis for which he and his co-workers now report RADseq data [12].  Nathan’s 2016 results strongly imply very long periods of isolation, punctuated by very rare introductions of genomes from very great distances away.  Now in light of Nathan’s 2019 research findings, his 2016 paper makes more sense. 

The third important lesson to be taken from Nathan’s RADseq study is that divergence among Leptoxis subpopulations of the Cahaba is phenotypic, as well as genotypic.  The shells born by most of Nathan’s eight samples were almost entirely smooth, as shown in (B) of his Figure 1, modified above.  But the subpopulation inhabiting the Little Cahaba River below Sixmile Creek bears lightly tuberculate shells, typically with carination, as shown in (C).

In the figure below I have reproduced my diagram of Naked Creek from last month’s post and inset a slice of topographic map showing the Little Cahaba River between Bulldog Bend and Sixmile Creek.  These two maps are depicted at the same scale, see the Naked Creek scale bar at upper left.  The water distance from Bulldog to Sixmile is a serpentine 7.85 km (flowing from right to left), comparable to the distance between Naked Creek Site 7 and Naked Creek Site 8.

The Bulldog Bend box is 100% blue, demonstrating no gene flow from any other subpopulation sampled.  The Sixmile box is almost entirely orange but shows a bunch of little Bulldog-blue nibbles at the bottom, plus one big dramatic blue cut.  That singleton snail, the one individual whose genotype seems [13] to match Bulldog more than Sixmile, also bore a smooth shell (B) like the Bulldog subpopulation, not a tuberculate/carinate shell (C) like the other 19 in Nathan’s Sixmile sample.

Clearly this phenomenon is attributable to washdown gene flow from Bulldog to Sixmile.  The Smooth-Shelled Singleton in Nathan’s Sixmile Sample (Let’s call him “5S.”) did not demonstrate a 100% Bulldog genome, however, but rather only about 50% [13] matching Bulldog.  The implication is that Snail 5S is not a first-generation washdown, but a second generation washdown, born at Sixmile but retaining the shell morphology along with half the genome of a Bulldog parent.  This is indirect but nevertheless compelling evidence for the heritability of tuberculate/carinate shell morphology in pleurocerid snails.  But wait, there’s more.

The final lesson from Nathan’s RADseq study, and the most important lesson, is this.  Although these eight subpopulations have diverged both genetically and morphologically, they have not speciated.  The isolation between them is physical, not reproductive.  When snails wash down, albeit rarely, they are apparently able interbreed freely with the snails in the riffles downstream.  The Cahaba River at Centreville is not populated by an admixture of five different Leptoxis species.  All 20 of the snails Nathan collected at Centreville belonged to the same biological species as the seven subpopulations Nathan sampled upstream.

Nathan identified all eight of his subpopulations as “Leptoxis ampla.”  OK, that’s a good start.  We all agree on the conspecific status all the Leptoxis subpopulations of the Cahaba.  Now let us see if we can apply the lessons we have learned in the Cahaba to the greater Mobile Basin beyond.

I have reviewed the tangled taxonomic history of the Mobile Basin Leptoxis fauna on several occasions in the tangled epistemological history of the FWGNA blog [15].  But not recently.  So to refresh the collective memory.

Timothy Abbot Conrad got the ball rolling back in 1834, describing four species, two from the Alabama/Coosa and two from the Black Warrior.  Isaac Lea [16] added seven, J. G. Anthony added three, H. H. Smith added eleven, and Calvin Goodrich [17] one, so that by 1922, Goodrich tallied 26 nominal species of Leptoxis in drainages of the Mobile Basin [18].  I reproduced Goodrich’s figure of all 26 in my essay of [15Sept09] and have re-reproduced it below.

Most of these nominal species were nominally-extincted by extensive damming and impoundment conducted throughout the Mobile Basin, starting in 1912, accelerating in the 1920s and 1930s, and continuing into the 1960s.  By the 1990s, Goodrich’s Leptoxis list had been reduced to four nominal species, each inhabiting small fragments of its former range: L. picta (Conrad 1834) in the main Alabama River, L. ampla (Anthony 1855) in the Cahaba, L. taeniata (Conrad 1834) in the lower reaches of three creeks in the Coosa drainage, and L. plicata (Conrad 1834) in the Black Warrior.

Goodrich [18]
How many of these might be biologically valid?  In 1998 I published a paper with Chuck Lydeard [19], reporting that the allozyme divergence among L. picta, L. ampla, and L. taeniata was no greater than the allozyme divergence among a set of conspecific Leptoxis praerosa controls sampled from equally-distant quarters of Tennessee [20].  We suggested that ampla and taeniata be synonymized under picta (Conrad 1834).

Modern fashion has trended in the other direction, however.  Even as our 1998 paper was in review, a pleurocerid population identified as “Leptoxis downei” was discovered in the Oostanaula River of Georgia, a nomen subsequently dropped in favor of L. foremani.  And in 2011 a population identified as Leptoxis compacta was discovered in the Cahaba River at the Shades Creek confluence, sympatric with snails Nathan identifies as L. ampla [21].  Today the list of nominal Leptoxis species inhabiting the Mobil Basin has rebounded to six [22].

Now research results have crossed our desk demonstrating that the Leptoxis population of the Cahaba River is strikingly fragmented into subpopulations, that these subpopulations have diverged both genetically and morphologically, and that they have not speciated.  Another 50 km downstream will bring us to the main Alabama River, and 100 km back up the Alabama/Coosa will bring us to the mouth of Buxahatchee Creek.  Does this new evidence support the assignment of three different specific nomina to “Leptoxis ampla” in the Cahaba, “L. picta” in the main river, and “L. taeniata” in Coosa tributaries such as Buxahatchee Ck?

Forty years ago, while I was yet a doe-eyed graduate student, it was clear to me that the key to understanding speciation was to understand population divergence, and the key to understanding population divergence was to understand intrapopulation gene flow.  I did not have any big grants, and I did not have any fancy tools, and I did not have legions of collaborators.  But I did have, even at that tender age, quite a few years of field observation on the biology of pleurocerid populations in rivers of the American southeast, and an openness to learn more, and that took me a long way.

Now I am delighted to discover colleagues in Alabama bringing sophisticated tools to bear on questions I myself pondered in my youth.  Can my colleagues extend their newfound understanding of intrapopulation gene flow forward through population divergence and generalize to the species level?  Can they bring dawn to the darkness that has enveloped the pleurocerid fauna of the Mobile Basin for 200 years?  That remains to be seen.  But Nathan Whelan and his colleagues have struck the first match.


Notes 

[1] My research project on gene flow in the Naked Creek population of Pleurocera proxima was ultimately published in three different ways.  The 1979 results, in their entirety, were published as Chapter 2 of my 1982 dissertation [2].  The barrier-to-dispersal portion of that 1979 study was combined with data from 1980 and 1985 and published in 1988 [3].  The isolation-by-distance portion of my 1979 study was published just last year in Ellipsaria [4].  For an overview of the entire research program, see last month’s post:

  • Intrapopulation gene flow: The polymorphic Pleurocera of Naked Creek [12Oct21]

[2] Dillon, R.T. Jr (1982)  The correlates of divergence in isolated populations of the freshwater snail, Goniobasis proxima.  Ph.D. Dissertation, University of Pennsylvania. 182 pp.  Dissertation Abstracts 43: 615B

[3] Dillon, R.T., Jr. (1988) The influence of minor human disturbance on biochemical variation in a population of freshwater snails. Biological Conservation 43: 137-144.  [PDF]

[4] Dillon, R. T. (2020) Fine scale genetic variation in a population of freshwater snails. Ellipsaria 22(1): 24 - 25.  [PDF]

[5] I was still frantically running allozyme gels when I was kicked out of my lab at the College of Charleston in the spring of 2016.  And still getting interesting results, too!  See:

  • The best estimate of the effective size of a gastropod population, of any sort, ever [14Jan19]

[6] Actually, Nathan and his colleagues used a clever modification called 2b-RADseq, involving a special restriction enzyme called ALF1, that cuts DNA into fragments exactly 36 bp long.

[7] A few of the better references on RADseq:

  • Baird N, Etter P, Atwood T, et al. (2008) Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers. PLoS ONE 3:e3376.
  • Davey, JW & M.L Blaxter (2010) RADSeq: next-generation population genetics.  Briefings in Functional Genomics 9: 416-423. doi: 10.1093/bfgp/elq031
  • Rubin, B.E.R., R.H. Ree, and C.S. Moreau (2012)  Inferring phylogenies from RAD sequence data.  Plos One 7(4): e33394.
  • Wang, S, E. Meyer, J.K. McKay, and M. Matz (2012)  2b-RAD: A simple and flexible method for genome-wide genotyping.  Nature Methods 9: 808 – 810.

[8] Whelan, N.V., M.P. Galaska, B.N. Sipley, J.M. Weber, P.D. Johnson, K.M. Halanych, and B.S. Helms (2019)  Riverscape genetic variation, migration patterns, and morphological variation of the threatened Round Rocksnail, Leptoxis ampla.  Molecular Ecology 28: 1593 – 1610.

[9] For a review of previous research on this important topic, see:

  • Intrapopulation gene flow: King Arthur’s lesson [7Sept21]

[10] Dillon, R.T., Jr. and J.D. Robinson (2009)  The snails the dinosaurs saw: are the pleurocerid populations of the Older Appalachians a relict of the Paleozoic Era?  Journal of the North American Benthological Society 28: 1 – 11 [PDF].  For more, see:

  • The snails the dinosaurs saw [16Mar09]

[11]  My four-part series on aerial dispersal:

  • Freshwater gastropods take to the air, 1991 [15Dec16]
  • A previously missed symbiosis? [11Jan17]
  • Accelerating the snail’s pace 2012 [24Apr17]
  • Freshwater snails and Passerine Birds [26May17]

[12] Whelan, N.V. & E. E. Strong (2016)  Morphology, molecules and taxonomy: extreme incongruence in pleurocerids (Gastropoda, Cerithiodea, Pleuroceridae). Zoologica Scripta 45: 62 – 87. For an independent analysis of these fascinating results, see:

  • Mitochondrial superheterogeneity: What we know [15Mar16]
  • Mitochondrial superheterogeneity: What it means [6Apr16]
  • Mitochondrial superheterogeneity and speciation [3May16]

[13] The singleton blue streak in the orange Sixmile box seems to slice much more than halfway through.  Possibly 75 – 80%?  I feel sure that’s just slop [14].  If a snail collected at Sixmile doesn’t bear 100% of the Bulldog genome, it must bear 50% or less.

[14] The only other explanation would be that Snail 5S has one parent and one grandparent washed down from Bulldog.  In other words, the mother of 5S washed down from Bulldog and was inseminated by a father with one parent born at Bulldog, yielding Snail 5S, with 75% Bulldog genome. 

That scenario seems quite unlikely.  The proportion of first-generation Bulldog-washdowns at Sixmile seems to be less than 1/20 = 0.05, and the proportion of second-generation washdowns approximately 1/20 = 0.05, so the likelihood of a first-generation x second-generation mating must be less than 0.05^2 = 0.0025. 

The implication of the highly-unlikely scenario outlined above would be that the father of Snail 5S actively sought the mother of 5S.  In other words, there is some sort of reproductive isolation between the Bulldog and Sixmile Leptoxis populations.  The two populations have speciated.

Might a unique species of Leptoxis have evolved on the Sixmile rapids in the middle of the Cahaba? Nah.  Just look downstream at Centreville.  I count six snails bearing hunks of Sixmile genome together with native Centreville genome, upstream Schultz Creek genome, and everything else.  So I agree with Nathan on this one.  The apparent excess in the length of that skinny blue cut in the orange Sixmile box is almost certainly just slop.

[15] For additional background on the taxonomy of Leptoxis populations in the Mobile Basin:

  • Mobile Basin I: Two pleurocerids proposed for listing [24Aug09]
  • Mobile Basin II: Leptoxis lessons [15Sept09]

[16] For the record:

  • Isaac Lea Drives Me Nuts [5Nov19]

[17] Actually, if you’re digging around in these footnotes for more homework, I would recommend reading my 2007 biographical sketch of Calvin Goodrich before reading the two Mobile Basin essays I posted in 2009.  Start here:

  • The Legacy of Calvin Goodrich [23Jan07]

[18] Goodrich, C. (1922) The Anculosae of the Alabama River Drainage.  University of Michigan Museum of Zoology Miscellaneous Publication 7: 1 – 57.

[19] Dillon, R.T., and C. Lydeard (1998) Divergence among Mobile Basin populations of the pleurocerid snail genus, Leptoxis, estimated by allozyme electrophoresis.  Malacologia. 39: 111-119. [PDF]

[20] Leptoxis plicata populations of the Black Warrior appear to be genetically distinct.

[21] Whelan, N.V. P.D. Johnson and P.M. Harris (2012)  Rediscovery of Leptoxis compacta (Anthony 1854) (Gastropoda: Cerithioidea: Pleuroceridae)  PlosOne 7(8) e42499 [html]

[22] Shelton-Nix, E. (2017)  Alabama Wildlife, Volume 5.  University of Alabama Press, Tuscaloosa. 355 pp.