Pangenomics transforms evolutionary biology

08 - 09 June 2026 09:00 - 17:00 Apex Grassmarket Hotel, Edinburgh Free
Request an invitation

Theo Murphy meeting organised by Dr Joana Meier, Dr Henry North and Dr Charlotte Wright.

Genomic studies often use a reference genome from a single individual. This underestimates genetic diversity and biases against more distant relatives. Novel pangenome graphs eliminate both issues by combining the genetic variation of diverse individuals. This meeting will showcase cutting-edge pangenome tools, applied uses of pangenomes which are transforming health and agriculture, and opportunities to revolutionise biodiversity genomics.

Programme

The programme, including the speaker biographies and abstracts, is available below. Please note the programme may be subject to change.

Poster session

There will be a poster session on Monday 8 June. If your request to attend is approved, you will be sent further information on how to submit your poster abstract. Submissions made within one month of the meeting may not be included in the programme booklet.

Attending this event

  • Free to attend and in-person only
  • Please note this meeting is now at capacity, and you can only request an invitation to be added to the reserve attendee list
  • When requesting an invitation, please briefly state your expertise and reasons for attending
  • Requests are reviewed by the meeting organisers on a rolling basis. You will receive a link to register if your request has been successful
  • Catering options will be available to purchase upon registering. Participants are responsible for booking their own accommodation
  • Please do not book accommodation until you have been invited to attend the meeting by the meeting organisers

Please note that scientific meetings hosted by the Royal Society do not necessarily represent a Royal Society position or signify an endorsement of the speakers or content presented.

Enquiries: contact the Scientific Programmes team.

Image credit: iStock.com / natrot

Organisers

  • Joana Meier

    Dr Joana Meier

    Joana Meier studies why the species richness is so unevenly distributed across the tree of life, particularly how hybridisation and chromosomal rearrangements affect rapid species radiations. After a PhD and postdoc at the University of Bern in Switzerland on cichlid fish speciation, she held two concurrent fellowships at the University of Cambridge in the UK, working on butterfly speciation. Since 2022, she has been leading a group at the Wellcome Sanger Institute, combining her group leader position with a Royal Society URF. Her research team focuses on rapid speciation in butterflies and peacock spiders, and she also leads large collaborative sequencing projects like Project Psyche – sequencing reference genomes of all Lepidoptera found in Europe.

  • Henry North

    Dr Henry North

    Henry is a research fellow at Girton College, Cambridge, broadly interested in adaptation and hybridization. He completed his undergraduate degree at the University of Queensland, Australia, working on speciation in Senecio in the Ortiz-Barrientos laboratory. He then completed his MSc through the Erasmus Mundus Programme in Evolutionary Biology, working with Carole Smadja, Robin Hopkins, and Chris Jiggins. He recently completed his PhD with Chris Jiggins, focusing on bidirectional adaptive introgression of structural variants between native and invasive species of the noctuid moth genus Helicoverpa. Henry’s current research uses this study system, as well as other instances of hybridization between native and invasive species, to investigate a range of questions in evolutionary genomics. His current interests include the mismatch of transposable elements and their suppressors in hybrid genomes, parallel evolution of structural variants, and the relationship between nucleotide diversity and gene absence-presence polymorphism.

  • Charlotte Wright

    Dr Charlotte Wright

    Dr Charlotte Wright is a postdoctoral fellow at the Wellcome Sanger Institute and is the Charles and Katherine Darwin research fellow at Darwin College, University of Cambridge. Charlotte’s research uses comparative and population genomic approaches to understand the evolutionary processes that shape genome evolution and how changes in genome structure impact the generation of biodiversity. Charlotte leverages large-scale sequencing datasets to investigate changes in genome evolution both within species and across the diversification of taxa. As part of this, Charlotte is particularly interested in using pangenomic approaches to understand how changes in genome structure such as chromosome rearrangements evolve. Her research focuses on butterflies and moths, and is one of the leaders of Project Psyche that aims to generate and utilise reference genomes of all butterflies and moths in Europe.

Schedule

Chair

Henry North

Dr Henry North

University of Cambridge, UK

09:00-09:05 Welcome
Dr Joana Meier

Dr Joana Meier

Wellcome Sanger Institute, UK

09:05-09:30 Concerted evolution and unorthodox recombination of human subtelomeres

Human subtelomeric regions are among the most dynamic and structurally complex parts of our genome, yet their interchromosomal relationships have remained difficult to characterize due to the limitations of both assembly completeness and alignment methodology. Here we present the most comprehensive survey of subtelomeric sequence relationships to date, leveraging 466 near-complete haplotype assemblies from the Human Pangenome Reference Consortium (HPRC) version 2. To analyze these regions, we introduce the implicit pangenome graph, a reference-free alignment approach that performs all-to-all pairwise comparisons across haplotypes—sampling approximately 12% of all possible combinations—without imposing chromosomal partitioning or positional bias. This yields a truly unbiased view of interchromosomal homology across the pangenome, where every haplotype serves as its own point of reference.

A genome-wide survey of alignment identity reveals extended regions of interchromosomal homology at nearly all subtelomeres, alongside known systems such as the acrocentric short arms and pseudoautosomal regions. Cladistic analysis based on neighbor-joining trees of subtelomeric similarity uncovers both expected relationships—Xp/Yp and Xq/Yq via the pseudoautosomal regions, acrocentric short arms—and novel associations, including strong 10p–18p homology, a tightly linked clade involving 22q, 21q, 19q, 1q, 13q, and 17q, and extended DUX4-containing homology between 4q and 10q with wide copy number diversity. A large clade of many chromosome arms shares homology at moderate similarity, suggesting broad ongoing interchromosomal exchange. Principal component analysis of the similarity matrix further resolves subtelomeric clustering. We hypothesize that these patterns are maintained by recombination facilitated by the physical proximity of subtelomeres at the nuclear envelope, and evaluate this using Hi-C-derived three-dimensional genome maps.

Dr Erik Garrison

Dr Erik Garrison

University of Tennessee Health Science Center, US

09:30-09:45 Discussion
09:45-10:15 Evolution of structural variation and mutation cross vertebrates

Structural variants (SVs) contribute substantially to genetic variation and play vital roles in adaptation and disease. However, SVs are poorly captured by short read sequencing and thus are understudied, particularly in non-model organisms. Here, taking advantage of recently generated haplotype-resolved genome assemblies from >600 vertebrate species, we present the most comprehensive survey of the diversity of SVs and single nucleotide variants (SNVs) across the vertebrate tree of life to date.

Professor Peter Sudmant

Professor Peter Sudmant

University of California, Berkeley, US

10:15-10:30 Discussion
10:30-11:00 Break
11:00-11:30 Sightseeing in Pangenheim: A tour of pangenome visualization tools

The term "pangenome" (sometimes spelled "pan-genome") encompasses a wide range of concepts and merges diverse, often complex genomics data into a single, somewhat fuzzy entity. Alongside efforts to build, analyze, and manipulate pangenomes, a dedicated branch of research focuses on how to visualize them for exploration and interpretation.

This talk will cover the main definitions of pangenomes and their associated visual representations, from gene sets to genome graphs. It aims to give a clear, practical overview of existing tools and their applications, drawing on both approaches developed across the community and our own work. In particular, it will present a glyph-based representation of structural variations and its implementation in SaVanache, the graph pangenome browser of GraSuite, an ecosystem of tools for graph pangenome manipulation.

Dr Éloi Durant

Dr Éloi Durant

Luxembourg Institute of Science and Technology, Luxembourg

11:30-11:45 Discussion
11:45-12:15 Pangenome-based genome inference

Typical analysis workflows map reads to a reference genome in order to genotype genetic variants. Generating such alignments introduces reference biases and comes with substantial computational burden. In contrast, recent k-mer based genotypers are fast, but struggle in repetitive or duplicated genomic regions. We introduced a new algorithm, PanGenie, that leverages a haplotype-resolved pangenome reference in conjunction with k-mer counts from short-read sequencing data to genotype a wide spectrum of genetic variation – a process we refer to as genome inference. We could demonstrate that our method produces better results compared to mapping-based approaches. Improvements are especially pronounced for structural variants (SVs) and variants in repetitive regions. We studied SVs across large cohorts sequenced with short-reads, using pangenome graphs generated by the HGSVC and HPRC consortia, which enables the inclusion of these classes of variants in genome-wide association studies.

Dr Jana Ebler

Dr Jana Ebler

Heinrich Heine University Düsseldorf, Germany

12:15-12:30 Discussion

Chair

Charlotte Wright

Dr Charlotte Wright

Wellcome Sanger Institute, UK

13:30-14:00 Talk title TBC
Dr Mona Schreiber

Dr Mona Schreiber

Philipps University Marburg, Germany

14:00-14:15 Discussion
14:15-14:45 Using pangenomes to understand the role of structural variants in resistance to new biotic threats

Natural populations face increasing threats, including from new pests and pathogens. Understanding how species adapt to such threats could be crucial for informing effective actions to protect biodiversity in the future. Structural variants (SVs) may play a key role in facilitating rapid adaptation to novel biotic stressors due to their disproportionate effect on the phenotype, but, until recently, their involvement has been largely overlooked. Fraxinus excelsior (European ash) is a keystone forest tree species under strong selection from the ash dieback disease epidemic, making it an ideal system is which to understand the role of SVs in resistance and adaptation to biotic threats. Although some F. excelsior individuals in the starting population have low susceptibility to ash dieback, and evidence suggests the frequency of such individuals is increasing in the next generation through selection on standing genetic variation, the contribution of SVs to this phenotypic variation and evolutionary response is currently unknown. To address this, we have sequenced the genomes of hundreds of F. excelsior individuals with variable susceptibility to the disease, from common garden experiments and natural populations. Using long-read data, we have built both reference-based and all-vs-all pangenome graphs and conducted association analyses to examine the contribution of SVs to the evolution of low susceptibility to ash dieback. Our analyses reveal large numbers of segregating SVs in F. excelsior, including presence/absence variation in disease resistance genes, but also highlight ongoing challenges with comprehensively capturing SVs underlying adaptive traits even when equipped with a multitude of genome assemblies.

Dr Laura Kelly

Dr Laura Kelly

Royal Botanic Gardens, Kew, UK

14:45-15:00 Discussion
15:00-15:30 Break
15:30-16:00 The role of structural variation during adaptation to whole genome duplication

Whole genome duplication (WGD) is a dramatic mutation, hugely disrupting cellular processes. To survive and thrive, polyploids undergo rapid evolution at genes regulating fundamental processes such as meiosis. Recent studies suggest an important role for structural variants (SVs) in the adaptability of young polyploids. Do SVs play a role in adaptation to WGD? We investigate this in Arabidopsis arenosa, which has undergone WGD ~30k generations ago. Here, we build a pangenome graph of phased diploid and autotetraploid genome assemblies to detect SVs, and we leverage existing short-read datasets to look for evidence of SVs involved in adaptation to WGD. We compare this to SNP-based genome scans. We find novel candidates for genes involved in adaptation to WGD from both the SV and the SNP based analyses, with a small number of candidates indicated by overlapping SNPs and SVs. This suggests that by ignoring structural variation we are also ignoring an important component of the genetic basis of adaptation.

Dr Emma Curran

Dr Emma Curran

University of Sheffield, UK

16:00-16:15 Discussion
16:15-16:45 Blinded by the beauty of our weapons: how short-read sequencing limits our understanding of complex biology in non-model organisms

Short-read sequencing is a remarkably powerful technology for assessing genomic information. For many non-model organisms, it remains the primary source of evidence for genomic variation and for linking this variation to phenotype. However, short reads generally rely on a reference genome for analysis, which limits the detection of large or complex forms of variation, leading to reference bias.

A broader consequence of this bias is that genomics often focuses on the variation that is simplest to detect, most commonly SNPs or small structural variants. For non-model organisms, this leaves substantial unknowns surrounding larger and more complex forms of variation, such as those driven by transposable elements or gene presence–absence variation, which are more aligned with the types of variation that pangenomic approaches are focused on assessing.

Here, I focus on the global agricultural pest moth Helicoverpa armigera and the emergence of resistance to human control efforts, primarily through genetically modified crops. Identifying the genomic basis of resistance to the Vip3Aa Bt toxin has proven elusive for over a decade, despite extensive field exposure and experimental trials. At CSIRO, we have identified two independent resistance mechanisms from field-derived lines, both of which are exceptionally difficult to detect using short-read sequencing. In addition, analysis of alternate haplotypes even within a long-term laboratory line reveals extensive hidden variation, including complete gene absences between haplotypes, raising questions about how much diversity remains undocumented in wild populations.

Together, these case studies illustrate how long-read sequencing and pangenomic frameworks are transforming our ability to characterise genomic variation, with important implications for resistance evolution, biodiversity genomics, and our broader understanding of evolutionary processes in non-model organisms, as well as how our technology shapes the questions we ask, and what we are able to see.

Dr Andreas Bachler

Dr Andreas Bachler

Commonwealth Scientific and Industrial Research Organisation, Australia

16:45-17:00 Discussion

09:00-09:30 Pangenome sequencing reveals dramatic transposable-element induced structural variation in species radiations

Advances over the last few years in long-read reference genome assembly enable us now to obtain essentially complete genome sequences including through highly repetitive regions. Here we apply this to multiple samples from two adaptive radiations: first the cichlid fish radiation of hundreds of species in Lake Malawi in the last half million years, and second the northern European small Yponomeuta moth radiation.

A large fraction of species diversity is created during adaptive radiations, in which multiple species lineages separate within a short period. It is established that the sequences of genomes in the resulting lineages are not related according to a simple bifurcating species tree. Known causes for this are incomplete lineage sorting (ILS) and hybridisation. However we also see extensive mobile element activity in both these radiations, with hundreds of active transposon families, many differentially active across the radiation, and associated with this large scale structural variation between species/lineages whose associated variable content dwarfs the single nucleotide mutation-derived variation.

Professor Richard Durbin FRS

Professor Richard Durbin FRS

University of Cambridge, UK

09:30-09:45 Discussion
09:45-10:15 The human pangenome reference as a new foundation for variant discovery and interpretation

The human pangenome reference represents a fundamental shift in how human genetic variation is represented, analyzed, and shared, with broad implications for genomics research and medicine. Building on recent data releases, I will describe the transition toward a stable pangenome reference that establishes a new standard for base-level accuracy and telomere-to-telomere (T2T) representation of all chromosomes across diverse, fully phased haplotypes sampled to reflect global human genetic diversity. This resource enables more comprehensive variant discovery and interpretation by supporting analyses across multiple high-quality haplotypes and incorporating local ancestry inference (LAI) to contextualize variation within admixed genomes. The release highlights expanded international partnerships through the GA4GH Human Pangenome Project Telomere-to-Telomere (T2T) initiative, broadening both development and application of this global resource. Further, the pangenome now enables systematic analysis of satellite-rich regions of the genome, including centromeres, allowing satellite variants to be confidently and systematically reported across multiple haplotypes for the first time. Together, these advances enable more comprehensive structural variant analysis and improved functional interpretation, while driving continued pangenomics tooling and workflow innovation in support of a shared, global genomic resource.

Professor Karen Miga

Professor Karen Miga

University of California, Santa Cruz, US

10:15-10:30 Discussion
10:30-11:00 Break
11:00-11:30 The evolution, function, and real-world relevance of hyper-divergent haplotypes in nematodes

Structural variants are often thought of as simple events such as inversions, duplications, and indels, but many do not fit neatly into these categories. Hyper-divergent haplotypes (HDHs) are large stretches of sequence that differ markedly between individuals of the same species. HDHs were first described in the free-living model nematode Caenorhabditis elegans, where they span approximately 20% of the genome and are enriched for genes involved in sensory perception, pathogen defence, and xenobiotic stress. Similar haplotypes have recently been reported in other invertebrates, suggesting that this form of structural variation may be more widespread than previously realised.

We have developed new low-input long-read sequencing approaches and applied them to nematodes that parasitise humans, livestock, and wild animals. In doing so, we have discovered that HDHs are common features of parasitic nematode genomes, and often encode genes implicated in host interaction, including those previously trialled as vaccine antigens. In some cases, alternative haplotypes encode different repertoires of functionally related genes, meaning that individuals of the same species can express distinct sets of host-interacting proteins. Phylogenetic analyses suggest that these haplotypes are maintained by long-term balancing selection and may, in some cases, be generated through introgression between closely related species. We hypothesise that this “hyperdiversity” facilitates evasion of host immunity during infection.

In this talk, I will present what we currently know about hyper-divergent haplotypes in nematodes and other organisms and explain how pangenomic approaches can be used to address the many outstanding questions concerning their evolution, function, and real-world relevance.

Dr Lewis Stevens

Dr Lewis Stevens

Wellcome Sanger Institute, UK

11:30-11:45 Discussion
11:45-12:15 Pangenomes from passerine birds: evolutionary processes and bioinformatic implications

Birds are a good test case for the power of pangenome methods because they have been characterized as conservative in mode of evolution and depauperate in repeat content. Here we present a synthesis of findings of structural variation derived from five multi-haplotype, long read pangenomes from both oscines and suboscines, the two major lineages of passerine birds. We find that the abundance of repetitive elements, particularly satellite DNA, is vastly greater than revealed by short-read genome assemblies and that the annotation of specific repeat types depends heavily on the particular annotation tools used. The abundance of structural variants scales with population size, a pattern that extends to complex multigene families like the major histocompatibility complex (MHC), which shows evidence for extensive lineage-specific proliferation of gene duplicates, including a high incidence of MHC pseudogenes. However, accurate estimates of the frequency of multi-allelic structural variants remains challenging. Gene copy number variants reveal relationships with population size expected if gene deletions are deleterious and gene multiplications are neutral or advantageous, but discriminating between true deletions and false positives is challenging with current pipelines. We attempt to estimate the de novo mutation rate of structural variants using long read data from five family trios of the Florida Scrub-Jay (Aphelocoma coerulescens), but in our hands the incidence of false positives is prohibitively high. Early pangenomes in birds reveal broad agreement with expected rates of structural variation, yet specific types of variants, such as gene copy numbers and de novo mutations, continue to present bioinformatic challenges.

Professor Scott Edwards

Professor Scott Edwards

Harvard University, US

12:15-12:30 Discussion

Chair

Mara Lawniczak

Dr Mara Lawniczak

Wellcome Sanger Institute, UK

13:30-14:00 Pangenomics reveals hidden genetic variation underlying rapid ecological diversification

Cichlids are an exceptional model for dissecting the genomic basis of rapid adaptive radiation and phenotypic diversification, and ultimately the mechanisms underlying speciation. In the Neotropical Midas cichlid radiation (Amphilophus citrinellus species complex), decades of genetic and genomic research have generated substantial insight into the basis of multiple adaptive traits. However, fully resolving how these traits evolve and contribute to divergence has remained challenging, largely because of their complex, multilocus architecture and the limitations of single-reference, SNP-centric approaches. This contribution demonstrates how a pangenomic framework applied to the Midas system enables a more complete characterization of the genomic architecture underlying adaptation and phenotypic diversification in young, rapidly diversifying fishes. A high-quality pangenome built from 16 chromosome-level, haplotype-resolved genomes and integrated with extensive short-read whole-genome resequencing and transcriptomic data revealed layers of genetic variation that are inaccessible to conventional analyses. Overall, these results highlight pangenomics as a powerful tool for linking genome structure to adaptive divergence and for advancing our understanding of rapid ecological diversification.

Professor Paolo Franchini

Professor Paolo Franchini

Tuscia University, Italy

14:00-14:15 Discussion
14:15-14:45 Haplotype architecture shapes phenotypic diversity across plant and animal pangenomes

Pangenomes provide a powerful lens into adaptation and phenotypic variation across agriculture, human health, and fundamental biology. Structurally diverse sequences revealed by pangenomes are increasingly recognized as potential contributors to phenotypic diversity, yet understanding how these sequences are organized and relate to traits remains poorly resolved. Despite rapid progress in genome sequencing, methods for analysing pangenomes are computationally demanding and have shown limited ability to resolve how complex genetic variation is organized into functional units that shape phenotypes. Addressing this need, we introduce Panagram (https://github.com/kjenike/panagram), an ultrafast reference-free platform for assembling and annotating the haplotype architecture within a pangenome consisting of evolutionarily coherent blocks of shared sequence variation. The core metric of Panagram is the pan-kmer bitmap that quantifies local sequence similarity across samples and enables rapid, on-the-fly clustering and analyses of haplotypic blocks. These capabilities provide a unified framework for association testing, introgression detection, and the flexible discovery of biologically meaningful variation across pangenomes.

Dr Katharine Jenike

Dr Katharine Jenike

University of Cambridge, UK

14:45-15:00 Discussion
15:00-15:15 Break
15:30-16:00 What do collections of genomes tell us about structural variants in evolution?

A significant fraction of genetic diversity lies in structural genomic variation (SV), eg chromosomal rearrangements or copy-number variants. New high-quality reference assemblies whether taken as collection of comparable genomes or gathered as pangenome graphs provide unprecedent access into SVs, showing their prevalence and their implication in adaptation or diversification.

Here, we will reflect on the role of SVs in the evolution of diversity and adaptation, with insights from experimental and ecological studies in the seaweed flies Coelopa sp. In particular, large polymorphic chromosomal inversions are particularly important for parallel adaptation to heterogeneous environments. For instance, they underlie morphotypes with different life-history strategies. We are now building a pangenome including genomes from different geographical populations and related species to target the origin and evolution of those large balanced rearrangements.

Next, we will ask how the new genome resources may help to more systematically investigate SVs. We have developed a pipeline leveraging long-reads and genome assemblies from the Darwin Tree of Life project to address patterns of intra-specific structural variation across a broad range of taxa. We’ll discuss how such type of datasets provides opportunities to investigate the role of species’ life-history and ecology in shaping the architecture of genetic diversity and architecture.

Dr Claire Mérot

Dr Claire Mérot

Centre National de la Recherche Scientifique - UMR 6553 - University of Rennes, France

16:00-16:15 Discussion
16:15-17:00 Panel discussion: future directions
Dr Joana Meier

Dr Joana Meier

Wellcome Sanger Institute, UK