< Return to Video

CARTA: DNA – Neandertal and Denisovan Genomes; Neandertal Genes in Humans; Neandertal Interbreeding

  • 0:00 - 0:01
  • 0:01 - 0:05
    This UCSD TV program is
    presented by University
  • 0:05 - 0:06
    of California Television.
  • 0:06 - 0:08
    Like what you learn?
  • 0:08 - 0:11
    Visit our website or follow
    us on Facebook and Twitter
  • 0:11 - 0:13
    to keep up with the
    latest programs.
  • 0:13 - 0:19
  • 0:19 - 0:23
    [MUSIC PLAYING]
  • 0:23 - 0:25
    We are the paradoxical ape.
  • 0:25 - 0:33
    Bipedal, naked, large-brained,
    long the master of fire, tools,
  • 0:33 - 0:39
    and language, but still trying
    to understand ourselves.
  • 0:39 - 0:46
    Aware that death is inevitable,
    yet filled with optimism.
  • 0:46 - 0:48
    We grow up slowly.
  • 0:48 - 0:51
    We hand down knowledge.
  • 0:51 - 0:54
    We empathize and deceive.
  • 0:54 - 0:57
    We shape the future from
    our shared understanding
  • 0:57 - 1:00
    of the past.
  • 1:00 - 1:03
    CARTA brings together experts
    from diverse disciplines
  • 1:03 - 1:09
    to exchange insights on who
    we are and how we got here,
  • 1:09 - 1:12
    an exploration made
    possible by the generosity
  • 1:12 - 1:14
    of humans like you.
  • 1:14 - 1:30
  • 1:30 - 1:33
    [MUSIC PLAYING]
  • 1:33 - 2:10
  • 2:10 - 2:14
    Well, thank you very much
    for the introduction, Anne,
  • 2:14 - 2:17
    and to the organizers for
    inviting me to participate
  • 2:17 - 2:20
    in the symposium today,
    and also to all of you
  • 2:20 - 2:22
    for coming back after the break.
  • 2:22 - 2:26
    So it's nice to
    see everyone again.
  • 2:26 - 2:28
    So I'm really happy to
    have the opportunity
  • 2:28 - 2:30
    to tell you about some
    of our recent work
  • 2:30 - 2:36
    on mapping archaic hominin DNA
    in the genomes of modern humans.
  • 2:36 - 2:39
    And actually, my
    talk, you'll see,
  • 2:39 - 2:43
    is going to be pretty similar
    to our first speaker, Shrira.
  • 2:43 - 2:45
    And in fact, when
    he was talking,
  • 2:45 - 2:49
    I was thinking to myself
    how nice it was actually
  • 2:49 - 2:51
    that some of the
    things he was saying
  • 2:51 - 2:54
    overlapped with what I
    was going to talk about,
  • 2:54 - 2:57
    because we were working
    on these projects
  • 2:57 - 2:58
    completely independently.
  • 2:58 - 3:01
    We developed very different
    statistical methods
  • 3:01 - 3:05
    to answer the same questions,
    and yet, by and large, we
  • 3:05 - 3:07
    came to many of the
    same conclusions.
  • 3:07 - 3:09
    So I think it engenders
    confidence in the things
  • 3:09 - 3:11
    that we're presenting today.
  • 3:11 - 3:13
    And my graduate student,
    Benjamin Vernot, and I
  • 3:13 - 3:17
    first became interested in this
    question of archaic admixture
  • 3:17 - 3:18
    a few years ago.
  • 3:18 - 3:22
    And I think this is actually one
    of the most fascinating topics
  • 3:22 - 3:26
    in all of genetics and genomics
    these days, is all of the things
  • 3:26 - 3:29
    that we've learned from
    ancient DNA sequencing.
  • 3:29 - 3:32
    And one of the more
    contentious questions, I think,
  • 3:32 - 3:34
    in human evolution has
    been whether or not
  • 3:34 - 3:38
    modern humans
    mated or hybridized
  • 3:38 - 3:41
    with archaic humans, like
    Neanderthals and Denisovans.
  • 3:41 - 3:45
    And for many decades, this was
    just an acrimonious debate,
  • 3:45 - 3:46
    and that was largely
    because the data didn't
  • 3:46 - 3:49
    exist to answer the question.
  • 3:49 - 3:53
    But with technologies developed
    by Svante Paabo and some
  • 3:53 - 3:55
    of the other speakers we've
    heard from this morning,
  • 3:55 - 4:01
    Matias and Kai, in the
    not too many years ago,
  • 4:01 - 4:04
    we were able to get
    high-quality genome sequences
  • 4:04 - 4:07
    from the Neanderthal
    and Denisovan genomes.
  • 4:07 - 4:10
    And this provided
    unambiguous evidence
  • 4:10 - 4:12
    that modern humans and
    these archaic humans
  • 4:12 - 4:16
    did, in fact, hybridize
    and exchange genes.
  • 4:16 - 4:19
    And as Matias talked about
    this morning, though,
  • 4:19 - 4:24
    studying ancient DNA from fossil
    still remains really challenging
  • 4:24 - 4:27
    because you have to find an
    appropriate specimen, first
  • 4:27 - 4:30
    of all, and you have to
    hope that the DNA has
  • 4:30 - 4:33
    been preserved over hundreds
    of thousands of years.
  • 4:33 - 4:37
    So my student and
    I thought, well,
  • 4:37 - 4:39
    if there was gene flow
    between modern humans
  • 4:39 - 4:43
    and archaic humans, maybe we
    could excavate ancient DNA,
  • 4:43 - 4:47
    not directly from fossils, but
    indirectly from the genomes
  • 4:47 - 4:49
    of modern humans.
  • 4:49 - 4:53
    And to give you a little bit of
    an intuition of how this works,
  • 4:53 - 4:56
    I'd like to argue that a little
    bit of archaic introgression
  • 4:56 - 4:57
    goes a long way.
  • 4:57 - 5:01
    And so in this schematic,
    I'm showing you a picture
  • 5:01 - 5:04
    of 10 or so individuals.
  • 5:04 - 5:05
    These aren't random individuals.
  • 5:05 - 5:10
    These are my colleagues in the
    department of genome sciences.
  • 5:10 - 5:12
    And this is what
    happens when you put
  • 5:12 - 5:13
    your picture on the internet.
  • 5:13 - 5:17
  • 5:17 - 5:22
    So each line here, let's
    imagine represents a stretch
  • 5:22 - 5:24
    of each person's genome.
  • 5:24 - 5:28
    And from previous work, we
    knew that all non-Africans
  • 5:28 - 5:31
    had about 2% of
    their DNA inherited
  • 5:31 - 5:32
    from Neanderthal ancestors.
  • 5:32 - 5:36
    And that's what's represented by
    these yellow chunks of sequence.
  • 5:36 - 5:39
    And so what we wanted to
    do was develop a method
  • 5:39 - 5:42
    where we could walk along
    an individual's genome
  • 5:42 - 5:45
    and pull out the parts
    that were inherited
  • 5:45 - 5:46
    from Neanderthal ancestors.
  • 5:46 - 5:50
    And the key here is that
    the 2% of my genome that
  • 5:50 - 5:52
    was inherited from Neanderthals
    might be a little bit
  • 5:52 - 5:55
    different than your 2%.
  • 5:55 - 5:58
    So that when we aggregate the
    data across many individuals,
  • 5:58 - 6:01
    we can actually recover
    a substantial amount
  • 6:01 - 6:03
    of the Neanderthal genome.
  • 6:03 - 6:06
    And actually, what I find most
    compelling about this approach
  • 6:06 - 6:10
    is that as opposed to
    sequencing ancient DNA
  • 6:10 - 6:13
    from a single fossil, by
    recovering these all surviving
  • 6:13 - 6:16
    archaic lineages,
    we're potentially
  • 6:16 - 6:19
    getting data that
    was-- or getting
  • 6:19 - 6:23
    sequences that were inherited
    from multiple archaic ancestors.
  • 6:23 - 6:25
    So we're getting
    population level data.
  • 6:25 - 6:27
    And that will allow
    us to make inferences
  • 6:27 - 6:29
    that are difficult or
    impossible to do if you just
  • 6:29 - 6:32
    have genetic data from
    a single individual.
  • 6:32 - 6:36
    So, I'm not going to talk a lot
    about the details of how we scan
  • 6:36 - 6:40
    along an individual's genome
    and look for archaic sequence,
  • 6:40 - 6:42
    but I did want to give you
    a little bit of intuition.
  • 6:42 - 6:46
    So what are the characteristics
    of introgressed archaic sequence
  • 6:46 - 6:47
    that we look at?
  • 6:47 - 6:51
    Well, what I'm showing you here
    is a simple schematic showing
  • 6:51 - 6:56
    that Europeans diverged from
    Africans about 80,000 years ago
  • 6:56 - 6:56
    or so.
  • 6:56 - 6:59
    And what we want to find--
  • 6:59 - 7:03
    or if we look at lineages
    superimposed on this tree,
  • 7:03 - 7:07
    we can see that what we're
    actually interested in finding
  • 7:07 - 7:09
    are cases like this.
  • 7:09 - 7:13
    So sequences that are found in
    non-Africans that were inherited
  • 7:13 - 7:15
    from a Neanderthal ancestor.
  • 7:15 - 7:18
    So what are the
    features that we expect
  • 7:18 - 7:20
    for these types of sequences?
  • 7:20 - 7:23
    Well, the first is
    that in contrast
  • 7:23 - 7:26
    to two modern human sequences
    that have a much more
  • 7:26 - 7:32
    recent common ancestor, mutation
    will have had a long time to act
  • 7:32 - 7:33
    and accumulate.
  • 7:33 - 7:36
    We want to find these sequences.
  • 7:36 - 7:39
    So mutation will have
    had a longer time to act
  • 7:39 - 7:41
    and accumulate on
    this lineage compared
  • 7:41 - 7:43
    to two modern human lineages.
  • 7:43 - 7:46
    But the other key
    feature is that admixture
  • 7:46 - 7:47
    happened relatively recently.
  • 7:47 - 7:51
    So in the last 60,000
    to 80,000 years or so.
  • 7:51 - 7:54
    And therefore, the
    Neanderthal haplotypes
  • 7:54 - 7:57
    will still persist over
    sizeable genomic regions.
  • 7:57 - 8:01
    So it's this combination of
    highly divergent sequences
  • 8:01 - 8:04
    that stretch over large
    genomic distances that
  • 8:04 - 8:07
    allow us to accurately
    and robustly predict
  • 8:07 - 8:09
    what are archaic
    sequences versus what
  • 8:09 - 8:11
    are modern human sequences.
  • 8:11 - 8:13
    And actually, the
    approach we're using
  • 8:13 - 8:16
    is a modification of a
    statistic called S star that
  • 8:16 - 8:18
    was developed by Jeff Wall.
  • 8:18 - 8:21
    And one of the nice
    things about this approach
  • 8:21 - 8:25
    is it doesn't explicitly use
    the Neanderthal or Denisovan
  • 8:25 - 8:28
    genome when making
    the initial inference.
  • 8:28 - 8:31
    And the really powerful
    thing about that is we
  • 8:31 - 8:34
    can potentially discover
    archaic lineages from groups
  • 8:34 - 8:36
    that we don't even
    know about yet.
  • 8:36 - 8:37
    And actually,
    that's a major part
  • 8:37 - 8:39
    of what we're looking at now.
  • 8:39 - 8:42
    But that will have to be a
    story for a different day.
  • 8:42 - 8:44
    So what I'm going to
    tell you about, though,
  • 8:44 - 8:49
    is applying this method to
    1,500 geographically diverse
  • 8:49 - 8:50
    individuals.
  • 8:50 - 8:55
    So whole genome sequences
    from about 1,500 people all
  • 8:55 - 8:56
    throughout the world.
  • 8:56 - 9:00
    These are largely sequences
    from the 1,000 genomes project.
  • 9:00 - 9:02
    So this is a
    publicly-available data set.
  • 9:02 - 9:06
    But to supplement this, we also
    sequenced, in collaboration
  • 9:06 - 9:11
    with Svante Pablo's group, 35
    individuals from Melanesia.
  • 9:11 - 9:15
    And the idea here was that
    we knew from previous work
  • 9:15 - 9:17
    that these individuals
    should have
  • 9:17 - 9:20
    substantial amounts of both
    Neanderthal and Denisovan
  • 9:20 - 9:22
    sequence.
  • 9:22 - 9:26
    And if we look at the
    amount of archaic ancestry
  • 9:26 - 9:28
    that we find per
    individual, that's
  • 9:28 - 9:30
    what I'm showing on this slide.
  • 9:30 - 9:33
    So this shows the
    distribution of the amount
  • 9:33 - 9:35
    of archaic sequence
    per individual
  • 9:35 - 9:40
    in Melanesians, East Asians,
    South Asians and Europeans.
  • 9:40 - 9:43
    And you can see that
    Melanesians have, on average,
  • 9:43 - 9:46
    much more archaic
    sequence per individual
  • 9:46 - 9:48
    compared to some of the
    other non-African groups.
  • 9:48 - 9:51
    And the reason is,
    as I just mentioned,
  • 9:51 - 9:56
    they have substantial amounts of
    both Neanderthal and Denisovan
  • 9:56 - 9:57
    sequence.
  • 9:57 - 10:03
    And so each row here is an
    individual, and the bar plots
  • 10:03 - 10:05
    correspond to how much
    Neanderthal versus Denisovan
  • 10:05 - 10:08
    sequence each individual has.
  • 10:08 - 10:11
    And if you look closely, there's
    a small amount of sequence
  • 10:11 - 10:13
    that we label as ambiguous.
  • 10:13 - 10:17
    This is sequence that we are
    confident is archaic in origin,
  • 10:17 - 10:20
    but we can't distinguish
    robustly, at least,
  • 10:20 - 10:23
    whether or it's
    Neanderthal or Denisovan.
  • 10:23 - 10:30
    So on average, Europeans have
    about 50 to 55 mega bases
  • 10:30 - 10:33
    of archaic sequence
    per individual,
  • 10:33 - 10:35
    and this is largely
    Neanderthal in origin.
  • 10:35 - 10:37
    South Asians have
    a little bit more,
  • 10:37 - 10:39
    East Asians have
    a little bit more,
  • 10:39 - 10:43
    and Melanesians have
    about, on average,
  • 10:43 - 10:47
    a hundred mega basis of archaic
    sequence per individual.
  • 10:47 - 10:49
    So that's 100
    million base pairs.
  • 10:49 - 10:50
    So that's great.
  • 10:50 - 10:53
    We can identify
    archaic sequence.
  • 10:53 - 10:54
    But I think the really
    interesting thing
  • 10:54 - 10:57
    is the things that we can
    potentially learn from it.
  • 10:57 - 11:00
    So what are the
    types of questions
  • 11:00 - 11:03
    that surviving archaic
    lineages allow us to ask?
  • 11:03 - 11:06
    So, I'm going to tell
    you about three things
  • 11:06 - 11:07
    that we've been interested in.
  • 11:07 - 11:10
    So the first is
    was hybridization
  • 11:10 - 11:13
    between archaic humans and
    modern humans deleterious?
  • 11:13 - 11:16
    That is, were there
    bad consequences?
  • 11:16 - 11:20
    Conversely, was hybridization
    beneficial or were there
  • 11:20 - 11:23
    some good consequences
    of hybridization?
  • 11:23 - 11:25
    And finally, what
    demographic model
  • 11:25 - 11:30
    is consistent with patterns of
    introgressed archaic sequences?
  • 11:30 - 11:32
    So let's start with
    the first question.
  • 11:32 - 11:36
    Were there deleterious
    consequences to hybridization?
  • 11:36 - 11:38
    And one of the most
    striking things
  • 11:38 - 11:41
    that we found when first
    looking at patterns
  • 11:41 - 11:44
    of Neanderthal sequence
    across the genome
  • 11:44 - 11:48
    is that it's very
    heterogeneous distributed.
  • 11:48 - 11:53
    So I'm showing you sequences
    from chromosome 7, 8, and 9.
  • 11:53 - 11:55
    So the blue ticks
    represent places
  • 11:55 - 11:57
    where we find
    Neanderthal sequence
  • 11:57 - 12:01
    in European individuals, the
    red lines indicate places
  • 12:01 - 12:03
    where we find Neanderthal
    sequence in East
  • 12:03 - 12:07
    Asian individuals, and
    the gray lines represent
  • 12:07 - 12:09
    parts of the genome that
    are too repetitive for us
  • 12:09 - 12:13
    to study and be confident
    in the predictions from.
  • 12:13 - 12:15
    And so if you squint long
    enough at this figure,
  • 12:15 - 12:17
    you can see that
    it doesn't appear,
  • 12:17 - 12:19
    and Shrira mentioned
    this earlier,
  • 12:19 - 12:22
    that patterns of
    surviving sequence
  • 12:22 - 12:26
    are randomly distributed
    across the chromosomes.
  • 12:26 - 12:29
    But you find these
    regions that have
  • 12:29 - 12:32
    been called deserts or
    depletions of archaic ancestry
  • 12:32 - 12:35
    that extend over really
    large genomic regions.
  • 12:35 - 12:37
    And this is
    consistent with there
  • 12:37 - 12:41
    being deleterious consequences
    to having Neanderthal sequence
  • 12:41 - 12:42
    in these regions.
  • 12:42 - 12:45
    And in fact, when we do
    extensive simulations
  • 12:45 - 12:48
    and try to model
    this process, we
  • 12:48 - 12:52
    see that there's an
    excess in the observed
  • 12:52 - 12:55
    data of these depletions,
    or archaic desserts,
  • 12:55 - 12:59
    compared to simulated data under
    neutral models of evolution.
  • 12:59 - 13:00
    So what does that mean?
  • 13:00 - 13:04
    It just means basically that
    under neutral evolution,
  • 13:04 - 13:06
    so where there's no
    fitness consequences
  • 13:06 - 13:08
    to the Neanderthal
    sequence, we really
  • 13:08 - 13:12
    wouldn't expect to see desserts
    this large in the real data.
  • 13:12 - 13:14
    So I think this is pretty
    compelling evidence
  • 13:14 - 13:17
    that there was
    deleterious fitness
  • 13:17 - 13:19
    consequences to hybridization.
  • 13:19 - 13:21
    And what's really
    fascinating to me
  • 13:21 - 13:27
    is that if you also superimpose
    Denisovan sequences on top
  • 13:27 - 13:30
    of this data, you
    find that there's
  • 13:30 - 13:34
    a significant overlap
    between Neanderthal desserts
  • 13:34 - 13:35
    and Denisovan desserts.
  • 13:35 - 13:38
    So the same places
    in the human genome
  • 13:38 - 13:41
    that are depleted of
    Neanderthal sequences
  • 13:41 - 13:44
    are also depleted of
    Denisovan sequences.
  • 13:44 - 13:46
    And again, this is very
    consistent with the idea
  • 13:46 - 13:50
    that these regions
    maybe are harboring
  • 13:50 - 13:53
    genetic changes that
    are very important
  • 13:53 - 13:55
    to modern human phenotypes.
  • 13:55 - 14:00
    So for example, the largest
    region or the largest depletion
  • 14:00 - 14:02
    is on chromosome 7.
  • 14:02 - 14:05
    It's about a 15-megabase desert.
  • 14:05 - 14:07
    So there's lots of genes.
  • 14:07 - 14:10
    One of the challenges in
    interpreting these regions is
  • 14:10 - 14:15
    that in a 15-megabase sequence,
    there's about 100 genes or so.
  • 14:15 - 14:18
    So you don't actually know
    which one is driving the signal
  • 14:18 - 14:19
    that you're interested in.
  • 14:19 - 14:23
    But one thing that caught our
    eye, and as Shrira mentioned
  • 14:23 - 14:26
    this morning, is right in the
    middle of this largest desert
  • 14:26 - 14:28
    is a gene called FOXP2.
  • 14:28 - 14:31
    And FOXP2 has previously
    been associated
  • 14:31 - 14:34
    with being important
    in speech and language.
  • 14:34 - 14:37
    And in fact, work
    from Svante's group
  • 14:37 - 14:40
    has shown that there's
    human-specific mutations
  • 14:40 - 14:43
    in regulatory regions of FOXP2.
  • 14:43 - 14:45
    So again, I want
    to be careful here.
  • 14:45 - 14:48
    And we haven't
    proven that FOXP2 is
  • 14:48 - 14:51
    driving this depletion
    of Neanderthal sequence
  • 14:51 - 14:52
    in this region.
  • 14:52 - 14:55
    But it's really
    interesting and I
  • 14:55 - 14:58
    think these deserts
    of archaic ancestry
  • 14:58 - 15:02
    can help us pinpoint places
    in the human genome that
  • 15:02 - 15:05
    might be important in
    modern human evolution.
  • 15:05 - 15:07
    So the search space
    is much narrower now
  • 15:07 - 15:10
    compared to when we
    first did these studies.
  • 15:10 - 15:14
    Another question that we
    were interested in asking
  • 15:14 - 15:17
    is, well, so it
    seems like there were
  • 15:17 - 15:20
    some deleterious consequences
    to hybridization.
  • 15:20 - 15:24
    Was there also evidence that
    maybe some of the sequences we
  • 15:24 - 15:26
    picked up from
    Neanderthals or Denisovans?
  • 15:26 - 15:27
    Was that beneficial?
  • 15:27 - 15:30
    And probably the simplest
    way to look at this question
  • 15:30 - 15:35
    is to look at the frequency
    of Neanderthal or Denisovan
  • 15:35 - 15:38
    sequences in modern populations.
  • 15:38 - 15:39
    And that's what I'm
    showing you here.
  • 15:39 - 15:42
    So each dot
    represents a frequency
  • 15:42 - 15:47
    of either a Neanderthal
    or a Denisovan haplotype
  • 15:47 - 15:51
    in East Asians, Europeans,
    Melanesians, and South Asians.
  • 15:51 - 15:53
    And you can see that
    for the most part,
  • 15:53 - 15:56
    the vast majority of
    archaic sequence that
  • 15:56 - 15:59
    persists in modern human
    populations is pretty rare.
  • 15:59 - 16:02
    So usually less
    than 10% frequency.
  • 16:02 - 16:05
    But there's a number
    of regions that
  • 16:05 - 16:07
    have risen to high frequency.
  • 16:07 - 16:12
    So 60% in some cases, in some
    cases, even a little bit higher.
  • 16:12 - 16:15
    And we've done extensive
    modeling, again,
  • 16:15 - 16:18
    to try to determine
    how likely it
  • 16:18 - 16:20
    is to see these high
    frequency haplotypes
  • 16:20 - 16:22
    in the absence of selection.
  • 16:22 - 16:26
    And it turns out
    that above 50% or so,
  • 16:26 - 16:29
    it's actually really
    unusual for a haplotype
  • 16:29 - 16:33
    to randomly drift up to
    such high frequencies.
  • 16:33 - 16:37
    So there's about
    100 or so, I think,
  • 16:37 - 16:41
    really high-confident targets
    of adaptive introgression.
  • 16:41 - 16:46
    And you might wonder, so what
    phenotypes were influenced
  • 16:46 - 16:48
    by adaptive introgression?
  • 16:48 - 16:53
    And so we knew previously
    that a version of a gene
  • 16:53 - 16:57
    called EPAS1 in certain
    Tibetan populations
  • 16:57 - 16:59
    was inherited from Denisovans.
  • 16:59 - 17:04
    And it's this gene that allows
    them to live at high altitude.
  • 17:04 - 17:07
    So, there was already some
    pretty good a priori evidence
  • 17:07 - 17:11
    that admixture
    with archaic humans
  • 17:11 - 17:14
    was beneficial for some genes.
  • 17:14 - 17:17
    And when we look
    carefully at these 100
  • 17:17 - 17:19
    high-frequency
    archaic haplotypes,
  • 17:19 - 17:21
    we see that they are
    largely comprised
  • 17:21 - 17:25
    of genes that can be
    categorized into two classes.
  • 17:25 - 17:28
    One, the immune system.
  • 17:28 - 17:32
    So many genes that
    influence immune
  • 17:32 - 17:35
    phenotypes, and in
    particular, innate immunity.
  • 17:35 - 17:37
    So the part of our
    immune system that deals
  • 17:37 - 17:39
    with viruses and bacteria.
  • 17:39 - 17:44
    So that seems to be a very
    enriched target or substrate
  • 17:44 - 17:46
    of adaptive introgression.
  • 17:46 - 17:48
    And I think you could have
    predicted this a priori.
  • 17:48 - 17:50
    So it's known that the
    immune system is often
  • 17:50 - 17:52
    a target of selection.
  • 17:52 - 17:56
    But the other category of genes
    that actually I would have never
  • 17:56 - 17:59
    predicted a priori
    turns out to be
  • 17:59 - 18:03
    a number of genes that have
    important functions in skin
  • 18:03 - 18:04
    biology.
  • 18:04 - 18:08
    So for example, one of these
    genes, as Shrira mentioned
  • 18:08 - 18:10
    is B and C2.
  • 18:10 - 18:14
    It's a gene called basal
    nucleon 2 that has recently
  • 18:14 - 18:16
    been shown to influence
    skin pigmentation
  • 18:16 - 18:20
    levels in Europeans.
  • 18:20 - 18:23
    So it's at very high frequency.
  • 18:23 - 18:25
    Each row here, again,
    is an individual
  • 18:25 - 18:27
    and columns are variant sites.
  • 18:27 - 18:31
    And these individuals carry
    the Neanderthal haplotype.
  • 18:31 - 18:33
    And you can see that it's a
    very high-frequency haplotype
  • 18:33 - 18:36
    in Europeans, not
    found in East Asians.
  • 18:36 - 18:37
    And finally, and
    real quickly, I'm
  • 18:37 - 18:42
    just going to give you a brief
    synopsis on the things we can
  • 18:42 - 18:45
    learn about demographic models.
  • 18:45 - 18:48
    And whenever I think
    of demographic models,
  • 18:48 - 18:51
    this image from National
    Geographic comes to mind.
  • 18:51 - 18:54
    I think it's a fascinating
    picture, actually.
  • 18:54 - 18:57
    My kids really like
    this too, because they
  • 18:57 - 19:05
    say I look like him, but
    that's a different story.
  • 19:05 - 19:08
    And so what are the things
    that we can try to learn?
  • 19:08 - 19:11
    Well, we want to know things
    like when did hybridization
  • 19:11 - 19:12
    happen?
  • 19:12 - 19:13
    How many times did it happen?
  • 19:13 - 19:16
    Did different populations have
    the same or different admixture
  • 19:16 - 19:18
    histories?
  • 19:18 - 19:20
    And I have a postdoc,
    Joshua Schrieber,
  • 19:20 - 19:23
    who developed a really
    clever method of trying
  • 19:23 - 19:27
    to infer whether two populations
    had the same admixture
  • 19:27 - 19:29
    history or different
    admixture histories.
  • 19:29 - 19:33
    And so when we apply this
    method to pairs of populations
  • 19:33 - 19:35
    that we analyzed--
  • 19:35 - 19:37
    the details here
    aren't important,
  • 19:37 - 19:41
    but we can infer this
    general picture of--
  • 19:41 - 19:45
    so this is Europeans, East
    Asians, Melanesians, Africans,
  • 19:45 - 19:47
    Neanderthals, and Denisovans.
  • 19:47 - 19:49
    And the main point I
    want to impress upon you
  • 19:49 - 19:53
    is that maybe even compared to
    as recently as a few years ago,
  • 19:53 - 19:55
    it seems like the
    admixture history
  • 19:55 - 19:59
    between modern and archaic
    humans is much more complex.
  • 19:59 - 20:03
    And in fact, the data is
    consistent with multiple pulses
  • 20:03 - 20:07
    of admixture between
    Neanderthals and modern humans,
  • 20:07 - 20:10
    and at least one pulse of
    admixture with Denisovans.
  • 20:10 - 20:13
    So, in conclusion,
    I've shown you
  • 20:13 - 20:16
    that substantial amounts of the
    Neanderthal and Denisovan genome
  • 20:16 - 20:20
    remain scattered in the
    DNA of modern humans,
  • 20:20 - 20:22
    that there were fitness
    consequences to hybridization,
  • 20:22 - 20:26
    both good and bad, and
    that the history of contact
  • 20:26 - 20:30
    was much more complex
    than previously thought.
  • 20:30 - 20:33
    And I would like to
    thank people in my lab.
  • 20:33 - 20:36
    So this guy right here in the
    middle is Benjamin Vernot.
  • 20:36 - 20:39
    He was a graduate student who is
    now a postdoc with Svante Paabo.
  • 20:39 - 20:42
    But he, by and large,
    did most of the work
  • 20:42 - 20:44
    that I talked about today.
  • 20:44 - 20:47
    So with that, I will thank you
    and I guess answer questions
  • 20:47 - 20:50
    after everyone's done.
  • 20:50 - 20:54
    [APPLAUSE]
  • 20:54 - 20:55
  • 20:55 - 20:58
    [MUSIC PLAYING]
  • 20:58 - 21:00
  • 21:00 - 21:03
    So I would like to start
    with acknowledgments.
  • 21:03 - 21:06
    So the work I'm going
    to present is actually
  • 21:06 - 21:08
    the work of many,
    many people who
  • 21:08 - 21:10
    were involved in
    sequencing these two
  • 21:10 - 21:13
    genomes I'm going to talk about
    and help to analyzing them.
  • 21:13 - 21:16
    I will give some more
    credit during the talk.
  • 21:16 - 21:19
    So let me start by
    just introducing
  • 21:19 - 21:22
    the samples that were used
    to generate these sequences.
  • 21:22 - 21:26
    So both of these
    samples were found
  • 21:26 - 21:29
    in the Denisova cave in the
    Altai Mountains in Russia.
  • 21:29 - 21:32
    And one was actually
    a small finger bone
  • 21:32 - 21:37
    that you see here on top, and
    another one was a small toe bone
  • 21:37 - 21:39
    that you see on the bottom.
  • 21:39 - 21:40
    And the reason that
    we are actually
  • 21:40 - 21:43
    having two genomes from the same
    place has to do with the fact
  • 21:43 - 21:48
    that the Denisova cave actually
    preserves the DNA in these bones
  • 21:48 - 21:50
    particularly well.
  • 21:50 - 21:51
    It's one of those
    exceptional places where
  • 21:51 - 21:55
    most of the DNA that we get
    out of these very old bones
  • 21:55 - 21:56
    really comes from
    the individual that
  • 21:56 - 22:02
    died and are not from bacterial
    or microbial contamination.
  • 22:02 - 22:04
    So what this allows
    us also to do
  • 22:04 - 22:09
    is to not just sequence these
    genomes to a low coverage,
  • 22:09 - 22:10
    meaning just a
    couple of sequences
  • 22:10 - 22:12
    from the nuclear genome,
    but we can actually
  • 22:12 - 22:15
    sequence them many times over.
  • 22:15 - 22:17
    And how this looks
    like then is that you
  • 22:17 - 22:21
    have small sequences
    stacking up like this
  • 22:21 - 22:24
    that are distributed randomly
    over the entire genome.
  • 22:24 - 22:30
    And you always have several
    of those for each position.
  • 22:30 - 22:34
    And so taken together,
    you have 30-fold coverage,
  • 22:34 - 22:36
    meaning at any position in--
  • 22:36 - 22:38
    on an average position
    in the genome,
  • 22:38 - 22:41
    you will have 30 different
    fragments for the finger bone,
  • 22:41 - 22:44
    and for the toe bone
    you have 50 fragments.
  • 22:44 - 22:47
  • 22:47 - 22:50
    So using these genomes
    then to actually understand
  • 22:50 - 22:52
    how they are related,
    you see that one
  • 22:52 - 22:56
    of those two genomes that
    we produced from this cave
  • 22:56 - 22:58
    falls together with
    other Neanderthals
  • 22:58 - 23:00
    that we have sequenced
    before to lower coverage.
  • 23:00 - 23:03
    And we call this the
    Altai Neanderthal.
  • 23:03 - 23:05
    And the second individual--
    the nuclear genome
  • 23:05 - 23:08
    is from a sister
    group of Neanderthals
  • 23:08 - 23:11
    that we call the Denisovan
    because they fall outside
  • 23:11 - 23:14
    of the variation that we
    observe of Neanderthals.
  • 23:14 - 23:16
    So they are more closely
    related to Neanderthals,
  • 23:16 - 23:21
    but they are not looking--
    close enoughly related
  • 23:21 - 23:24
    to deserve to be just
    called a Neanderthal,
  • 23:24 - 23:27
    and we rather call
    them Denisovan.
  • 23:27 - 23:29
    And so one of the
    questions that you
  • 23:29 - 23:32
    might ask yourself is, why
    are we actually bothering
  • 23:32 - 23:34
    with sequencing these
    genomes so deeply?
  • 23:34 - 23:38
    Why do we sequence them
    30 or even 50 times over?
  • 23:38 - 23:40
    And the reason for
    this is the fact
  • 23:40 - 23:42
    that we are diploid organism.
  • 23:42 - 23:47
    So we are actually having
    each chromosome twice.
  • 23:47 - 23:49
    One complete set
    are inherited from--
  • 23:49 - 23:52
    is inherited from the mother,
    and another complete set
  • 23:52 - 23:54
    of chromosomes is
    inherited from the father.
  • 23:54 - 23:56
    And so one of the
    things that you
  • 23:56 - 23:59
    can do when you have
    so many fragments
  • 23:59 - 24:03
    and so many sequences is you can
    call confidently the differences
  • 24:03 - 24:05
    between these two
    copies that you have
  • 24:05 - 24:07
    from the mother and the father.
  • 24:07 - 24:09
    And this is really the reason
    why we are sequencing it
  • 24:09 - 24:11
    so deeply so that we
    can call the differences
  • 24:11 - 24:14
    between these chromosomes.
  • 24:14 - 24:19
    And one of the most easy
    analysis that you can actually
  • 24:19 - 24:21
    do once you have
    sequenced so deeply
  • 24:21 - 24:23
    and call these differences
    between the chromosomes
  • 24:23 - 24:27
    is to just ask how different
    are they on average?
  • 24:27 - 24:30
    So this is called
    heterozygosity.
  • 24:30 - 24:32
    And you can actually put
    this into perspective
  • 24:32 - 24:36
    by also showing,
    as in this plot,
  • 24:36 - 24:38
    the level of
    heterozygosity, so the level
  • 24:38 - 24:42
    of differences between the
    chromosomes, in modern humans,
  • 24:42 - 24:43
    in present-day modern humans.
  • 24:43 - 24:46
    And we have some
    individuals from Africa here
  • 24:46 - 24:48
    and some individuals
    from outside of Africa.
  • 24:48 - 24:53
    And what you can see is that
    Africans have about 1 in 1,000
  • 24:53 - 24:55
    differences between the
    chromosomes that they inherited
  • 24:55 - 25:00
    from the mother and the father,
    while non-Africans have between
  • 25:00 - 25:02
    6 and 8 in 10,000.
  • 25:02 - 25:05
    And the archaics are much
    reduced compared to both
  • 25:05 - 25:09
    of these present-day human
    populations or present-day human
  • 25:09 - 25:15
    regions, and they are at a
    level of 2 to 3 in 10,000.
  • 25:15 - 25:17
    And there's even a quite
    significant difference
  • 25:17 - 25:20
    between the two archaics
    in that the Denisova
  • 25:20 - 25:22
    is higher than the Neanderthal.
  • 25:22 - 25:25
    So the Neanderthal
    is further reduced.
  • 25:25 - 25:27
    So one can actually look
    into this in more detail
  • 25:27 - 25:30
    by looking over the chromosomes.
  • 25:30 - 25:32
    So just going in a small
    window over the chromosomes
  • 25:32 - 25:35
    and just counting the
    differences that you observe.
  • 25:35 - 25:39
    And we have done this here for a
    French individual, the Denisova
  • 25:39 - 25:40
    and the Altai Neanderthal.
  • 25:40 - 25:46
    And what you can see is that
    the level of heterozygosity,
  • 25:46 - 25:47
    so the differences
    between the chromosomes,
  • 25:47 - 25:50
    varies over the genome.
  • 25:50 - 25:52
    But there's one thing that
    is very special, and that
  • 25:52 - 25:56
    is that the Altai Neanderthal
    has this very long stretch here,
  • 25:56 - 25:58
    for instance, on chromosome 21--
  • 25:58 - 26:00
    there are other stretches like
    this on the other chromosomes--
  • 26:00 - 26:03
    where there's hardly any
    difference between the two
  • 26:03 - 26:05
    parental copies.
  • 26:05 - 26:09
    And so now what one can
    do is one can actually
  • 26:09 - 26:12
    take the size of these stretches
    and how much of the genome
  • 26:12 - 26:14
    is actually residing
    in those stretches
  • 26:14 - 26:17
    to calculate back
    how closely related
  • 26:17 - 26:21
    the parents would have to be to
    generate stretches like this.
  • 26:21 - 26:26
    And this is an analysis that
    Flora Fay in Monty Slatkin's lab
  • 26:26 - 26:29
    in Berkeley was carrying out for
    for the analysis of the Altai
  • 26:29 - 26:30
    Neanderthal genome.
  • 26:30 - 26:32
    And what she found
    is that there are
  • 26:32 - 26:36
    several different relationships
    between the parents,
  • 26:36 - 26:38
    possible that would
    actually generate exactly
  • 26:38 - 26:40
    the patterns that we see.
  • 26:40 - 26:44
    And so I guess one
    easy way to say
  • 26:44 - 26:46
    this is that the parents
    of this individual
  • 26:46 - 26:48
    would have to be
    at least related
  • 26:48 - 26:52
    on the level of half siblings
    to generate these patterns.
  • 26:52 - 26:53
    So they were closely related.
  • 26:53 - 26:55
    And then you can take
    it a step further
  • 26:55 - 26:58
    and you just take
    your prediction
  • 26:58 - 26:59
    of how much you
    would actually expect
  • 26:59 - 27:01
    in terms of long
    stretches that are looking
  • 27:01 - 27:04
    like this, almost identical.
  • 27:04 - 27:06
    And you just ask, if
    I would subtract now,
  • 27:06 - 27:09
    based on what I
    understood, the family
  • 27:09 - 27:11
    relationship of the
    parents would be,
  • 27:11 - 27:15
    if I subtract this away, is
    there actually anything left?
  • 27:15 - 27:17
    And this is, in fact, the case.
  • 27:17 - 27:18
    So for the Altai
    Neanderthal, you still
  • 27:18 - 27:23
    see an excess over the stretches
    that you see in the Denisova
  • 27:23 - 27:24
    and in modern humans.
  • 27:24 - 27:26
    And this actually means that
    this is not just a singular
  • 27:26 - 27:28
    event that just once
    happened, that just by chance,
  • 27:28 - 27:30
    the parents were
    closely related,
  • 27:30 - 27:32
    but also further
    back in the past,
  • 27:32 - 27:37
    they were probably
    closely related ancestors.
  • 27:37 - 27:40
    So another topic that I
    would like to talk about
  • 27:40 - 27:41
    is archaic admixture.
  • 27:41 - 27:44
    And so we already heard
    about archaic admixture
  • 27:44 - 27:48
    from Neanderthals and
    Denisovans into modern humans.
  • 27:48 - 27:51
    What I would like to talk about
    is really archaic admixture
  • 27:51 - 27:52
    between both archaics.
  • 27:52 - 27:55
    But before I get to
    this, I would actually
  • 27:55 - 27:59
    like to go a little bit
    deeper into how we actually
  • 27:59 - 28:02
    know what signal we have to look
    for to understand that there
  • 28:02 - 28:03
    is really admixture.
  • 28:03 - 28:08
    And so as a very simple
    way of depicting this,
  • 28:08 - 28:11
    just imagine that you
    have a certain individual.
  • 28:11 - 28:13
    And of course, as I
    already explained,
  • 28:13 - 28:15
    every chromosome has two copies.
  • 28:15 - 28:19
    So this individual has these two
    copies of a certain chromosome,
  • 28:19 - 28:22
    an arbitrary one.
  • 28:22 - 28:24
    Of course, you can go
    back to his parents,
  • 28:24 - 28:26
    and one of those copies
    will come from the father,
  • 28:26 - 28:28
    and the other one will
    come from the mother.
  • 28:28 - 28:33
    So I can paint them
    now blue and red.
  • 28:33 - 28:35
    But I can also go a step
    further and actually
  • 28:35 - 28:38
    paint them according to whether
    they come from the grandparents
  • 28:38 - 28:40
    or from which
    grandparent they come.
  • 28:40 - 28:42
    And what you see
    in this picture now
  • 28:42 - 28:44
    is that there is actually a
    process called recombination
  • 28:44 - 28:47
    that is actually mixing up
    these different chromosomes
  • 28:47 - 28:49
    in the parents of
    the individual.
  • 28:49 - 28:51
    So now you have these
    random stretches
  • 28:51 - 28:56
    from all the grandparents that
    are painting these chromosomes.
  • 28:56 - 28:59
    And of course, you can take
    another step, another step,
  • 28:59 - 29:00
    another step.
  • 29:00 - 29:02
    And so essentially
    what this means
  • 29:02 - 29:04
    is that you're breaking
    up the ancestry-- when
  • 29:04 - 29:06
    you go through
    the ancestors, you
  • 29:06 - 29:09
    jump between
    different ancestries
  • 29:09 - 29:11
    when you go over the chromosome.
  • 29:11 - 29:15
    So you change which
    ancestor's genome you look at.
  • 29:15 - 29:19
    And so when you repeat this
    process for a very long time,
  • 29:19 - 29:23
    and let's say you have ancestry
    from one population for most
  • 29:23 - 29:26
    of the genome, but you
    have a couple of ancestors
  • 29:26 - 29:29
    from a different human group
    hiding among your ancestors,
  • 29:29 - 29:32
    then what the most
    common outcome will be
  • 29:32 - 29:35
    is that you will have these
    short stretches, where
  • 29:35 - 29:36
    one of the chromosomes
    actually shows
  • 29:36 - 29:40
    this ancestry of this
    other human group,
  • 29:40 - 29:43
    while the other
    chromosome is actually
  • 29:43 - 29:46
    looking like the chromosome
    of the majority of the groups
  • 29:46 - 29:49
    because these stretches
    will be randomly placed
  • 29:49 - 29:51
    on your two chromosomes.
  • 29:51 - 29:54
    And now we can actually
    use the Altai Neanderthal
  • 29:54 - 29:57
    and the Denisova to
    find out whether there
  • 29:57 - 30:01
    is any Neanderthal
    ancestry in the Denisova,
  • 30:01 - 30:03
    or whether there is
    any Denisovan ancestry
  • 30:03 - 30:05
    in the Neanderthal
    individual that we sequenced.
  • 30:05 - 30:08
    And so in one direction,
    just showing--
  • 30:08 - 30:10
    so if Neanderthals would
    contribute to the Denisovan,
  • 30:10 - 30:13
    what we would expect is that
    there are some stretches where
  • 30:13 - 30:16
    the Denisovan looks very
    much like a Neanderthal,
  • 30:16 - 30:19
    but on the other chromosome, we
    would expect that it actually
  • 30:19 - 30:21
    looks like a normal Denisovan.
  • 30:21 - 30:23
    That means that the two
    chromosomes are actually
  • 30:23 - 30:25
    very different.
  • 30:25 - 30:26
    And so the prediction
    that this makes
  • 30:26 - 30:30
    is that if you go
    to regions where--
  • 30:30 - 30:32
    shown here on the
    left-hand side,
  • 30:32 - 30:34
    so the x scale is giving
    you how closely related
  • 30:34 - 30:37
    you can this-- make any
    particular window that you look
  • 30:37 - 30:40
    through, how closely related
    you can actually make that
  • 30:40 - 30:43
    to the other archaic.
  • 30:43 - 30:46
    So when you have windows
    where the Altai Neanderthal is
  • 30:46 - 30:50
    very closely related to the
    Denisovan, shown here in blue,
  • 30:50 - 30:52
    you actually see no effect.
  • 30:52 - 30:54
    But if you look in the
    same for the Denisovans,
  • 30:54 - 30:59
    when the Denisova can be made
    very closely related-- or looks
  • 30:59 - 31:01
    actually very closely related
    to the Altai Neanderthal,
  • 31:01 - 31:03
    you see that the two
    chromosomes are very different,
  • 31:03 - 31:08
    and this is shown here
    in red at this position.
  • 31:08 - 31:10
    So this is a hallmark sign
    that there is actually,
  • 31:10 - 31:12
    among the ancestors
    of the Denisovans,
  • 31:12 - 31:16
    some Neanderthal ancestry.
  • 31:16 - 31:18
    The last signal I
    want to talk about
  • 31:18 - 31:21
    is actually the one of
    unknown archaic material
  • 31:21 - 31:23
    that we found in the Denisovan.
  • 31:23 - 31:26
    And so the first signal
    that we saw for that
  • 31:26 - 31:29
    is really just when you look
    for divergence to Africa,
  • 31:29 - 31:32
    so that's nothing else than just
    looking for how many differences
  • 31:32 - 31:36
    we observe, we actually see
    when we take larger windows
  • 31:36 - 31:38
    and we just compare
    to an African,
  • 31:38 - 31:41
    that the Denisova is always
    a little bit more different
  • 31:41 - 31:42
    than the Altai Neanderthal.
  • 31:42 - 31:44
    So these two
    distributions that you
  • 31:44 - 31:46
    see here, they are-- the
    one for the Denisova in blue
  • 31:46 - 31:49
    is slightly shifted
    to the right.
  • 31:49 - 31:50
    And you can look
    even deeper into this
  • 31:50 - 31:54
    by actually looking at
    different allele frequencies
  • 31:54 - 32:00
    and divide up your comparison by
    how many Africans actually carry
  • 32:00 - 32:03
    a certain derived variant,
    meaning a new variant that
  • 32:03 - 32:06
    occurred some time after
    the split from chimpanzee.
  • 32:06 - 32:08
    And then all Africans
    are the same,
  • 32:08 - 32:10
    you actually see that the
    signal is the strongest.
  • 32:10 - 32:13
    So you see the most differences.
  • 32:13 - 32:20
    And in an analysis that also
    Monty Slatkin's lab carried out
  • 32:20 - 32:25
    in Berkeley with Fernando
    Racimo, what they did
  • 32:25 - 32:29
    was essentially taking the
    signals that I just described,
  • 32:29 - 32:31
    and they tried three
    different models
  • 32:31 - 32:36
    to actually compare how
    this could come about.
  • 32:36 - 32:39
    And so the first model
    assumes that there
  • 32:39 - 32:42
    was gene flow from Neanderthals
    into the common ancestor
  • 32:42 - 32:44
    of all modern humans.
  • 32:44 - 32:47
    The second model is assuming
    that all modern humans actually
  • 32:47 - 32:50
    gave some material
    to the Neanderthals.
  • 32:50 - 32:55
    And so these first two
    models are essentially
  • 32:55 - 32:57
    trying to explain
    how you could make
  • 32:57 - 33:00
    the Neanderthals and the modern
    humans more closely related.
  • 33:00 - 33:03
    And the last model is one where
    you have some lineage that we
  • 33:03 - 33:06
    haven't observed, so we
    don't know what it is, that
  • 33:06 - 33:08
    contributed to the Denisovans.
  • 33:08 - 33:10
    And that would make the
    Denisovans more distantly
  • 33:10 - 33:12
    related to modern humans.
  • 33:12 - 33:15
    And so in most
    comparisons, the model 3
  • 33:15 - 33:17
    was actually the
    best explanation
  • 33:17 - 33:18
    that we could find for the data.
  • 33:18 - 33:22
    And so we believe that there
    is this super archaic admixture
  • 33:22 - 33:27
    of some very deeply divergent
    lineage into the Denisovan.
  • 33:27 - 33:29
    So what I would like
    to say in the end is--
  • 33:29 - 33:31
    or what I would like
    to show in the end
  • 33:31 - 33:33
    is really just a general
    overview of the different gene
  • 33:33 - 33:35
    flows that we have now observed.
  • 33:35 - 33:38
    And this picture is
    not quite complete yet.
  • 33:38 - 33:42
    So what you can
    see is that we have
  • 33:42 - 33:49
    the ancestry of these deeply
    divergent ancestor that
  • 33:49 - 33:51
    contributed to Denisovans.
  • 33:51 - 33:52
    We have the
    Neanderthal admixture
  • 33:52 - 33:57
    into the modern humans, we have
    contributions from Denisovans
  • 33:57 - 34:00
    to modern humans, and
    the Neanderthal admixture
  • 34:00 - 34:02
    into the Denisovans that
    I just talked about.
  • 34:02 - 34:06
    And they are-- it seems that
    there is-- by now there are also
  • 34:06 - 34:08
    other publications that say
    that there are contributions
  • 34:08 - 34:11
    to Africans and so on and so on.
  • 34:11 - 34:13
    And so I think what this all
    means, when you sum it up,
  • 34:13 - 34:17
    is that these different
    types of admixtures
  • 34:17 - 34:19
    are actually something
    that is quite common.
  • 34:19 - 34:21
    So it actually happens
    quite a lot in the past,
  • 34:21 - 34:22
    and that is something
    that is really
  • 34:22 - 34:25
    a transition in our thinking,
    because originally, I
  • 34:25 - 34:28
    think we were all very skeptical
    that there was actually
  • 34:28 - 34:30
    any admixture between
    these archaic groups.
  • 34:30 - 34:33
    And with this, I
    would like to end
  • 34:33 - 34:36
    and say thank you
    for your attention.
  • 34:36 - 34:40
    [APPLAUSE]
  • 34:40 - 34:41
  • 34:41 - 34:45
    [MUSIC PLAYING]
  • 34:45 - 34:47
  • 34:47 - 34:48
    Thank you very much.
  • 34:48 - 34:50
    So I couldn't have
    asked for a better
  • 34:50 - 34:53
    introduction for what I'm going
    to talk to you about here today.
  • 34:53 - 34:55
    We've heard from several
    of the previous speakers
  • 34:55 - 34:59
    about the genetic legacy of
    interbreeding with Neanderthals.
  • 34:59 - 35:02
    But I'm very interested in
    understanding what, if anything,
  • 35:02 - 35:06
    is the phenotypic legacy in
    modern human populations?
  • 35:06 - 35:09
    Is this Neanderthal DNA that
    remains in us, is it functional?
  • 35:09 - 35:12
    And if so, what
    function does it have?
  • 35:12 - 35:15
    And so as we've seen, thanks
    to the pioneering work
  • 35:15 - 35:17
    of many of these
    previous speakers,
  • 35:17 - 35:20
    we know that
    Neanderthal DNA remains
  • 35:20 - 35:23
    in certain modern
    human populations.
  • 35:23 - 35:27
    And if we look at a schematic
    of a human chromosome here,
  • 35:27 - 35:32
    you can think of this as a long
    string of As, Ts Cs and Gs.
  • 35:32 - 35:36
    I've colored in blue all the
    locations where we've ever
  • 35:36 - 35:40
    observed someone living
    today to have Neanderthal DNA
  • 35:40 - 35:41
    in their genome.
  • 35:41 - 35:44
    And if you look across
    many, many thousands
  • 35:44 - 35:47
    of European and
    Asian individuals,
  • 35:47 - 35:49
    you'll see that on average,
    around 2% of their genomes
  • 35:49 - 35:52
    are derived from
    Neanderthal interbreeding.
  • 35:52 - 35:55
    As we've heard, different
    people will have a different 2%.
  • 35:55 - 35:59
    My 2% is different than Ed's
    2%, is different than Anne's 2%.
  • 35:59 - 36:01
    And I want you to
    remember that because this
  • 36:01 - 36:04
    is a really important feature
    that we're going to use later
  • 36:04 - 36:07
    to try to understand the
    function of these different bits
  • 36:07 - 36:10
    of Neanderthal DNA that
    remain in our genomes.
  • 36:10 - 36:12
    And at some parts
    of our genome are
  • 36:12 - 36:16
    more likely to retain
    Neanderthal DNA than others.
  • 36:16 - 36:19
    So in one extreme, we see
    these Neanderthal deserts,
  • 36:19 - 36:22
    like the position here
    on the right=hand side,
  • 36:22 - 36:26
    where we've never observed
    anyone to have Neanderthal DNA.
  • 36:26 - 36:28
    And then on the
    left-hand side, we
  • 36:28 - 36:32
    have the other extreme
    where we have up to 60%
  • 36:32 - 36:34
    of European individuals.
  • 36:34 - 36:36
    If you went out and sequenced,
    a bunch of European people
  • 36:36 - 36:40
    would have Neanderthal
    DNA at that location.
  • 36:40 - 36:44
    And so ultimately, this
    suggests that Neanderthal DNA
  • 36:44 - 36:46
    had an influence
    on our ancestors
  • 36:46 - 36:48
    after the interbreeding.
  • 36:48 - 36:50
    In some cases, perhaps
    positive, in other cases,
  • 36:50 - 36:52
    perhaps negative.
  • 36:52 - 36:55
    And so for me, this
    raised a very big question
  • 36:55 - 36:58
    that I really wanted to
    answer is, OK, so then
  • 36:58 - 37:02
    what is the phenotypic legacy of
    this Neanderthal interbreeding
  • 37:02 - 37:06
    and the DNA that remains
    from it in modern humans?
  • 37:06 - 37:11
    And so I hope if you remember
    nothing else from my talk,
  • 37:11 - 37:13
    really just two main points.
  • 37:13 - 37:16
    The first is that indeed,
    interbreeding with Neanderthals
  • 37:16 - 37:20
    has left a phenotypic
    legacy in modern humans.
  • 37:20 - 37:22
    And the way I'm going
    to go about trying
  • 37:22 - 37:24
    to show what that
    legacy has been
  • 37:24 - 37:30
    is using a new type of resource
    that's just becoming available,
  • 37:30 - 37:33
    and that's of large
    clinical biobanks
  • 37:33 - 37:36
    with electronic medical
    records from patients,
  • 37:36 - 37:39
    from hospitals linked
    to genetic information.
  • 37:39 - 37:41
    And this is a really,
    really powerful resource
  • 37:41 - 37:43
    for studying the
    genetics of disease,
  • 37:43 - 37:45
    but I also think it's a really,
    really powerful resource
  • 37:45 - 37:48
    for studying the genetics
    of our recent evolution.
  • 37:48 - 37:50
    And so if you want
    to, you can go
  • 37:50 - 37:52
    to sleep now and just
    remember those two things
  • 37:52 - 37:54
    and I won't blame you.
  • 37:54 - 37:58
    So basically, we got the
    idea for this project
  • 37:58 - 38:03
    because I collaborate with a
    big national consortium called
  • 38:03 - 38:07
    the Electronic Medical
    Records and Genomics Network.
  • 38:07 - 38:09
    And what this is
    is a collaboration
  • 38:09 - 38:13
    of about 10 academic hospitals
    from across the nation
  • 38:13 - 38:17
    that have electronic medical
    record systems implemented
  • 38:17 - 38:21
    in their hospitals, and
    also genetic information
  • 38:21 - 38:24
    from those patients linked
    to their electronic medical
  • 38:24 - 38:25
    records.
  • 38:25 - 38:26
    And so this looks
    a little something
  • 38:26 - 38:29
    like this, where on
    the left-hand side
  • 38:29 - 38:31
    we have John Doe's
    patient record.
  • 38:31 - 38:33
    He's been coming to the
    hospital and seeing doctors,
  • 38:33 - 38:34
    let's say, for
    the last 10 years.
  • 38:34 - 38:37
    And we've got records of
    all those events and all
  • 38:37 - 38:40
    the treatments he's received
    in that electronic form.
  • 38:40 - 38:45
    And then some day John comes in
    to have blood drawn and he says,
  • 38:45 - 38:48
    "Yeah, actually, it'd be OK if
    you use any leftover material
  • 38:48 - 38:53
    from this blood draw for
    basic medical research."
  • 38:53 - 38:55
    And if he's
    consented to do that,
  • 38:55 - 38:58
    then all that
    information is sent
  • 38:58 - 39:00
    through a de-identifying
    process where
  • 39:00 - 39:03
    all the identifying
    information is removed
  • 39:03 - 39:05
    from that electronic
    medical record,
  • 39:05 - 39:08
    but the basics of the treatment
    history are maintained.
  • 39:08 - 39:10
    And then the blood
    sample is also
  • 39:10 - 39:13
    passed through and biobanked
    and given an ID that links it up
  • 39:13 - 39:17
    to that anonymized version of
    the electronic medical record.
  • 39:17 - 39:19
    And now this is really
    powerful because it
  • 39:19 - 39:22
    enables us to do
    genetic association
  • 39:22 - 39:24
    testing on a very large scale.
  • 39:24 - 39:26
    So what is genetic
    association testing?
  • 39:26 - 39:28
    Well, we can--
    let's imagine we've
  • 39:28 - 39:31
    got a number of patients
    here for which we have
  • 39:31 - 39:33
    these biobanked blood samples.
  • 39:33 - 39:36
    And let's say we're interested
    in studying something
  • 39:36 - 39:37
    about their genetics.
  • 39:37 - 39:39
    Well, we can look at
    these blood samples
  • 39:39 - 39:41
    and see at one given
    position in their genome
  • 39:41 - 39:44
    whether or not they
    have an A, T, C, or G.
  • 39:44 - 39:48
    And so in this example, patient
    1 has an A, patient 2 has an A,
  • 39:48 - 39:51
    and then patient N has a G.
  • 39:51 - 39:55
    And let's say we're also
    interested in heart disease
  • 39:55 - 39:57
    and whether or not this
    particular location
  • 39:57 - 40:00
    in those patients genome has any
    effect on their risk for heart
  • 40:00 - 40:01
    disease.
  • 40:01 - 40:05
    What we can do is then go look
    in their electronic medical
  • 40:05 - 40:08
    record and say, all right,
    well, has this person ever
  • 40:08 - 40:10
    been treated for heart disease?
  • 40:10 - 40:13
    And let's say in this case we
    find that, yes, patients 1 and 2
  • 40:13 - 40:16
    have, and then
    patient N has not.
  • 40:16 - 40:18
    And once we have
    that information,
  • 40:18 - 40:21
    we can perform statistical
    tests for association
  • 40:21 - 40:24
    between these individual's
    DNA at that given
  • 40:24 - 40:27
    position in their genome,
    and whether or not they've
  • 40:27 - 40:29
    ever been treated
    for heart disease.
  • 40:29 - 40:31
    And so in this
    simplistic example,
  • 40:31 - 40:34
    we might say that
    yes, having an A
  • 40:34 - 40:36
    at this location in your
    genome increases your risk
  • 40:36 - 40:37
    for heart disease.
  • 40:37 - 40:39
    Now, of course,
    we don't normally
  • 40:39 - 40:41
    do this on three people, we
    do this on tens of thousands
  • 40:41 - 40:44
    of people to try to find
    significant associations
  • 40:44 - 40:49
    between regions of our
    genome and disease.
  • 40:49 - 40:52
    Now, this is all well and
    good, but let's say we're
  • 40:52 - 40:54
    interested in another disease.
  • 40:54 - 40:56
    Let's say we're interested in
    arthritis and the genetic basis
  • 40:56 - 40:57
    for arthritis.
  • 40:57 - 41:00
    Well, if we didn't have this
    electronic medical record
  • 41:00 - 41:03
    system, we'd have to go out and
    collect a whole other cohort
  • 41:03 - 41:06
    of people that had arthritis
    and then some matched control
  • 41:06 - 41:09
    people that didn't have
    arthritis and then genotype
  • 41:09 - 41:12
    them and then test whether
    or not those genetic loci had
  • 41:12 - 41:13
    any effect on the risk.
  • 41:13 - 41:15
    But because we have the
    electronic medical record
  • 41:15 - 41:18
    system, we can instead
    just go look in the record
  • 41:18 - 41:21
    and say, all right, let's find
    a new set of cases and controls
  • 41:21 - 41:25
    for arthritis and perform
    genetic association tests,
  • 41:25 - 41:28
    again, on the genetic
    information we already have.
  • 41:28 - 41:31
    So that's all well and
    good, but we're here
  • 41:31 - 41:34
    because we care about human
    origins and human evolution.
  • 41:34 - 41:35
    So let's get back to that.
  • 41:35 - 41:38
    How can we use this kind of
    data to answer this question
  • 41:38 - 41:42
    about the effects of the
    Neanderthal DNA that remains
  • 41:42 - 41:43
    in modern human populations?
  • 41:43 - 41:47
    And so what we did
    was to start with data
  • 41:47 - 41:50
    from this large eMERGE,
    Electronic Medical Records
  • 41:50 - 41:52
    and Genomics Network,
    from across the country.
  • 41:52 - 41:57
    We got data for about
    28,000 patients from across
  • 41:57 - 41:58
    the country.
  • 41:58 - 42:01
    And we first looked
    at their genotypes.
  • 42:01 - 42:05
    We first found genetic
    information from about 600,000
  • 42:05 - 42:08
    positions across their genomes.
  • 42:08 - 42:10
    And so you can think of
    this as a string, again,
  • 42:10 - 42:14
    of about 600,000 As, Ts, Cs, and
    Gs that we've associated with
  • 42:14 - 42:16
    each one of these patients.
  • 42:16 - 42:20
    And then what we realized we
    could do was use these great
  • 42:20 - 42:24
    high-quality maps of
    Neanderthal DNA that remain in--
  • 42:24 - 42:26
    remains in modern
    human populations
  • 42:26 - 42:28
    that you've heard about
    from Shrira and Josh.
  • 42:28 - 42:30
    And so we could
    look at those maps
  • 42:30 - 42:34
    and then intersect them
    with our own patients
  • 42:34 - 42:36
    and apply those techniques
    to our patients genomes
  • 42:36 - 42:41
    and identify regions where each
    patient had Neanderthal DNA.
  • 42:41 - 42:44
    And so we could do this for
    about 1,500 of these positions
  • 42:44 - 42:46
    in these patient's genomes.
  • 42:46 - 42:48
    And we can see where some
    may have Neanderthal DNA
  • 42:48 - 42:51
    and others may not.
  • 42:51 - 42:54
    And then finally, the last
    piece, as I indicated before,
  • 42:54 - 42:57
    comes from using these
    electronic medical record
  • 42:57 - 43:01
    data to define a set
    of phenotypes or traits
  • 43:01 - 43:02
    for each of these patients.
  • 43:02 - 43:04
    We can ask for hundreds
    of different phenotypes,
  • 43:04 - 43:07
    covering the whole spectrum
    of things you might
  • 43:07 - 43:09
    be treated for by a doctor.
  • 43:09 - 43:13
    Whether or not each of these
    people either had that disease,
  • 43:13 - 43:14
    they were a case, or
    they were a control,
  • 43:14 - 43:16
    or we couldn't
    really figure it out
  • 43:16 - 43:18
    and we should leave them
    out of the analysis.
  • 43:18 - 43:23
    And so then using this matrix of
    data of genetic data annotated
  • 43:23 - 43:25
    with Neanderthal
    ancestry and then
  • 43:25 - 43:28
    many, many different
    phenotypes, we
  • 43:28 - 43:31
    were able to start testing for
    the effects of Neanderthal DNA
  • 43:31 - 43:35
    on a much broader scale than
    really had been possible before.
  • 43:35 - 43:38
    And so before I get into
    what we actually find,
  • 43:38 - 43:40
    I'll try to be a good
    scientist and think
  • 43:40 - 43:43
    about what we would expect to
    find before actually running
  • 43:43 - 43:45
    the experiment.
  • 43:45 - 43:47
    And so what did we expect?
  • 43:47 - 43:55
    Now, as modern humans migrated
    out of Africa where they first
  • 43:55 - 43:58
    appeared, they
    encountered a number
  • 43:58 - 44:00
    of different environments.
  • 44:00 - 44:02
    So they encountered different
    climates, different levels
  • 44:02 - 44:05
    of sun exposure,
    different temperatures,
  • 44:05 - 44:08
    different seasonal patterns.
  • 44:08 - 44:11
    They also encountered
    different animals and plants
  • 44:11 - 44:15
    that led to different diets,
    and very importantly, they also
  • 44:15 - 44:17
    encountered different pathogens.
  • 44:17 - 44:20
    And so it's been
    proposed that perhaps
  • 44:20 - 44:24
    by interbreeding with
    Neanderthals and Denisovans,
  • 44:24 - 44:26
    and perhaps other
    archaic human forms
  • 44:26 - 44:30
    that had been living in these
    environments for hundreds
  • 44:30 - 44:32
    of thousands of
    years, in many cases,
  • 44:32 - 44:34
    before anatomically
    modern human groups
  • 44:34 - 44:36
    ever arrived there,
    perhaps there really
  • 44:36 - 44:40
    was some adaptive benefit
    you could get from spending
  • 44:40 - 44:41
    a night with a Neanderthal.
  • 44:41 - 44:46
    Maybe that was not
    such a bad trade off.
  • 44:46 - 44:49
    But this is really a hypothesis.
  • 44:49 - 44:52
    This hasn't been shown at all.
  • 44:52 - 44:54
    So under this
    hypothesis, we might
  • 44:54 - 45:00
    expect that the Neanderthal DNA
    that could have been adaptive
  • 45:00 - 45:05
    in our modern human populations
    would have been influencing
  • 45:05 - 45:07
    human traits that are
    involved in interactions
  • 45:07 - 45:09
    with the environment.
  • 45:09 - 45:11
    So things like our
    immune system, of course,
  • 45:11 - 45:12
    be one of the most important.
  • 45:12 - 45:16
    But our skin perhaps,
    and perhaps also
  • 45:16 - 45:19
    our metabolism or other
    traits related to our diet.
  • 45:19 - 45:24
    And so we also expected that
    we might see some effects
  • 45:24 - 45:27
    on our bone or
    skeletal structure
  • 45:27 - 45:30
    because we also know about
    many important differences
  • 45:30 - 45:32
    or many very easily
    detectable differences
  • 45:32 - 45:35
    between the bones of
    anatomically modern humans
  • 45:35 - 45:37
    and Neanderthals.
  • 45:37 - 45:39
    So those are some of the
    things we were expecting
  • 45:39 - 45:41
    as we went into this analysis.
  • 45:41 - 45:43
    So what did we find?
  • 45:43 - 45:48
    And now in doing this analysis,
    we decided to split up our data,
  • 45:48 - 45:52
    our 28,000 individuals,
    into two different sets,
  • 45:52 - 45:55
    a discovery cohort of
    about 13,500 individuals,
  • 45:55 - 45:58
    which we'd run an
    initial analysis,
  • 45:58 - 46:00
    and then a replication cohort in
    which we would try to replicate
  • 46:00 - 46:03
    anything that we found
    in that first cohort.
  • 46:03 - 46:06
    So in the discovery, I'm
    going to show you just some
  • 46:06 - 46:10
    of the top associations we
    found between Neanderthal DNA
  • 46:10 - 46:16
    and potential phenotypes
    in European ancestry,
  • 46:16 - 46:18
    anatomically-modern
    human populations.
  • 46:18 - 46:21
    And so when I saw this, I
    almost couldn't believe it
  • 46:21 - 46:23
    because what do
    we see at the top?
  • 46:23 - 46:26
    We see osteoporosis,
    a bone trait.
  • 46:26 - 46:28
    Then we see
    hypercoagulable state.
  • 46:28 - 46:29
    So what is that?
  • 46:29 - 46:31
    That's just blood clotting.
  • 46:31 - 46:32
    Your blood's too thick.
  • 46:32 - 46:35
    It clots too much, which can
    lead to all sorts of problems.
  • 46:35 - 46:38
    Then we see protein calorie
    malnutrition, a metabolic trait.
  • 46:38 - 46:40
    And so this is really
    surprisingly matching
  • 46:40 - 46:42
    what we expected.
  • 46:42 - 46:44
    But before I go too
    far into interpreting
  • 46:44 - 46:47
    these, let's talk about
    that replication analysis
  • 46:47 - 46:48
    I mentioned.
  • 46:48 - 46:52
    So what we did here is we looked
    at the other 14,500 individuals
  • 46:52 - 46:55
    we left out of the initial
    analysis and tested to see
  • 46:55 - 46:59
    whether we saw consistent
    effects in that group.
  • 46:59 - 47:03
    And so luckily for four
    of these top associations
  • 47:03 - 47:05
    I'm telling about, we did
    see something consistent.
  • 47:05 - 47:07
    We did see a consistent effect.
  • 47:07 - 47:11
    Unfortunately, the osteoporosis
    one did not replicate there.
  • 47:11 - 47:13
    And I should say,
    just as an aside,
  • 47:13 - 47:15
    I don't think that necessarily
    means it's not true,
  • 47:15 - 47:19
    but it's notoriously difficult
    sometimes to replicate these
  • 47:19 - 47:21
    genetic associations and
    we're following that up
  • 47:21 - 47:22
    in some other cohorts.
  • 47:22 - 47:26
    So let's focus on these
    four that did replicate.
  • 47:26 - 47:29
    So first we have this
    hypercoagulable state
  • 47:29 - 47:31
    association that I already
    talked a little bit about.
  • 47:31 - 47:35
    So this means that your blood
    coagulates very quickly.
  • 47:35 - 47:37
    And this is actually
    a very important part
  • 47:37 - 47:39
    of the early immune response.
  • 47:39 - 47:41
    The coagulation
    factors are really
  • 47:41 - 47:44
    some of the first proteins that
    pathogens interact with when
  • 47:44 - 47:45
    they come into your body.
  • 47:45 - 47:47
    And so this really
    fits in with this idea
  • 47:47 - 47:50
    of the potential
    immune benefits.
  • 47:50 - 47:52
    And we've looked into
    the molecular basis
  • 47:52 - 47:54
    for this association
    and we've actually
  • 47:54 - 47:58
    been able to show that the
    Neanderthal DNA nearby--
  • 47:58 - 48:01
    sorry, this Neanderthal DNA that
    is associated with increased
  • 48:01 - 48:05
    coagulation increases the level
    of several nearby coagulation
  • 48:05 - 48:07
    factors in your blood.
  • 48:07 - 48:09
    So we have a very compelling
    molecular mechanism
  • 48:09 - 48:12
    for how that might be happening.
  • 48:12 - 48:15
    And now, I'm sure by now you've
    read the rest of this list
  • 48:15 - 48:19
    and seen one that's a little
    bit more difficult to interpret.
  • 48:19 - 48:21
    And that's tobacco use disorder.
  • 48:21 - 48:24
    And so that really just
    means addiction to nicotine.
  • 48:24 - 48:29
    And so I think should we
    be thinking about this?
  • 48:29 - 48:33
    Were Neanderthals sitting
    around outside of caves smoking?
  • 48:33 - 48:35
    And I want to say
    unequivocally, no.
  • 48:35 - 48:37
    No, we cannot say this.
  • 48:37 - 48:40
    You should not say this,
    you should not think this.
  • 48:40 - 48:45
    This extreme example highlights
    a really important point,
  • 48:45 - 48:49
    that the effects of genetic
    variation in modern environments
  • 48:49 - 48:54
    may not actually reflect its
    effects 50,000 years ago against
  • 48:54 - 48:56
    a very different genetic
    background and Neanderthals,
  • 48:56 - 48:59
    or in early human
    Neanderthal hybrids.
  • 48:59 - 49:03
    And on top of that, of course,
    tobacco is a new-world plant.
  • 49:03 - 49:06
    They didn't really have nicotine
    existing in their environment.
  • 49:06 - 49:09
    But what this does tell
    us is that Neanderthal DNA
  • 49:09 - 49:13
    in modern humans is influencing
    a system in our body
  • 49:13 - 49:15
    that is now, in
    modern environments,
  • 49:15 - 49:17
    relevant to this trait.
  • 49:17 - 49:21
    And in particular, this bit of
    Neanderthal DNA is very nearby,
  • 49:21 - 49:24
    a transporter for
    a neurotransmitter
  • 49:24 - 49:27
    called GABA that's involved in
    all sorts of important processes
  • 49:27 - 49:32
    in the brain and even may have
    a role in circadian processes.
  • 49:32 - 49:34
    So we don't really
    know what might have
  • 49:34 - 49:38
    been behind this association.
  • 49:38 - 49:41
    Now, just to move on, I want
    to tell you about one more
  • 49:41 - 49:42
    analysis that we did.
  • 49:42 - 49:45
    So in that first
    set of tests, we
  • 49:45 - 49:48
    were testing for the effect
    of one bit of Neanderthal DNA
  • 49:48 - 49:51
    with one trait in
    a human population.
  • 49:51 - 49:53
    That we wondered,
    well, what if we
  • 49:53 - 49:55
    looked at all the
    Neanderthal DNA
  • 49:55 - 49:58
    that a person might or
    might not have in aggregate,
  • 49:58 - 50:00
    and ask whether or not
    that could predict,
  • 50:00 - 50:03
    better predict someone's
    risk for a disease.
  • 50:03 - 50:07
    And so we did an
    analysis of that.
  • 50:07 - 50:11
    And again, we found several
    very interesting associations
  • 50:11 - 50:12
    that replicated.
  • 50:12 - 50:16
    And now I think this top one
    is really, really fascinating.
  • 50:16 - 50:19
    It's Neanderthal DNA-- if I know
    your Neanderthal DNA complement,
  • 50:19 - 50:24
    I can better predict your
    risk for actinic keratosis.
  • 50:24 - 50:28
    And this is-- in case you don't
    know, this is a skin disease.
  • 50:28 - 50:30
    It's not terribly serious.
  • 50:30 - 50:32
    It's often seen in
    fair-skinned people
  • 50:32 - 50:35
    after long-term sun exposure.
  • 50:35 - 50:37
    And it's caused
    by malfunctioning
  • 50:37 - 50:42
    of a gene-- of a type of cell in
    your skin called keratinocytes.
  • 50:42 - 50:46
    And I find this so fascinating
    for really several reasons
  • 50:46 - 50:48
    because keratinocytes, one
    of their main functions
  • 50:48 - 50:51
    is protecting our skin
    from UV radiation.
  • 50:51 - 50:55
    So again, a very important
    environmental difference
  • 50:55 - 50:58
    between Africa and other
    non-African environments.
  • 50:58 - 51:00
    But they're also really
    intimately involved
  • 51:00 - 51:02
    in early stages of the
    innate immune response
  • 51:02 - 51:06
    and signaling for the activation
    of other immune factors.
  • 51:06 - 51:10
    When we look at patterns of
    where Neanderthal DNA falls
  • 51:10 - 51:12
    in our genome, we
    see that many of
  • 51:12 - 51:16
    the Neanderthal-- high-frequency
    Neanderthal bits of DNA
  • 51:16 - 51:20
    are nearby genes that are
    involved in keratin biology.
  • 51:20 - 51:22
    And so this is taking
    it to the next step
  • 51:22 - 51:24
    and showing, not only
    is it enriched nearby
  • 51:24 - 51:27
    those genes, but actually
    in modern populations,
  • 51:27 - 51:28
    it's having an effect
    on a phenotype that's
  • 51:28 - 51:31
    very relevant to keratin.
  • 51:31 - 51:37
    But again here we'll see there's
    a second confusing, or at least
  • 51:37 - 51:39
    more complicated to
    interpret association
  • 51:39 - 51:43
    that we need to think about,
    and that's depression.
  • 51:43 - 51:46
    And so again, I really
    want to be very clear
  • 51:46 - 51:48
    that this is not what we
    should be thinking about.
  • 51:48 - 51:50
    Neanderthals, we cannot
    say they were depressed.
  • 51:50 - 51:53
    We cannot blame them for
    any depression we have.
  • 51:53 - 51:54
    These are very
    complex phenotypes
  • 51:54 - 51:56
    with major
    environmental components
  • 51:56 - 51:58
    and many other
    genetic components.
  • 51:58 - 52:00
    And the Neanderthal
    influence is really
  • 52:00 - 52:03
    quite modest in the whole
    constellation of all
  • 52:03 - 52:06
    the contributions to them.
  • 52:06 - 52:09
    So in conclusion, I
    want you to remember
  • 52:09 - 52:11
    that interbreeding
    with Neanderthals
  • 52:11 - 52:14
    has indeed left a phenotypic
    legacy in modern humans.
  • 52:14 - 52:17
    And in particular,
    it's left effects
  • 52:17 - 52:20
    on many different
    systems in our bodies.
  • 52:20 - 52:24
    Our immune systems, our skin,
    our metabolism, and in fact,
  • 52:24 - 52:26
    even likely, our brains.
  • 52:26 - 52:30
    And so, I think largely because
    of the nature of the data sets
  • 52:30 - 52:34
    we've been looking at,
    we've found many cases
  • 52:34 - 52:37
    where the Neanderthal DNA has
    a mildly deleterious effect
  • 52:37 - 52:38
    in modern environments.
  • 52:38 - 52:42
    But again, I want to remind
    you that's not necessarily true
  • 52:42 - 52:46
    50,000 years ago when this
    interbreeding likely occurred.
  • 52:46 - 52:49
    And so one of the main
    challenges going forward
  • 52:49 - 52:50
    is trying to understand
    what knowing something
  • 52:50 - 52:53
    about Neanderthal DNA
    in a modern environment
  • 52:53 - 52:56
    can actually tell us about
    what was happening back then.
  • 52:56 - 52:59
    And so then the second point
    I wanted you to remember
  • 52:59 - 53:02
    is that this was all enabled by
    using a new type of resource.
  • 53:02 - 53:05
    These large-scale databases
    of tens or hundreds
  • 53:05 - 53:07
    of thousands of
    electronic medical records
  • 53:07 - 53:11
    from patients linked up
    to genetic information.
  • 53:11 - 53:15
    And so, I think just as the
    ability to sequence people's
  • 53:15 - 53:18
    DNA at large scale
    has dramatically
  • 53:18 - 53:22
    changed our understanding of the
    genetic basis of human evolution
  • 53:22 - 53:24
    over the past 5 or
    10 years, thanks
  • 53:24 - 53:26
    to many of the speakers
    in the symposium,
  • 53:26 - 53:28
    I think that leveraging
    these sorts of data
  • 53:28 - 53:29
    and these sorts of
    projects that are popping
  • 53:29 - 53:32
    up all over the
    world will allow us
  • 53:32 - 53:35
    to do the same thing
    for the phenotypic basis
  • 53:35 - 53:36
    of recent human evolution.
  • 53:36 - 53:38
    And so with that, I would
    like to say thank you
  • 53:38 - 53:41
    all very much for listening and
    thank all of my collaborators.
  • 53:41 - 53:44
    And yeah, thanks.
  • 53:44 - 53:46
    [APPLAUSE]
  • 53:46 - 53:47
  • 53:47 - 53:50
    [MUSIC PLAYING]
  • 53:50 - 54:43
Title:
CARTA: DNA – Neandertal and Denisovan Genomes; Neandertal Genes in Humans; Neandertal Interbreeding
Description:

more » « less
Video Language:
English
Duration:
54:44

English subtitles

Revisions