Manolis Kellis: Biology of Disease | Lex Fridman Podcast #133
Aq9UPIXbtKI • 2020-10-25
Transcript preview
Open
Kind: captions
Language: en
the following is a conversation with
manolas kellis
his third time on the podcast he is a
professor at mit
and head of the mit computational
biology group
this time we went deep on the science
biology and genetics so this is a bit
of an experiment manolas went back and
forth
between the basics of biology to the
latest state of the art
and the research he's a master at this
so i just said back
and enjoyed the ride this conversation
happened at 7 00 am
so it's yet another podcast episode
after an all-nighter for me
and once again since the universe has a
sense of humor
this one was a tough one for my brain to
keep up
but i did my best and i never shy away
from good challenge quick mention of
your sponsor followed by some thoughts
related to
the episode first is sem rush
the most advanced seo optimization tool
i've ever come across
i don't like looking at numbers but
someone probably should
it helps you make good decisions second
is pessimist archive they're back one of
my favorite history podcasts on why
people
resist new things from recorded music to
umbrellas to cars
chess coffee and the elevator
third is eight sleep a mattress that
cools itself
measures heart rate variability has an
app and has given me yet another reason
to look forward to sleep including
the all-important power nap and finally
better help online therapy when you want
to face your demons with a licensed
professional
not just by doing the david goggins like
physical challenges
like i seem to do on occasion please
check out the sponsors in the
description to get a discount and
to support this podcast as a side note
let me say that biology
in the brain and in the various systems
of the body
fill me with awe every time i think
about how such
a chaotic mess coming from its humble
origins in the ocean
was able to achieve such incredibly
complex
and robust mechanisms of life that
survived
despite all the forces of nature that
want to destroy it
it is so unlike the computing systems we
humans have engineered
that it makes me feel that in order to
create artificial general intelligence
and artificial consciousness we may have
to completely rethink
how we engineer computational systems
if you enjoy this thing subscribe on
youtube review it with five stars in
apple podcast
follow on spotify support on patreon or
connect with me on twitter
at lex friedman and now here's my
conversation
with manolis kalis so your group at mit
is trying to understand the molecular
basis of human disease
what are some of the biggest challenges
in your view don't
get me started i mean irregularities
standing human disease
is the most complex challenge
in modern science so because human
disease
is as complex as the human genome
it is as complex as the human brain
and it is in many ways even more complex
because the more we understand disease
complexity
the more we start understanding genome
complexity and
epigenome complexity and brain circuitry
complexity
and immune system complexity and cancer
complexity and so on and so forth
so traditionally
human disease was following basic
biology
you would basically understand basic
biology and model organisms
like you know mouse and fly and yeast
you would understand sort of
mammalian biology and animal biology and
eukaryotic biology
in sort of progressive layers of
complexity
getting closer to human phylogenetically
and you would do perturbation
experiments in those species
to see if i knock out a gene what
happens
and based on the knocking out of these
genes you would basically then have a
way to drive
human biology because you would you
would sort of understand the functions
of these genes and then if you find that
a human gene
locus something that you've mapped from
human genetics
to that gene is related to a particular
human disease you say
now i know the function of the gene from
the model organisms
i can now go and understand the function
of that gene
in human but this is all changing this
is dramatically changed so that that was
the old way of doing basic biology you
would start with the animal models the
eukaryotic models
the mammalian models and then you would
go to human
human genetics has been so transformed
in the last
decade or two that human genetics is now
actually driving the basic biology
there is more genetic mutation
information in the human
genome than there will ever be in any
other species
what do you mean by mutation information
so perturbations is how you understand
systems
so an engineer builds systems and then
they know how they work from the inside
out
a scientist studies systems through
perturbations
you basically say if i poke that balloon
what's going to happen and i'm going to
film it in super high resolution
understand i don't know
aerodynamics or fluid dynamics if it's
filled with water etc so you can then
make experimentation by perturbation and
then the scientific process is sort of
building models that
best fit the data designing new
experiments that best test your models
and challenge your models and so forth
that's the same thing with science
basically if you're trying to understand
biological science
you basically want to do perturbations
that then drive
the models so how do these perturbations
allow you to understand
disease so if if you know that
a gene is related to disease you don't
want to just know that it's related to
the disease you want to know what is the
disease mechanism
because you want to go and intervene so
the way that i like to describe it is
that traditionally
epidemiology which is basically the
study of disease
you know sort of the observational study
of disease has been
about correlating one thing with another
thing so if you if you have a lot of
people with liver disease who are also
alcoholics
you might say well maybe the alcoholism
is driving the liver disease
or maybe those who have liver disease
self-medicate with alcohol
so that the connection could be either
way
with genetic epidemiology it's about
correlating changes in the genome
with phenotypic differences and then you
know the direction of causality
so if you know that a particular gene is
related
to the disease you can basically say
okay perturbing that gene in mouse
causes the mice to have x phenotype
so perturbing that gene in human causes
the humans to have the disease
so i can now figure out what are the
detailed molecular phenotypes
in the human that are related to that
organismal
phenotype in the disease so it's all
about understanding
disease mechanism understanding what are
the pathways what are the tissues
what are the processes that are
associated with the disease so that we
know how to intervene
you can then prescribe particular
medications that also alter these
processes
you can prescribe lifestyle changes that
also affect these processes
and so forth that's such a beautiful
puzzle to try to solve
like what kind of perturbations
eventually have this ripple effect that
leads to disease
across the population and then you study
that for animals
a mice first and then see how that might
possibly connect to
humans how hard is that puzzle of
trying to figure out how little
perturbations might lead
to in a stable way to a disease
in animals we make the puzzle
simpler because we perturb one gene at a
time
that's the beauty of it's the power of
animal models you can basically decouple
the perturbations you only do one
perturbation and you only do strong
perturbations at a time
in human the puzzle is incredibly
complex
because i mean obviously you don't do
human experimentation
you wait for natural selection and
natural genetic variation
to basically do its own experiments
which it has been doing for hundreds
and thousands of years in the human
population
and for hundreds of thousands of years
across
you know the the history leading to the
human population
so you basically take this natural
genetic variation
that we all carry within us every one of
us carries
six million perturbations so i've done
six million experiments on you
six million experiments for me six
million experiments on every one
of seven billion people on the planet
what's the six million correspond to
six million unique genetic variants
that are segregating the human
population every one of us
carries millions of polymorphic
sites poly many morph forms polymorphic
means many forms
variants that basically means that every
one of us has
single nucleotide alterations that we
have inherited from mom and from that
that basically can be thought of as tiny
little perturbations
most of them don't do anything but some
of them
lead to all of the phenotypic
differences that we see
between us the reason why two twins are
identical is because
these variants completely determine the
way that i'm going to look at exactly 93
years of age
how happy are you with this kind of data
set is it uh
large enough of the human population of
earth is that too big
too small yeah so so the the is it is it
large enough
is a power analysis question and in
every one of our grants we do a power
analysis
based on what is the effect size that i
would like to detect
and what is the natural variation
in the two forms so every time you do a
perturbation you're asking i'm changing
form a into for
b form a has some natural genetic vary
some natural phenotypic variation around
it
and form b has some natural phenotypic
variation around it
if those variances are large and the
differences between the mean of a and
the mean of b
are small then you have very little
power the further the means go apart
that's the effect size the more power
you have
and the smaller the standard deviation
the more power you have
so basically when you're asking is that
sufficiently large
certainly not for everything but we
already have enough power
for many of the stronger effects
in the more tight distributions so
that's a hopeful message that
there exists parts of the genome
that that have a strong effect that has
a small
variance that's exactly right
unfortunately
those perturbations are the basis of
disease in many cases
so it's not a you know hopeful message
sometimes it's a terrible message
it's basically well some people are sick
but if when
if we can figure out what are these
contributors to sickness
we can then help make them better and
help many other people better
who don't carry that exact mutation but
who carry
mutations on the same pathways
and that's what we like to call the
allelic series of a gene
you basically have many perturbations
of the same gene in different people
each with a different
frequency in the human population and
each with a different effect
on the individual charism so you said uh
in the past
there would be these small experiments
on perturbations and animal models
what does this puzzle solving process
look like today
so we basically have you know something
like seven billion people in the planet
and every one of them carries something
like six million mutations
you basically have an enormous matrix
of genotype by phenotype
by systematically measuring the
phenotype
of these individuals and the traditional
way of measuring this phenotype
has been to look at one trait at a time
you would gather families and you would
sort of paint
the pedigrees of a strong effect what we
like to call
mendelian mutation so a mutation that
gets transmitted in a dominant or a
recessive
but strong effect form where basically
one locus
plays a very big role in that disease
and you could then look at carriers
versus non-carriers
in one family carries versus
non-carriers in another family
and do that for hundreds sometimes
thousands of families
and then trace these inheritance
patterns and then figure out what is the
gene
that plays that role is this the matrix
that you're showing
in in talks or lectures
so that matrix is the
input to the stuff that i saw in talks
so basically that matrix has
traditionally been strong effect genes
what the matrix looks like now is
instead of pedigrees instead of families
you basically have thousands and
sometimes
hundreds of thousands of unrelated
individuals
each with all of their genetic variants
and each with their
phenotype for example height or lipids
or you know whether they're sick or not
for a particular trait
that has been the modern view instead of
going to families
going to unrelated individuals with one
phenotype at a time
and what we're doing now as we're
maturing
in all of these sciences is that we're
doing this in the context
of large medical systems or enormous
cohorts
that are very well phenotyped across
hundreds of
phenotypes sometimes with our complete
electronic health record
so you can now start relating not just
one gene segregating one family
not just thousands of variants
segregating with one phenotype
but now you can do millions of variants
versus hundreds of phenotypes
and as a computer scientist i mean
deconvolving that matrix partitioning it
into the layers of biology
that are associated with every one of
these elements
is a dream come true it's it's like the
world's greatest puzzle
and you can now solve that puzzle by
throwing in
more and more knowledge about the
function of different genomic regions
and
how these functions are changed across
tissues
and in the context of disease and that's
what my group and many other groups are
doing
we're trying to systematically relate
this genetic variation
with molecular variation at the
expression level
of the genes at the epigenomic level
of the gene regulatory circuitry and at
the cellular level
of what are the functions that are
happening in those cells at the single
cell level
using single cell profiling and then
relate all that
vast amount of knowledge computationally
with the thousands of traits that each
of these of thousands of variants
are perturbing i mean this is something
we talked about i think last time
so there's these effects at different
levels that happen you said at a single
cell level
you're trying to see things that happen
due to certain perturbations
and then so it's not just like a puzzle
of
um perturbation and disease
it's perturbation then effect at a
cellular level
at an organ level a body like
how do you disassemble this into like
what your group is working on
you're basically taking a bunch of the
hard problems in the space
how do you break apart a difficult
disease
uh and break it apart into problems that
you into puzzles that you can now start
solving
so there's a struggle here computer
scientists love hard puzzles
and they're like oh i want to you know
build a method that just deconvolves the
whole thing computationally
and you know that's very tempting and
it's very appealing
but biologists just like to decouple
that complexity experimentally to just
like peel off layers of complexity
experimentally and that's what many of
these modern tools that you know my
group and others have
both developed and used the fact that we
can now figure out tricks
for peeling off these layers of
complexity by testing
one cell type at a time or by testing
one cell
at a time and you could basically say
what is the effect of this genetic
variant associated with alzheimer's
on human brain human brain
sounds like oh it's an organ of course
just go one organ at a time
but human brain has of course dozens of
different brain regions
and within each of these brain regions
dozens of different cell types
and every single type of neuron every
single type of glial cell
between astrocytes oligodendrocytes
microglia
between you know all of the neural cells
and the vascular cells and the immune
cells
that are co-inhabiting the the brain
between the different types of
excitatory and inhibitory neurons that
are sort of interacting with each other
between different layers
of neurons in the cortical layers every
single one of these
has a different type of function
to play in cognition in
interaction with the environment in
maintenance
of the brain in energetic needs
in feeding the brain with blood with
oxygen
in clearing out the debris that are
resulting from the super high energy
production of cognition in in humans
so all of these things are basically um
potentially deconvolvable
computationally but experimentally
you can just do single cell profiling of
dozens of regions of the brain across
hundreds of individuals
across millions of cells and then now
you have
pieces of the puzzle that you can then
put back together
to understand that complexity
i mean first of all i mean the human
brain the cells in the human brain are
the most
okay maybe i'm romanticizing it but
cognition seems to be very complicated
so uh separating into the function
breaking alzheimer's down to
the cellular level seems very
challenging
is that basically you're trying to find
a way that
some perturbation and genome
results in some obvious
major dysfunction in the cell
you're trying to find something like
that exactly so so
what does human genetics do human
genetics basically looks at the whole
path
from genetic variation all the way to
disease
so human genetics has basically taken
thousands
of alzheimer's cases
and thousands of controls matched for
age for
sex for you know environmental
backgrounds and so forth and then looked
at that map
where you're asking what are the
individual genetic persuasions
and how are they related to all the way
to alzheimer's disease
and that has actually been quite
successful so we now have
you know more than 27 different loci
these are genomic regions
that are associated with alzheimer's at
this
end-to-end level but the moment you sort
of
break up that very long path into
smaller levels
you can basically say from genetics what
are the epigenomic alterations
at the level of gene regulatory elements
where that genetic variant perturbs
the control region nearby that effect is
much larger
you mean much larger in terms of this
down the line impact
or it's much larger in terms of the
measurable effect this a versus b
variance is actually so much cleanly
defined
when you go to the shorter branches
because for one genetic variant to
affect
alzheimer's that's a very long path that
basically means that in the context of
millions of these six million variants
that every one of us carries
that one single nucleotide has a
detectable effect
all the way to the end i mean it's just
mind-boggling that that's even possible
but indeed yeah but indeed there are
such effects
so the hope is or the most
scientifically speaking the
the most effective place where to detect
the alteration that results in disease
is
earlier on in the pipeline as early as
possible it's
it's a trade-off if you go very early on
in the pipeline
now each of these epigenomic alterations
for example this enhancer control region
is active maybe 50 less which is a
dramatic effect
now you can ask well how much does
changing one regulatory region in the
genome
in one cell type change disease well
that path is now long
so if you instead look at expression
the path between genetic variation the
expression of one gene goes through many
enhancer regions
and therefore it's a subtler effect at
the gene level but then now you're
closer because
one gene is acting on you know in the
context of only 20 000 other genes
as opposed to one enhancer acting in the
context of two million other enhancers
so you basically now have genetic
epigenomic the circuitry
transcriptomic the gene expression level
and then
cellular where you can basically say i
can measure various properties of those
cells
what is the calcium influx
rate when i have this genetic variation
what is the synaptic density what is the
electric
impulse conductivity and so on so forth
so you can measure things along this
path to disease and you can also measure
endophenotypes you can basically measure
you know
your brain activity you can do imaging
in the brain
you can basically measure i don't know
the heart rate the pulse the lipids
the amount of blood secreted and so
forth
and then through all of that you can
basically get at
the path to causality the path to
disease
and is there something beyond cellular
so you mentioned lifestyle
interventions or changes as a way to
or like be able to prescribe changes in
life style
like what what about organs what about
like
the function of the body as a whole yeah
absolutely so basically
when you go to your doctor they always
measure you know your pulse they always
measure your height those measure your
weight
your you know your bmis basically these
are just very basic variables
but with digital devices nowadays you
can start measuring hundreds of
variables for every individual
you can basically also phenotype
cognitively
through tests uh alzheimer's patients
there are cognitive tests that you can
imagine that you that you typically do
for uh cognitive decline these
minimental
you know observations that that you have
specific questions too
you can think of sort of enlarging the
set of cognitive tests
so in the mouse for example you do
experiments for how do they get out of
mazes
how do they find food whether they
recall a fear
whether they shake in a new environment
and so forth
in the human you can have much much
richer phenotypes where you can
basically say
not just imaging at the you know
organ level but and all kinds of other
activities at the organ level
but you can also do at the organism
level
you can do behavioral tests and how did
they do on empathy
how did they do on memory how did they
do on
long-term memory versus short-term
memory and so forth i love how you're
calling that phenotype
i guess it is it is but like your
behavior
patterns that might change over over uh
over a period of a life
it's yeah your ability to remember
things your ability to be
yeah empathetic or emotionally your
intelligence
perhaps even yeah but intelligence has
hundreds of variables
you can be your math intelligence your
literary intelligence your
puzzle-solving intelligence your logic
it could be like hundreds of things
and all of that is it's we were able to
measure that better and better so and
all that could be connected to the
entire pipeline
we used to think of each of these as a
single variable like intelligence i mean
that's ridiculous
it's basically dozens of different genes
that are controlling every single
variable you can basically think of you
know imagine us in a video game
where every one of us has measures of
you know strength stamina
you know energy left and so forth but
you could click on each of those like
five bars that are just the main bars
and
each of those will just give you then
hundreds of bars yeah and you can
basically say
okay great for my you know machine
learning task
i want someone who i'm a human
who has these particular forms of
intelligence i require now these
you know 20 different things and then
you can combine those things
and then relate them to of course
performance in a particular task
but you can also relate them to genetic
variation
that might be affecting different parts
of the brain
for example your frontal cortex versus
your temporal cortex versus your visual
cortex
and so forth so genetic variation that
affects expression of genes in different
parts of your brain
can basically affect your you know music
ability your auditory ability your smell
your you know just dozens of different
phenotypes
can be broken down into you know
hundreds of cognitive variables and then
relate each of those
to thousands of genes that are
associated with them
so somebody who loves rpgs role-playing
games
there's uh there's too few variables
that we can control so i'm excited
if we're in fact living in a simulation
and this is a video game
i'm excited by the quality of of the
video game the
the the the game designer did a hell of
a good job
so we're impressed oh i don't know the
sunset last night was a little
unrealistic
yeah yeah the graphics exactly come on
nvidia
to zoom back out we've been talking
about the
genetic origins of diseases but i think
it's fascinating to
talk about what are the most important
diseases to understand
and especially as it connects to the
things that you're working on
so it's very difficult to think about
important diseases to understand there's
many metrics of importance
one is lifestyle impact i mean if you
look at kovid the impact on lifestyle
has been enormous so understanding kovid
is important because it has impacted the
well-being
in terms of ability to have a job
ability to have an apartment ability to
go to work
ability to have a mental circle of
support
and all of that for you know millions of
americans like huge huge
impact so that's one aspect of
importance so basically mental disorders
alzheimer's has a huge importance in the
well-being of americans
whether or not it die it kills someone
for many many years it has a huge impact
so the first measure of importance is
just well-being
like impact on the quality of life
impact on the quality of life absolutely
the second metric which is much easier
to quantify is deaths
what is the number one killer the number
one killer
is actually heart disease it is actually
killing
650 000 americans per year
number two is cancer with 600 000
americans
number three far far down the list is
accidents
every single accident combined so
basically you you know you read the news
accidents like you know there was a huge
car crash all over the news
but the number of deaths number three by
far
167 000 lower respiratory disease so
that's
asthma not being able to breathe and so
forth 160 000
alzheimer's number four number five with
000 and then stroke brain aneurysms and
so forth that's 147
000 diabetes and metabolic disorders etc
that's 85 000.
the flu is 60 000 suicide
50 000 and then overdose et cetera
you know goes further down the list so
of course kovit has creeped up to be the
number three killer
this year with you know more than 100
000 americans
and counting um and you know
but but if you think about sort of what
do we use what are the most important
diseases you have to understand
both the quality of life and the
the sheer number of deaths and just
numbers of years lost if you wish
and and uh each of these diseases you
can think of as uh
and also including terrorist attacks and
school shootings for example
things which lead to fatalities you can
look at
as problems that could be solved
and some problems are harder to solve
than others
i mean that's part of the equation so
maybe if you look at these diseases if
you look at heart disease or cancer
or alzheimer's or just
like schizophrenia and obesity w like
not necessarily things that kill you but
affect the quality of life
which problems are solvable which aren't
which are harder to solve which aren't i
love your question because it puts it in
the context
of a global um effort
rather than just a local effort so
basically if you look at
the global aspect exercise
and nutrition are two interventions that
we can as a society
make a much better job at so if you
think about sort of the availability
of cheap food it's extremely high in
calories
it's extremely detrimental for you like
a lot of processed food
etc so if we change that equation
and as a society we made availability of
healthy food
much much easier and charged
a burger at mcdonald's the price that it
costs
on the health system then people would
actually start buying more healthy
foods so basically that's sort of a
societal intervention if you wish
in the same way increasing empathy
increasing education increasing the
social
framework and support would basically
lead to fewer suicides
it would lead to fewer murders it would
lead to fewer
you know deaths overall so
you know that's something that we as a
society can do you can you can also
think about external factors versus
internal factors so the external factors
are basically communicable diseases
like covid like the flu etc and
the internal factors are basically
things like you know cancer and
alzheimer's where basically your
your genetics will eventually you know
drive you there
um and then of course
with all of these factors every single
disease has both a genetic component
and environmental component so heart
disease you know huge then they
contribute
contribution alzheimer's it's like you
know
60 plus genetic
so i think it's like 79 heritability
so that basically means that genetics
alone explains
79 of alzheimer's incidence
and yes there's a 21 environmental
component
where you could basically enrich your
cognitive environment enrich your social
interactions read more books learn a
foreign language
go running you know sort of have a more
fulfilling life
all of that will actually decrease
alzheimer's but there's a limit to how
much that
that can impact because of the huge
genetic footprint so this is fascinating
so
each one of these problems have a
genetic component
and an environment component and so like
when there's a genetic component
what can we do about some of these
diseases what what have you worked on
what can you say that's uh in terms of
problems that are solvable here
or understandable so my group works on
the genetic component
but i would argue that understanding the
genetic component can have a huge impact
even on the environmental component why
is that
because genetics gives us access to
mechanism and if we can alter the
mechanism
if we can impact the mechanism we can
perhaps counteract
some of the environmental components
interesting so
understanding the biological mechanisms
leading to disease
is extremely important in being able to
intervene
but when you can intervene what you know
the analogy that i like to gay
to give is for example for obesity you
know think of it as a giant bathtub of
fat there's basically fat coming in from
your diet
and there's fat coming out from your
exercise
okay that's an in out equation and
that's the equation that everybody's
focusing on
but your metabolism impacts that
you know bathtub basically your
metabolism controls
the rate at which you're burning energy
it controls away the rate at which
you're storing energy
and it also teaches you about
the various valves that control the
input and the output equation
so if we can learn from the genetics
the valves we can then manipulate those
valves
and even if the environment is feeding
you a lot of fat
and getting a little that out you just
poke another hole at the bathtub
and just get a lot of the fat out yeah
that's fascinating
yeah so that we're not just passive
observers of our genetics
the more we understand the more we can
come up with actual treatments
and i think that's an important uh
aspect to realize
when people are thinking about strong
effect versus weak effect variants
so some variants have strong effects we
talked about these mendelian disorders
where a single gene has a sufficiently
large effect
pen and trans expressivity and so so
forth that basically
you can um trace it in families with
cases and not cases cases not cases and
so forth
but even the you know but
so so these are the genes that everybody
says oh that's the genes we should go
after
because that's a strong effect gene i
like to think about it slightly
differently
these are the genes where genetic
impacts that have a strong effect were
tolerated
because every single time we have a
genetic association with disease
it depends on two things number one the
obvious one
whether the gene has an impact on the
disease number two the more subtle one
is whether there is genetic in variation
standing and circulating and segregating
in the human population
that impacts that gene some genes
are so darn important that if you mess
with them
even a tiny little amount that person is
dead
so those genes don't have variation
you're not going to find the genetic
association if you don't have variation
that doesn't mean that the gene has no
role
it's simply that the gene it simply
means that the gene tolerates no
mutations
so that's actually a strong signal when
there's no variation that's so fast
exactly genes that have very little
variation
are hugely important you can actually
rank the importance of genes
based on how little variation they have
and those genes that have very little
variation but no association
with disease that's a very good metric
to say oh that's probably a
developmental gene
because we're not good at measuring
those phenotypes so it's genes that you
can tell
evolution has excluded mutations from
but yet we can't see them associated
with anything that we can measure
nowadays
it's probably early embryonic lethal
what are all the words you just said
earlier in brionic what
lethal meaning meaning that if you don't
have it
okay there's a bunch of stuff that um
is required for a stable functional
organism
exactly across the board for our entire
for for entire species i guess if you
look at sperm
it expresses thousands of proteins
does sperm actually need thousands of
proteins no
but it's probably just testing them
so my speculation is that misfolding of
these proteins is an early test for
failure
so that out of the you know millions of
sperm
that are possible you select the subset
that are just not grossly misfolding
thousands of proteins
so it's kind of an assert uh that this
is followed
correctly correct yeah this uh just uh
because
if this little thing about the folding
of a protein is incorrect
that probably means somewhere down the
line there's a bigger issue
that's exactly right so fail fast so
basically if you look at
the mammalian investment in
a new born that investment is enormous
in terms of resources
so mammals have basically evolved
mechanisms
for fail fast where basically in those
early
months of development i mean it's it's
horrendous of course at the personal
level
when you lose a uh you know your future
child
but in some ways
there's so little hope for that child to
develop
and sort of make it through the
remaining months that sort of fail fast
is probably
a good evolutionary principle from an
evolutionary perspective for
mammals and of course humans
have a lot of medical resources that you
can sort of give those children a chance
and you know we have so much more
success in sort of giving folks who have
these strong carrier mutations a chance
but if they're not even making it
through the first three months
we're not going to see them so that's
why when we
when we say what are the most important
genes to focus on the ones that have a
strong effect
mutation or the ones that have a weak
effect mutation
well you know the jury might be out
because the ones that have a strong
effect mutation
are basically you know not mattering as
much
the ones that only have weak effect
mutations
by understanding through genetics that
they have a weak effect mutation
and understanding that they have a
causal role on the disease
we can then say okay great evolution has
only tolerated a two percent
change in that gene pharmaceutically
i can go in and induce a 70 change in
that gene
and maybe i will poke another hole at
the bathtub
that was not easy
to control in you know many of the other
sort of strong effect genetic variants
so okay so there's this beautiful map
of uh across the population of things
that
you're saying strong and weak effects so
stuff with a lot of
mutations and stuff with little
mutations with
no mutations and you have this map and
it's it lays out the puzzle
yeah so so when i say strong effect i
mean at the level of individual
mutations so so basically
genes where so so
you have to think of first the effect of
the gene on the disease remember how i
was sort of
painting that map earlier from genetics
all the way to phenotype
that gene can have a strong effect on
the disease
but the genetic variant might have a
weak effect on
the gene so basically when you ask
what is the effect of that genetic
variant on the disease
it could be that that genetic variant
impacts the gene by a lot
and then the gene impacts the disease by
a little or it could be that the genetic
variant
impacts the gene by a little and then
the gene impacts the disease by a lot
so what we care about is genes that
impact the disease a lot
but genetics gives us the full equation
and what i would argue
is if we couple the genetics
with expression variation to basically
ask what
genes change by a lot
and you know which genes correlate with
disease by a lot
even if the genetic variants change them
by a little
then that those are the best places to
intervene
those are the best places where
pharmaceutically if i have
even a modest effect i will have a
strong effect on the disease
whereas those genetic variants that have
a huge effect on the disease i might not
be able to change that gene by this much
without affecting all kinds of other
things
interesting so yeah okay so that's what
we're looking at then
what have we been able to find in terms
of
which disease could be helped again
don't get me started this is um
we have found so much our understanding
of disease
has changed so dramatically with
genetics i mean places that we had no
idea would be involved
so one of the worst things about my
genome is that i have a genetic
predisposition to
age-related macular degeneration amd
so it's a form of blindness that causes
you to to lose the central part of your
vision
progressively as you grow older my
increased risk
is fairly small i have an eight percent
chance you only have a six percent
chance
you i'm on average yeah by the way when
you say my you mean literally yours
you know this about you i know this
about me
yeah which is kind of uh
i mean uh philosophically speaking is a
pretty powerful thing
so to live with i mean maybe that's uh
so we agreed to talk again by the way
for the
listeners to where we're going to try to
focus on science today and
a little bit of philosophy next time but
it's uh interesting to think about
the more you're able to know about
yourself from the genetic information in
terms of the diseases
how that changes your own view of life
yeah
so there's there's a lot of impact there
and there's a
something called genetic exceptionalism
which basically thinks of genetics as
something very very different
than everything else as a type of
determinism
and um you know let's talk about that
next time
so basically it's a good preview yeah so
let's go back to amd so basically with
amd
we have no idea what causes amd you know
it was it was a mystery
until the genetics were worked out and
now the fact that i know that i have a
predisposition
allows me to sort of make some life
choices number one
but number two the genes that lead to
that predisposition give us insights as
to how does it actually work
and that's a place where genetics gave
us something totally unexpected
so there's a complement pathway
which is an immune function pathway that
was in you know most of the loci
associated with amd and that basically
told us that wow there's an immune basis
to this eye disorder
that people had just not expected before
if you look at complement
it was recently also implicated in
schizophrenia
and there's a type of microglia
that is involved in synaptic pruning so
synapses are the connections between
neurons
and in this whole use it or lose it view
of
mental cognition and other capabilities
you basically have uh microglia which
are immune cells that are sort of
constantly traversing your brain
and then pruning neuronal connections
pruning synaptic connections
that are not utilized so
in schizophrenia there's thought to be
a change in the pruning that basically
if you don't prune your synapses the
right way
you will actually have an increased role
of schizophrenia this is something that
was completely unexpected
for schizophrenia of course we knew it
has to do with neurons but the role of
the complement complex
which is also implicated in amd which is
now also implicating schizophrenia was a
huge surprise what's the complement
complex
so it's basically a set of genes the
complement genes
that are basically having various immune
roles and as i was saying earlier our
immune system has been co-opted
for many different roles across the body
so they actually play
many diverse roles and somehow the
immune system
is connected to the synaptic pruning
process exact process
exactly so immune cells were co-opted to
prune synapses how did you figure this
out
how does one go about figuring this
intricate connection
uh like pipeline of connections out yeah
let me give you another example
so so alzheimer's disease the first
place that you would expect it to act is
obviously the brain
so we had basically this roadmap
epigenomics consortium view of the human
epigenome
the largest map of the human epigenome
that has ever been built
across 127 different tissues and samples
with dozens of epigenomic marks measured
in you know
hundreds of donors so what we've
basically
learned through that is that you you
basically can map
what are the active gene regulatory
elements for every one of the tissues in
the body
and then we connected these gene
regulatory
active maps of basically what regions
of the human genome are turning on in
every one of different tissues
we then can go back and say where are
all
the genetic loci that are associated
with disease
this is something that my group i think
was the first to do back in 2010
in this ernst nature biotech paper
but basically we were for the first time
able to show that specific
chromatin states specific epigenomic
states in that case enhancers
were in fact enriched enriched in
disease associated variants
we pushed that further in the ernst
nature paper a year later
and then in this roadmap epigenomics
paper
you know a few years after that but
basically that
matrix that you mentioned earlier was in
fact the first time that we could see
what genetic traits have genetic
variants that are enriched
in what tissues in the body
and a lot of that map made complete
sense if you looked at
a diversity of immune traits like
allergies and type 1 diabetes and so
forth
you basically could see that they were
enriching that the genetic variants
associated with those traits
were enriched in enhancers in these gene
regulatory elements
active in t cells and b cells and
hematopoietic stem cells and so forth
so that basically gave us a
confirmation in many ways that those
immune traits were instead
indeed enriching immune cells if you
look
if you if you looked at type 2 diabetes
you basically saw an enrichment in only
one type of sample and it was pancreatic
eyelids
and we know that type 2 diabetes in you
know sort of stems from the
dysregulation of insulin
in the beta cells of pancreatic eyelids
and that sort of was
you know spot on super precise if you
looked at blood pressure
where would you expect blood pressure to
occur
you know i don't know maybe in your
metabolism in ways that you process
coffee or something like that maybe in
your brain the way that you stress out
increases your blood pressure etc
what we found is that blood pressure
localized specifically
in the left ventricle of the heart so
the enhancers of the left technology in
the heart
contain a lot of genetic variants
associated with blood pressure
if you look at height we found an
enrichment specifically
in embryonic stem cell enhancers so the
genetic variants predisposing you to be
taller or shorter
are in fact acting in developmental stem
cells makes
complete sense if you looked at
inflammatory bowel disease
you basically found inflammatory which
is immune
and also bowel disease which is
digestive
and indeed we saw a double enrichment
both in the immune cells
and in the digestive cells so that
basically told us that
this is acting in both components
there's an immune component to
inflammatory bowel disease
and there's a digestive component and
the big surprise was for alzheimer's
we had seven different brain samples
we found zero enrichment in the brain
samples
for genetic variants associated with
alzheimer's and this is mind-boggling
our brains were literally hurting what
is going on
and what is going on is that the brain
samples are primarily
neurons oligodendrocytes and astrocytes
in terms of the cell types that make
them up
so that basically indicated that genetic
variants associated with alzheimer's
were probably not acting in
oligodendrocytes astrocytes or neurons
so what could they be acting in well the
fourth major cell type is actually
microglia
microglia are resident immune cells in
your brain
oh nice the immune oh wow
and they are cd14 plus which is this
sort of cell surface markers uh of those
cells
so their cd14 plus cells just like
macrophages that are circulating
in your blood the microglia are
resident monocytes that are basically
sitting in your brain they're
tissue-specific
monocytes and every one of your tissues
like your your fat for example
has a lot of macrophages that are resin
and the m1 versus m2 macrophage ratio
has a huge role to play in obesity and
you know so basically again these immune
cells are everywhere but basically what
we found
through this completely unbiased view of
what are the tissues that likely
underlie different disorders
we found that alzheimer's was
humongously enriched in microglia but
not at all in the other cell types so
what what are we supposed to make that
if you
look at the tissues involved is that
simply
useful for indication of uh
propensity for disease or does it give
us somehow a pathway of treatment
it's very much the second if you look at
the
um the way to therapeutics you have to
start somewhere
what are you gonna do you're gonna
basically make assays
that manipulate those genes
and those pathways in those cell types
so before we know the tissue of action
we don't even know where to start
we basically are at a loss but if you
know the tissue of action and even
better if you know the pathway of action
then you can basically screen your small
molecules
not for the gene you can screen them
directly for the pathway
in that cell type so you can basically
develop a high throughput multiplexed
you know robotic system for testing
the impact of your favorite molecules
that you know are safe efficacious and
you know sort of
hit that particular gene and so forth
you can basically screen those molecules
against either a set of genes that act
in that pathway
or on the pathway directly by having a
cellular assay
and then you can basically go into mice
and do experiments and basically
sort of figure out ways to manipulate
these processes
that allow you to then to go back to
humans and do a clinical trial that
basically says okay
i was able indeed to reverse these
processes in mice can i do the same
thing in humans
so that the the knowledge of the tissues
gives you the pathway
to treatment but that's not the only
part there are many
additional steps to figuring out the
mechanism of disease
i mean so that's really promising maybe
uh
to take a small step back you've you've
mentioned all these puzzles that were
figured out with the nature paper
for i mean you've mentioned a ton of
diseases
from obesity to alzheimer's even
schizophrenia i think you mentioned
and just what is the actual methodology
of figuring this out
so indeed i mentioned a lot of diseases
and and my lab works on a lot of
different disorders
and the reason for that is that
if you look at the
if you look at biology
it used to be you know zoology
departments in both technology
departments and you know virology
departments and so on so forth and mit
was one of the first schools to
basically create a biology department
like oh we're going to study
all of life suddenly why was that even
the case
because the advent of dna and the genome
and the central dogma of dna makes rna
mixed protein
in many ways unified biology you could
suddenly
study the process of transcription in
viruses
or in bacteria and have a huge impact on
yeast and fly and maybe even mammals
because of this realization of these
common underlying processes
and in the same way that dna unified
biology
genetics is unifying disease
studies so you used to have
um you used to have
uh you know i don't know um
cardiovascular disease department
and uh you know neurological disease
department
and neurodegeneration department and uh
you know um basically immune and cancer
and so forth
and all of these were studied in
different labs
you know because it made sense because
basically the first step was
understanding how the tissue functions
and we kind of knew the tissues involved
in cardiovascular disease and so forth
but what's happening with human genetics
is that all of that
all of these walls and edifices that we
had built are
crumbling and the reason for that is
that
genetics is in many ways revealing
unexpected connections so suddenly we
now have to bring the immunologists
to work on alzheimer's they were never
in the room they were in another
building altogether
the same way for schizophrenia we now
have to sort of worry about
all these interconnected aspects for
metabolic disorders we're finding
contributions from brain
so suddenly we have to call the
neurologist from the other building and
so forth
so in my view it makes no sense
anymore to basically say oh i'm a
geneticist
studying immune disorders i mean that's
that's ridiculous because i mean yeah of
course in many ways
you still need to sort of focus but what
what what we're doing is that we're
basically saying we'll go wherever the
genetics takes us
and by building these massive resources
by working on our latest map is now 833
tissues
sort of the the next generation of the
epigenomics roadmap which we're now
called epimap
is 833 different tissues and using those
we've basically found enrichments in 540
different disorders
those enrichments are not like oh great
you guys work on that and we'll work on
this
they're intertwined amazingly so of
course there's a lot of modularity
but there's these enhancers that are
sort of broadly active and these
disorders that are broadly active
so basically some enhancers are active
in on tissues and some disorders are
enriching
in all tissues so basically there's
these multifactorial
and this other class which i like to
call polyfactorial diseases
which are basically li
Resume
Read
file updated 2026-02-13 13:24:43 UTC
Categories
Manage