🎁Amazon Prime 📖Kindle Unlimited 🎧Audible Plus 🎵Amazon Music Unlimited 🌿iHerb 💰Binance
Video
Transcript
so I’m one of the people whose
background was in neuroscience I
finished my PhD before starting the
program and I had gotten progressively
more excited about deep learning
throughout my PhD and all the exciting
things that it can do especially as
evidenced by all the cool things that
open AI keeps doing so I’m really
excited to be in this program and I
picked a project that’s are close to my
heart and that works with neuroscience
data where I had some experience so I
was working on predicting epileptic
seizures using deep networks so what are
seizures a seizure is when the neurons
in your brain start to fire to
synchronously for your good
so at worst they will take over your
whole brain and you will literally pass
out if you have a lot of seizures
repeatedly so there recurring and
unpredictable you might be diagnosed
with a disorder known as epilepsy
epilepsy affects about 1% of people
across the globe and disproportionately
so in the developing world the fact that
seizures are unpredictable in particular
causes anxiety for sufferers they will
suffer physical injuries from accidents
because they didn’t see an oncoming
seizure and also in most countries
people with epilepsy understandably are
not allowed to drive cars which will
take a toll on their quality of life in
many places in the world so if we could
predict seizures better we could
contribute to reduce anxiety fewer
injuries and in the best case we might
bring the drivers licenses back and make
it possible to drive safely again and it
also has the potential to contribute to
brain stimulation treatments to actually
alleviate the epilepsy itself now up to
20 years ago it was actually unclear if
seizure prediction was even possible
because in order to predict an upcoming
seizure there needs to be some signal
somewhere in the brain that gives away
that that a seizure is coming before the
seizure has happened so 20 years ago
people were still debating whether this
signal exists at all
now anecdotally for people with epilepsy
it was not surprising that seizures can
be predicted because some people with
epilepsy and hot dogs who have been able
to alert them to an oncoming seizure
ahead of time and it was only recent
recently proven that this is in fact the
case and that the dogs do it based on
so there is some brain signal that gives
away that a seizure is coming it’s a bit
hard to pick out what exactly it is
epileptologists don’t know exactly what
the signature is and so it’s almost a
problem that asks for neural networks so
the data set that I worked on was
publicly available from Cargill and it
was she’s male select pointer here so
that people can see so I used two
publicly available data set from Cargill
that had recordings from from dogs from
voltage sensors so it has time series
recorded from different places in the
dark Sprint’s both from time points
right before a seizure is starting up to
when a seizure starts so when you see
this abnormal synchronous activity and
so the way that I worked with this data
is that I took the raw signal that was
available both from time periods really
far from a seizure so the brain is just
hanging out it’s at least four hours
until any seizure and from an hour
before a seizure so when there would be
these signals that something dangerous
is going to happen we I I pack this data
into smaller time chunks which were then
basically labeled data that could lend
itself for to a classification problem
so safe epoch and then dangerous epochs
from an upcoming seizure and converted
this into an image representation and I
apologize that part of this is covered
here so into a spectrogram
representation and what spectrograms do
is that they take time series
information and and will highlight
basically the frequency content of those
time series so both if there’s a lot of
lower frequency content or as here in
the simulated danger signal if there are
these periods of higher frequency
content in the original data set that I
worked with one second
yeah excuse me so in the data set that I
worked with it I had data from five dogs
and it was highly imbalanced so say dog
three here represented most of the data
and also we had much fewer of these
target sequences that that Herald in an
upcoming C shirt than that then we had
safe sequences this could cause a number
of problems when you train neural
networks so for example the network
might cheat and really overfit to dog
number three or just always guess that
every epoch is safe which is a problem
when you’re trying to predict a seizure
so the first thing that I did was that I
sub sampled this data so that we had
equal representation from all of the
dogs and also from all of the from both
of the classes so the danger episodes
that I labeled with X’s here and also
the safe episodes and some early results
that I find so in the balanced data set
were chance level performance would be
55% I got to 69 percent accuracy with a
rest net 18 model a baseline model which
was a regular logistic regression with
match normalization I got to 55 percent
accuracy now when you think about models
like this it’s important where you set
the threshold for whether you label a
tiny part to be dangerous or safe and so
if you’re working with predicting a
seizure you might want to make sure that
you catch every seizure so that you
definitely don’t leave anything
unpredicted but if you do that depending
on how good your model is you might also
be creating a lot of false alarms and so
an ROC curve is a way of quantifying how
is that trade-off going and so ideally
you basically want this RC curve to push
up towards the left hand corner and you
want that area to be one you never get a
model that’s that good but you want to
push it in that direction and our model
is about point 77 so what does this mean
in practice that if I want to catch 99%
of all of the upcoming seizures well
then I’m also going to label just over
80% of the safe episode as seizures
episode says seizures so we still have
somewhere to go but we’re not a chance
another way of looking at this trade-off
is with
precision recall curve and that has this
true positive rate on the x-axis and so
in that case if I say okay I want to
catch 99 percent of all of the seizures
well then out of the the epochs that I
labeled that I labeled as being
dangerous actually only like 55 percent
of them really were dangerous that’s
another way of looking at that trade-off
and a third way is something called the
confusion matrix and the way that you
want this one to look for a good model
is that you want to have really high
numbers on the diagonals and low numbers
on the off diagonals and so again we’re
getting there with this model we’re not
quite there
and so basically about 72% of the true
seizures are labeled as actually
seizures but not all of them so that’s
one thing that you can take from this
confusion matrix
so in conclusion so far we’ve shown that
our model has shows some promise but we
still have ways to go on this and the
directions that I wanted to take this
moving forward I’m really excited to
continue working on this project is that
I want to make the prediction better by
hyper parameter tuning and
regularization there’s also a lot of
opportunity to vary the neural network
architecture trying larger rest nets and
maybe sequence models and also looking
at different kinds of pre-processing
once I have a network that performs
hopefully really well I’m excited about
interpreting what the network is
learning using some of the methodology
that the clarity team at open AI is
working with I’ve started a bit with
visualizing the network weights but I’m
not ready to tell you about that yet
today and I’m also excited about looking
at activation maximization approaches
I’m really grateful for having had the
opportunity to be in this program I
learned a lot I really enjoyed working
with Pi torch every day learning how to
train and evaluate neural nets excuse me
debugging neural nets and also this side
of deep learning that you don’t see when
you just look at it from the results
side so working on a virtual machine and
cloud infrastructure that’s definitely
being both a challenge and really really
good a great to get experience with I’m
grateful for lots of people and I should
also say
now it’s a great time to start to send
your questions over I will take those
after my thank you slide see if there’s
anything there yeah so first and
foremost I’m really grateful to my
mentor Johannes who in Sam’s words from
yesterday has been like a co-pilot with
me helping me debug in person and doing
pair programming with me I’m really
really grateful for the existence of
this amazing program so thank you so
much to Greg and Sam and open AI for
making that happen I’m grateful for our
to our program mom and dad Christine and
Mariah after Frances for organizing this
event and also our interview day my
fellow scholars have been a really
really great source of community and
support during these challenging times
that we’ve been here and I really
enjoyed connecting with everyone at open
AI over the course of the program so the
clarity team especially little big
interests have given me great advice
Paul and the reflection team has also
given me really great ideas and I really
really enjoyed meeting people for lunch
when we were still in the office and for
other virtual donut chats that we have
had since after that okay and are there
questions coming through are the data
from the dogs detecting seizures or
having seizures themselves a great
question so it’s um I have data from
basically safe time periods let me go
back to that slide that are really far
from the seizures and then from up to
one hour before the seizure so we don’t
actually have any data from the seizure
itself we’re trying to find a signature
of when a seizure is about to happen
into the future thank you for that
question
you
yeah okay the next question says since
this is based on from neural data what
challenges do you see in making this
useful for those that suffer from
epilepsy
I am parenthesis perhaps falsely
assuming it would be difficult to have
access to this data for an average user
how can you model and how can your model
and future work plug in to their daily
lives great question so it’s an it’s a
matter of stages actually people with
epilepsy will sometimes choose to have
surgery to control their epilepsy and
some people will have implants like
closed loop deep brain stimulation to
control their seizures and so then you
would actually have exactly this type of
intracranial data to work with to make a
predictive model in other cases what
would be more relevant for most
sufferers of epilepsy is that you would
have not an implanted type of brain
recording device but a scalp EEG that’s
on top of that’s outside of the skull
and then you have less signal to work
with but it’s still a similar signal so
the hope is that once we can figure out
good models with intracranial data that
we might also be able to train models on
on other brain data that’s less invasive
but has a little bit less signal did we
have what was the reasoning for choosing
the pre trained arrest net 18 yeah so we
started with the idea that the
spectrogram is a good representation of
neural data source and neuroscientists
we often work in the spectrogram
representation and different oscillatory
frequency bands are thought to convey
different information in the brain so
once we decided to work in the
spectrogram representation really
they’re like images and so we also
worked with a bit of vanilla coordinates
but rest nets are really quite good
models for image classification so we
decided to use that one but for sure we
could we could try other architectures
as well and that’s a future direction
so sorry I should read out the question
the next question was which layers of
the network did you retrain if any so we
actually didn’t use a pre trained model
so we trained the model from scratch we
use the existing resonant 18a
architecture but then trained it from
scratch did we have more questions how
similar do you expect seizure patterns
to be from one patient to another are
there categories of seizure types or is
there a wide variety of signals okay
this is a great question from one
patient to another those might be pretty
different so right now we reused all of
the darks data together but I know that
most in the literature successful
seizure prediction models have been
within patient so they do seem to differ
by category there are absolutely a lot
of different kinds of seizure types and
that’s an open question in the
literature if you could categorize that
some types of seizures are more lend
themselves more to prediction than
others thank you
what do you think would improve the
model accuracy more data cleaner signal
yes more data more data cleaner signal
both of those things would improve the
model accuracy I should say also that
now the effects that we worked with were
from up to one hour before the seizure
started it’s actually not clear how much
in advance you can predict you can’t
predict a seizure it might be in some
cases that the process that that
generates the seizure has not actually
started an hour before so some of the
labels that we have as being danger
signals might actually not have the
information to predict the seizure so
one thing that we’re going to try is to
turn it into a multi-label
classification problem and separately
predict epochs that are maybe only 10
minutes before the seizure is starting
versus the ones that are one hour before
so that would improve model accuracy
also the way that this data was set up
from Cairo was actually in 10-minute
epics and we subsample them to 22nd
epochs so we could train a second model
a hierarchy up to predict the fall
ten minute chunks that will also improve
accuracy I’m also looking at increasing
the segment lengths because maybe the
warning signal is not there in every
single 22nd epoch another thing is that
right now we treated each spectrogram
from each electrode as a separate
observation implicitly assuming
independence between electrodes which is
likely not true and there might actually
be a lot of information in the
correlations between electrodes but each
dog had a different number of electrodes
so it wasn’t straightforward to just set
it up as a as a 3d tensor but that’s
definitely something we’d like to look
at in the future to also give the
network an opportunity to learn from the
correlations