How to Read Minds Using an FMRI (2013) | ||
In
the detective television series Homicide: Life on the Street,
interrogating suspects may often include the question, “Have you seen
this person before?” This
being a 1-hour television show, the suspect shown is normally guilty and
often attempts to evade prosecution by lying, “No, never seen him
before…” whereby the detectives then need to use the remaining time to
build their case. They need to walk the streets and find direct
or indirect evidence that supports their hypothesis that the suspect
has indeed seen the person and has in fact been involved in a capital
crime. Imagine
if the same scene can be played again, this time within a massive fMRI
scanner. “Have you seen
this person before? Wait,
don’t bother answering. According
to this fMRI scan, you have. Guilty.
Done. Now what do we
do for the next 59 minutes?” Technically,
a real life fMRI can do this. This
is not science fiction. This
is today. How would it work
and why is it not in everyday use? Nothing
is ever done in isolation and it pays to briefly scan the background with
similar technology and with similar labor-reducing goals.
A customer service desk needs to hire customer service employees
that await incoming calls. When a caller connects with the service employee, they
exchange greetings (“Hi, how can we help you?”) and get down to the
business (“Hi, I need help checking on my order…”).
It is a mundane phone call that allows two people to communicate at
a distance. To
reduce costs, enhance service employee productivity, and provide 24-hour
coverage at greatly reduced operating costs, the customer service desk
might add machine operators. In
this case, a caller connects to the machine operator and hears a recording
(“For order status say ‘status.’
For new orders say ‘new order.’”)
To get a mechanical view of what the machine operator hears back
and needs to interpret, figure 1 shows how the speech waveform might
appear.
Figure
1. A sample view of how the
speech waveform for “Status” and “New Order” might appear. The
machine operator needs to interpret the speech waveform pattern and
translate it back into a menu path - “Status” or “New Order.”
To a human operator, this is a trivial task.
Considering that this is a two-alternative-forced-choice task,
non-humans could do this. But
for a machine, it needs to consider linguistics, natural language
processing, speech sample time warping, and a heavy dose of machine
learning pattern recognition. This despite the fact that “Status” has two syllables
while “New Order” has three. It
bears stating that this is a hard task for a machine operator.
Reading
a brain fMRI uses the much of the same tools, albeit on both spatial and
temporal scales. Figure 2
shows a sample fMRI image.
Figure
2. Sample brain fMRI scan
image. fMRI
scans record oxygen use in brain regions at any given point in time.
More oxygen use in a region implies more activity.
Correlating the brain regions with a brain function atlas could
indicate which functions just occurred.
The colors in a typical fMRI image indicate changes in oxygen use.
This forms a spatio-temporal pattern of brain activity.
Norman,
et al. (2006) provides a brief overview of pattern analysis in fMRI
data. If visually
recognizing a known person activates a specific pattern of brain activity
detectable via fMRI, then seeing that specific fMRI pattern again implies
visual recognition of the same person. In
a crime interrogation scenario, if the detectives could see the
suspect’s fMRI scan patterns while posing the question, “Have you seen
this person before?” they can get their answer.
While dimensionally more complex than the speech waveform pattern
recognition task, the sample warping, normalization, and pattern
recognition algorithms essentially remain the same.
Detecting whether a suspect recognizes a photo or not follows the
same procedure as determining whether a caller said, “Status,” or
“New Order.” However,
there are key functional differences that render this simplistic technical
process translation moot. Speech
is voluntary; a caller can say, “Status” or “New Order” or
possibly remain silent. The
machine operator only needs to be programmed to detect one of two
potential choices in a direct answer detection mode.
A brain is involuntarily constantly active; the suspect could
recognize, partially recognize, be distracted, be confused, not recognize,
and still need to breathe, plan, feel hungry, feel sleepy, tap the foot,
feel the seat underneath, and more, all of which gets picked up in an fMRI
scan. Speech can be isolated
into a static sample. Brain
fMRI scans may need to be placed in context over time and space.
Speech
into an automated machine operator is deliberate and transmitted as
a deliberate message intended to be unambiguous.
Brain activity is internal. Reading
brain activity inside the skull while it quietly transmits
neurotransmitters in the dark is far more intrusive than interpreting
audio signals intended for public consumption.
Using an fMRI to read a brain is like placing hidden cameras in
someone’s dressing room. Privacy concerns aside, most of the time nothing of note
happens. When something
does happen, it has no context unless anchored to a known external event.
Inducing a known external event (e.g. posing a question) biases the
internal activity and introduces non-stationary behavior.
Comparison to past baseline patterns becomes moot.
Speech
is fundamentally one-dimensional. A speaker may think and plan several different utterances
concurrently, but is physically constrained to produce only a single
intentional vocalized output via a single larynx. The difference between thoughts and speech may not be solely
technical differences that can be bridged with better data mining, noise
cancellation, and pattern recognition techniques. Finding a single coherent, un-vocalized, internalized thought
in an active brain is not merely an issue of finding a needle in a
haystack. It is an issue of
determining the intent of a person before that person knows their own
intent. And doing so without introducing the observer’s intent.
In legal terms, reading unspoken thoughts in the brain with an fMRI
machine picks up so many divergent and immature thoughts that can be
directed and interpreted so many ways it should be considered an
objectionable leading question.
|