When deep-learning models are deployed in the real world—perhaps to detect financial fraud from credit card activity or identify cancer in medical images—they are often able to outperform humans.
ADVERTISEMENT |
But what exactly are these deep-learning models learning? Does a model trained to spot skin cancer in clinical images, for example, actually learn the colors and textures of cancerous tissue, or is it flagging some other features or patterns?
These powerful machine-learning models are typically based on artificial neural networks that can have millions of nodes that process data to make predictions. Researchers often call these models “black boxes” because even the scientists who build them don’t understand everything that is going on under the hood.
Stefanie Jegelka isn’t satisfied with that “black box” explanation. A newly tenured associate professor in the MIT Department of Electrical Engineering and Computer Science, Jegelka is digging deep into deep learning to understand what these models can learn and how they behave, and how to build certain prior information into these models.
…
Add new comment