Using LLMs to bridge data-driven models and scientific theories in language neuroscience

Date: 
November 13, 2024
Time: 
3 to 4 p.m. PT
Place: 
via Zoom

Chandan Singh
Senior Researcher, Microsoft

Science faces an explainability crisis: data-driven deep learning methods are proving capable of predicting many natural phenomena but not explaining them. One emblematic field is language neuroscience, where LLMs are highly effective at predicting human brain responses to natural language, but are virtually impossible to interpret or analyze by hand. To overcome this challenge, we introduce a framework that translates deep learning models of language selectivity in the brain into concise verbal explanations and then design follow-up experiments to verify that these explanations are causally related to brain activity. This approach is successful at explaining selectivity both in individual voxels and cortical regions of interest, demonstrating that LLMs can be used to bridge the widening gap between data-driven models and formal scientific theories. This talk covers 2 papers: Benara et al. (NeurIPS 2024) & Antonello et al. (arXiv, 2024).

Register for Zoom

Event Type: 
Biostatistics and Bioinformatics Seminar