Main content
Toward a universal decoder of linguistic meaning from brain activation
- Francisco Pereira
- Bin Lou
- Brianna Pritchett
- Sam Ritter
- Samuel J. Gershman
- Nancy Kanwisher
- Matthew Botvinick
- Evelina Fedorenko
Date created: 2018-01-04 05:29 PM | Last Updated: 2021-06-25 08:56 PM
Category: Project
Description: Prior work decoding linguistic meaning from imaging data has been largely limited to concrete nouns, using similar stimuli for training and testing, from a relatively small number of semantic categories. Here we present a new approach for building a brain decoding system in which words and sentences are represented as vectors in a semantic space constructed from massive text corpora. By efficiently sampling this space to select training stimuli shown to subjects, we maximize the ability to generalize to new meanings from limited imaging data. To validate this approach, we train the system on imaging data of individual concepts, and show it can decode semantic vector representations from imaging data of sentences about a wide variety of both concrete and abstract topics, from two separate datasets. These decoded representations are sufficiently detailed to distinguish even semantically similar sentences, and to capture the similarity structure of meaning relationships between sentences.
Note (if any links not working)
- All materials, except individual subject datasets, are also included under Files/Pereira_Materials.zip
- Subject data can be downloaded from Dropbox or Drive (these files are large and downloads may take a while).
Article
http://rdcu.be/IqEP (link to Nature Communications web site)
Datasets
Code
…Files
Files can now be accessed and managed under the Files tab.
Citation
Recent Activity
Unable to retrieve logs at this time. Please refresh the page or contact support@osf.io if the problem persists.