Added support for masked language modeling (bidirectional models) #211

shehadak · 2023-11-08T21:12:20Z

This PR builds on the Huggingface subject, which assumes that models are autoregressive (following the ModelForCausalLM). This PR adds support for bidirectional models with masked language modeling following the ModelForMaskedLM). Since bidirectional models rely on future context, I use a sliding window approach (see google-research/bert#66). In particular, for each text part, up to w/2 tokens are included for the current part + previous context, and the remaining w/2 tokens are masked.

The region_layer_mapping for the language system was determined by scoring every transformer layer in BERT's encoder against the Pereira2018.243sentences-linear, Pereira2018.384sentences-linear, and Blank2014-linear benchmarks, and choosing the layer with the highest average score.

This PR also provides unit tests for reading time estimation, next word prediction, and neural recording, using the bert-base-uncased model. Future models can use the same format, as long as they implement the ModelForMaskedLM interface. For example, to add the base DistilBERT model:

model_registry['distilbert-base-uncased'] = lambda: HuggingfaceSubject(model_id='distilbert-base-uncased', region_layer_mapping={
    ArtificialSubject.RecordingTarget.language_system: 'distilbert.transformer.layer.5'}, bidirectional=True)

brainscore_language/model_helpers/huggingface.py

brainscore_language/models/bert/__init__.py

Co-authored-by: Martin Schrimpf <[email protected]>

…rence

shehadak requested review from aalok-sathe, kvfairchild and mschrimpf November 9, 2023 02:46

mschrimpf reviewed Nov 9, 2023

View reviewed changes

shehadak and others added 6 commits November 9, 2023 15:00

Added support for bidirectional LM models via masked processing

3116e1b

Added base BERT huggingface subject

f0f2c68

Added unit tests for tasks with the bidriectional model

3e7c3f2

Update brainscore_language/model_helpers/huggingface.py

d375b11

Co-authored-by: Martin Schrimpf <[email protected]>

Refactored common functionality in _masked_inference and _causal_infe…

73fdcec

…rence

Added unit tests for bidirectional huggingface models

66311de

shehadak force-pushed the ks/bidirectional-huggingface branch from bc44421 to 66311de Compare November 9, 2023 20:00

shehadak and others added 2 commits November 10, 2023 12:51

Update brainscore_language/model_helpers/huggingface.py

2a1d9ed

added comment explaining BERT layer assignment

08433e3

shehadak requested review from benlipkin and gretatuckute November 14, 2023 00:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added support for masked language modeling (bidirectional models) #211

Added support for masked language modeling (bidirectional models) #211

Uh oh!

shehadak commented Nov 8, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added support for masked language modeling (bidirectional models) #211

Are you sure you want to change the base?

Added support for masked language modeling (bidirectional models) #211

Uh oh!

Conversation

shehadak commented Nov 8, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants