LEAD: Linear Embedding Alignment across Deep Neural Network Language Models' Representations

Warning

This project was developed in around six days as a final project for MIT's 6.7960 Deep Learning class and as such, there are some slight errors in our results and final analysis. Nonetheless, this project, it's methodology, and codebase may be informative to others.

This is a repository containing the source code for the LEAD Blog Post by Gatlen Culp and Adriano Hernandez from MIT.

Published December 10th, 2024

Abstract

Recent advances in Large Language Models (LLMs) have demonstrated their remarkable ability to capture semantic information. We investigate whether different language embedding models learn similar semantic representations despite variations in architecture, training data, and initialization. While previous work explored model similarity through top-k results and Centered Kernel Alignment (CKA), yielding mixed results, in the field of large language embedding models, which we focus on, there is a gap: more modern similarity quantifiation methods from Computer Vision, such as model stitching, which operationalizes the notion of "similarity" in a way that emphasizes downstream utility, are not explored. We apply stitching by training linear and nonlinear (MLP) mappings, called "stitches" between embedding spaces, which aim to biject between embeddings of the same datapoints. We define two spaces as connectivity-aligned if stitches achieve low mean squared error, inreadicating approximate bijectivity.

Our analysis spans 6 embedding datasets (5,000-20,000 documents), 18 models (between 20-30 layers, including both open-source and OpenAI models), and stitches ranging from linear stitches to MLPs 7 layers deep, with a focus on linear stitches. We hoped that stitching would recover the similarity between models, aligning with a strong interpretation of the platonic representation hypothesis. However, things appear to be more complicated. Our results suggest that embedding models are not linearly connectivity-aligned. Specifically, linear stitches do not perform significantly better than mean estimators. A brief foray into MLPs suggests that training shallow MLPs does not necessarily work out of the box either, but more work remains to be done on non-linear stitches, since we haven't fully maximized their potential here. Stitches are important, because their success can be used to determine operational, and therefore useful, notions of representational similarity. Our findings buttress the hypothesis that alignment metrics such as CKA are not always informative of behavior or feature overlap between models.

To read the rest of blog, click here.

Project Overview

This project investigates whether different embedding models learn similar semantic representations by applying model stitching — a technique that maps between embedding spaces. We define and test the concept of "connectivity-alignment" to measure how effectively embeddings from one model can be mapped to another.

Key Research Questions

Do different language embedding models learn similar semantic representations?
Can we efficiently translate between different models' embedding spaces using linear or shallow non-linear maps?
Are connectivity-alignment metrics more informative than traditional similarity measures like CKA?

Installation and Usage

# Clone the repository
git clone https://github.com/GatlenCulp/embedding_translation
cd embedding_translation

# Install dependencies with uv (recommended)
uv sync
# Alternative
pip install -r requirements.txt

Project Structure

src/ - Source code for running experiments and analysis
- blog/ - Code for the blog visualization and presentation
docs/ - Documentation and project planning materials
owler_fork/ - Modified code from the Beyond Benchmarks paper

Data and Models

Our experiments use:

6 embedding datasets (5,000-20,000 documents)
18 embedding models (including open-source and OpenAI models)
Stitches ranging from linear mappings to 7-layer MLPs

Other Resources

View our HuggingFace datasets here and our trained stitches here.

Our work was heavily influenced by the Beyond Benchmarks paper. The code of which we used and of which you can find the source of here.

Citation

If you use this work in your research, please cite our blog post:

@article{culp2024lead,
  title={LEAD: Linear Embedding Alignment across Deep Neural Network Language Models' Representations},
  author={Culp, Gatlen and Hernandez, Adriano},
  year={2024},
  url={https://gatlenculp.github.io/embedding_translation/}
}

Name		Name	Last commit message	Last commit date
Latest commit History 236 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
.vscode		.vscode
data/figs		data/figs
docs		docs
notebooks		notebooks
owler_fork		owler_fork
ref		ref
secrets		secrets
src		src
tests		tests
.cruft.json		.cruft.json
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE.md		LICENSE.md
README.md		README.md
Taskfile.yml		Taskfile.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LEAD: Linear Embedding Alignment across Deep Neural Network Language Models' Representations

Abstract

Project Overview

Key Research Questions

Installation and Usage

Project Structure

Data and Models

Other Resources

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

GatlenCulp/embedding_translation

Folders and files

Latest commit

History

Repository files navigation

LEAD: Linear Embedding Alignment across Deep Neural Network Language Models' Representations

Abstract

Project Overview

Key Research Questions

Installation and Usage

Project Structure

Data and Models

Other Resources

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages