Skip to content
@LeanModels

LeanModels

LeanModels — Making Foundation Models Leaner and Meaner

Welcome to LeanModels, an organization founded by Tianyi Zhang dedicated to making foundation models, such as LLMs and diffusion models, more memory- and compute-efficient through practical compression and inference optimization techniques.

Explore our key projects:

  • DFloat11: A lossless LLM compression framework enabling efficient GPU inference
  • Bagel-DFloat11: DFloat11-compressed version of Bagel, a unified multimodal model
  • LeanQuant: Scalable, loss-error-aware quantization for LLMs

We welcome contributors, collaborators, and feedback! If you're working on model compression or efficient inference, feel free to reach out.

Pinned Loading

  1. DFloat11 DFloat11 Public

    DFloat11: Lossless LLM Compression for Efficient GPU Inference

    Python 418 26

  2. Bagel-DFloat11 Bagel-DFloat11 Public

    Forked from ByteDance-Seed/Bagel

    Python 74 7

  3. LeanQuant LeanQuant Public

    Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"

    Python 15 1

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…