Skip to content
@swiss-ai

swiss-ai

Popular repositories Loading

  1. mmore mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets …

    Python 49 17

  2. MoE MoE Public

    some mixture of experts architecture implementations

    Python 14 2

  3. parity-aware-bpe parity-aware-bpe Public

    Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [arXiv 2025]

    Python 7 2

  4. nanotron nanotron Public

    Forked from huggingface/nanotron

    Minimalistic large language model 3D-parallelism training

    Python 6 9

  5. data-PDF-pipeline data-PDF-pipeline Public

    PDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)

    Python 5

  6. Megatron-LM Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 4 10

Repositories

Showing 10 of 36 repositories

Top languages

Loading…

Most used topics

Loading…