Skip to content

Falcon H1 Model Support #657

@sbhavani

Description

@sbhavani

Add support for Falcon H1 models, following the implementation in Megatron-LM PR #1785

The upstream Megatron-LM implementation provides the foundation, but Megatron Bridge needs additional integration work to support:

  1. Model Provider Implementation: Create FalconH1ModelProvider similar to existing Llama provider
  2. Hugging Face Bridge Integration: Implement bidirectional conversion between HuggingFace Falcon H1 checkpoints and Megatron format
  3. Configuration Management: Add Falcon H1-specific configuration handling and parameter mappings
  4. Training Recipes: Provide pre-configured training recipes compatible for Falcon H1

Metadata

Metadata

Assignees

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions