-
Notifications
You must be signed in to change notification settings - Fork 37
Open
Description
Add support for Falcon H1 models, following the implementation in Megatron-LM PR #1785
The upstream Megatron-LM implementation provides the foundation, but Megatron Bridge needs additional integration work to support:
- Model Provider Implementation: Create
FalconH1ModelProvider
similar to existing Llama provider - Hugging Face Bridge Integration: Implement bidirectional conversion between HuggingFace Falcon H1 checkpoints and Megatron format
- Configuration Management: Add Falcon H1-specific configuration handling and parameter mappings
- Training Recipes: Provide pre-configured training recipes compatible for Falcon H1