Releases · EvolvingLMMs-Lab/lmms-engine · GitHub

25 Oct 11:11

kcz358

[v0.1.2] First official Release with new models and feature support Latest

Latest

What's Changed

feat: Bagel Image Understanding by @pufanyi in #43
fix: Bagel Docs Data Format by @pufanyi in #44
fix: Allow training bagel on understanding dataset when visual_gen=True by @pufanyi in #46
feat: Bagel naive implementation of sparse attention by @kcz358 in #45
feat: Better merge and print batch input by @kcz358 in #48
fix: Merge fsdp by @kcz358 in #49
add_single_gpu_muon&fix_some_bugs by @BIGKnight in #53
[v0.1.2] release: hydra launch config, sit, and rae training by @kcz358 in #50
fix: Fix launch from cli using config examples by @kcz358 in #54
feat: Support Qwen2.5 Omni Thinker by @kcz358 in #56
feat: Add llava_ov, bagel and better cicd readme and control by @kcz358 in #57
docs: Add a auto build docs, may be deprecated by @kcz358 in #58
feat: Support Qwen3-VL ulysses sequence parallel operation by @kcz358 in #59
fix: Fix random shuffle seed on same dp rank to prevent sp hang by @kcz358 in #60
docs: improve documentation accuracy and add Qwen-VL training guide by @mwxely in #62
Fix/reorg examples by @Luodian in #61
Dev/readme by @Luodian in #63
docs: Fix some examples error and better documentation on implementing new class by @kcz358 in #64

New Contributors

@mwxely made their first contribution in #62

Full Changelog: v0.1.1...v0.1.2

Contributors

Luodian, BIGKnight, and 3 other contributors

Assets 2

18 Sep 09:54

kcz358

[v0.1.1] Bagel, WanVideo, stream packing and refactor for better repo structure

What's Changed

feat: Custom FSDP2 trainer by @kcz358 in #8
feat: Add Save and Load logic for fsdp2 trainer by @kcz358 in #9
Dev/bo 0809 by @Luodian in #10
feat: Add flash-attn and liger-kernel dependencies by @Luodian in #11
feat: Support Qwen2 for remove padding training by @kcz358 in #14
[feat] enable dllm training by @BIGKnight in #15
feat: Add cicd by @kcz358 in #16
feat: LLaVA-Ov ops and liger-kernel rfc by @kcz358 in #17
rfc: Better base dataset abstract class and flexible args for kwargs by @kcz358 in #20
test: Multi-gpu cicd test for robustness by @kcz358 in #22
Dev/wan by @BIGKnight in #23
feat: Add Qwen2 ulysses sequence parallel by @kcz358 in #24
rfc: Refactor video loading logic and processor by @kcz358 in #25
rfc: Train implementation, monkey patch logic by @kcz358 in #28
Add efficient loss for dllms by @yshenaw in #27
feat: Add profiler by @kcz358 in #30
fix: profile error by @kcz358 in #31
feat: Support stream packing by @kcz358 in #32
Dev/muon by @BIGKnight in #34
fix: Force iterable max steps by @kcz358 in #35
feat: Support bagel training by @kcz358 in #33
fix: Image tensor size error by @pufanyi in #40

New Contributors

@yshenaw made their first contribution in #27

Full Changelog: v0.1.0...v0.1.1

Contributors

Luodian, BIGKnight, and 3 other contributors

Assets 2

21 Aug 05:01

kcz358

[v0.1.0.post1] Stable release before world model and vide gen

What's Changed

feat: Custom FSDP2 trainer by @kcz358 in #8
feat: Add Save and Load logic for fsdp2 trainer by @kcz358 in #9
Dev/bo 0809 by @Luodian in #10
feat: Add flash-attn and liger-kernel dependencies by @Luodian in #11
feat: Support Qwen2 for remove padding training by @kcz358 in #14
[feat] enable dllm training by @BIGKnight in #15
feat: Add cicd by @kcz358 in #16
feat: LLaVA-Ov ops and liger-kernel rfc by @kcz358 in #17

Full Changelog: v0.1.0...v0.1.0.post1

Contributors

Luodian, BIGKnight, and kcz358

Assets 2

07 Aug 05:46

kcz358

[v0.1.0] Framework Init and support Sequence Parallel

What's Changed

[Feat] simplification by @Luodian in #2
Dev/fla by @kcz358 in #5
feat: Refactor Kernels structure and init sp logic by @kcz358 in #1
fix: Fsdp save and merge by @kcz358 in #6
feat!: Support ulysses sequence parallel for qwen2_5_vl model by @kcz358 in #7

New Contributors

@Luodian made their first contribution in #2
@kcz358 made their first contribution in #5

Full Changelog: https://github.com/EvolvingLMMs-Lab/lmms-engine-mini/commits/v0.1.0

Contributors

Luodian and kcz358

Assets 2