🍉
I may be slow to respond before the due date of ACL.
PhD@CUHK, Research Engineer@Alibaba
- Shatin, N.T., HKSAR
- https://lixin4ever.github.io/
- @lixin4ever
Pinned Loading
-
DAMO-NLP-SG/VideoLLaMA3
DAMO-NLP-SG/VideoLLaMA3 PublicFrontier Multimodal Foundation Models for Image and Video Understanding
-
DAMO-NLP-SG/VideoLLaMA2
DAMO-NLP-SG/VideoLLaMA2 PublicVideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
-
DAMO-NLP-SG/Inf-CLIP
DAMO-NLP-SG/Inf-CLIP Public[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training sc…
-
DAMO-NLP-SG/VideoRefer
DAMO-NLP-SG/VideoRefer Public[CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"
-
alibaba-damo-academy/RynnEC
alibaba-damo-academy/RynnEC PublicRynnEC: Bringing MLLMs into Embodied World
-
alibaba-damo-academy/RynnVLA-001
alibaba-damo-academy/RynnVLA-001 PublicRynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.