Popular repositories Loading
-
dasheng-lm
dasheng-lm PublicEfficient audio understanding with general audio captions
-
recogdrive
recogdrive PublicReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
-
dasheng-denoiser
dasheng-denoiser PublicOfficial PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders
Repositories
- diffrhythm2 Public
xiaomi-research/diffrhythm2’s past year of commit activity - time-r1 Public
[NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding
xiaomi-research/time-r1’s past year of commit activity - mecat Public
xiaomi-research/mecat’s past year of commit activity - q-frame Public
[ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"
xiaomi-research/q-frame’s past year of commit activity - genesis Public
[NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
xiaomi-research/genesis’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…