Xiaomi Research

dasheng-lm Public

Efficient audio understanding with general audio captions

Jupyter Notebook 369 37

recogdrive Public

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Python 306 24

r1-aqa Public

🤗 R1-AQA Model: mispeech/r1-aqa

Python 303 26

lego-edit Public

Jupyter Notebook 94 1

dasheng-denoiser Public

Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders

Python 69 6

genesis Public

[NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency

Provide feedback