thu-coai
Conversational AI groups from Tsinghua University
Pinned Loading
Repositories
Showing 10 of 99 repositories
- LRM-Safety-Study Public
thu-coai/LRM-Safety-Study’s past year of commit activity - TransferAttack Public
[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
thu-coai/TransferAttack’s past year of commit activity - Backdoor-Data-Extraction Public
thu-coai/Backdoor-Data-Extraction’s past year of commit activity - BARREL Public
thu-coai/BARREL’s past year of commit activity - Agent-SafetyBench Public
thu-coai/Agent-SafetyBench’s past year of commit activity - AISafetyLab Public
AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
thu-coai/AISafetyLab’s past year of commit activity - Crisp Public
Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues
thu-coai/Crisp’s past year of commit activity - CharacterBench Public
[AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models
thu-coai/CharacterBench’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…