thu-coai
Conversational AI groups from Tsinghua University
Pinned Loading
Repositories
Showing 10 of 100 repositories
- ShieldVLM Public
thu-coai/ShieldVLM’s past year of commit activity - SafetyBench Public
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
thu-coai/SafetyBench’s past year of commit activity - VPO Public
thu-coai/VPO’s past year of commit activity - SPaR Public
thu-coai/SPaR’s past year of commit activity - LRM-Safety-Study Public
thu-coai/LRM-Safety-Study’s past year of commit activity - TransferAttack Public
[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
thu-coai/TransferAttack’s past year of commit activity - Backdoor-Data-Extraction Public
thu-coai/Backdoor-Data-Extraction’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…