Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0400Updated Mar 5, 2025Mar 5, 2025
    • AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
      Python
      MIT License
      68300Updated Mar 3, 2025Mar 3, 2025
    • MAPS

      Public
      Official Implementation of ICLR25 paper "MAPS: Advancing Multi-modal Reasoning in Expert-level Physical Science"
      Python
      0100Updated Mar 2, 2025Mar 2, 2025
    • Python
      MIT License
      0800Updated Feb 26, 2025Feb 26, 2025
    • Python
      01600Updated Feb 20, 2025Feb 20, 2025
    • Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
      Python
      MIT License
      86840Updated Feb 20, 2025Feb 20, 2025
    • [EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
      Python
      Apache License 2.0
      3645120Updated Jan 7, 2025Jan 7, 2025
    • SPaR

      Public
      Python
      Apache License 2.0
      34210Updated Dec 17, 2024Dec 17, 2024
    • [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models
      MIT License
      01310Updated Dec 10, 2024Dec 10, 2024
    • MiniPLM

      Public
      [ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models
      Python
      MIT License
      63430Updated Nov 23, 2024Nov 23, 2024
    • Python
      01710Updated Nov 7, 2024Nov 7, 2024
    • OpenMEVA

      Public
      Benchmark for evaluating open-ended generation
      Python
      74831Updated Nov 6, 2024Nov 6, 2024
    • CodePlan

      Public
      0610Updated Oct 16, 2024Oct 16, 2024
    • ShieldLM

      Public
      ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]
      Python
      MIT License
      817510Updated Sep 29, 2024Sep 29, 2024
    • PICL

      Public
      Code for ACL2023 paper: Pre-Training to Learn in Context
      Python
      MIT License
      410811Updated Jul 26, 2024Jul 26, 2024
    • PsyQA

      Public
      一个中文心理健康支持问答数据集,提供了丰富的援助策略标注。可用于生成富有援助策略的长咨询文本。
      1719300Updated Jul 21, 2024Jul 21, 2024
    • [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
      Python
      11900Updated Jul 9, 2024Jul 9, 2024
    • Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
      Python
      12420Updated Jul 9, 2024Jul 9, 2024
    • Python
      414160Updated Jul 1, 2024Jul 1, 2024
    • Official github repo for AutoDetect, an automated weakness detection framework for LLMs.
      Python
      MIT License
      14100Updated Jun 25, 2024Jun 25, 2024
    • BPO

      Public
      Python
      Apache License 2.0
      1630710Updated Jun 24, 2024Jun 24, 2024
    • Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
      Python
      MIT License
      919621Updated Jun 24, 2024Jun 24, 2024
    • Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems
      Python
      Other
      3525130Updated Jun 19, 2024Jun 19, 2024
    • CrossWOZ

      Public
      A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
      Python
      Apache License 2.0
      11567331Updated Jun 17, 2024Jun 17, 2024
    • ConvLab-2

      Public
      ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
      Python
      Apache License 2.0
      133462131Updated Jun 17, 2024Jun 17, 2024
    • Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
      Apache License 2.0
      8494210Updated Feb 27, 2024Feb 27, 2024
    • Official Code for EMNLP 2023 paper: "Unveiling the Implicit Toxicity in Large Language Models""
      Python
      01000Updated Nov 30, 2023Nov 30, 2023
    • Re3Dial

      Public
      Official Code for EMNLP 2023 paper: "Re3Dial: Retrieve, Reorganize and Rescale Conversations for Long-Turn Open-Domain Dialogue Pre-training"
      Python
      0610Updated Oct 22, 2023Oct 22, 2023
    • Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"
      Python
      Other
      18124100Updated Sep 10, 2023Sep 10, 2023
    • The beamsearch algorithm for DA-Transformer
      C++
      Other
      1500Updated Sep 10, 2023Sep 10, 2023