llm-attack

Here are 4 public repositories matching this topic...

Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers

attack jailbreak safety adversarial-machine-learning adversarial-attacks llm llms llm-attack

llm attacks basic payloads

sql attack xss-vulnerability xss-attacks sqlinjection injections llm paylods llm-attack

This repository is for the Socratic Method, a fine-tuning dataset for LLM to keep alignment during fine-tuning.

fine-tuning-llm llama2 llm-attack

A project for CSE527A Natural Languag Processing at Washington University

Add a description, image, and links to the llm-attack topic page so that developers can more easily learn about it.

To associate your repository with the llm-attack topic, visit your repo's landing page and select "manage topics."