ai-security

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

adversarial-machine-learning adversarial-attacks ai-security backdoor-attacks deep- poisoning-attacks

Updated May 9, 2023
Python

LetterLiGo / SafeGen_CCS2024

Star

[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

text-to-image ai-safety ai-security generative-ai thrustworthy-ai

Updated Oct 13, 2024
Python

RjDuan / AdvDrop

Star

Code for "Adversarial attack by dropping information." (ICCV 2021)

pytorch adversarial-examples adversarial-attacks ai-security

Updated Jan 13, 2022
Python

jay-johnson / train-ai-with-django-swagger-jwt

Star

Train AI (Keras + Tensorflow) to defend apps with Django REST Framework + Celery + Swagger + JWT - deploys to Kubernetes and OpenShift Container Platform

machine-learning jwt deep-neural-networks ai openshift tensorflow rest-api django-rest-framework swagger drf keras celery network-analysis network-security celery-tasks machine-learning-security ai-security anti-nex

Updated Nov 2, 2018
Python

Hacking-Notes / VulnScan

Star

Performing website vulnerability scanning using OpenAI technologie

hacking-tool vulnerability-scanners vulnerability-scanning ai-security chatgpt

Updated Apr 19, 2024
Python

RomiconEZ / llamator

Star

Framework for testing vulnerabilities of large language models (LLM).

Updated Jan 22, 2025
Python

mitre-atlas / atlas-data

Star

ATLAS tactics, techniques, and case studies data

security machine-learning mitre-attack ai-security mitre-atlas

Updated Oct 2, 2024
Python

elliothe / CVPR_2019_PNI

Star

pytorch implementation of Parametric Noise Injection for adversarial defense

ai-security adversarial-defense

Updated Oct 23, 2019
Python

wssun / TiSE-CodeLM-Security

Star

This repository provide the studies on the security of language models for code (CodeLMs).

security language-model adversarial-attacks ai-security code-intelligence backdoor-attacks adversarial-defense backdoor-defense ai4se lm4se lm4code

Updated Dec 13, 2024
Python

HKU-TASR / Imperio

Star

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

ai-security backdoor-attacks llm

Updated Apr 17, 2024
Python

zhangzp9970 / MIA

Star

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

machine-learning research deep-learning ai-security model-inversion-attacks

Updated Oct 6, 2023
Python

LetterLiGo / Inaudible-Adversarial-Perturbation-Vrifle

Star

[NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

artificial-intelligence iot-security ai-security

Updated Sep 28, 2024
Python

AI-Initiative-KAUST / VideoRLCS

Star

Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)

reinforcement-learning computer-vision deep-learning explainable-ai ai-security iccv2023

Updated Aug 19, 2023
Python

modzy / sdk-python

Star

Python library for Modzy Machine Learning Operations (MLOps) Platform

python docker machine-learning microservices deployment api-client model-deployment model-serving serving explainable-ai production-machine-learning ai-security mlops kuberenetes drift-detection machine-learning-operations

Updated Sep 8, 2023
Python

reds-lab / Meta-Sift

Star

The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on poisoned dataset.

ai-security backdoor-attacks data-poisoning dataset-security

Updated Apr 27, 2023
Python

moonwatcher-ai / moonwatcher

Star

Evaluation & testing framework for computer vision models

computer-vision ai-safety ethical-artificial-intelligence ai-security mlops ml-safety ml-validation trustworthy-ai ml-testing

Updated Jun 20, 2024
Python

jay-johnson / antinex-datasets

Star

Datasets for training deep neural networks to defend software applications

Updated Jun 4, 2018
Python

Improve this page

Add a description, image, and links to the ai-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-security topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-security

Here are 50 public repositories matching this topic...

Giskard-AI / giskard

utkusen / promptmap

normster / llm_rules

reds-lab / Narcissus

LetterLiGo / SafeGen_CCS2024

RjDuan / AdvDrop

jay-johnson / train-ai-with-django-swagger-jwt

Hacking-Notes / VulnScan

RomiconEZ / llamator

mitre-atlas / atlas-data

elliothe / CVPR_2019_PNI

wssun / TiSE-CodeLM-Security

HKU-TASR / Imperio

zhangzp9970 / MIA

LetterLiGo / Inaudible-Adversarial-Perturbation-Vrifle

AI-Initiative-KAUST / VideoRLCS

modzy / sdk-python

reds-lab / Meta-Sift

moonwatcher-ai / moonwatcher

jay-johnson / antinex-datasets

Improve this page

Add this topic to your repo