Tags: #red-teaming
LLM Evaluation & Red Teaming Tool
Node.js
20.5k
promptfoo/promptfoo
A CLI and library for testing, evaluating, and red-teaming LLM applications to ensure security, reliability, and performance across various models.
AI Security Platform
Docker
3.6k
Tencent/AI-Infra-Guard
A full-stack AI Red Teaming platform designed to secure AI ecosystems by offering comprehensive vulnerability scanning and LLM jailbreak evaluation.
AI/ML Testing and Evaluation Framework
5.3k
Giskard-AI/giskard-oss
An open-source Python library for comprehensive testing, evaluation, and red teaming of LLM agents and AI systems, designed for dynamic, multi-turn interactions.
Autonomous Hacking Agent
Docker
3.2k
PurpleAILAB/Decepticon
Decepticon is an autonomous hacking agent that leverages AI to automate red team operations, executing reconnaissance, exploitation, and post-exploitation tasks at machine speed.