Tags: #red-teaming

LLM Evaluation & Red Teaming Tool

20.5k

promptfoo/promptfoo

A CLI and library for testing, evaluating, and red-teaming LLM applications to ensure security, reliability, and performance across various models.

llm evaluation red-teaming

Details

AI Security Platform

Docker

3.6k

Tencent/AI-Infra-Guard

A full-stack AI Red Teaming platform designed to secure AI ecosystems by offering comprehensive vulnerability scanning and LLM jailbreak evaluation.

ai-security red-teaming vulnerability-scanning

Details

AI/ML Testing and Evaluation Framework

5.3k

Giskard-AI/giskard-oss

An open-source Python library for comprehensive testing, evaluation, and red teaming of LLM agents and AI systems, designed for dynamic, multi-turn interactions.

llm testing evaluation

Details

Autonomous Hacking Agent

Docker

3.2k

PurpleAILAB/Decepticon

Decepticon is an autonomous hacking agent that leverages AI to automate red team operations, executing reconnaissance, exploitation, and post-exploitation tasks at machine speed.

autonomous-hacking red-teaming ai-agent

Details