Tags: #testing

LLM Evaluation & Red Teaming Tool

20.5k

promptfoo/promptfoo

A CLI and library for testing, evaluating, and red-teaming LLM applications to ensure security, reliability, and performance across various models.

llm evaluation red-teaming

Details

Mobile Automation Protocol Server

Node.js

4.8k

mobile-next/mobile-mcp

A platform-agnostic Model Context Protocol (MCP) server enabling scalable mobile automation and development across iOS and Android devices, emulators, and simulators.

mobile automation ios android

Details

AI Agent Testing CLI Tool

Node.js

3.4k

millionco/expect

Expect empowers AI coding agents with automated, real-browser QA capabilities by generating and executing test plans based on code changes.

testing ai-agent playwright

Details

AI/ML Testing and Evaluation Framework

5.3k

Giskard-AI/giskard-oss

An open-source Python library for comprehensive testing, evaluation, and red teaming of LLM agents and AI systems, designed for dynamic, multi-turn interactions.

llm testing evaluation

Details

Development Tool

Ruby

4.9k

simplecov-ruby/simplecov

A powerful code coverage analysis tool for Ruby, simplifying result processing and merging across test suites.

ruby code-coverage testing

Details