Tags: #testing
LLM Evaluation & Red Teaming Tool
Node.js
20.5k
promptfoo/promptfoo
A CLI and library for testing, evaluating, and red-teaming LLM applications to ensure security, reliability, and performance across various models.
Mobile Automation Protocol Server
Node.js
4.8k
mobile-next/mobile-mcp
A platform-agnostic Model Context Protocol (MCP) server enabling scalable mobile automation and development across iOS and Android devices, emulators, and simulators.
AI Agent Testing CLI Tool
Node.js
3.4k
millionco/expect
Expect empowers AI coding agents with automated, real-browser QA capabilities by generating and executing test plans based on code changes.
AI/ML Testing and Evaluation Framework
5.3k
Giskard-AI/giskard-oss
An open-source Python library for comprehensive testing, evaluation, and red teaming of LLM agents and AI systems, designed for dynamic, multi-turn interactions.
Development Tool
Ruby
4.9k
simplecov-ruby/simplecov
A powerful code coverage analysis tool for Ruby, simplifying result processing and merging across test suites.