Tags: #testing
LLM Evaluation & Red Teaming Tool
Node.js
20.0k
promptfoo/promptfoo
Test, evaluate, and red-team your LLM applications to ensure security, reliability, and optimal performance across various models.
Mobile Automation Server
Node.js
4.5k
mobile-next/mobile-mcp
A Model Context Protocol (MCP) server enabling scalable, platform-agnostic mobile automation and interaction for iOS and Android devices, optimized for AI agents and LLMs.
LLM Evaluation and Testing Framework
python
5.3k
Giskard-AI/giskard-oss
An open-source Python library for comprehensive evaluation, testing, and red teaming of LLM agents and agentic systems.
AI Application Development Suite
python
11.1k
microsoft/promptflow
A comprehensive development suite for building, testing, evaluating, deploying, and monitoring high-quality LLM-based AI applications.