Tags: #llm-deployment
awslabs/amazon-bedrock-agentcore-samples
A collection of samples and tutorials for deploying and operating scalable, secure, and framework-agnostic AI agents using Amazon Bedrock AgentCore.
awslabs/agentcore-samples
A collection of samples and tutorials for deploying and operating scalable, secure, and framework-agnostic AI agents using Amazon Bedrock AgentCore.
PaddlePaddle/FastDeploy
A high-performance inference and deployment toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs) based on PaddlePaddle.
gpustack/gpustack
An open-source GPU cluster manager that orchestrates high-performance AI inference engines across diverse environments, optimizing model deployment and resource utilization.
datawhalechina/handy-ollama
A comprehensive guide to deploying large language models (LLMs) locally on CPU using Ollama, making advanced AI accessible without powerful GPUs.