Overview
NVIDIA NIM™ provides prebuilt, optimized inference microservices for rapidly deploying the latest AI models on any NVIDIA-accelerated infrastructure—cloud, data center, workstation, and edge.
Sovereign AI Agents Think Local, Act Global With NVIDIA AI Factories
Validated design for AI factories pairs accelerated infrastructure with software, including new NVIDIA NIM™ capabilities and an expanded suite of NVIDIA blueprints.
Free Development Access to NIM
Get access to unlimited prototyping with hosted APIs for NIM accelerated by DGX Cloud, or download and self-host NIM microservices for research and development as part of the NVIDIA Developer program.
Accelerate AI Deployment With NVIDIA NIM
NVIDIA NIM combines the ease of use and operational simplicity of managed APIs with the flexibility and security of self-hosting models on your preferred infrastructure. NIM microservices come with everything AI teams need—the latest AI foundation models, optimized inference engines, industry-standard APIs, and runtime dependencies—prepackaged in enterprise-grade software containers ready to deploy and scale anywhere.
Benefits
Enterprise Generative AI That Does More for Less
Easy, enterprise-grade microservices built for high-performance AI—designed to work seamlessly and scale affordably. Experience the fastest time to value for AI agents and other enterprise generative AI applications powered by the latest AI models for reasoning, simulation, speech, and more.
Ease of Use
Accelerate innovation and time to market with prebuilt, optimized microservices for the latest AI models. With standard APIs, models can be deployed in five minutes and easily integrated into applications.
Enterprise Grade
Deploy enterprise-grade microservices that are continuously managed by NVIDIA through rigorous validation processes and dedicated feature branches—all backed by NVIDIA enterprise support, which also offers direct access to NVIDIA AI experts.
Performance and Scale
Improve TCO with low-latency, high-throughput AI inference that scales with the cloud, and achieve the best accuracy with support for fine-tuned models out of the box.
Portability
Deploy anywhere with prebuilt, cloud-native microservices ready to run on any NVIDIA-accelerated infrastructure—cloud, data center, and workstation—and scale seamlessly on Kubernetes and cloud service provider environments.
Demo
Build AI Agents With NIM
Learn how to set up two AI agents—one for content generation and another for digital graphic design—and see how easy it is to get up and running with NIM microservices.
Technology
Building Blocks for Agentic AI
Get the Latest AI Models
Access the latest AI models for reasoning, language, retrieval, speech, vision and more—ready to deploy in five minutes on any NVIDIA-accelerated infrastructure.
Jump-Start Development With NVIDIA Blueprints
Build impactful agentic AI applications with comprehensive reference workflows featuring NVIDIA acceleration libraries, SDKs, and NIM microservices.
Simplify Development With NVIDIA Agent Intelligence Toolkit
Weave NIM microservices into agentic AI applications with the NVIDIA Agent Intelligence Toolkit library, a developer toolkit for building AI agents and integrating them into custom workflows.
Benchmarks
Boost Throughput With NIM
NVIDIA NIM provides optimized throughput and latency out of the box to maximize token generation, support concurrent users at peak times, and improve responsiveness. NIM microservices are continuously updated with the latest optimized inference engines, boosting performance on the same infrastructure over time.
Models
Build With the Leading Open Models
Get optimized inference performance for the latest AI models to power multimodal agentic AI with reasoning, language, retrieval, speech, image, and more. NIM comes with accelerated inference engines from NVIDIA and the community, including NVIDIA® TensorRT™, TensorRT-LLM, and more—prebuilt and optimized for low-latency, high-throughput inferencing on NVIDIA-accelerated infrastructure.
Features
The Easy Button for AI Development and Deployment
Designed to run anywhere, NIM inference microservices expose industry-standard APIs for easy integration with enterprise systems and applications and scale seamlessly on Kubernetes to deliver high-throughput, low-latency inference at cloud scale.
Deploy NIM
Deploy NIM for your model with a single command. You can also easily run NIM with LLMs supported by NVIDIA TensorRT-LLM, vLLM, or SGLang, including fine-tuned models.
Run Inference
Get NIM up and running with the optimal runtime engine based on your NVIDIA-accelerated infrastructure.
Build
Integrate self-hosted NIM endpoints with just a few lines of code.
Use Cases
How NIM Is Being Used
See how NVIDIA NIM supports industry use cases, and jump-start your AI development with curated examples.
AI Virtual Assistants
Enhance customer experiences and improve business processes with generative AI.
Intelligent Document Processing
Use generative AI to accelerate and automate document processing.
AI for Hyperpersonalized Shopping
Deliver tailored experiences that enhance customer satisfaction with the power of AI.
3D Product Configurators
Use OpenUSD and generative AI to develop and deploy 3D product configurator tools and experiences to nearly any device.
Starting Options
Ways to Get Started With NVIDIA NIM
Start Prototyping for Free
Get started with easy-to-use API endpoints for NIM, powered by DGX Cloud.
Ensure your data isn't used for model training.
Access for development and testing as part of the NVIDIA Developer Program.
Download and Deploy
Run NVIDIA NIM to scale optimized AI models in the cloud or data center of your choice.
Ensure data never leaves your secure enclave.
Seamlessly transition from cloud endpoints to self-hosted APIs without code changes.
Start with free access for development and testing, and move to an NVIDIA AI Enterprise license for production.
Get in Touch
Talk to an NVIDIA AI specialist about moving generative AI pilots to production with the security, API stability, and support that comes with NVIDIA AI Enterprise.
Explore your generative AI use cases.
Discuss your technical requirements.
Align NVIDIA AI solutions to your goals and requirements.
Resources
The Latest NVIDIA NIM Resources
Blogs
Sessions
Courses
Videos
NVIDIA NIM in the News
See All Tech Blogs
See All Topic News