
Renen Hallak
CEO, VAST Data
The Conference for the Inference Era
The Conference for the Inference Era
Join leaders from Character, Workato, VAST Data, Arcee, and the vLLM ecosystem as they break down how they are running AI in production at scale.
You'll hear directly from:
The hardest problem in AI is no longer training models. It's running inference in production: latency, throughput, system reliability, and unit economics, all at once.
Deploy is where you will see how leading AI-native companies solve this, through real architectures, real tradeoffs, and live systems running today.
Across sessions and hands-on demos, you'll learn how teams are designing inference systems that scale without breaking performance or their margin structure.



CEO, VAST Data

CEO, Inferact

CEO, Arcee

Chief Architect, Character.ai

CEO, DigitalOcean

AI Research Technical Lead, Workato

CTO/CISO, ISMG

CPTO, DigitalOcean

Head of Growth & Marketing, DigitalOcean

Senior Director, Marketing & Communications, DigitalOcean

VP of Engineering, DigitalOcean

Principal Engineer, DigitalOcean

Senior Director of AI/ML, DigitalOcean

VP/Fellow Engineer, DigitalOcean

Senior Director of Engineering, DigitalOcean

Director of Product Management, DigitalOcean
Learn how leading teams optimize inference performance and cost in production — including intelligent routing, benchmarking, and cost-per-token optimization to protect unit economics at scale.
Don't let infrastructure stall your roadmap. See how teams integrate and ship production-ready AI features into existing applications using smart defaults, streamlined workflows, and scalable architecture patterns.
AI belongs in your stack, not in a silo. Learn how teams are integrating security, data infrastructure, vector databases, and observability into unified production systems for AI workloads.
A technical deep-dive into modern inference architecture — from serverless to dedicated GPUs, containerized services, and intelligent routing systems that power high-performance production AI.
The Deploy 2026 agenda is taking shape — check back in for some more exciting updates! Across sessions, expect live demos and hands-on examples showing how these systems actually run in production.
April 28, 2026 • 12:00pm – 8:00pm PT
📍 Convene 100 Stockton
Join the technical leaders and executives building the next generation of AI-native companies at Deploy, the Conference for the Inference Era.
Close out Deploy with the people building the future of AI. At 5pm, join the AI builder community for a casual mixer with demos, networking and more.

Deploy 2026 will be hosted in person at Convene 100 Stockton, 40 O'Farrell St, San Francisco. The mainstage keynote will also be streamed live to registrants.
Deploy is designed for teams responsible for managing or building AI workloads in production at scale.
This is a special Deploy that represents an evolution of cloud infrastructure that will change the way companies with AI in production conceive of their businesses. DigitalOcean's vertically integrated agentic inference cloud delivers radically simple operations and predictable unit economics that will set AI-natives on a path to success and growth.
No. Deploy is free to attend. See you in San Francisco.
Yes. Deploy follows the DigitalOcean Community Code of Conduct.