Enterprise AI transformation is accelerating faster than anyone imagined. Our latest customer story with WRITER shows what’s possible when innovation meets execution. WRITER partners with some of the world’s most sophisticated enterprises. Their customers demand ROI, accountability, and secure, scalable AI starting on day one. That’s why we’re proud to support WRITER with reliable model deployment across hyperscalers and infrastructure designed for rapid iteration, security, and scale. Their work on self-evolving models marks an exciting next chapter in enterprise AI. And we’re honored to be part of their journey. Watch the story 🎥
Baseten
Software Development
San Francisco, CA 17,101 followers
Inference is everything.
About us
Inference is everything. Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring great AI products to market - fast. Our proprietary Inference Stack utilizes the cutting-edge of performance research combined with highly performant and reliable infrastructure to give you out-of-the-box global availability with 99.99% of uptime.
- Website
-
https://www.baseten.co/
External link for Baseten
- Industry
- Software Development
- Company size
- 51-200 employees
- Headquarters
- San Francisco, CA
- Type
- Privately Held
- Specialties
- developer tools, software engineering, and artificial intelligence
Products
Baseten
Machine Learning Software
At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.
Locations
-
Primary
Get directions
San Francisco, CA, US
-
Get directions
New York, NY, US
Employees at Baseten
Updates
-
Baseten reposted this
🩺 AI is transforming how clinicians document, validate, and interpret patient information – freeing up more time for patient care instead of paperwork. In our new use case with Baseten, we show how healthcare teams can deploy multimodal clinical AI on Vultr Cloud GPUs accelerated by NVIDIA HGX B200 to automate documentation and imaging workflows with low latency, predictable costs, and HIPAA-ready security. See how Baseten is helping organizations move clinical AI from research to production – and what it means for the future of patient care. Read the full story and get inspired to scale your own healthcare AI solutions. https://lnkd.in/gV7AZ3BS #HealthcareAI #CloudGPU #ClinicalAutomation #NVIDIA
-
Tuhin Srivastava sits down on the Gradient Dissent podcast by Weights & Biases They discuss all things inference and what sets Baseten apart: > When to consider closed-source vs. open-source models > Inference vs. runtime optimizations > The importance of the developer experience > Building a high-velocity product org Links to the full episode are in the comments. Thanks for having us!
-
Our friends at Oxen.ai are on a mission: turn raw datasets into beautifully deployed, production-ready models, fast. And guess what? They’re doing it on Baseten. Their team moves quickly and supporting them has been a total blast. Our case study shows how they’re pushing the boundaries of training while keeping things delightfully smooth for their customers. Check out their journey: From datasets to deployed models: How Oxen AI builds on Baseten https://lnkd.in/eAYNx3ev Massive thanks to the Oxen.ai team and Greg Schoeninger, you’re the kind of customers that make us want to run faster, build more, and high-five more often. #AI #MachineLearning #MLOps #Baseten
-
Happy Monday 👋 We're pleased to welcome some new Baseten crew members to the team. Say hello to Tom Berger, Paulina Pevzner, Tal Yaacovi, and Michael Cenni! Paulina and Michael join us on the GTM team, Tom joins us as an engineer on the Infrastructure team and Tal as an engineer on the Core Product team. Looking forward to all your future success!
-
-
Baseten used NVIDIA Dynamo to double inference speed for long-context code generation and increased throughput by 1.6x. Dynamo simplifies multi-node inference on Kubernetes, helping us scale deployments while reducing costs. Read the full blog ⏬ https://lnkd.in/e2_K33Y7
-
From cricket, to the earliest days of Baseten, to where we are today, Aditya Agarwal and Tuhin Srivastava go deep on the Minus One podcast by South Park Commons Check out the full episode 👇
You have to be your first believer. Otherwise, what’s the point? South Park Commons alum Tuhin Srivastava joined us on Minus One to break down how he’s scaling Baseten into one of the fastest-growing AI infrastructure companies today. Spoiler, it's not just luck. Full episode out now ⬇️
-
Welcome to the new age Defense Against the Dark Arts. It's called fast inference! (& Harry Potter would be jealous). Check out our deep dive on how the Baseten wizards (model performance team) optimized Kimi-K2 Thinking (now faster and just as smart as GPT-5). https://lnkd.in/eKcGZ79S
-
Congrats to the World Labs team on the launch today! Marble lets you create 3D worlds from just a single image, text prompt, video, or 3D layout. We couldn't be more excited to power the inference behind this. Can't wait to see what everyone makes. 🔥
Introducing Marble by World Labs, the foundation for a spatially intelligent future. Marble lets anyone generate high-fidelity, persistent 3D worlds from a single image, video, or even a text prompt. Get started today: marble.worldlabs.ai Powered by our multimodal world models that can perceive, generate, and interact with the 3D world, Marble represents a major step toward spatial intelligence where digital and physical realities blend seamlessly. Learn more about the technology behind Marble in our technical blog: https://lnkd.in/gBKT2NiY Welcome to the world model era. We’re just getting started.