Baseten
Serve and scale open-source and custom AI models on the fastest, most reliable inference platform.
- Open roles
- 62
- New role every
- ~2.0 days
- Posting trend
- 11.0× vs prior 90d
Job facts
- Location
- San Francisco
- Workplace
- hybrid
- Type
- full-time
- Department
- EPD
- Posted
- Feb 24, 2026
More roles at Baseten
- Engineering Manager, Cloud Platform · San Francisco
- Senior Manager, Cloud Platform & Site Reliability · San Francisco
- Engineering Manager, Model Library · San Francisco
- Capacity and Infrastructure Lead · San Francisco
- GTM Recruiter · San Francisco
- SRE · San Francisco
Software Engineer - Baseten for Labs
at Baseten
ABOUT BASETEN
Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.
THE ROLE:
You'll join Baseten for Labs — a small, high-ownership team building the products that power how model labs and AI researchers ship and scale their models. This team moves fast and owns its outcomes end-to-end.
This is a role for a full-stack, product-minded engineer who likes working across the whole surface area: from shaping a clean API or user-facing feature, to building the backend systems that run it reliably in production. You'll contribute across three interconnected product areas:
-
Model Library — The place developers discover, evaluate, and deploy the right model for their use case. You'll build the browsing, evaluation, and onboarding experiences that help developers navigate an exploding model landscape.
-
Inference API Gateway — A production-ready, white-labeled API gateway that lets model labs serve their models to customers under their own domain. You'll build the auth, key management, rate limiting, metering, and multi-tenant isolation that power it.
You'll work on meaningful, high-impact projects with real ownership of your work — and you'll think about the developer experience as much as the systems design.
EXAMPLE INITIATIVES:
RESPONSIBILITIES:
-
Take meaningful ownership of projects: from API design and backend implementation to frontend surfaces, rollout, and operation.
-
Build backend services with high reliability and clear SLOs — auth, rate limiting, quotas, metering, and multi-tenant isolation.
-
Ship developer-facing product surfaces: dashboards, onboarding flows, and self-serve tooling that reduce time-to-value.
-
Collaborate closely with design, product, and GTM to define and ship what labs and developers actually need.
-
Drive performance and reliability improvements through profiling, tracing, and load testing.
REQUIREMENTS:
-
4+ years building and operating production software, including at least some full-stack experience (backend-primary is fine, but you're comfortable touching the frontend).
-
Demonstrated ability to take initiative and contribute beyond the spec — you think about the "why" behind what you build.
-
Strong backend fundamentals: API design, distributed systems, observability, and operational rigor.
-
Comfort working across the stack: backend services, data pipelines, and user-facing product surfaces.
-
Strong written communication — clear design docs, effective async collaboration.
-
Genuine curiosity about the AI/ML infrastructure space; you don't need ML expertise, but you want to understand the ecosystem.
NICE TO HAVE:
-
Experience building developer-facing products: APIs, SDKs, CLIs, dashboards, or self-serve workflows.
-
Experience with API gateways, auth systems, billing/metering infrastructure, or multi-tenant platforms.
-
Frontend experience (React/TypeScript) or strong product UX instincts for developer tools.
-
Familiarity with model serving, LLM runtimes, or inference platforms.
-
Comfort with Kubernetes, distributed scheduling, or service mesh concepts.
BENEFITS
-
Competitive compensation, including meaningful equity.
-
100% coverage of medical, dental, and vision insurance for employee and dependents
-
Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
-
Paid parental leave
-
Fertility and family-building stipend through Carrot
-
Company-facilitated 401(k)
-
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.
At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.
We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).