Senior Software Engineer [REMOTE]
Upbound - Job Posting
- Austin, Texas, United States
- Austin, Texas, United States
About
Our mission
is to power the AI‑native enterprise with a foundational platform layer that helps teams provision, operate, and adapt infrastructure at scale—so platforms are ready for
both humans and AI agents .
Today, Upbound supports Fortune 500 companies and platform engineers across 100+ countries . Crossplane has surpassed 100M+ downloads and is used by 1,000+ teams worldwide . Upbound is hiring a Senior Software Engineer to build and operate Upbound Spaces, the control‑plane management software at the heart of the Upbound Platform.
What You’ll Do
Actively build and operate Upbound Spaces in production, troubleshooting and resolving issues across multi-tenant SaaS environments, and contributing to Upbound's open-source projects, including Crossplane.
Take ownership of building features in high demand by Upbound’s customers and deliver new functionality that delights and amazes our users.
Investigate and debug complex issues in customer environments, including multi‑control‑plane scenarios, resource reconciliation problems, and performance bottlenecks.
Communicate through thoughtful and thorough design documents for new initiatives and detailed post‑incident reviews that drive system improvements.
Support the full project lifecycle for highly scalable and reliable services running in a cloud environment—discovery, analysis, architecture, design, review, documentation, building, migration, automation, deployment, production‑readiness, and ongoing operational support.
Write and maintain Go code that interfaces with the Kubernetes API, such as operators, controllers, add‑ons, etc., with a focus on observability, debuggability, and operational excellence.
Deploy, manage, and troubleshoot our Kubernetes services in production, using metrics, logs, and traces to identify and resolve issues quickly.
Build and maintain operational tooling for debugging customer environments, analyzing control‑plane health, and automating incident response.
Author documentation, user guides, runbooks, and blog posts to support and promote new features that you release.
Support the software release cycle for Spaces self‑hosted distributions, including diagnosing issues in customer‑managed deployments.
Participate in on‑call rotation to support Upbound Cloud, responding to incidents and driving them to resolution.
What You’ll Bring
Experience operating production cloud services at scale: monitoring, alerting, incident response, post‑mortems, and continuous improvement of service reliability.
Strong debugging skills across distributed systems, with experience using observability tools (Prometheus, Grafana, OpenTelemetry, distributed tracing) and techniques for diagnosing issues in production environments.
Experience building and operating controllers that interact with the Kubernetes API server, including troubleshooting reconciliation loops, managing API rate limits, and optimizing controller performance.
Comfortable working directly with customers to understand, reproduce, and resolve complex technical issues in their environments.
Take responsibility and ownership for solving problems even when they are outside your lane, especially during incidents affecting customer workloads.
Demonstrate excellence in your work, constantly trying to improve your skills and the operational posture of the systems you build.
Empathy for customers and keep them in mind as you build solutions, understanding that reliability and debuggability are features.
Realize the importance of clear communication and effective collaboration to work as a team, deliver great results, and support customers through technical challenges.
Help create a safe environment where everyone can contribute, learn from failures, share on‑call knowledge, and help each other grow as operators and engineers.
Why Upbound? At Upbound, you’ll help shape the systems and strategies that drive predictable, scalable growth in a product‑led company embracing usage‑based models. If you’re excited to build from the ground up, work with cutting‑edge cloud technologies, and directly impact how revenue is generated and scaled—this is your seat at the table.
As set forth in Upbound - Job Posting’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.
#J-18808-Ljbffr
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.