Dieses Stellenangebot ist nicht mehr verfügbar
Senior Site Reliability Engineer
- Somerville, Massachusetts, United States
- Somerville, Massachusetts, United States
Über
**This role is located in Somerville, MA (add Location) - We are a hybrid work environment and are in the office 3+ days/per week.**
Tulip, the leader in frontline operations, is helping companies around the world equip their workforce with connected apps, leading to higher quality work, improved efficiency, and end-to-end traceability across operations. Companies of all sizes and across industries have implemented composable solutions with Tulip's cloud-native, no-code platform to solve some of the most pressing challenges in operations: error-proofing processes and boosting productivity, capturing and analyzing real-time data, and continuous improvement.
A spinoff out of MIT, Tulip is headquartered in Somerville, MA, with offices in Germany and Hungary. Focused on composable, human-centric solutions for industrial environments, Tulip is disrupting the MES category and has been recognized as a World Economic Forum Global Innovator. Tulip has also been named one of Energage's Top Workplaces USA and one of Built In Boston's "Best Places to Work" and "Best Midsize Places to Work" for 2024.
About You:
- You have experience building and maintaining stable infrastructure at scale.
- You can reason about systems — their edge cases, failure modes, and life cycles.
- You're excited about setting the technical agenda and coming up with novel, broad ideas.
- You can debug complex issues across the entire stack.
- You're opinionated about the tools and frameworks that work best.
- You enjoy building for other engineers equally, if not more, than building for a customer.
- You know what a good SLA looks like, and can teach others how to spot one.
What skills do I need?
- You have 5+ years of experience working with open source Observability tools (e.g. LGTM stack)
- You have hands-on experience instrumenting distributed systems using OpenTelemetry and managing metrics pipelines with Prometheus at scale.
- You have experience working with time-series data, ideally using promQL
- You can pick up new languages/frameworks with ease. We currently run Go and Typescript services on Kubernetes.
- You can communicate as well as you can code. You understand the value of discussion and work best in a team that champions clear and frequent communication.
Key Responsibilities:
- Mentor and evangelize on observability best practices, SLIs/SLOs, and reliability culture across engineering teams.
- Help architect our systems for growth and scale.
- Implement internal tools to automate common developer tasks.
- Perform incident response and debug production issues across the entire stack.
- Design, build, and maintain the core infrastructure used by all of Tulip's engineering teams.
- Work to automate detection and resolution of recurring issues.
Key Collaborators:
Engineering team, Edge team, DevOps team, Hardware team
Working At Tulip
We know even great candidates experience imposter syndrome. Even if you don't match every requirement, applying gives you the opportunity to be considered.
We're building a strong, diverse team that values hard work, families, and personal well-being. Benefits of working with us include:
- Direct impact on product and culture
- Company equity
- Competitive benefits package including Health, Dental, Vision, Short-term Disability, Long-term Disability, Life Insurance, AD&D Insurance, Flexible Spending
Sprachkenntnisse
- English
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.