- +2
- +11
- California, United States
About
At Tecton, we solve the complex data problems in production machine learning. Tecton’s feature platform makes it simple to activate data for smarter models and predictions, abstracting away the complex engineering to speed up innovation.
Tecton’s founders developed the first Feature Store when they created Uber’s Michelangelo ML platform, and we’re now bringing those same capabilities to every organization in the world.
Tecton is funded by Sequoia Capital, Andreessen Horowitz, and Kleiner Perkins, along with strategic investments from Snowflake and Databricks. We have a fast-growing team that’s distributed around the world, with offices in San Francisco and New York City. Our team has years of experience building and operating business-critical machine learning systems at leading tech companies like Uber, Google, Meta, Airbnb, Lyft, and Twitter.
As a member of Tecton’s Infrastructure Engineering DevOps team, you will contribute to and own the foundation for building, automating, and scaling Tecton. You will leverage your experience with cloud architectures, distributed systems, containerization technologies (Kubernetes), and Linux system internals to design, build, and maintain our multi-cloud deployments, ensure our systems are secure in-depth, and work closely with the rest of Tecton’s Infrastructure Engineering team to scale and optimize our core compute and online serving systems.
Prior experience with machine learning is not required. We are looking for exceptional DevOps, infrastructure, and software engineers who are driven to find simple solutions to complex challenges. You'll be at the intersection of design, engineering, and operational processes.
Responsibilities
Own the complete lifecycle of Tecton’s cloud infrastructure development from design through automation, deployment, and operation
Engage with other engineering and solutions teams to build tools that will accelerate engineering and deployments efficiency
Develop and maintain infrastructure and tooling to monitor observability of Tecton health, availability, and latency
Joint ownership building and managing Tecton’s CI/CD system to reliably deploy production components with a GitOps model - Including the multi-language, multi-platform Build System based on Bazel
Participate in an on-call rotation, triaging and addressing Tecton platform major incidents
Qualifications
Engineer with 4+ years of experience in DevOps, SRE, or Software Engineering
Experience with infrastructure-as-code tools such as Terraform
Fluent in one or more programming languages such as Python or Golang
Expertise in cloud providers such as AWS, Google Cloud, and/or Microsoft Azure
Experience building and troubleshooting robust and secure networks
Experience with microservices & container orchestration such as Kubernetes
Expertise in observability stack (Prometheus, ELK, Chronosphere, Datadog, etc.)
A passion for excellence and high developer productivity
Strong and effective verbal and written communication skills
In-depth experience with Linux systems administration and troubleshooting
Nice to have
Experience building reliable CI/CD pipelines (Github, CircleCI, Buildkite, etc.)
Experience with Kubernetes configuration management tools (Helm, Kustomize, etc.)
Experience with GitOps tools (Flux CD, Argo CD)
Experience with on-call rotation and support of production environments
Experience working with large-scale data infrastructure or batch/streaming data pipelines
The estimated US base salary range for this position is $176,000 - $210,000 annually for employees based within California & New York. In addition to base salary, we offer competitive equity & comprehensive benefits such as medical, dental, vision, life, 401(K), flexible paid time off, 10 paid holidays each calendar year, sick time, leave of absence as per the FMLA and other relevant leave laws. Individual compensation packages are based on multiple factors such as location, level, role scope, and complexity, as well as additional job-related factors such as skills, experience, and expertise.
Tecton is a remote-friendly company that employs a hybrid working policy for employees based in the SF, NY, and Seattle areas. We believe that working in-person helps us stay connected, collaborate faster, and promote a strong culture while still providing the flexibility of working from home. We expect SF & NY employees to be in the office at least two designated days per week, and those in the Seattle area at least two designated days per month.
Tecton values diversity and is an equal opportunity employer committed to creating an inclusive environment for all employees and applicants without regard to race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or other applicable legally protected characteristics. If you would like to request any accommodations from the application through to the interview, please contact us at recruitingteam@tecton.ai.
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Engineering and Information Technology
Industries
Technology, Information and Internet, Software Development, and Computer and Network Security
#J-18808-Ljbffr
Nice-to-have skills
- DevOps
- Distributed Systems
- Kubernetes
- Linux
- Terraform
- Python
- AWS
- Microservices
- Prometheus
- Github
- CircleCI
Work experience
- DevOps
- Site Reliability (SRE)
Languages
- English