This job offer is no longer available
About
Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Raised over $500 million
from strategic investors including Sequoia, Google Ventures, Kleiner Perkins, and OpenAI. Harvey is hiring the best
talent
from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Glean, Superhuman, Figma, and more. Partnerships: Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services. Performance: 4x ARR in 2024. As a Software Engineer on the Site Reliability team at Harvey, you will ensure the reliability, scalability, and performance of our legal AI platform. You’ll join a high-leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission-critical operations, your work will ensure that Harvey remains resilient as we grow. If you’re passionate about building robust systems and reducing complexity through automation, we’d love to work with you.
We use an in-person work model and offer relocation assistance to new employees.
Design, implement, and manage monitoring, alerting, and infrastructure resources (compute, storage, networking) across 50+ global regions Lead incident management processes, including postmortems, root cause analyses, and driving actionable improvements Automate operational tasks and workflows, building tools and processes for capacity planning, graceful rollouts, and safe data access to maintain high reliability and reduce manual intervention Develop and enforce best practices for security, compliance, and infrastructure reliability while collaborating cross-functionally to integrate these principles throughout the software lifecycle Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions while maintaining system performance, reliability, and functionality.
3+ years of experience in Site Reliability Engineering or similar roles supporting production environments ~ Proficiency with cloud infrastructure platforms (Azure, GCP, AWS, etc.) ~ Strong programming skills (Python, Bash, Go, or similar languages) ~ Solid understanding of CI/CD, Kubernetes, containerization, networking, and cloud security principles ~ Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.
Languages
- English
Notice for Users
This job was posted by one of our partners. You can view the original job source here.