Job Opportunities
Find jobs near you, whether onsite, hybrid, or remote.- Similar Jobs to: Infrastructure Engineer I
Sr. Production Engineer ( Analytics Infrastructure)
ektelloNew YorkSr Production EngineerDuration:12 Months (with possibility of extension based on performance and business needs.)Location:Remote: US ( Occasional travel to Yahoo offices Sunnywale, Ca or New York, NY)
IT Infrastructure Network Engineer
South East WaterSnodlandSummary:Are you a hands-on Network Engineer who thrives on the open road rather than being chained to a desk? Do you love the satisfaction of physical deployments, complex problem-solving, and seeing
Machine Learning Infrastructure Engineer
WhatnotSan FranciscoJoin the Future of Commerce with Whatnot! Whatnot is the largest livestream shopping platform in North America and Europe to buy, sell, and discover the things you love. Whether it's trading cards, fa
Senior DevOps & Infrastructure Engineer
Storm3United StatesAI for Science - Connecting Tech talent into innovative Healthtechs and Biotechs Remote within the US or CanadaInterested in joining an exciting, innovative medical device startup leading the charge i
Engineer – Civil/Infrastructure (telework possible)
GCM Consultants inc.AnjouAt GCM Consultants, you CHOOSE ! Remote, hybrid or office? Contribute to ambitious projects We're all passionate about contributing to a variety of projects that resonate in our industry and impact o
Remote AI Data Infrastructure Engineer
Bright Vision TechnologiesBellevueBright Vision Technologies is looking for an AI Data Infrastructure Engineer to join our team and enhance our innovative solutions. This full-time, remote position requires 6+ years of experience in d
AI Software Engineer: Intelligent Data Infrastructure
NetAppUnited StatesOwn Every Moment at NetApp At NetApp, your ideas power innovation. We lead in intelligent data infrastructure—delivering unified storage, integrated data services, and solutions that help organization
Remote Blockchain Infrastructure Engineer - Scroll
Blockchain WorksUnited StatesWe are looking for ablockchain infrastructure engineerto help build the sequencer client for our fully EVM-compatible zkRollup based on a zkEVM. To maintain full compatibility, our client is a fork of
Network Engineer (AI Infrastructure) Early Career
NebiusHaarlemKickstart your career in networking with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for students, recent graduates, and early career professionals who
Network Engineer (AI Infrastructure) Early Career
NebiusZaandamKickstart your career in networking with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for students, recent graduates, and early career professionals who
Network Engineer (AI Infrastructure) Early Career
NebiusAlmereKickstart your career in networking with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for students, recent graduates, and early career professionals who
Network Engineer (AI Infrastructure) Early Career
NebiusPurmerendKickstart your career in networking with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for students, recent graduates, and early career professionals who
Network Engineer (AI Infrastructure) Early Career
NebiusAalsmeerKickstart your career in networking with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for students, recent graduates, and early career professionals who
AI Software Engineer: Intelligent Data Infrastructure
NetAppUnited StatesOwn Every Moment at NetAppAt NetApp, your ideas power innovation. We lead in intelligent data infrastructure—delivering unified storage, integrated data services, and solutions that help organizations
Senior Data Engineer, Data Lakehouse Infrastructure
Crypto Pro NetworkUnited StatesSenior Data Engineer, Data Lakehouse Infrastructure TRM is a blockchain intelligence company that’s on a mission to build a safer world for billions of people. We’re a lean, high-impact team tackling
Site Reliability Engineer (SRE) AI Infrastructure (Early Career)
NebiusAmstelveenLaunch your career in site reliability engineering with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for current university students, recent graduates,
Software Engineer, Data Infrastructure & Acquisition - Dallas, TX, USA
SpeechifyUnited StatesJob DescriptionJob DescriptionThe mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-to-speech products to turn whatever they
Senior CAD Engineer, ASIC Development Infrastructure, RTL Design
Amazon.com Services LLCUnited StatesJob DescriptionAmazon Lab126 is an inventive research and development company that designs and engineers high-profile consumer electronics. Lab126 began in 2004 as a subsidiary of Amazon.com, Inc., or
Software Engineer, Data Infrastructure & Acquisition - Seattle, WA, USA
SpeechifyUnited StatesJob DescriptionJob DescriptionThe mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-to-speech products to turn whatever they
Site Reliability Engineer (SRE) AI Infrastructure (Early Career)
NebiusAmsterdamLaunch your career in site reliability engineering with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for current university students, recent graduates,
Software Engineer, Data Infrastructure & Acquisition - Miami, FL, USA
SpeechifyUnited StatesJob DescriptionJob DescriptionThe mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-to-speech products to turn whatever they
Software Engineer, Data Infrastructure & Acquisition - Kirkland, WA, USA
SpeechifyUnited StatesJob DescriptionJob DescriptionThe mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-to-speech products to turn whatever they
Software Engineer, Data Infrastructure & Acquisition - Anchorage, AK, USA
SpeechifyUnited StatesJob DescriptionJob DescriptionThe mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-to-speech products to turn whatever they
R&D US Remote Senior Software Engineer, Search & Retrieval Infrastructure
PineconeNew YorkSenior Software Engineer, Knowledge Engine About PineconePinecone is the leading vector database for building accurate and performant AI applications at scale in production. Pinecone's mission is to m
Site Reliability Engineer (SRE) AI Infrastructure (Early Career)
NebiusHilversumLaunch your career in site reliability engineering with Nebius through this 3-month Early Talent Program in Amsterdam. This opportunity is designed for current university students, recent graduates,
Sr. Production Engineer ( Analytics Infrastructure)
- New York, New York, United States
- New York, New York, United States
About
Duration:
12 Months (with possibility of extension based on performance and business needs.)
Location:
Remote: US ( Occasional travel to Yahoo offices Sunnywale, Ca or New York, NY)
Coding test:
Required
Pay:
$85-$100/Hr
How does this role fit within the team/department?
This role sits within Yahoo Mail's Production Engineering. Engineers in this role directly support cloud infrastructure reliability, cost efficiency, and automation for one of the world's largest consumer email platforms, serving hundreds of millions of users globally.
Overview Of The Team
Yahoo Mail Production Engineering manages GCP-based infrastructure including GKE clusters, Compute Engine, Dataproc, Vertex AI and more gcp services. The team is responsible for production reliability, capacity planning, cost optimization, CI/CD pipelines, MLOPS, and infrastructure-as-code across 40+ GCP projects on an extra large, petabyte data size scale. We work in close collaboration with software architects, developers and product managers to deliver end to end results.
Primary responsibilities (daily/weekly)?
Operate, monitor, and improve GKE apps, Analytics, and ML production workloads Manage Terraform/Ansible/Helm IaC for GCP resource provisioning and policy enforcement Participate in on-call rotation for production incidents Review and improve CI/CD pipelines for services deployed in Python, Node.js, and Java Collaborate with architects and developers on infrastructure architecture and design Automate cloud operations through programmable and secure solutions Leverage AI-driven tools for development agents, troubleshooting, and automation
Key projects or initiatives for the role?
On-prem to GCP migration of large-scale Yahoo Mail workloads Analyti- Analytics pipeline and reliability improvementsplatform work (Vertex AI, Generative AI, BigQuery, Looker, Dataproc)
Success metrics or KPIs for this role?
On-call incident resolution time and escalation rate (MTTD, MTTR, MTTE) Terraform/IaC coverage of managed resources CI/CD pipeline reliability and deployment velocity Progress on on-prem to GCP migration milestones Sprint goal achievement (SMART goals per sprint)
Technical (Required)
5+ years in SRE, DevOps, Infrastructure, or Cloud Operations with on-call duties GCP services proficiency: GKE, GCE, Networking, Security, CI/CD, and common cloud technologies IaC proficiency: Terraform, Ansible, and Helm Charts Programming in Python, Node.js, and Java; ability to build CI/CD pipelines in these languages Linux, TCP/IP, HTTP, mail protocols, DNS, CDN, load balancers, and troubleshooting Experience with large-scale production applications, systems, and networks
Technical (Advantageous)
Cloud databases and storage: GCS, Cloud SQL, Spanner, Memorystore ML/AI platforms: Vertex AI, Generative AI, BigQuery, Looker, Dataproc Cloud Observability and OpenTelemetry Proven track record migrating on-prem infrastructure to GCP Operational experience in both on-prem and cloud environments Ideal experience level (years, leadership, industries)?
5+ years total cloud/SRE experience, with preference for GCP. Experience at large-scale internet companies with petabytes level data production systems is strongly preferred.
Languages
- English
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.