Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Staff Network Engineer, Operations
Staff Network Engineer, Operations
G2 Venture PartnersUnited StatesStaff Network Operations EngineerCrusoe Cloud is seeking a Staff Network Operations Engineer to help own production reliability across our global network infrastructure, including edge, backbone, data
Senior Staff Network Engineer, Operations
ProducePaySan FranciscoCrusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of th
Staff Fiber Network Engineer
anthropicSan FranciscoAnthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group o
Principal/Staff HPC Network Engineer
Electric CapitalSan FranciscoLocation San Francisco, CAEmployment Type Full timeDepartment EngineeringCompensation$250K – $325KWe're building the company which will de-risk the largest infrastructure build‑out in history.When peo
Senior Staff Network Engineer, Deployment
JobrSan FranciscoCrusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of th
Staff Frontend Engineer, Client Data & Networking Platform
NerdleveltechSan FranciscoAirbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every co
Senior/Staff Machine Learning Research Engineer, General Agents, Enterprise GenAI
Scale AISan FranciscoScale AI is the data foundation for AI, helping organizations build and deploy reliable production AI applications. We partner with leading enterprises and government organizations to accelerate their
Staff Machine Learning Engineer - Search
WarnerMedia Services, LLCSan FranciscoStaff Machine Learning Engineer – Search & Personalization, Content Discovery (Seattle, San Francisco BayArea, NewYork). Lead the design and evolution of ML and AI‑driven search algorithms for HBOMax,
Staff ML Engineer Search & Personalization Architect
Warner Media, LLCSan FranciscoWarner Media, LLC. is seeking a Staff Machine Learning Engineer to lead the design and evolution of search and personalization algorithms for HBO Max. You will oversee the entire innovation process of
Staff Search Engineer (US)
CreatorIQSan FranciscoCreatorIQ is the operating system for creator‑led growth trusted by more than 1,300 global brands and agencies.We’re on a mission to make businesses more human, and humans more impactful. We operate b
Staff Frontend Engineer: AI Underwriting UX (Hybrid, Equity)
JobrSan Franciscojobr.pro in San Francisco is seeking a Staff Frontend Engineer to lead the design of AI-assisted underwriting workflows. You'll work with product leadership to define and implement core interaction mo
Staff/Senior Frontend Engineer
Hamilton AISan FranciscoAbout Hamilton AI Hamilton AI is creating the operating system for business aviation. Unlike commercial aviation, business aviation lives in the dark ages, think emails, phone and spreadsheets that ru
Staff Backend Engineer, Scalable Data Pipelines | Remote
OwnerSan FranciscoOwner.com is seeking a backend services expert to manage data pipelines and improve system reliability. This is a pivotal role contributing to the success of Grader, a fast-growing product for restaur
Staff Full Stack Software Engineer (Backend) - Transition & Termination (8616)
RipplingSan FranciscoStaff Full Stack Software Engineer (Backend) - Transition & Termination (8616) About this position About Rippling Rippling gives businesses one place to run HR, IT, and Finance. It brings together all
Staff Frontend Engineer for AI Visualization & DevTools
TrajectorySan FranciscoTrajectory, based in San Francisco, is seeking a Design Engineer to craft an intuitive product experience for continual learning. You will own the visualization and interaction surfaces, collaborating
Senior Staff Backend Engineer - AI Finance Platform
United States Digital Space LLCSan FranciscoUnited States Digital Space LLC in San Francisco is looking for an experienced backend engineer to join their Codex for Finance team. This role involves designing and scaling systems to support AI inn
Staff Software Engineer, Mobile Architecture
United States Digital Space LLCSan FranciscoWe're looking for a Staff Software Engineer to join our Mobile Architecture team to help lead the technical vision for the company's mobile experience across iOS and Android.You’ll work on performance
Staff Software Engineer, Data & Analytics Platform (Remote)
Ultimate LLCSan FranciscoUltimate.ai is seeking a Staff Software Engineer for its Data and Analytics team, focusing on developing analytics capabilities utilizing Java and TypeScript. This hybrid role requires collaboration a
Senior Staff Backend Software Engineer, API Platform
United States Digital Space LLCSan FranciscoAbout the Team Our team brings the company’s most capable technology to the world through our developer platform: the the company API. As the leading AI development platform, our API is used by millio
Senior Staff Software Engineer - Mobile AI Platform Lead
United States Digital Space LLCSan FranciscoUnited States Digital Space LLC is seeking a Senior Staff Software Engineer to serve as the technical lead for Claude's core chat experience on mobile platforms. You will shape the technical direction
Staff Backend Engineer
Linkup IncSan FranciscoAbout the role We’re looking for a top‑tier Staff Backend Engineer to join our tech team.You’ll wear many hats. Your core responsibilities will be to:Design internet‑scale, high‑performance infrastruc
Staff Payments Backend Engineer: Scale Secure APIs
MediumSan FranciscoMedium is seeking a passionate and experienced Staff Software Engineer to join our payments technology team in San Francisco. In this role, you will design, build, and scale our payment platform while
Staff / Senior Staff Backend Software Engineer, Workspace Agents
United States Digital Space LLCSan FranciscoAbout the TeamThe Workspace Agents team builds the product and platform foundations that bring powerful, reliable agents into ChatGPT workspaces. The team recently launched Workspace Agents in ChatGPT
Senior Staff Data Engineer — Platform & ML Architect
Hinge HealthSan FranciscoHinge Health is looking for a Senior Staff Data Engineer to lead the technical vision for their Data & ML Platform team. This role involves defining the architectural direction for how data models and
Staff Analytics Engineer: BigQuery, GTM Intelligence
CodeRabbitSan FranciscoCodeRabbit is looking for a Staff Analytics Engineer in Boston, MA. In this role, you will architect our BigQuery warehouse and develop models to enhance our go-to-market strategy, including PQL and P
Über
Crusoe Cloud is seeking a Staff Network Operations Engineer to help own production reliability across our global network infrastructure, including edge, backbone, data center fabric, and GPU cluster interconnects. This is a hands-on production ownership role focused on incident response, root cause analysis, and operational excellence initiatives that keep our hyperscale AI infrastructure running at scale. Your work will directly affect the availability of AI workloads running across thousands of GPUs worldwide. The ideal candidate is a seasoned network engineer with deep operational experience in large-scale environments who thrives in high-pressure situations and takes pride in keeping systems healthy. You'll contribute to defining SLIs and SLOs, improving observability tooling, building automation to reduce toil, and mentoring peers — all while serving as a key escalation point during high-severity network events. What You'll Be Working On: Production Reliability: Help own uptime across Crusoe's global edge, backbone, data center, and GPU cluster network, directly supporting AI workloads at scale. Incident Response: Lead and contribute to end-to-end response for high-severity network events, including mitigation, stakeholder communication, and postmortem documentation. Root Cause Analysis: Drive RCAs for production incidents, identify systemic issues, and author remediation plans tracked through to closure. Observability Improvements: Contribute to and improve Crusoe's network monitoring stack using streaming telemetry, SNMP, NetFlow, and tools such as Kentik, Grafana, Prometheus, and ThousandEyes. Operational Standards: Author and maintain runbooks, escalation playbooks, and SOPs used across the operations team. Operational Automation: Write Python-based tooling to reduce toil, automate common remediation workflows, and accelerate mean time to resolution. SLI/SLO Contribution: Partner with Architecture and SRE teams to define and track network reliability metrics and service level objectives backed by real-time dashboards. Mentorship: Provide technical guidance to Senior engineers and contribute to a culture of operational excellence and continuous learning. What You'll Bring to the Team: 8+ years of production network engineering experience with a focus on operations, incident response, and reliability in large-scale or internet-scale environments. Hands-on experience with observability and monitoring tools including streaming telemetry, SNMP, NetFlow/sFlow, Grafana, Prometheus, and ThousandEyes. Experience operating RDMA/RoCE lossless fabrics for GPU or HPC workloads, including familiarity with PFC, ECN, and DCQCN tuning. Expert hands-on knowledge of BGP, EVPN-VXLAN, IS-IS, OSPF, MPLS, QoS, and TCP/IP in production data center environments. Proficiency with Arista (EOS) and Juniper (Junos) platforms in leaf-spine CLOS architectures across multi-vendor environments. Python proficiency for writing auto-remediation scripts, diagnostic tooling, and operational automation. Comfort operating large device fleets across multi-region environments with on-call responsibility, including experience as an escalation point during critical events. Bachelor's degree in Computer Science, Electrical Engineering, or a related field, or equivalent practical experience. Bonus Points: Experience with NVIDIA/Mellanox networking platforms in GPU cluster environments. Familiarity with Kentik or Arbor for traffic analysis and DDoS visibility. Experience defining or contributing to SLIs and SLOs in partnership with SRE or product teams. Exposure to operating 10K+ device fleets across hyperscale or cloud environments. Background contributing to post-incident learning programs or operational excellence initiatives org-wide. Benefits: Competitive compensation and equity packages Restricted Stock Units Paid time off, paid holidays & leave of absence programs Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Global travel insurance & emergency assistance Daily meals allowance Additional perks & programs specific to location Compensation Range: Compensation will be paid in the range of up to $195,000 -$235,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.