Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : Cloud Platform Engineer
GPU Cloud Platform Engineer
Yotta LabsNew YorkLocation: Remote (Global)Type: Full-timeCompany: Yotta LabsApply: careers@yottalabs.aiAbout Yotta Labs Yotta Labs is pioneering the development of a Decentralized Operating System (DeOS) for AI worklo

Cloud Platform Engineer (m/w/d)
Wüstenrot GruppeAustriaUnterstützung bei Design und Administration unserer Cutting-Edge Systeme: Cloud Platform Engineer (m/w/d) Dienstort: Salzburg, Wien oder deine Wohnung Die Wüstenrot Technology GmbH gehört im Bereich d
GCP Cloud Platform Senior Engineer
Ontrac SolutionsNew YorkAbout Ontrac Solutions Ontrac Solutions is a leading technology consulting firm, specializing in cutting‑edge solutions that drive business transformation. We partner with organizations to modernize t
Cloud Platform Engineer (OpenStack, Python) - STACKIT (gn)
Schwarz DigitsHeilbronnErfahrener Entwickler/in mit guten Python-KenntnissenDu fühlst dich auf einem Linux-System heimischSicher im Troubleshooting in verteilten SystemenGutes Know-How in der Entwicklung und Bereitstellung

Senior Cloud & Data Platform Engineer (m/w/d)
green flexibility development GmbHMünstergreen flexibility treibt mit innovativen Batteriespeicherlösungen aktiv die Energiewende voran. Unser erfahrenes Team plant, realisiert und betreibt Großbatteriespeicher in ganz Europa. Von der Identi
IT DevOps / Cloud Platform Engineer (gn) - Kubernetes & GitOps
Schwarz DigitsWeinsbergContainer-Expertise: Du hast fundierte Erfahrung mit Docker-Images und beherrschst das Deployment innerhalb von Kubernetes (u. a. mittels Helm).Automatisierungs-Profi: Du bist sicher im Umgang mit mod
Senior Cloud Platform Engineer | Azure, Kubernetes, Terraform
Hispanic Alliance for Career EnhancementNew YorkThe Hispanic Alliance for Career Enhancement is looking for a seasoned Cloud Platform Engineer to lead the maintenance and evolution of our cloud infrastructure. Candidates should have 7 years of rele
Breezy HR : Senior Cloud Platform Engineer (AWS)
CloudDevsJacksonvilleHeadquarters: Jacksonville, FLURL: https://breezy.hr/Location: Remote (US)Team: Engineering • Platform/InfrastructureAbout Breezy HR Breezy HR is a remote‑first, hiring platform built for small and mi
(Senior) Cloud Native Platform Engineer / Distributed Cloud - STACKIT (m/w/d)
Schwarz DigitsHeilbronnDu verfügst über eine abgeschlossene Ausbildung oder ein Studium mit informationstechnischem Hintergrund (z. B. (Wirtschafts-)Informatik).Du hast mindestens 2 Jahre Erfahrung mit Virtualisierung und m
(Senior) Cloud Native Platform Engineer / Distributed Cloud - STACKIT (m/w/x)
Schwarz DigitsHeilbronnDu verfügst über eine abgeschlossene Ausbildung oder ein Studium mit informationstechnischem Hintergrund (z. B. (Wirtschafts-)Informatik).Du hast mindestens 2 Jahre Erfahrung mit Virtualisierung und m
Cloud Platform Engineer (OpenStack, Python) - STACKIT (m/w/d)
Schwarz DigitsHeilbronnErfahrener Entwickler/in mit guten Python-KenntnissenDu fühlst dich auf einem Linux-System heimischSicher im Troubleshooting in verteilten SystemenGutes Know-How in der Entwicklung und Bereitstellung
Cloud Platform Developer Engineer (AWS CDK/TypeScript) - Remote
Mutual of OmahaRemoteMutual of Omaha is seeking a Cloud Platform Developer Engineer who brings both strong AWS cloud infrastructure expertise and hands-on software development experience. This distinction matters. In this
Senior Data Engineer - Google Cloud Platform Berlin (gn)
Schwarz DigitsBerlinErfahrung: Mehrjährige fundierte Erfahrung im Data Engineering, Big Data Umfeld oder Software Engineering, idealerweise in einer Senior-Rolle.Cloud- & Streaming-Expertise: Du bringst fundierte Kenntni
Cloud Platform Engineer (OpenStack, Python) - STACKIT (m/w/x)
Schwarz DigitsHeilbronnErfahrener Entwickler/in mit guten Python-KenntnissenDu fühlst dich auf einem Linux-System heimischSicher im Troubleshooting in verteilten SystemenGutes Know-How in der Entwicklung und Bereitstellung
Senior Cloud Platform Engineer | Node.js/TS/JS | Remote
n8nStaten IslandThe AI orchestration of your wildest imagination.n8n is the open workflow orchestration platform built for the new era of AI. We give technical teams the freedom of code with the speed of no-code, so
IT DevOps / Cloud Platform Engineer (m/w/d) - Kubernetes & GitOps
Schwarz DigitsWeinsbergContainer-Expertise: Du hast fundierte Erfahrung mit Docker-Images und beherrschst das Deployment innerhalb von Kubernetes (u. a. mittels Helm).Automatisierungs-Profi: Du bist sicher im Umgang mit mod
IT DevOps / Cloud Platform Engineer (m/w/x) - Kubernetes & GitOps
Schwarz DigitsWeinsbergContainer-Expertise: Du hast fundierte Erfahrung mit Docker-Images und beherrschst das Deployment innerhalb von Kubernetes (u. a. mittels Helm).Automatisierungs-Profi: Du bist sicher im Umgang mit mod
Senior Data Engineer - Google Cloud Platform Berlin (m/w/d)
Schwarz DigitsBerlinErfahrung: Mehrjährige fundierte Erfahrung im Data Engineering, Big Data Umfeld oder Software Engineering, idealerweise in einer Senior-Rolle.Cloud- & Streaming-Expertise: Du bringst fundierte Kenntni
(Senior) Software Engineer - Data & AI Platform - STACKIT (gn)
Schwarz DigitsBerlinDu bringst die Leidenschaft und Begeisterung für neue Technologien im Kontext BI und AI mitDabei liegen deine Schwerpunkte in der Softwareentwicklung und Kubernetes und du fühlst dich in Cloud-Umgebun
(Senior) Software Engineer - Data & AI Platform - STACKIT (m/w/d)
Schwarz DigitsBerlinDu bringst die Leidenschaft und Begeisterung für neue Technologien im Kontext BI und AI mitDabei liegen deine Schwerpunkte in der Softwareentwicklung und Kubernetes und du fühlst dich in Cloud-Umgebun
(Senior) Software Engineer - Data & AI Platform - STACKIT (m/w/x)
Schwarz DigitsBerlinDu bringst die Leidenschaft und Begeisterung für neue Technologien im Kontext BI und AI mitDabei liegen deine Schwerpunkte in der Softwareentwicklung und Kubernetes und du fühlst dich in Cloud-Umgebun
Lead Engineer - SRE - Cloud Storage - STACKIT (gn)
Schwarz DigitsBerlinDu hast Lust, etwas Großes zu bewegen und dabei die Lösung mit modernsten Cloud-Technologien maßgeblich mitzugestaltenDu hast ausgeprägte Erfahrung im Marktumfeld mit verschiedenen Storageprodukten (z

Verkäufer als Fachkraft / Quereinsteiger Frischetheke (m/w/d)
REWEBerlinOrt: 10439 Berlin / Prenzlauer Berg, Schönhauser Allee 80 | Vertragsart: Voll-/Teilzeit, 25 bis 38 Wochenstunden, befristet | Job-ID: 959776 Die Frischetheke ist das Herzstück unserer REWE Karsten Sch

Junior Facility Manager (d/m/w)
wirkaufendeinauto.deBerlinFür unsere offene Position als Junior Facility Manager (d/m/w) suchen wir dich! Starte im Real Estate Team und sei zuständig für das Immobilienmanagement unseres WKDA Real Estate & Procurement Depart

Verkäufer Berlin Friedrichstraße (25h) m/w/d
Marc O´PoloBerlinDON'T JUST WORK. WORK WITH US. Arbeite dort, wo das Wir der stärkste Teamplayer ist. Wir sind ein diverses Team aus allen Altersgruppen und bieten unseren Kund:innen als Markenbotschafter:innen ei
GPU Cloud Platform Engineer
- New York, New York, United States
- New York, New York, United States
À propos
Type: Full-time
Company: Yotta Labs
Apply: careers@yottalabs.ai
About Yotta Labs Yotta Labs is pioneering the development of a Decentralized Operating System (DeOS) for AI workload orchestration at a planetary scale. Our mission is to democratize access to AI resources by aggregating geo-distributed GPUs, enabling high-performance computing for AI training and inference on a wide spectrum of hardware—from commodity to high-end GPUs. Our platform supports major large language models (LLMs) and offers customizable solutions for new models, facilitating elastic and efficient AI development.
️ Role Overview We are seeking a
GPU Cloud Platform Engineer
to join our core infrastructure team and help build the next-generation AI compute cloud. In this role, you will design, deploy, and operate large-scale, multi-cluster GPU infrastructure across data centers and cloud environments. You will be responsible for ensuring high availability, performance, and efficiency of containerized AI workloads—ranging from LLMs to generative models—deployed in Kubernetes-based GPU clusters. If you're passionate about high-performance systems, distributed orchestration, and scaling real-world AI infrastructure, this role offers a unique opportunity to shape the backbone of our AI cloud platform.
Responsibilities
Build and operate large-scale, high-performance GPU clusters; ensure stable operation of compute, network, and storage systems; monitor and troubleshoot online issues.
Conduct performance testing and evaluation of multi-node GPU clusters using standard benchmarking tools to identify and resolve performance bottlenecks.
Deploy and orchestrate large models (e.g., LLMs, video generation models) across multi-cluster environments using Kubernetes; implement elastic scaling and cross-cluster load balancing to ensure efficient service response under high concurrency for global users.
Participate in the design, development, and iteration of GPU cluster scheduling and optimization systems. Define and lead Kubernetes multi-cluster configuration standards; Optimize scheduling strategies (e.g., node affinity, taints/tolerations) to improve GPU resource utilization.
Build a unified multi-cluster management and monitoring system to support cross-region resource monitoring, traffic scheduling, and fault failover. Collect key metrics such as GPU memory usage, QPS, and response latency in real time; configure alert mechanisms.
Coordinate with IDC providers for planning and deploying large-scale GPU clusters, networks, and storage infrastructure to support internal cloud platforms and external customer needs.
✅ Qualifications
Bachelor's degree or higher in Computer Science, Software Engineering, Electronic Engineering, or related fields; 3+ years of experience in system engineering or DevOps.
5+ years of experience in cloud-native development or AI engineering, with at least 2 years of hands‑on experience in Kubernetes multi-cluster management and orchestration.
Familiarity with the Kubernetes ecosystem; hands‑on experience with tools such as kubectl, Helm, and expertise in multi‑cluster deployment, upgrade, scaling, and disaster recovery.
Proficient in Docker and containerization technologies; knowledge of image management and cross-cluster distribution.
Experience with monitoring tools such as Prometheus and Grafana; Has practical experience in GPU fault monitoring and alerting.
Hands‑on experience with cloud platforms such as AWS, GCP, or Azure; understanding of cloud-native multi-cluster architecture.
Experience with cluster management tools such as Ray, Slurm, KubeSphere, Rancher, Karmada is a plus.
Familiarity with distributed file systems such as NFS, JuiceFS, CephFS, or Lustre; ability to diagnose and resolve performance bottlenecks.
Understanding of high-performance communication protocols such as IB, RoCE, NVLink, and PCIe.
Strong communication skills, self‑motivation, and team collaboration
Preferred Experience
Experience in developing and operating MaaS platforms or large-scale model inference clusters. Proven track record of leading multi-cluster system development or performance optimization projects.
Proficiency in CUDA programming and the NCCL communication library; understanding of high-performance GPUs like H100.
Ability to develop standardized inference APIs (RESTful/gRPC) and automation tools using Golang or Python.
Hands‑on experience with optimization techniques such as model quantization, static compilation, and multi‑GPU parallelism; capable of profiling inference processes in multi-cluster setups and identifying bottlenecks like memory fragmentation and low compute efficiency.
Active engagement with open-source communities such as Hugging Face and GitHub; deep understanding of the design principles of inference frameworks like Triton, vLLM, and SGLang; ability to perform secondary development and optimization based on open-source projects and quickly translate cutting-edge techniques into production-ready multi-cluster solutions.
Why Join Yotta Labs?
Be part of a visionary team aiming to redefine AI infrastructure.
Work on cutting-edge technologies that bridge AI and decentralized computing.
Collaborate with experts from leading institutions and tech companies.
Enjoy a flexible, remote work environment that values innovation and autonomy.
How to Apply Interested candidates should apply directly or send their resume and a brief cover letter tocareers@yottalabs.ai. Please include links to any relevant projects or contributions.
#J-18808-Ljbffr
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.