XX
Senior DevOps EngineeriSpace, Inc.Westlake Village, California, United States

Cette offre d'emploi n'est plus disponible

XX

Senior DevOps Engineer

iSpace, Inc.
  • US
    Westlake Village, California, United States
  • US
    Westlake Village, California, United States

À propos

JOB DETAILS:

Sr DevOps Engineer - AI platform

Location – Westlake, Village, CA (Onsite Work)

Contract Duration – Direct hire / Full time employment

Base Salary/Compensation Range: $120K -$150K plus benefits.

Job Description:

Responsibilities:

The
Sr DevOps Engineer - AI platform
will:

  • Design, implement, and manage scalable and resilient infrastructure on AWS.
  • Architect and maintain Windows/Linux based environments, ensuring seamless integration with cloud platforms.
  • Develop and maintain infrastructure-as-code(IaC) using both AWS Cloudformation/CDK and Terraform/OpenTofu.
  • Develop and maintain Configuration Management for Windows & Linux servers using Chef.
  • Design, build, and optimize CI/CD pipelines using GitLab CI/CD for .NET applications.
  • Integrate and support AI services, including orchestration with AWS Bedrock, Google Agentspace, and other generative AI frameworks, ensuring they can be securely and efficiently consumed by platform services.
  • Enable AI/ML workflows by building and optimizing infrastructure pipelines that support large-scale model training, inference, and deployment across AWS and GCP environments.
  • Automate model lifecycle management (training, deployment, monitoring) through CI/CD pipelines, ensuring reproducibility and seamless integration with development workflows.
  • Collaborate with AI engineering teams to deliver scalable environments, standardized APIs, and infrastructure that accelerate AI adoption at the platform level.
  • Implement observability, security, data privacy and cost-optimization strategies specifically for AI workloads, including monitoring and resource scaling for inference services.
  • Implement and enforce security best practices across the infrastructure and deployment processes.
  • Collaborate closely with development teams to understand their needs and provide DevOps expertise.
  • Troubleshoot and resolve infrastructure and application deployment issues.
  • Implement and manage monitoring and logging solutions to ensure system visibility and proactive issue detection.
  • Clearly and concisely contribute to the development and documentation of DevOps standards and best practices.
  • Stay up-to-date with the latest industry trends and technologies in cloud computing, DevOps, and security.
  • Provide mentorship and guidance to junior team members.

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • 5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role.
  • 1+ year(s) of experience with AI services & LLMs.
  • Extensive hands-on experience with Amazon Web Services (AWS)
  • Solid understanding of Windows/Linux Server administration and integration with cloud environments.
  • Proven experience with infrastructure-as-code tools, specifically AWS CDK and Terraform.
  • Strong experience designing and implementing CI/CD pipelines using GitLab CI/CD.
  • Experience deploying and managing .NET applications in cloud environments.
  • Deep understanding of security best practices and their implementation in cloud infrastructure and CI/CD pipelines.
  • Solid understanding of networking principles (TCP/IP, DNS, load balancing, firewalls) in cloud environments.
  • Experience with monitoring and logging tools (e.g., NewRelic, CloudWatch).
  • Strong scripting skills (e.g., PowerShell, Python, Ruby, Bash).
  • Excellent problem-solving and troubleshooting skills.
  • Strong communication and collaboration skills.
  • Experience with containerization technologies (e.g., Docker, Kubernetes) is a plus.
  • Relevant AWS and/or GCP certifications are a plus.
  • Experience with the configuration management tool Chef

Preferred Qualifications:

  • Knowledge of and a strong understanding of Powershell and Python Scripting
  • Strong background with AWS EC2 features and Services (Autoscaling and WarmPools)
  • Understanding of Windows server Build process using tools like Chocolaty for packages and Packer for AMI/Image generation.
  • Extensive hands-on experience with Amazon Web Services (AWS)
  • Westlake Village, California, United States

Compétences linguistiques

  • English
Avis aux utilisateurs

Cette offre a été publiée par l’un de nos partenaires. Vous pouvez consulter l’offre originale ici.