DevOps Machine Learning Infrastructure
- Redwood City, California, United States
- Redwood City, California, United States
À propos
Summary Description:
Syntiant Corp., a leader in the high-growth AI software and semiconductor solutions space, is looking for an experienced and talented DevOps Machine Learning Infrastructure Engineer to take on a critical role with expansive responsibilities to enhance the Software Engineering function in a growing organization.
This individual will play a crucial role in maintaining and enhancing our machine learning applications, specifically those deployed on AWS Elastic Kubernetes Service (EKS) and on-premises machine learning cluster. The ideal candidate will have experience with cloud infrastructure, containerization, and DevOps practices, as well as the ability to work on both on-premises and cloud environments.
Specific Duties and Responsibilities:
- Develop, maintain, and optimize RESTful APIs using Python and FastAPI.
- Deploy & manage applications on AWS EKS, ensuring high availability, scalability, and performance.
- Integrate with various data sources & services (e.g., MongoDB, Postgres) for efficient data storage and retrieval.
- Implement CI/CD pipelines to automate testing and deployment processes.
- Collaborate across teams to ensure seamless integration of cloud-native tools and practices.
- Maintain on-premises servers running Ceph, PostgreSQL, and NFS services.
- Troubleshoot & resolve issues across different environments (development, staging, production).
- Participate in code reviews, pair programming, and knowledge sharing sessions.
Requirements
Qualifications, Education, and Experience Required:
- Bachelor's Degree in Computer Science or equivalent work experience.
- Minimum of 5 years of relevant experience.
- Proficiency in Python, with experience using FastAPI for API development.
- Hands-on experience with AWS EKS, including deploying, scaling, and managing Kubernetes clusters.
- Knowledge of containerization technologies (Docker, Kubernetes).
- Experience with cloud infrastructure services such as S3, RDS, and VPCs.
- Familiarity with CI/CD tools like GitLab CI/CD or Jenkins.
- Strong understanding of database systems, including MongoDB, PostgreSQL, and NFS.
- Experience with on-premises storage solutions like Ceph.
- Expertise in Linux system administration.
- Ability to write clean, maintainable code and perform thorough testing.
- Excellent problem-solving skills and a strong attention to deta.
- Experience with cloud-native observability tools (Prometheus, Grafana) is a plus.
- Familiarity with Kubernetes operators and Helm charts is a plus.
- Knowledge of container orchestration on-premises using K8s or similar technologies is a plus.
- Experience with DevOps best practices and infrastructure as code (Terraform, Ansible) is a plus.
Benefits
Benefits Summary:
- Medical: Several plan options including PPO and HSA-compatible plans from Anthem Blue Cross, most of which are 100% paid by Syntiant Corp. for you and your family.
- Dental: Company-paid dental PPO coverage from MetLife, including coverage for Orthodontia.
- Vision: Company-paid vision PPO coverage from MetLife / VSP.
- Life Insurance / AD&D: Company-paid basic Life / AD&D coverage in the amount of 3x your salary (up to $1,000,000). Additional supplemental life insurance with low group rates is available for yourself and your family.
- Disability Coverage: Company-paid Short Term and Long-Term Disability coverage provides up to 60% income replacement protection.
- Spending and Savings Accounts: Flexible Spending
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.