Principal Software Engineer - Machine Learning

Diverse Lynx

United States

United States

À propos

Position: Principal Software Engineer - Machine Learning Location: Dallas, TX/ Atlanta /GA/ Plano, TX onsite day1 (Any of these three locations can be given) Fulltime only
AI, data science, Machine Learning, Computer Vision and Cloud computing This is a senior level role where this person is responsible for the development of high performance, distributed modeling tasks using Machine Learning and Data Science 1. Programming Languages: SQL(MySQL/ PostgreSQL) , Java, Scala, Python, No-Sql technologies (Cassandra/MongoDB, Redis) 2. DevOps: Docker, Kubernetes, Jenkins, CI/CD, Git, Jira, Azure DevOps 3. Data exploration, analysis, summarization, visualization using necessary tools like Tableau, excel, etc. 4. Experience with tools like Snowflake, Talend, and Informatica for extracting data from various sources 5. Expertise in Extract, Transform, Load (ETL) processes using tools like Apache NiFi, Talend, and Informatica 6. Knowledge of building and managing data pipelines with tools like Apache Kafka, Apache Flume, and Apache Storm, Apache Flink, BI Analytics and Databricks. 7. Experience with REST services, MQ/Rabbit, Redis/Hazelcast 8. Proficiency in Python, Java, or Scala 9. Understanding of data warehousing concepts and platforms like Snowflake 10. Knowledge of Telecom Domain
A Machine Learning (ML) Engineer plays a crucial role in designing, implementing, and maintaining machine learning models and systems. They bridge the gap between data science and software engineering, ensuring that ML models are scalable, efficient, and integrated into production environments.
Key Roles and Responsibilities 1. Model Development and Training • Algorithm Selection: Select and implement appropriate machine learning algorithms and models based on the problem and data characteristics. • Feature Engineering: Develop and transform features from raw data to improve model performance. This includes data preprocessing, normalization, and feature selection. • Model Training: Train machine learning models using historical data, optimizing model parameters to achieve the best performance. 2. Model Evaluation and Tuning • Model Evaluation: Evaluate model performance using metrics such as accuracy, precision, recall, F1 score, AUC-ROC, and others. Compare different models and select the best-performing one. • Hyperparameter Tuning: Optimize model hyperparameters using techniques such as grid search, random search, or Bayesian optimization to improve model performance and generalizability. • Cross-Validation: Implement cross-validation techniques to ensure the robustness and reliability of the model. 3. Model Deployment and Integration • Model Deployment: Deploy machine learning models into production environments, ensuring they are scalable, efficient, and reliable. • API Development: Develop APIs to expose machine learning models as services that can be consumed by other applications or systems. • Integration: Integrate machine learning models with existing systems, applications, or workflows. Collaborate with software engineers and IT teams to ensure seamless deployment and integration. 4. Monitoring and Maintenance • Model Monitoring: Monitor the performance of deployed models in real-time, tracking metrics such as latency, throughput, and prediction accuracy. • Model Maintenance: Update and retrain models as new data becomes available to ensure they remain accurate and relevant. Address issues such as model drift and data drift. • Error Analysis: Analyze model errors and misclassifications to identify areas for improvement and refine the model. 5. Infrastructure and Tooling • Infrastructure Management: Set up and manage the infrastructure required for training and deploying machine learning models, including cloud platforms, GPUs, and distributed computing resources. • Automation: Automate repetitive tasks such as data preprocessing, model training, and deployment using scripting languages (e.g., Python) and workflow orchestration tools (e.g., Apache Airflow). • Tooling: Utilize and maintain ML frameworks and libraries such as TensorFlow, PyTorch, scikit-learn, and others to streamline the development and deployment process. 6. Collaboration and Communication • Cross-Functional Collaboration: Work closely with data scientists, software engineers, product managers, and other stakeholders to understand their requirements and ensure alignment on project goals. • Documentation: Create and maintain comprehensive documentation for machine learning models, pipelines, and processes. Ensure documentation is accessible and up-to-date. • Stakeholder Communication: Communicate progress, issues, and solutions effectively with stakeholders. Provide regular updates on machine learning activities and projects. 1. Programming Languages: Python, SQL, Java/Scala, shell-scripting 2. Cloud Technologies: Azure ML, Databricks, Snowflake and Palantir Foundry 3. DevOps: Docker, Azure Kubernetes Service (AKS), Jenkins, CI/CD, Git 4. ML Frameworks: Numpy, Pandas, Scikit-learn, OpenCV, TensorFlow, PyTorch, Hugging Face's Transformers, Spacy & NLTK 5.
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.

United States

Compétences linguistiques

English

Avis aux utilisateurs

Cette offre a été publiée par l’un de nos partenaires. Vous pouvez consulter l’offre originale ici.

Trouver des emplois similaires