Dieses Stellenangebot ist nicht mehr verfügbar
Über
Location:
Hybrid NYC Role Type:
Full-Time Compensation:
$200k - $300k base + bonus
About the Role
We are looking for an NLP / LLM Engineer to design, build, and deploy language-driven AI systems into production. You will work on large language models, retrieval-augmented generation (RAG), and NLP pipelines that power real-world products. This role blends applied research with hands-on engineering and production ownership.
Responsibilities Design, fine-tune, and evaluate LLMs for NLP use cases such as search, summarising, classification, and conversational AI. Build and optimize RAG pipelines using vector databases, embeddings, and retrieval strategies. Develop prompt engineering, evaluation, and guardrail frameworks for reliability and safety. Deploy NLP/LLM models into production environments and support ongoing optimization. Collaborate with product, data, and platform teams to translate business needs into ML solutions. Monitor model performance, drift, and quality using robust evaluation metrics. Qualifications
Strong experience in NLP and machine learning, with hands-on work on LLMs. Proficiency in Python and modern ML frameworks (PyTorch, TensorFlow, or similar). Experience with embeddings, vector search, and retrieval systems. Familiarity with LLM fine-tuning techniques (LoRA, adapters, quantization). Experience deploying ML models into production environments. Solid understanding of NLP fundamentals (tokenization, transformers, evaluation metrics). Experience training or fine-tuning proprietary models. Experience building agentic or multi-step LLM applications. Experience with LangChain, LangGraph, or similar orchestration frameworks. Experience with MLOps, CI/CD, and cloud platforms (AWS, GCP, Azure).
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.