Senior Applied Scientist - Machine Learning Systems Engineer- PhotoshopAdobe • Seattle, Washington, United States
Cette offre d'emploi n'est plus disponible
Senior Applied Scientist - Machine Learning Systems Engineer- Photoshop
Adobe
- Seattle, Washington, United States
- Seattle, Washington, United States
À propos
Inference & Serving Optimization: Design and optimize high‑throughput, low‑latency inference systems. Optimize model architectures to improve deployment and runtime efficiency using techniques such as distillation, pruning, quantization, and Mixture‑of‑Experts (MoE). Implement advanced serving strategies including batching, caching, quantization (FP8/INT8), and distributed inference strategies covering data, tensor, pipeline, expert, and hybrid parallelism, while balancing computation and communication efficiency. Kernel Development & System Acceleration: Write and maintain high‑performance GPU kernels using Triton or CUDA to accelerate custom model layers and critical workloads. Improve GPU utilization through kernel fusion, asynchronous pipelines, and optimized scheduling strategies. Performance Profiling & System Optimization: Conduct deep performance analysis using tools such as PyTorch Profiler and NVIDIA Nsight to identify bottlenecks in compute, memory, and communication, and optimize end‑to‑end system performance across inference workloads. Distributed Systems & Infrastructure Collaboration: Partner with infrastructure teams to design scalable and reliable distributed serving systems across heterogeneous hardware environments (e.g., A100, H100, B200, CPU). Contribute to resource scheduling, GPU pooling, and elastic workload management. Cost‑Aware ML Engineering: Establish and track efficiency metrics such as cost per million inferences. Build benchmarking frameworks and dashboards to guide trade‑offs among quality, latency, and compute cost, enabling data‑driven system and product decisions. Technical Leadership & Best Practices: Serve as a trusted technical advisor to research and product teams on efficiency trade‑offs. Define best practices for scalable and cost‑efficient ML development and mentor other engineers on performance‑oriented systems design.
What You'll Need to Succeed
Education: Master’s or PhD in Computer Science, Electrical Engineering, or a related field, with a focus on machine learning systems, distributed systems, or high‑performance computing. Distributed Inference & Serving Expertise: Hands‑on experience implementing and scaling large‑scale inference or serving workloads using distributed frameworks and runtime systems (e.g., Triton, vLLM, SGLang, xDiT, or similar). Experience applying inference compilation and optimization tools (e.g., TensorRT, ONNX Runtime, AOTI), including techniques such as operator fusion and graph‑level optimization, with a strong understanding of system‑level performance trade‑offs. GPU & Performance Engineering Skills: Strong understanding of GPU architecture (e.g., memory hierarchy, compute throughput, communication bandwidth) and practical experience diagnosing performance bottlenecks across compute, memory, and I/O subsystems. Programming & Systems Development: Proficiency in Python and C++, with experience building high‑performance or distributed systems. Familiarity with CUDA or Triton for performance‑critical workloads is highly desirable. Data‑Driven Engineering Mindset: Demonstrated ability to make engineering decisions based on rigorous measurement and benchmarking, focusing on improving system efficiency, scalability, and reliability in production environments.
Preferred Experience
ML Frameworks & Tooling: Experience contributing to or maintaining performance‑ or efficiency‑focused libraries or systems. Hands‑on experience with open‑source serving frameworks (e.g., vLLM, SGLang, xDiT or similar), inference compilation tools (e.g., TensorRT, Triton, AOTI, or equivalent, operation fusion, or graph‑level optimization), and GPU profiling and performance analysis tools (e.g., PyTorch Profiler, NVIDIA Nsight, CUDA tooling). Distributed Systems & Communication: Exposure to low‑level communication libraries such as NCCL and a practical understanding of collective operations (e.g., AllReduce, AllGather) in large‑scale distributed serving environments. Containerization & Cluster Operations: Familiarity with containerized workflows (Docker, Kubernetes) and job scheduling in headless Linux environments, including experience operating production ML workloads on shared GPU clusters. Model Architectures: Working knowledge of model architectures such as Transformers, multimodal models, Mixture‑of‑Experts (MoE), or Diffusion Transformers (DiT).
Pay Range Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on the defined markets. The U.S. pay range for this position is $164,000 – $313,300 annually. Pay within this range varies by work location and may also depend on job‑related knowledge, skills, and experience. Your recruiter can share more about the specific salary range for the job location during the hiring process. In California, the pay range is $216,400 – $313,300. In Washington, the pay range is $204,800 – $296,600. Equal Employment Opportunity Statement Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other protected characteristic. State‑Specific Notices California: Adobe will consider qualified applicants with arrest or conviction records for employment in accordance with state and local laws and fair chance ordinances. Colorado: If this role is open to hiring in Colorado, the application window will remain open at least until the stated date and time in Pacific Time, in compliance with Colorado pay transparency regulations. If not listed as a hiring location, no specific application window applies, and the posting may close at any time based on hiring needs. Massachusetts: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. #J-18808-Ljbffr
Compétences linguistiques
- English
Avis aux utilisateurs
Cette offre a été publiée par l’un de nos partenaires. Vous pouvez consulter l’offre originale ici.