Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Staff Software Engineer I - Cloud Compute Platform
Staff Software Engineer Search Platform, Ingestion & Indexing
Thomson ReutersUnited StatesStaff Software EngineerAdvanced Content Engineering (ACE) is seeking a Staff Software Engineer to serve as the technical anchor for the search platform's ingestion and indexing systems. The platform p
Senior Staff Software Engineer, Backend - Platform (Core Automation)
CoinbaseUnited StatesReady to be pushed beyond what you think you’re capable of?At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, eve
Senior/Staff Software Engineer - Machine Learning Platform (Inference)
StreamlitUnited StatesMachine Learning Platform Team MemberAt Snowflake, we are powering the era of the agentic enterprise. To usher in this new era, we seek AI-native thinkers across every function who are energized by th
Sr. Staff Software Engineering - WindowsOS Platform
QualcommUnited StatesCompany: Qualcomm Technologies, Inc.Job Area: Engineering Group, Engineering Group > Software EngineeringGeneral Summary:Qualcomm is seeking a highly skilled and experienced Engineer for role as a sof
Member of Technical Staff (Backend Software Engineer, API Platform)
Perplexity AIUnited StatesAPI Platform EngineerPerplexity is redefining how people search, reason, and interact with information. Our API team sits at the core of this vision, designing and operating the high-performance inter
Software Engineer - DevOps Platform
WealthfrontUnited StatesWe're looking for a Software Engineer to join our Devops Platform team, where you will blend linux system administration and software engineering skills to build and maintain all our infrastructure an
Senior Software Engineer - Search Platform
AlgoliaUnited StatesSenior Software Engineer - Search PlatformParis, France At Algolia, we're proud to be a pioneer and market leader in AI Search, empowering 17,000+ businesses to deliver blazing-fast, predictive search
Director, Software Engineering — Growth & Cloud Platforms
Capital OneUnited StatesCapital One is looking for a Director of Software Engineering in McLean, VA, to lead multiple teams in building customer-facing technologies. This role emphasizes mentoring engineers and fostering inn
Backend Software Engineer – Financial Collaboration Platform
FeedinkooUnited StatesSoftware Engineer (Backend) at Standard MetricsStandard Metrics is an automated financial collaboration platform that helps investors and founders to move faster together and make better, forward-faci
Software Engineer - Machine Learning Platform
FronteraUnited StatesSoftware Engineer - Machine Learning PlatformRemote - Bogotá; Remote - Medellín Frontera is reimagining how children with autism and other behavioral health needs get the care they deserve. We bring t
Director, Software Engineering, Digital Platform
LumicityUnited StatesDirector of Software Engineering Los Angeles County, CA Full-time | On-site | Relocation assistance providedMy client is a high-growth aerospace company building next-gen autonomous aerial systems for
Senior Software Engineer | GTM Platform, Backend
Paribus (Ramp)United StatesRampRamp is building the smart infrastructure for finance teams, embedded in the transaction flow of every dollar a business spends. We automate how over $100B in annualized spend flows in and out of
Platform Embedded Software Sr. Manager
Blue OriginUnited StatesApplication close date: Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of
Frontend Software Architect - Vault Platform
Veeva SystemsUnited StatesVeeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in histor
Sr. Manager - Software Engineering - Data Platform
Blue OriginUnited StatesApplication close date: Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of
Software Engineering Manager - MuleSoft Enablement & Integration Platform (Remote)
CareFirst BlueCross BlueShieldUnited StatesResp & QualificationsPURPOSE:Join our team to lead the creation of CareFirsts next-generation Integration Platform, seamlessly blending API-first design with a product-based approach to advance our cl
Manager, Software Engineering - AI and Data Platform
BlackbaudUnited StatesResponsibilitiesOwn and evolve core Data & AI platform capabilities, including ingestion, transformation, lakehouse architecture, feature services, and AI/ML enablement layers. Drive platform decision
Staff Data Engineer - Platform & Architecture Leader
eBay Inc.United StateseBay Inc. is seeking a Staff Data Engineer to lead the development of scalable database architectures in the United States. You will collaborate with technical leaders to enhance the reliability and e
Director, Software Engineering (Conference Tech Platform Integration)
GartnerUnited StatesHiring near our Irving, TX and Stamford, CT Centers of Excellence with a flexible environment. About Gartner IT: Join a world-class team of skilled engineers who build creative digital solutions to su
Staff Backend Software Engineer
Semgrep, IncUnited StatesBackend EngineerAs a Backend Engineer, you'll work on our backend and infrastructure systems to design, build and maintain a fast and reliable user experience for our customers. You'll collaborate wit
Staff Software Engineer, Frontend
SUNOUnited StatesStaff Engineer Frontend CoreWe're looking for a Staff Engineer to join the Frontend Core team, working closely with engineering and design to build something truly world-class. You'll bring a deep kno
Backend Staff Software Engineer
VisaUnited StatesAbout Us Visa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territ
Staff Data Engineer, Data Platform - IntelliScript (Remote)
Milwaukee SucceedsUnited StatesStaff Data Engineer, Data Platform - IntelliScript (Remote)Milliman IntelliScript is a group of a few hundred experts in fields ranging from actuarial science to information technology to clinical pra
Staff Data Engineer, Data Platform - IntelliScript (Remote)
MillimanUnited StatesWhat We DoMilliman IntelliScript is a group of a few hundred experts in fields ranging from actuarial science to information technology to clinical practice. Together, we develop and deploy category-d
Software Engineer Backend/Platform II (Full Time)- United States ENG/CPO/WTG ETR
CiscoUnited StatesThe application window is expected to close on: 05/29/2026Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received . Meet the Team Our dedica
Staff Software Engineer Search Platform, Ingestion & Indexing
- United States
- United States
Über
Advanced Content Engineering (ACE) is seeking a Staff Software Engineer to serve as the technical anchor for the search platform's ingestion and indexing systems. The platform processes millions of documents across TR's legal, tax, and professional content corpora parsing, chunking, enriching, embedding, and indexing them into a hybrid search engine that powers both human-facing search interfaces and autonomous AI agents. Getting this pipeline right, at scale, with zero-downtime operations and increasingly agentic retrieval patterns, is one of the platform's most consequential engineering challenges. This role owns the design, implementation, and operational health of the document ingestion pipeline and search index management systems from the Kafka-based streaming infrastructure that moves documents through processing stages, to the Vespa application architecture that stores and serves them. Staff Engineers on this team define, build, test, deploy, scale, and operate what they ship full-stack ownership is not a principle we aspire to, it is the daily reality. AI-assisted development is the team norm, not the exception, and constant delivery to production is the expectation. This is a role for someone who sets architectural boundaries, not just executes within them About the Role In this position, you will focus on: Ingestion Pipeline Architecture & Engineering Plan, design, develop, and own the end-to-end document ingestion pipeline a Kafka-based stream processing architecture that moves documents through parsing, chunking, enrichment (entity extraction, embedding generation, metadata enrichment), and indexing stages including all fault tolerance, version ordering, and at-least-once delivery guarantees Architect and implement pluggable, configurable pipeline components (parsers, chunkers, enrichers, indexers) that client teams can assemble into custom topologies via the platform's self-service APIs, while maintaining reliable, observable, and performant execution Own the platform's Protobuf-based document schema and schema registry integration establishing schema governance standards, enforcing backward-compatible evolution, and ensuring reliable serialization across all pipeline stages Design and implement dual-flow ingestion: a high-throughput batch path for full reindexing and a low-latency incremental path for real-time document updates, with strong guarantees around document version ordering and idempotent processing Lead the migration of ingestion infrastructure from OpenSearch to Vespa, including design of Vespa document processors, custom Kafka feeders, and application package architecture resolving complex technical challenges that have little or no precedent within the team Custom Model Operationalization Own the end-to-end lifecycle for custom models integrated into the ingestion pipeline re-ranking models, embedding models, and enrichment components including inference serving behind a stable API surface, latency SLO management, hardware and runtime configuration (batching, quantization), and scaling Build and operate the model promotion pipeline: the CI/CD workflow that moves a model artifact from the fine-tuning team through staging to production, including versioning, canary rollouts, and rollback mechanisms ensuring the platform team can operate model updates independently without depending on the research team for production changes Define and maintain integration contracts between custom models and downstream pipeline components governing input/output schemas, compatibility requirements, and the governance process for model updates that ensures search pipeline consumers are not broken by changes upstream Instrument model serving for production observability: latency distributions, throughput, error rates, and quality signals such as re-ranking score distributions enabling the team to detect regressions or model drift without requiring the fine-tuning team's involvement Search Engine & Index Management Own the search engine layer end-to-end: design and operate Vespa (and OpenSearch during transition) index configurations, ranking profiles, schema definitions, and application package lifecycle management applying architectural principles that scale to the platform's long-term content and tenancy goals Build and operate zero-downtime index management: shadow indexing, blue/green index promotion, and rolling reindex workflows that keep the platform available during major infrastructure changes Implement and maintain the Component Registry and Index Registry the platform's catalog of reusable processing components and active index configurations with a focus on correctness, observability, and safe concurrent modification Develop the full-reindex and incremental-update orchestration logic, including change detection, document tracking, Kafka topic management, and DynamoDB-backed state management Agentic Search Infrastructure Design ingestion and indexing infrastructure with agentic retrieval patterns as a first-class concern including explicit latency budgets per retrieval hop, chunking and result compression strategies optimized for token economy in context windows, and index boundary definitions that give agents clean, predictable tool contracts Build trace-level observability into the retrieval stack that captures which tools were called, in what order, and with what inputs enabling reliable diagnosis and reproduction of failures in non-deterministic agentic retrieval paths Design session state and cache invalidation patterns for multi-turn agentic search: reasoning carefully about cache validity windows, session state scope (per-user, per-session, per-query), and mechanisms to prevent stale context from corrupting downstream agent responses Evaluation & Search Quality Build and own the integration between the ingestion pipeline and the platform's offline evaluation framework ensuring that experiment runs produce query/result outputs that feed seamlessly into the search grading tool, supporting gold test set maintenance, LLM-as-judge evaluation, and side-by-side ranking comparison across pipeline versions Instrument the query and retrieval stack for online analytics: real-time query latency and throughput monitoring, query log collection for session analysis, and the infrastructure to support A/B and interleaved ranking experiments in production generating the signals that connect low-level search metrics to downstream product KPIs Partner with TR Labs and research scientists to ensure that new search components can be evaluated in isolation with automated offline evaluation on every build and a clear path from evaluation results to production promotion decisions Reliability & Operational Ownership Take full operational responsibility for ingestion and indexing infrastructure: define SLOs, set measurable goals and meet them, build and maintain CloudWatch dashboards and alarms, and participate in on-call rotations you built it, you own it, you run it Treat delivery friction as the enemy: identify and remove obstacles that slow the team's ability to ship ingestion and indexing changes to production safely and frequently improving CI/CD pipelines, deployment automation, and local development workflows as a standing priority Instrument pipeline components with distributed tracing, structured logging, and rich metrics establishing documentation standards and knowledge management practices so that the team and platform consumers can understand system behavior at all times Design and implement resilient fault tolerance mechanisms dead-letter queues, retry strategies with exponential backoff, circuit breakers, consumer lag monitoring that make the pipeline robust to downstream failures and transient errors Drive system-level performance architecture: profiling ingestion throughput and indexing latency, identifying bottlenecks, and implementing optimizations that meet platform SLOs under peak load Technical Leadership Serve as the team's deepest technical authority on document processing pipelines and search engine internals guiding architectural decisions, resolving technical ambiguity, and establishing cross-system design patterns that raise the quality bar across the team Lead significant projects and initiatives that span multiple engineers and interact with other teams; determine work priorities based on strategic direction; recommend modifications to team operations and make needed adjustments to short-term priorities while maintaining strategic focus Mentor and develop Senior and mid-level engineers providing coaching, technical direction, and educational opportunities in modern distributed systems, stream processing, search infrastructure, and AI-assisted development practices Collaborate closely with TR Labs and research scientists to integrate new chunking strategies, embedding models, and enrichment techniques into the pipeline in a safe, well-instrumented, and ethically responsible way Deliver effective presentations on complex technical concepts to both technical and non-technical stakeholders; develop strategic plans for technology implementation that align with business objectives About You You're an ideal fit if you have: Required Experience Bachelor's or Master's degree in Computer Science, Engineering, or a related field 8+ years of software engineering experience, with demonstrated progression to staff-level or equivalent technical leadership including ownership of a functional area and leadership of significant cross-functional projects Deep expertise in distributed stream processing: designing, building, and operating high-throughput, fault-tolerant event-driven pipelines using Kafka or equivalent technologies at production scale Production experience with Vespa, OpenSearch, or Elasticsearch including schema design, ranking profile configuration, and end-to-end
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.