Job Opportunities
Find jobs near you, whether onsite, hybrid, or remote.- Similar Jobs to: Senior AI Curation Data Scientist
Senior AI Curation Data Scientist
xCuresNew YorkThe xCures platform helps improve clinical care via comprehensive, intelligent access to healthcare data on an AI-assisted platform. Delivered using a Software as a Service (SaaS) model, xCures enable
Senior Data Scientist
SOSiNew YorkFounded in 1989, SOSi is among the largest private, founder-owned technology and services integrators in the defense and government services industry. We deliver tailored solutions, tested leadership,
Senior Data Scientist
DuckDuckGoNew YorkWho We Are Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, annual reven
Senior Data Scientist, Financial Services
Crypto Pro NetworkNew YorkTRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, investigate, and disrupt c
Senior Data Scientist (USA / Israel)
NiftNew YorkNift is disrupting performance marketing, delivering millions of new customers to brands every month. We’re actively looking for a hands‑on Senior Data Scientist to focus on building ML models for sca
Senior Data Scientist | Remote
Crossing HurdlesNew YorkPosition: Experienced & Credentialed Data ScientistsType: Hourly contractCompensation: $100-$160 per hourLocation: RemoteCommitment: 10–40 hours/weekRole ResponsibilitiesWork on projects that focus on
Senior Data Scientist, Consumer
RedditNew YorkLocation: US remote-friendly or any office location - SF, LA, CHI, NYThe Data Science Team at Reddit is growing and we are looking for experienced Data Scientists to partner with our cross‑functional
Senior Market Research Data Scientist - Remote
EscalentNew YorkEscalent is seeking a Senior Data Scientist specializing in Market Research Analytics to join their remote team in the United States. This role involves leading complex analytical projects and deliver
Senior Data Scientist, AI Products
DropboxNew YorkRole Description How many times do you get the opportunity to be on the ground floor of a big and important mission? What if you could be one of the top contributors defining the mission, guiding our
Senior Data Scientist Compliance Technology
Framework VenturesNew YorkAbout the Opportunity As a Senior Data Scientist in our Compliance Technology & Data Strategy function, you will play a key role in shaping our approach to governance, independent validation, and cont
Senior Data Scientist - Lead Strategy & ML, Remote
ClubhouseNew YorkAbout us Alpha is a product studio focused on the intersection of AI and consumer social – backed by a16z and many of the top investors in the world. Our goal is to create social products that we use,
Senior Data Scientist: DoD Model Governance & Bayesian AI
Nova UsaNew YorkNOVA Corporation is seeking a Senior Data Scientist to lead analytics for a Department of Defense customer. Responsibilities include the full lifecycle of machine learning models, ensuring compliance
Senior Data Scientist - Remote Energy Analytics & VPP
Renew Home ServicesNew YorkRenew-Home is looking for a Senior Data Scientist to enhance data-driven decisions within their Product Analytics team. The role involves collaborating with various departments to deliver analytic sol
Senior Data Scientist - Production ML & NLP Specialist
National Debt ReliefNew YorkNational Debt Relief is seeking an experienced Senior Data Scientist to enhance our Data Intelligence team. The ideal candidate will have over 7 years of expertise in building enterprise-grade machine
Senior Risk Data Scientist Analytics-Driven, Remote
Framework VenturesNew YorkFramework Ventures is looking for a Senior Data Scientist to help define how we measure and manage risk in our systems. Collaborate with engineering and business teams to surface insights, design deci
Senior Data Scientist - Price, Promotions & Merch Analytics
9025 CVS Shared Services Resources LLCNew York9025 CVS Shared Services Resources LLC is seeking a Lead Analyst to enhance consumer engagement through advanced analytics. This role involves developing models to optimize pricing, promotions, and as
Data Scientist - Senior - Innovative Engines and Energy Company
AndiamoNew YorkData Scientist / Machine Learning Engineer — Advanced Analytics & AI Solutions Data Platform ExpertiseTurn complex data into powerful, real‑world business outcomes. This role offers the opportunity to
Remote Data Scientist AI Agents & Orchestration
Mutual of OmahaNew YorkMutual of Omaha is looking for a Data Scientist to work remotely. The role requires a blend of traditional statistical modeling and advanced AI orchestration. Responsibilities include collaborating wi
AI Data Scientist NLP/LLM, MLOps on GCP
GenNext India Private LimitedNew YorkOverview Python, TensorFlow/PyTorch, Scikit-learn, NLP/LLM. MLOps Experience: Hands-on experience with CI/CD for ML. Cloud Experience: Strong understanding of Google Cloud Platform SQL/NoSQL databases
Remote Applied Data Scientist: AI for Agentic Trading
Clough AMECNew YorkBinance is seeking an Applied Data Scientist for its Accelerator Program, a remote role where you will work directly on AI systems powering Binance products. Candidates must be current university stud
Volunteer AI Data Scientist for Childhood Health Analytics
FeedinkooNew YorkFeedinkoo is offering a volunteer opportunity aimed at processing and analyzing data for a children’s environmental health survey using AgenticAi. Volunteers will gain hands-on experience in AI, data
Senior Sales Engineer - Data & AI Security
VeeamNew YorkSenior Sales Engineer - Data & AI Security Remote, United StatesVeeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, an
Senior Data Engineer, AI & Robotics Infra
General MotorsNew YorkGeneral Motors is looking for a Staff Data Engineer specialized in AI and Robotics to join its AI Research team. This role will set the technical direction for scalable robot learning in manufacturing
Senior Data Engineer (dbt) | Snowflake & AI-ready Data
Bridgecrest Acceptance CorporationNew YorkBridgecrest Acceptance Corporation is seeking a Data Engineer to design and develop scalable data models that support enterprise reporting and analytics. The ideal candidate will have 5+ years of expe
Senior Data Engineer - Real-Time AI Pipelines (Remote)
Travel IndustryNew YorkTravel Industry is looking for a Senior Data Engineer to lead the development of AI-native data systems. This role involves designing scalable architectures, optimizing data pipelines, and mentoring j
Senior AI Curation Data Scientist
- New York, New York, United States
- New York, New York, United States
About
About the role: Reporting to the VP of Data Science, the AI Curation Data Scientist will, using traditional computing and custom AI model training, work on mission-critical projects driven by xCure’s product development needs and will expand xCures’ complex, innovative health data processing, extraction, and analysis capabilities. We’re looking for an individual who values team-building, cooperation, and communications with colleagues to serve the needs of our customers. You will reach out across the organization for guidance on engineering, clinical, PHI/PII, data policy, and business topics as needed. Projects will include significant data processing challenges, such as C-CDA XML parsing and de-identification of structured and unstructured EHR content. Equally important projects will address data curation and custom AI model training. You will author software and AI models and take the lead on data set curation. You will serve as the anchor of Data Science data set quality assurance within an innovative, fast-moving team. Team responsibilities are key requirements for this position, which will deliver large and complex data products and data analysis tools.
This position is fully remote, but will coordinate very closely with a small team and thus requires excellent communication and coordination skills. Occasional travel is required.
This job is right for you if you like:
A high-energy start-up working with a brilliant and passionate team
Working on problems that make a real difference in people’s lives
Understanding and delivering on reliable and well-characterized products and deliverables within a highly innovative and fast-changing environment: clinical data extraction and aggregation, the relation between data processing and QA framework; LLM tuning and training, pedantic data curation, compute architecture, data exchange.
Rockstar teammates: you will be working with a strong team with decades of prior work experience in artificial intelligence, software systems, molecular biology, and clinical medicine
Innovation and problem solving to provide order-of-magnitude improvements in capabilities for data handling and analysis while maintaining traceable data and methods development
Responsibilities:
Developing and testing data extraction and integration software for structured EHR content (XML, FHIR) and unstructured text content (attached documents)
Planning, explaining, and implementing projects to curate data sets used in model training
Maintaining understanding of current and new generative AI and transformer technologies
Tuning and training LLMs
Maintaining a strong understanding of PHI/PII and de-identification policies and strategies at xCures and implementing software solutions compliant with policies and strategies
Developing and implementing tests of data extraction and aggregation performance to improve efficiency, timeliness, and cost-effectiveness
Flexibly taking on technical leadership or participation roles per project
Implementing and maintaining code repositories
Working closely with manager to explore methods, test hypotheses, and collaboratively implement innovative solutions for data science
Coordinating as required for a fully remote role
Working with Engineering and other groups to improve overall company efficiency and effectiveness
Required Skills and Qualifications:
Ph.D., or equivalent experience in Computer Science, Software Engineering, Statistics, Biology, or related field
Minimum of 10 years of hands-on experience in data science, machine learning, AI, data analysis, software development, and/or predictive analytics
Expertise in generative AI and transformer models, especially training of LLMs
Significant prior experience with curating data sets to train LLMs
Significant hands‑on coding experience with LLMs, embeddings models, sentence_transformers, and authoring python code to build data extraction and/or classification tools
Significant prior work experience with parsing XML, JSON, and/or other complex data formats, preferably C-CDA health data
Experience with TensorFlow, PyTorch, and/or scikit-learn
Software development skills including git
Proven efficiency using, and cautious approach to using, LLM-assisted coding
Experience writing unit and integration tests for scientific/clinical data software as well as with developing scientifically motivated data quality assessments
Experience as a senior technology development leader, contributing while coordinating the work of colleagues
Flexible, innovative, can‑do approach to delivering software and data products balanced with team cooperation
A passion for successful delivery of team work products
Must reside in the United States.
Must have authorization to work in the United States.
Preferred Skills and Qualifications:
Extensive experience with data handling efficiency tools, such as jq, xq, Unix command-line tools such as sed, bash programming
Deep understanding of regex
Extensive AWS experience and understanding of tradeoffs for different types of data storage for AI training
Significant prior experience with PHI and PII, HIPAA, and de-identification is a major plus
Software development experience in multiple coding languages
Confidence extending the capabilities of open source tools
Experience with multiple approaches to LLM-assisted coding, such as within Visual Studio, copilot, Claude Code; and familiarity with frontier and open model capabilities
Experience with remote teams and solving technical project communications challenges
Notes: This is a big list. Don’t worry if you do not meet every qualification or wishlist item. If you are passionate, ambitious, adept, and mission‑aligned, then we want to hear from you — even if you don’t check every box listed here. True talent shines through and transcends a list of bullet points. To apply, please send your cover letter and resume to ds-jobs@xcures.com
Salary range : 150K to 200K annually
401k
xCures acknowledges that equal opportunity for all persons is a fundamental human value. Each employee and applicant will be considered on the basis of individual ability and merit, without regard to race, color, religion, age, sex, sexual orientation, gender identity, gender expression, pregnancy, national origin, marital status, physical disability, mental disability, medical condition, genetic information, protected military or veteran status, or any other characteristics.
#J-18808-Ljbffr
Languages
- English
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.