Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : AI Curation Data Scientist
AI Curation Data Scientist
xCuresNew YorkThe xCures platform helps improve clinical care via comprehensive, intelligent access to healthcare data on an AI-assisted platform. Delivered using a Software as a Service (SaaS) model, xCures enable
Senior AI Curation Data Scientist
xCuresNew YorkThe xCures platform helps improve clinical care via comprehensive, intelligent access to healthcare data on an AI-assisted platform. Delivered using a Software as a Service (SaaS) model, xCures enable
Data Scientist
SOLACE HEALTH LLCNew YorkAbout Solace Healthcare in the U.S. is fundamentally broken. The system is so complex that 88% of U.S. adults do not have the health literacy necessary to navigate it without help. Solace cuts through
Data Scientist
DropboxNew YorkOverview PLEASE READ: Zones are based on your zip code. If you’re within 100 miles of a listed metro area (straight-line radius), you’re included in that Zone. For this role, we are hiring in Zones 2
Data Scientist
Bespoke LabsNew YorkSenior Data Scientist: AI Training Data (2-4 Months Contract) Company: BespokeLabs (VC-backed; founded by IIT & Ivy League alumni) Location: Remote Role Type: Contract (2-4 Months) Time Commitment: 40
Data Scientist
Avenue CodeNew YorkAvenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We’re privately held, profitable, and have been
Data Scientist
CintNew YorkJob Description The Opportunity As a Data Scientist at Cint you will have the opportunity to work alongside our Data Science and Analytics teams and collaborate with product and engineering teams to w
Staff Data Scientist
OpenXNew YorkCompany at a Glance OpenX is focused on unleashing the full economic potential of digital media companies. We do this by making digital advertising markets and technologies that are designed to delive
Sr. Data Scientist
CotivitiNew YorkOverview We are seeking for an Experienced data scientist leading Intelligent Claims Routing models, developing scoring algorithms and prioritization systems to optimize claims workflows. Responsibili
Data Analyst Associate Data Scientist
Framework VenturesNew YorkOverview Integra is hiring entrepreneurial and quantitatively-skilled individuals with data analytics experience who are interested in applying their skills to identify, investigate and analyze fraud
Data Scientist- Platform Integrity
SleeperNew YorkAbout Sleeper Sleeper is a sports-focused games platform with messaging at its core. We are a young and energetic company, fueled by a passion for sports and a drive for innovation. Our mission is to
Staff Data Scientist | Analytics
Machinify, Inc.New YorkMachinify is a leading healthcare intelligence company with expertise across the payment continuum, delivering unmatched value, transparency, and efficiency to health plan clients across the country.
Senior Data Scientist: Data Products
VonageNew YorkJoin Vonage as a Senior Data Scientist and innovate cloud communications by developing the data infrastructure that demonstrates the value of verification products. In this role, you will drive the cu
Senior Data Scientist
Fetch..New YorkWe are looking for a Senior Data Scientist to join Fetch, a platform that rewards consumers for everyday purchases and provides brands with insights to drive loyalty and growth.What We’re Building and
Staff Data Scientist
Northbeam LLCNew YorkAbout Northbeam Northbeam is building the world's most advanced marketing intelligence platform, providing top eCommerce brands a unified view of their business data through powerful attribution model
Senior Data Scientist
AlmaNew YorkAlma is on a mission to simplify access to high-quality, affordable mental health care by making it easy and financially rewarding for therapists to accept insurance and offer in-network care. When a
Scientific Programmer/ Data Scientist/ Statistical Programmer
Atria GroupNew YorkScientific Programmer/ Data Scientist/ Statistical Programmer We specialize in Staffing, Consulting, Software Development, and Training along with IT services to small to medium size companies. AG's p
Senior Data Scientist, AI Products
DropboxNew YorkRole Description How many times do you get the opportunity to be on the ground floor of a big and important mission? What if you could be one of the top contributors defining the mission, guiding our
Data Scientist/Machine Learning Engineer
Sumble Inc.New YorkSumble is building a knowledge graph from web data with a first focus on data for go-to-market teams. We use sources like job posts and resume data to identify things like org structure, tech stack, a
Staff Data Scientist, Full Stack
SentiLinkNew YorkSentiLink provides innovative identity and risk solutions, empowering institutions and individuals to transaction with confidence. We’re building the future of identity verification in the United Stat
Machine Learning Engineer / Data Scientist
FusemachinesNew YorkAbout Fusemachines Founded in 2013, Fusemachines is a global provider of enterprise AI products and services, on a mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the compa
Staff Forecasting Data Scientist
Omada HealthNew YorkJob Overview Omada Health is looking for a Staff Forecast Data Scientist to lead the technical development and automation of our enrollment forecasting capability. This role will build and scale forec
Junior Data Scientist Healthcare
Framework VenturesNew YorkIntegra Med Analytics, based in Austin, TX, is a team of researchers and economists who seek to use forensic data analysis to promote integrity in the healthcare system. We are looking for strong cand
Senior Data Scientist, Analytics
TRMNew YorkBuild a Safer World. TRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, inves
Marketing Data Scientist - Remote
Harbor Freight ToolsNew YorkWe are looking for a highly skilled and motivated Data Scientist to join our Marketing Analytics team. This role is crucial in leveraging our vast customer, sales, and campaign data to generate action
AI Curation Data Scientist
- New York, New York, United States
- New York, New York, United States
À propos
About the role: Reporting to the VP of Data Science, the AI Curation Data Scientist will, using traditional computing and custom AI model training, work on mission-critical projects driven by xCure’s product development needs and will expand xCures’ complex, innovative health data processing, extraction, and analysis capabilities. We’re looking for an individual who values team-building, cooperation, and communications with colleagues to serve the needs of our customers. Projects will include significant data processing challenges, such as C-CDA XML parsing and de-identification of structured and unstructured EHR content. Equally important projects will address data curation and custom AI model training. You will author software and AI models and contribute to data set curation. You will coordinate data set quality assurance within an innovative, fast-moving team. Team responsibilities are key requirements for this position, which will deliver large and complex data products and data analysis tools.
This position is fully remote, but will coordinate very closely with a small team and thus requires excellent communication and coordination skills. Occasional travel is required.
This job is right for you if you like:
A high-energy start-up working with a brilliant and passionate team
Working on problems that make a real difference in people’s lives
Understanding and delivering on reliable and well-characterized products and deliverables within a highly innovative and fast-changing environment: clinical data extraction and aggregation, the relation between data processing and QA framework; LLM tuning and training, pedantic data curation, compute architecture, data exchange.
Rockstar teammates: you will be working with a strong team with decades of prior work experience in artificial intelligence, software systems, molecular biology, and clinical medicine
Innovation and problem solving to provide order-of-magnitude improvements in capabilities for data handling and analysis while maintaining traceable data and methods development
Responsibilities:
Developing and testing data extraction and integration software for structured EHR content (XML, FHIR) and unstructured text content (attached documents)
Organizing and contributing to data set curation for model training
Tuning and training LLMs
Maintaining a strong understanding of PHI/PII and de-identification policies and strategies at xCures and implementing software solutions compliant with policies and strategies
Developing and implementing tests of data extraction and aggregation performance to improve efficiency, timeliness, and cost-effectiveness
Implementing and maintaining code repositories
Working closely with manager to explore methods, test hypotheses, and collaboratively implement innovative solutions for data science
Coordinating as required for a fully remote role
Working with Engineering and other groups to improve overall company efficiency and effectiveness
Required Skills and Qualifications:
Masters degree or equivalent experience in Computer Science, Software Engineering, Statistics, Biology, or related field
Minimum of 5 years of hands‑on experience in data science, machine learning, AI, data analysis, software development, and/or predictive analytics
Experience applying generative AI and transformer models, especially training of LLMs
Significant experience with curating data sets to train LLMs
Significant hands‑on coding experience with LLMs, embeddings models, sentence_transformers, and authoring python code to build data extraction and/or classification tools
Significant prior work experience with parsing XML, JSON, and/or other complex data formats, preferably C-CDA health data
Experience with TensorFlow, PyTorch, and/or scikit‑learn
Software development skills including git
Proven efficiency using, and cautious approach to using, LLM-assisted coding
Experience writing unit and integration tests for scientific/clinical data software as well as with developing scientifically motivated data quality assessments
Flexible, innovative, can‑do approach to delivering software and data products balanced with team cooperation
A passion for successful delivery of team work products
Must reside in the United States.
Must have authorization to work in the United States.
Preferred Skills and Qualifications:
Extensive experience with data handling efficiency tools, such as jq, xq, Unix command-line tools such as sed, bash programming
Deep understanding of regex
Extensive AWS experience and understanding of tradeoffs for different types of data storage for AI training
Significant experience with PHI and PII, HIPAA, and de-identification is a major plus
Software development experience in multiple coding languages
Confidence extending the capabilities of open source tools
Experience with multiple approaches to LLM-assisted coding, such as within Visual Studio, copilot, Claude Code; and familiarity with frontier and open model capabilities
Experience with remote teams and solving technical project communications challenges
Notes: This is a big list. Don’t worry if you do not meet every qualification or wishlist item. If you are passionate, ambitious, adept, and mission-aligned, then we want to hear from you — even if you don’t check every box listed here. True talent shines through and transcends a list of bullet points. To apply, please send your cover letter and resume to ds-jobs@xcures.com
Salary range : 100K to 165K annually
401k
xCures acknowledges that equal opportunity for all persons is a fundamental human value. Each employee and applicant will be considered on the basis of individual ability and merit, without regard to race, color, religion, age, sex, sexual orientation, gender identity, gender expression, pregnancy, national origin, marital status, physical disability, mental disability, medical condition, genetic information, protected military or veteran status, or any other characteristics.
#J-18808-Ljbffr
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.