{"22243592":{"jobPath":"/jobs/22243592/social-worker-program-coordinator-mental-health-intensive-case-management-mhicm","source":"naylor","job":"22243592","jobTitle":"Social Worker Program Coordinator - Mental Health Intensive Case Management (MHICM)"},"22243096":{"jobPath":"/jobs/22243096/nuclear-medicine","source":"naylor","job":"22243096","jobTitle":"nuclear medicine"},"22243032":{"jobPath":"/jobs/22243032/certified-medical-assistant-urology","source":"naylor","job":"22243032","jobTitle":"Certified Medical Assistant Urology"},"22241869":{"jobPath":"/jobs/22241869/bms-research-associate-iii-kaye-lab","source":"naylor","job":"22241869","jobTitle":"BMS Research Associate III, Kaye Lab"},"22242057":{"jobPath":"/jobs/22242057/associate-director","source":"naylor","job":"22242057","jobTitle":"Associate Director"},"22242455":{"jobPath":"/jobs/22242455/licensed-professional-counselor-lpc-mhsp","source":"naylor","job":"22242455","jobTitle":"Licensed Professional Counselor (LPC-MHSP)"},"22243104":{"jobPath":"/jobs/22243104/rn-seasonal-ed-nights","source":"naylor","job":"22243104","jobTitle":"RN Seasonal ED Nights"},"22242930":{"jobPath":"/jobs/22242930/dialysis-technician","source":"naylor","job":"22242930","jobTitle":"Dialysis Technician"},"22171920":{"jobPath":"/jobs/22171920/director-of-ambulance-services","source":"naylor","job":"22171920","jobTitle":"Director of Ambulance Services"},"22242450":{"jobPath":"/jobs/22242450/licensed-therapist","source":"naylor","job":"22242450","jobTitle":"Licensed Therapist"},"22242966":{"jobPath":"/jobs/22242966/health-welfare-and-retirement-benefits-specialist-total-rewards","source":"naylor","job":"22242966","jobTitle":"Health & Welfare and Retirement Benefits Specialist , Total Rewards"},"22242525":{"jobPath":"/jobs/22242525/patient-care-technician-i-full-time-days-7am-7pm-ed-morristown-medical-center","source":"naylor","job":"22242525","jobTitle":"Patient Care Technician I - Full Time, Days 7am-7pm, ED, Morristown Medical Center"},"22242903":{"jobPath":"/jobs/22242903/tech-lead-data-scientist-ai-evaluation-monitoring","source":"naylor","job":"22242903","jobTitle":"Tech Lead Data Scientist, AI Evaluation & Monitoring"},"22243035":{"jobPath":"/jobs/22243035/rn-cm","source":"naylor","job":"22243035","jobTitle":"RN CM"},"22242782":{"jobPath":"/jobs/22242782/department-assistant-general-services","source":"naylor","job":"22242782","jobTitle":"Department Assistant - General Services"},"22241891":{"jobPath":"/jobs/22241891/heart-institute-clinical-research-associate-i-susan-cheng-team","source":"naylor","job":"22241891","jobTitle":"Heart Institute - Clinical Research Associate I, Susan Cheng Team"},"22242960":{"jobPath":"/jobs/22242960/patient-services-specialist-family-medicine","source":"naylor","job":"22242960","jobTitle":"Patient Services Specialist - Family Medicine"},"22242783":{"jobPath":"/jobs/22242783/registered-nurse-i-acute-physical-rehab","source":"naylor","job":"22242783","jobTitle":"Registered Nurse I - Acute Physical Rehab"},"22242685":{"jobPath":"/jobs/22242685/phlebotomy","source":"naylor","job":"22242685","jobTitle":"Phlebotomy"},"22243655":{"jobPath":"/jobs/22243655/stars-substance-treatment-and-recovery-services-rn-case-manager","source":"naylor","job":"22243655","jobTitle":"STARS (Substance Treatment and Recovery Services) RN Case Manager"},"22242488":{"jobPath":"/jobs/22242488/licensed-marriage-and-family-therapist-lmft","source":"naylor","job":"22242488","jobTitle":"Licensed Marriage and Family Therapist (LMFT)"},"22242983":{"jobPath":"/jobs/22242983/certified-medical-assistant-in-basket","source":"naylor","job":"22242983","jobTitle":"Certified Medical Assistant In Basket"},"22242424":{"jobPath":"/jobs/22242424/licensed-clinical-therapist-lcsw-lpc-lmft","source":"naylor","job":"22242424","jobTitle":"Licensed Clinical Therapist (LCSW, LPC, LMFT)"},"22243118":{"jobPath":"/jobs/22243118/non-cert-patient-care-technician-full-time-nights","source":"naylor","job":"22243118","jobTitle":"Non-Cert Patient Care Technician Full Time Nights"},"22242700":{"jobPath":"/jobs/22242700/manager-core-automated-testing","source":"naylor","job":"22242700","jobTitle":"Manager, Core Automated Testing"}}
Tech Lead Data Scientist, AI Evaluation & Monitoring
Geisinger
Application
Details
Posted: 01-May-26
Location: Danville, Pennsylvania
Type: Full-time
Categories:
Operations
Internal Number: R-95508
Job Summary
The Tech Lead Data Scientist, AI Evaluation & Monitoring is the principal technical expert for how Geisinger evaluates, monitors, and optimizes AI systems in production. This is a hands-on technical leadership role. The Tech Lead sets the technical direction for AI evaluation across a large and growing portfolio, provides technical leadership to a team of data analysts who execute evaluation work, and partners directly with AI program teams to raise the quality of how AI is validated, monitored, and improved over time.
The role exists because AI at Geisinger has scaled past the point where oversight can be a document-review exercise. We need a technical leader who can guide program teams toward better-designed evaluations up front, instrument meaningful production monitoring, and continually advance the methods we use, from LLM-as-Judge frameworks to simulation-based testing to pragmatic experiment design that actually scales in healthcare.
Job Duties
What You Will Own:
The technical evaluation methodology applied to AI programs across the enterprise, pre-production validation, production monitoring, and ongoing optimization
Hands-on guidance to program teams as they design validation studies, equity audits, monitoring plans, and escalation playbooks for their AI systems
Instrumentation of production monitoring: translating program-specific failure modes into concrete, measurable metrics
The evaluation toolkit: LLM-as-Judge frameworks, golden sets, simulation harnesses, experimental study designs, drift detection, subgroup fairness analysis
Reusable evaluation playbooks and templates that let each new program move faster than the last
Technical direction, design review, and mentorship for a team of data analysts supporting the evaluation function
What You Will Not Own:
People management, HR administration, or formal performance evaluations for the analyst team (those sit with the analysts' line manager; the Tech Lead provides technical input)
Program-level product strategy or go/no-go decisions
Final clinical validation judgment on whether a given AI is safe for a given clinical use
The software infrastructure behind the evaluation and monitoring tooling (built by the AI Platform team - the Tech Lead defines what's measured and how; Platform builds the backend)
Shape of the Work:
This is a role that lives at three altitudes at once:
With program teams (hands-on advisory). Partner with program owners early, before evaluations are designed, to shape study approach, sample size, stratification, gold-standard definition, and decision thresholds. Translate ambiguous failure modes into concrete, defensible evaluation designs. Coach teams through the technical work so that what arrives at governance review is rigorous, not performative.
With the evaluation toolkit (hands-on build). Design and operate the reusable assets that let evaluation scale: LLM-as-Judge rubrics and calibration methods, golden sets, simulation harnesses, A/B and shadow-mode study templates, subgroup fairness analyses, and drift monitors. Keep a pragmatic eye on what actually works in a clinical environment versus what works in a paper.
With the analyst team (technical leadership). Set technical direction, assign work across active evaluations, review analysis code and study designs, and raise the technical bar. Mentor analysts on methodology, statistical rigor, and the domain knowledge that makes evaluation credible. Grow them from execution into independent evaluation design.
Methods You'll Use:
Experimental and quasi-experimental design for production AI systems
LLM and generative AI evaluation: golden sets, judge-based evaluation, hallucination and grounding checks
Fairness and equity evaluation across patient and stakeholder subgroups
Production monitoring design: drift detection, performance decay, adoption, and outcome metrics
Causal inference methods appropriate to healthcare settings where full RCTs are often impractical
Simulation and adversarial testing for pre-production stress testing
Python, SQL, modern ML and evaluation tooling, cloud-native data platforms
Work is typically performed in an office or remote environment. Accountable for satisfying all job specific obligations and complying with all organization policies and procedures. The specific statements in this profile are not intended to be all-inclusive. They represent typical elements considered necessary to successfully perform the job.
*Relevant experience may be a combination of related work experience and degree obtained (Master's Degree = 2 years; PHD = 4 years ).
Position Details
Required Skills & Qualifications:
6+ years in data science, statistics, ML engineering, or applied quantitative research, with demonstrated experience as the senior technical voice on cross-functional projects
Strong foundation in experimental design and causal inference - and judgment about which method fits which situation
Hands-on experience designing and running model evaluation studies in real production settings
Experience evaluating LLM or generative AI systems, or comparable experience evaluating complex ML systems where ground truth is messy
Proven ability to translate ambiguous failure modes into concrete, defensible evaluation designs and monitoring metrics
Strong fluency in Python and SQL; working comfort with modern ML tooling and cloud-native data environments
Experience with fairness and equity evaluation for ML systems
Track record of providing technical leadership and mentorship without formal people-management authority
Clear written communication - the role produces evaluation memos and specifications that non-technical decision-makers rely on
Healthcare, clinical, or regulated-industry experience strongly preferred
MS or PhD in a quantitative field preferred; equivalent experience accepted
Education
Bachelor's Degree-Related Field of Study (Required)
Experience
Minimum of 6 years-Relevant experience* (Required)
Certification(s) and License(s)
OUR PURPOSE & VALUES: Everything we do is about caring for our patients, our members, our students, our Geisinger family and our communities. KINDNESS: We strive to treat everyone as we would hope to be treated ourselves. EXCELLENCE: We treasure colleagues who humbly strive for excellence. LEARNING: We share our knowledge with the best and brightest to better prepare the caregivers for tomorrow. INNOVATION: We constantly seek new and better ways to care for our patients, our members, our community, and the nation. SAFETY: We provide a safe environment for our patients and members and the Geisinger family We offer healthcare benefits for full time and part time positions from day one, including vision, dental and domestic partners. Perhaps just as important, from senior management on down, we encourage an atmosphere of collaboration, cooperation and collegiality. We know that a diverse workforce with unique experiences and backgrounds makes our team stronger. Our patients, members and community come from a wide variety of backgrounds, and it takes a diverse workforce to make better health easier for all. We are proud to be an affirmative action, equal opportunity employer and all qualified applicants will receive consideration for employment regardless to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or status as a protected veteran.
We are an Affirmative Action, Equal Opportunity Employer Women and Minorities are Encouraged to Apply. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of disability or their protected veteran status.
At Geisinger, our innovative ideas are inspired by the communities we serve – like our Fresh FoodFarmacy, a program that delivers life-saving healthy alternatives to patients with diabetes. With additional tools like our MyCode Community Health Initiative, one of the first health system genome sequencingprograms, and our new asthma app suite that we developed in partnership with AstraZeneca, it’s no wonder we’re ranked one of the Top 5 Most Innovative Healthcare Systems by Becker's Hospital Review. We continually work towards continuous improvement in a culture where everyone has a voice and firmly believe that better begins with all of us.Founded more than 100 years ago, Geisinger serves more than three million residents throughout central, south-central and northeastern Pennsylvania and southern New Jersey. Our physician-led system is comprised of 30,000 employees, including 1,600 employed physicians, and consists of 1...3 hospital campuses, the Geisinger Health Plan, Geisinger Commonwealth School of Medicine and two research centers. What you do at Geisinger shapes the future of health and improves lives – for our patients, communities, and you.