AI Evaluation Engineer, Device Intelligence (m/f/d)

Danaher

Job Description

At Danaher, our work saves lives. And each of us plays a part. Fueled by our culture of continuous improvement, we turn ideas into impact – innovating at the speed of life.

Our 63,000+ associates work across the globe at more than 15 unique businesses within life sciences, diagnostics, and biotechnology.

Are you ready to accelerate your potential and make a real difference? At Danaher, you can build an incredible career at a leading science and technology company, where we're committed to hiring and developing from within. You'll thrive in a culture of belonging where you and your unique viewpoint matter.

Learn about the Danaher Business System which makes everything possible.

The AI Evaluation Engineer, Device Intelligence(m/f/d) will be a key member of the AI Product and Imaging Innovation team, reporting to its Senior Director. This new role is instrumental in the implementation of cutting-edge AI systems that leverage data created by Danaher devices to extract meaningful insights and dramatically improve user experience, with the goal of upleveling Danaher's devices across Life Sciences, Diagnostics and Biotechnology sectors. This position is remote in Germany.

Join our winning team today. Together, we'll accelerate the real-life impact of tomorrow's science and technology. We partner with customers across the globe to help them solve their most complex challenges, architecting solutions that bring the power of science to life.

For more information, visit www.danaher.com.


  • Define, own and run the AI evaluation strategy for AI products in life sciences, diagnostics, and biotechnology.
  • Design and implement robust evaluation frameworks for agentic workflows, LLMs / NLP, computer vision and multimodal models.
  • Develop and execute evaluation plans to measure performance, reliability, and safety across multimodal datasets.
  • Collaborate with the Sr. Director for the Initiative, Sr. AI Engineers and product teams to align evaluation criteria with product KPIs and regulatory needs.
  • Analyze evaluation results, identify weaknesses, and recommend improvements to AI models and workflows.
  • Build automated pipelines for continuous evaluation and monitoring of AI systems in production.

  • Bachelor's degree in Computer Science, Engineering, Data Science, or related field; MS/PhD preferred.
  • Proven experience designing and implementing evaluation methodologies for AI systems, including LLMs and computer vision.
  • Strong knowledge of metrics for AI performance, robustness, and fairness, especially in regulated domains.
  • Expertise in at least 3 of the following: benchmarking frameworks, statistical validation, synthetic data generation, adversarial testing, explainability techniques.
  • Proficiency in Python and ML libraries (e.g., PyTorch, TensorFlow) and familiarity with evaluation tools (e.g., OpenAI Evals, Dynabench, Promptfoo).
  • Ability to communicate complex evaluation results to technical and non-technical stakeholders and influence model improvements.

It would be a plus if you also possess previous experience in:

  • Experience with regulatory processes, especially for medical devices and AI/ML-based software as a medical device (SaMD).
  • Familiarity with quality management systems and standards relevant to the life sciences and diagnostics industries.
  • Knowledge of instrument control mechanisms and how they integrate with AI systems for enhanced automation.

#LI-AC1

View More