Observability and Event Management Architect (f/m/d)

Uniper SE

Job Description

At Uniper, we proactively transform the world of energy while ensuring the security of energy supply. As an internationally operating company, we work in very diverse teams with the greatest possible working time flexibility for our employees. Our corporate culture is characterized by equal opportunities, mutual appreciation, and respect. With us, you will be able to develop new business models, work on technological solutions for a modern, sustainable, and future-oriented energy supply, as well as proactively help shape changes. Interested? We look forward to meeting you!


The Observability and Event Management Architect is responsible for designing and implementing a robust observability framework and ensuring transparency in service operations. This role is crucial for maintaining system reliability, and providing proactive information to avoid any critical incidents within a modern service management environment.

Architecting a Comprehensive Observability Framework:

  • Design and maintain a comprehensive observability framework covering cloud, traditional infrastructure, applications, and services.
  • Ensure the framework supports data ingestion from modern technologies such as Docker, Kubernetes, serverless architectures, and cloud services (especially Azure).
  • Enable monitoring of logs, metrics, traces, and user experience analytics to provide a holistic view of system performance.
  • Integrate with existing platforms like Elastic, Azure Monitor, AWS Cloudwatch, and Netscout to leverage current intelligence and ensure continuity.
  • Utilize AIOPS, event correlation, and suppression techniques to optimize incident management and prevent disruptions.

Driving Supplier and Internal Team Collaboration:

  • Lead and coordinate efforts between internal teams and external suppliers to ensure transparency in service operations and proactive information sharing.
  • Establish clear communication channels and regular progress reviews to ensure alignment and accountability.
  • Deliver full access to monitoring data and insights for service owners and users, enabling collaboration and accountability across teams.

Implementing AI-Driven Monitoring Tools:

  • Design and maintain AI-powered tools for automated anomaly detection, predictive analytics, and self-healing workflows to minimize manual intervention and prevent incidents.

Modern Service Management Practices:

  • Implement and promote modern service management practices, ensuring alignment with ITIL, IT4IT and other industry standards.
  • Drive continuous improvement initiatives to enhance service delivery and operational efficiency.
  • Ensure the observability framework supports seamless integration with service management platforms like ServiceNow.

  • Bachelor/Master degree in Computer Science or related field
  • More than 5 years of experience in managing an ITIL/IT4IT process in a multi-vendor ecosystem
  • Proven experience in IT operations, observability, incident management, and event management.
  • A strong understanding of modern IT Service Management delivery
  • Strong problem solving, quantitative and analytical abilities
  • Service oriented with good communication and presentation skills
  • Strong knowledge of monitoring tools and platforms (e.g., Dynatrace, Datadog, Splunk).
  • Experience with AI-driven monitoring and automation tools.
  • Excellent problem-solving skills and the ability to work under pressure.
  • Strong communication and collaboration skills.
  • Experience in managing supplier relationships and driving internal team performance.
  • Familiarity with ITIL/IT4IT and modern service management practices.
  • We are an international company with global reach. We expect fluent English, as this is our business language. German language skills an advantage.
View More