Taulia Inc.NOC/APM Engineering Manager
Oct. 2018 - Nov. 2024Bulgaria, SofiaAs an Engineering Manager at Taulia, I spearhead the APM (Application Performance Monitoring) and Observability initiatives, leading a team dedicated to creating and managing our observability system from inception. Beginning with zero engineers and limited coverage, I strategically built a team of six engineers on rotating shifts, ensuring 24/7 observability and incident management. One of our pivotal achievements has been the transformation of our observability system from a collection of disparate conditions to a unified alert management framework, meticulously deployed and maintained using CaC. Currently, our system monitors over 1500 components, complemented by comprehensive system-level and business process alerting policies. Key responsibilities of our team include providing end-to-end monitoring, encompassing both application and infrastructure layers, and ensuring the continuous surveillance of Taulia's infrastructure and critical business processes. We are vigilant in responding to critical alerts, and swiftly taking action to mitigate any issues that arise. Additionally, we deliver detailed reporting on critical components, services, and business processes, prioritizing the protection of the customer experience above all else. In my role as manager, I focus on fostering strong relationships and collaboration within the engineering department and across various teams, such as customer success management, developer teams, etc. to ensure the success of our Observability team. I am deeply committed to the professional development of engineers within our team and the broader company, as well as the ongoing optimization of our tools and processes to exceed expectations and drive continuous improvement across the organization.