Job Responsibilities
- Research, define, and implement observability standards for the bank and develop associated processes.
- Design and implement standard dashboards and monitors to ensure the availability of critical applications and services.
- Continuously analyze trends and improve visibility and correctness.
- Install and configure monitoring and observability via APM tools (DataDog).
- Collaborate with IT teams and vendors to troubleshoot and resolve complex issues using the DataDog platform.
- Participate actively in the bank's incident management process for IT services covering 24/7 operations.
Applicant's Profile
- Bachelor's degree in Information Technology, Computer Science, or a related field recognized by the University Grants Commission.
- Minimum of 2 years of experience in a similar capacity.
- Understanding of incident management principles, processes, and best practices (DataDog / ITIL certification is a plus).
- Basic understanding of application architectures, microservices, middleware, distributed systems, DEVOPS, application runtimes (JAVA or IIS), and operating systems (Windows or LINUX) is an added advantage.
- Strong problem-solving and result-oriented skills.
- Good interpersonal and communication skills.
Benefits
- Attractive remuneration package commensurate with benchmarked financial institutions.
How to Apply
Interested candidates are invited to apply for the position. All applications should be routed through our corporate website. Please visit www.combank.lk and navigate to Careers > Open Positions > Observability Engineer - Incident Management and Service Reliability.