HSBCJP00052234 (CTO EI DigiBA) Observability Engineer
ref nr: 63/12/2024/TM/89217In Antal we have been dealing with recruitment for over 20 years. Thanks to the fact that we operate in 10 specialised divisions, we have an excellent orientation in current industry trends. We precisely determine the specific nature of the job, classifying key skills and necessary qualifications. Our mission is not only to find a candidate whose competences fit the requirements of the given job advertisement, but first and foremost a position which meets the candidate’s expectations. Employment agency registration number: 496.
Observability Engineer – Banking Sector
This is an opportunity to join a dynamic and innovative team supporting a leading financial institution in delivering cutting-edge observability solutions. The role focuses on implementing and maintaining monitoring frameworks and observability tools to support critical infrastructure and applications within the banking sector.
Your Career Opportunity
The Observability Engineer will work on a global observability project for a mission-critical application in the financial services environment. The role entails ensuring optimal performance, availability, and recoverability of monitoring tools, contributing to the broader technology roadmap, and supporting self-healing solutions for key applications and infrastructure.
Key Responsibilities
- Deliver and implement monitoring and observability strategies for critical platforms.
- Build and maintain observability frameworks that cater to diverse stakeholders.
- Set up dashboards, visualizations, and monitoring solutions, collaborating with application and infrastructure teams.
- Engineer and standardize solutions for tools like Splunk, AppDynamics, and ThousandEyes, focusing on optimization and application tuning.
- Automate deployment processes to minimize operational tasks and integrate observability tools with existing monitoring systems.
- Offer training sessions to promote best practices and tool adoption.
- Enhance monitoring solutions, contributing to a standards-based, self-service, automated platform.
- Adhere to organizational policies and risk management practices while delivering high-quality solutions.
What You Need to Succeed
- 2+ years of experience with observability tools such as Splunk, AppDynamics, and ThousandEyes.
- Knowledge of REST API development and monitoring extensions.
- Experience with enterprise-level application development, particularly in Java, is a plus.
- Familiarity with architecture tech stacks such as Java application servers, Kubernetes, OpenShift, PCF, and cloud platforms (AWS, GCP).
- Understanding of AI/ML concepts is highly desirable.
- Experience using tools like ServiceNow, Confluence, and Jira.
- Practical knowledge of event management tools and operations automation (e.g., AIOps).
- Experience creating and supporting monitoring dashboards for applications running on Unix and Oracle databases.
- Proficiency with monitoring and observability products like Elasticsearch, Grafana, Prometheus, and their associated methodologies.
- Solid understanding of distributed systems design, application performance metrics, and statistical/machine learning concepts.
- 2+ years of experience deploying APM solutions for mission-critical applications.
- Strong interpersonal and communication skills, along with the ability to work independently and manage multiple priorities.
- Technical writing skills for queries, reports, and presentations.