OVERVIEW
Our client, a leading global financial services organization, is seeking a seasoned Observability Engineer to enhance their technology team. With a strong foundation in Splunk and observability technologies, our client aims to bolster their operational and strategic capabilities. This role presents an opportunity to contribute to the day-to-day operations and shape the future architectural landscape. Joining a team dedicated to excellence, the ideal candidate will thrive in a dynamic environment, leveraging their expertise to drive continuous improvement and innovation.
JOB DESCRIPTION
Company Overview:
Our client, part of a renowned global financial group, is committed to digital innovation and excellence in capital markets.
Department Overview:
Within the Global Technology Services (GTS) division, our client focuses on delivering top-tier technology solutions, operational excellence, and efficient resource management. As a client-centric organization, GTS empowers internal business lines while ensuring cost-effective operations across the company.
Role Summary:
As an Observability Engineer, you will play a pivotal role in maintaining and enhancing the operational efficiency of our client's applications and infrastructure. Your responsibilities will span configuring and managing monitoring solutions, optimizing observability tools, and collaborating closely with cross-functional teams. By championing best practices and automation, you will elevate the reliability and performance of their technology stack.
Key Accountabilities:
Develop and maintain robust monitoring solutions utilizing Splunk, Splunk Observability (formerly SignalFX), Grafana, AWS CloudWatch, and Prometheus.
Implement and consult on observability frameworks to meet the needs of internal stakeholders.
Create and manage dashboards to provide actionable insights into system health and performance.
Contribute to Event, Incident, and Operations Escalation Management Policies.
Evangelize observability tools and practices across the organization.
Collaborate with development and operations teams to integrate observability tools into the development lifecycle.
Translate business requirements into technical solutions aligned with strategic goals.
Conduct performance analysis and provide solutions to enhance system reliability and scalability.
Document observability best practices and maintain configuration documentation.
Provide 2nd and 3rd-level systems support and liaise with vendors for problem resolution.
Required Technical Skills:
Proficiency in Splunk, Splunk Observability (formerly SignalFX), Grafana, AWS CloudWatch, and Prometheus.
Strong understanding of cloud environments, particularly AWS.
Experience with scripting and automation (e.g., Python, Bash).
Familiarity with configuration languages such as Ansible and Terraform.
Knowledge of Linux operating systems, preferably RedHat.
Proficiency in source control systems like Git.
Required Non-Technical Skills:
Excellent communication skills for effective stakeholder engagement.
Strong problem-solving abilities for complex troubleshooting.
Self-motivated and capable of working independently or collaboratively.
Nice-to-Have Skills/Experience:
Experience with container platforms and orchestration (e.g., Kubernetes, OpenShift).
Knowledge of virtualization technologies (VMWare, RHV).
Familiarity with OS deployment systems like RedHat Satellite.
Preferred Qualifications:
Bachelor’s degree in Computer Science, Engineering, or a related field.
5+ years of experience in systems engineering, platform/cloud/DevOps engineering, or similar roles.
Relevant certifications in Splunk, AWS, or related technologies.
Experience with additional observability and monitoring tools is advantageous.
Compensation:USD 115000-136000