Climate Impact X (CIX)
is a carbon market solutions provider with a mission to create real impact by turning trust in carbon credits into tangible and actionable outcomes. CIX is aSingapore-based carbon market leader that is in the unique position of meeting needs across primary and secondary carbon markets with its offscreen sourcing services and online platforms for buyers, suppliers, and traders. For more information, please visit www.climateimpactx.com.
Site Reliability Engineer
We are seeking a motivated Site Reliability Engineer (SRE) to join our team. Based in Singapore, the ideal candidate, will ensure the reliability, performance, and scalability of CIX’s technology stack while supporting critical infrastructure needs globally. With a diverse client base across multiple jurisdictions, you will be required to cover London business hours to ensure smooth operations and effective collaboration.This is a hands-on, dynamic role where you’ll collaborate with multiple departments, providing insights and improvements that align with our global growth strategy.
Responsibilities
- Manage scalable and resilient infrastructure using cloud solutions
- Implement and maintain monitoring solutions to track system health and performance across application and data systems.
- Respond to alerts and incidents, troubleshooting issues as they arise to minimize downtime and ensure high availability.
- Conduct post-incident reviews (PIRs) to analyze failures and improve systems and processes, optimizing resource utilization to ensure cost-effectiveness while maintaining performance and reliability.
- Implement CI/CD pipelines for deployment processes to automate and enhance deployment processes.
- Monitor system capacity and usage trends to anticipate future needs, working with the team to identify and resolve performance bottlenecks and ensure timely scaling of resources.
- Set up monitoring and alerting systems to proactively catch issues before they impact users and maintain optimal system performance.
- Manage software deployments, patches, and upgrades for supported applications, ensuring minimal disruption.
- Collaborate with development teams to reproduce issues, gather requirements, and test solutions
- Document processes, create user guides, and deliver training to internal stakeholders.
- Proactively review and improve processes, tools, and practices to identify opportunities for enhancement.
- Stay updated with industry trends, new technologies, and best practices relevant to SRE and fintech.
Requirements
- A minimum of 5 years of hands-on experience in site reliability engineering, infrastructure management, or DevOps.
- Experience with Docker and Kubernetes for container orchestration and management and troubleshooting.
- Experience with at least one major cloud provider, such as AWS, Google Cloud, or Azure (preferable), with a focus on deploying and managing infrastructure.
- Solid understanding of networking principles, including DNS, TCP/IP, VPNs, and load balancing.
- Familiarity with CI/CD pipelines (preferably ArgoCD, Helm Charts, or similar DevOps tools) for automation of deployment and infrastructure processes.
- Strong background in Linux environments, including bash scripting for automation and troubleshooting.
- Experience with monitoring tools for altering and dashboards.
- Experience with Terraform for managing infrastructure as code is preferred.
- Azure Administrator Certification (or equivalent) and Certified Kubernetes Administrator (CKA) certification preferred.
- Strong willingness to continuously learn and adopt new tools, frameworks, and technologies as required by the job.
- A strong team player, with excellent communication skills, who strives for excellence and continuous improvement.
CIX is an equal opportunity employer committed to diversity and inclusion.
*We seek your understanding that only shortlisted candidates will be contacted.