At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox Live, PlayStation Network, and EA Origin. We are backed by top investors including Softbank, Sony Interactive Entertainment, Galaxy Interactive, NetEase, and Krafton. Our latest Series B funding has firmly solidified our place as a top player in the gaming industry. AccelByte’s talent has decades of experience building and shipping some of the largest game and distribution platforms in the world.
We believe that the best companies empower employees to make decisions, obsess about the best user experience, and are not afraid to make and learn from their mistakes. Our culture is based on humility, openness to feedback, drive, and collaboration, which we feel results in the best performing teams. As a company that values diversity, inclusion, and employee growth, our employees have opportunities to work with and learn from teams all over the world. We offer competitive salaries, a full range of health benefits, social activities, career growth opportunities, and an amazing team.
Come join us!
Position Summary
As a Sr SRE/ Cloud Engineer II, you will play a pivotal role in designing, managing and optimizing our software development infrastructure. You will also be responsible for overseeing the design and maintenance of automated build, deployment, and configuration management systems. The ideal candidate will have hands-on industry experience in the CICD domain along with good coding skills.
Essential Functions/Responsibilities
The Sr SRE/ Cloud Engineer II is accountable for the following functions and responsibilities:
- Mentoring, reviewing, and providing feedback on coworkers' works and changes to support DevOps mentality efforts.
- Collaborate intensely with the team to resolve problems with infrastructure and software deployments.
- Design, execute, and manage infrastructure and operational tasks with scalability and stability considerations in mind.
- Design and manage the development of automation solutions to streamline tasks, enhance efficiency, and minimize manual effort.
- Collaborate closely with the team and assist them in managing and maintaining cloud-based infrastructure and services.
- Design, execute, and manage scalable infrastructure and deployment frameworks utilizing K8s and CNCF projects.
- Design and manage the microservices in the distributed system
- Perform extensive deployment and configuration of technologies and tools used, including building servers, version control systems, and automated testing frameworks.
- Communicate directly with clients, ensuring their needs are understood and providing support as a valued team member.
- The ability to train and mentor less experienced engineers and set the direction for other engineers.
- Model standards for engineering excellence
- Discover requirements by working with PMs and stakeholders
- Proactively seek opportunities for process improvement and innovation, driving initiatives to enhance automation, efficiency, and reliability across the entire infrastructure and operational landscape.
- Participate in hiring activities for SREs as needed.
- Work closely with development and QA teams to ensure seamless integration and deployment of applications.
- Help in defining, measuring, and tracking OKRs related to CICD work.
- Participate in the on-call schedule for the team.
- Conduct Root Cause Analysis of incidents and implement measures to prevent occurrences.
- Perform other duties as assigned.
Qualifications/Experience Required
- Bachelor's Degree background or relevant work experience, certification, or courses
- At least 6 years of experience specializing in operations and automation processes, with a focus on maintaining and optimizing infrastructure.
- Advanced experience in scripting in programming languages such as Python, Bash, GoLang, etc.
- Advanced experience with Jenkins, GitLab CI, or similar tools for automation, CI/CD, and GitOps, with experience in utilizing CI/CD tooling and pipelines, particularly with emphasis on GitLab, Jenkins, and Flux.
- Advanced experience in performing cloud system operations in an AWS environment.
- Advanced experience in using Kubernetes involves familiarity with tools such as Kubectl, Flux, and others for debugging and modifying cluster states. Additionally, understanding Customization and the infrastructure as code (IaC) structure/mechanism within the 'deployments' repository is crucial.
- Advanced experience in containerization technology, particularly Docker and Containers, including its limitations and practical applications within a Kubernetes cluster environment.
- Advanced experience in using Terraform, Terragrunt syntax, and CloudFormation usage, including the ability to apply, modify, or delete modules.
- Experience in cloud monitoring, logging, and APM solutions, with exposure to monitoring tools such as Prometheus & Grafana.
- Experience in operating and understanding microservices within distributed systems, demonstrating proficiency in designing, deploying, and managing microservices architectures.
- Experience in using database design and management, including knowledge of how to scale up databases.
- Advanced experience in using networking concepts and protocols, particularly within cloud ecosystems, with the ability to operate network stacks effectively.
- Advanced experience in using Unix system administration concepts and principles, including the ability to troubleshoot and maintain Unix-based systems.
- Experience in handling software architecture, especially from the infrastructure point of view.
- Advanced experience in developing and handling system architecture for scratch
- Experience in communicating technical concepts through documentation and specifications effectively.
- Experience in managing full-cycle projects and project breakdowns will be a plus
- Experience in similar products or services offered by AccelByte, preferably in a AAA game studio or software product company. Expected to acquire practical knowledge of how AccelByte's products are hosted within the infrastructure upon joining.
- Basic experience in mentoring, managing, and setting the direction for other engineers.
- AWS Certified Solutions Architect is a big plus.
- Experience at a AAA game studio or a software product company is a big plus.
- Experience working in a multinational technology startup is a big plus.
- Eagerness to learn new technologies.
- Proficiency in written and verbal English language.
- Flexibility to adjust to work routines/schedules.
AccelByte Inc is an Equal Employment Opportunity Employer, all qualified candidates and applicants will receive consideration for employment without regard to race, religion, gender, national origin, sexual orientation, marital status, age, or disability. Our culture is innovative and inclusive, and we value our people the highest.Please visit our career page for a complete listing of our open positions https://accelbyte.io/careers