Pison is seeking a talented and motivated Site Reliability Engineer (SRE)/DevOps professional to join our dynamic team. The ideal candidate will play a critical role in ensuring the reliability, scalability, and performance of our groundbreaking neural interface platform. You will collaborate with cross-functional teams to implement best practices in system design, automation, and continuous integration and delivery (CI/CD).
Duties/Responsibilities:
- Design, implement, and maintain scalable and reliable infrastructure solutions.
- Develop and maintain CI/CD pipelines to automate deployments and streamline development workflows.
- Monitor system performance, identify bottlenecks, and implement solutions to optimize system performance.
- Ensure the security and integrity of our systems by implementing best practices in access control, data protection, and incident response.
- Collaborate with software development teams to ensure systems are designed with reliability and scalability in mind.
- Perform capacity planning and demand forecasting to meet future infrastructure needs.
- Troubleshoot and resolve system issues, providing timely and effective support.
- Develop and maintain documentation related to system architecture, processes, and procedures.
- Participate in on-call rotations to provide 24/7 support for critical systems.
Skills/Abilities:
- Strong proficiency in cloud platforms such as AWS, Azure, or Google Cloud.
- Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
- Proficiency in scripting and automation using languages such as Python, Bash, or similar.
- Hands-on experience with CI/CD tools like Jenkins, GitLab CI, or CircleCI.
- Strong understanding of networking concepts, including TCP/IP, DNS, and load balancing.
- Knowledge of infrastructure as code (IaC) tools such as Terraform, Ansible, or CloudFormation.
- Familiarity with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or similar.
- Strong problem-solving skills and the ability to troubleshoot complex issues.
- Excellent communication and collaboration skills, with the ability to work effectively in a team environment.
Education/Experience:
- Bachelor's degree in Computer Science, Information Technology, or a related field; or equivalent practical experience.
- 3+ years of experience in a Site Reliability Engineer, DevOps, or similar role.
- Proven experience managing and maintaining production systems in a high-availability environment.
- Experience with neural interfaces, IoT, or similar technologies is a plus.
- Certifications in cloud platforms (AWS, Azure, Google Cloud) or relevant technologies are a plus.
Apply for this job