At Insane Cyber, we’re focused on advancing cybersecurity for the better. We’ve developed innovative tools backed by expert support to change how organizations perform deep level proactive and reactive analysis. We partner with our customers to provide cutting-edge solutions and services to help protect our critical infrastructure and critical operations from threats – from the power grid to manufacturing.
Our flagship Valkyrie and Cygnet products provide host and network analysis automation beyond the capabilities of other products on the market. Our Corvus and Aesir product lines deliver managed and professional services to help assess and fill gaps and weaknesses in the security posture of clients' security programs.As we continue to grow, we're looking for a Senior Site Reliability Engineer to leverage their software engineering and system administration skills to support customer deployments and enhance our engineering reliability efforts.Responsibilities
- Manage the reliability of infrastructure and systems, including development and production environments.
- Manage Security Tools: Oversee, optimize, and extend the operation of proprietary and open-source security tools (e.g., Zeek, Suricata) in both development and production environments.
- Monitor and Enhance Reliability: Develop and deploy automation tools to monitor software reliability in both development and production environments, on-prem and in the cloud.
- System Health Monitoring: Develop tools to monitor system health and security in both internal and production instances.
- Problem Solving: Utilize Python and other scripting languages to diagnose and resolve issues.
- Bug Fixing: Discover, triage, and fix bugs in the Valkyrie and Cygnet products to ensure high quality and reliability.
- Automation: Enhance development pipeline and production automation to support internal development and reduce update/patching friction in customer environments.
- Customer Support: Collaborate with customers to diagnose and resolve technical issues.
- Sales POC Support: Provide technical support for sales Proof of Concept (POC) engagements, working closely with the sales team to demonstrate the capabilities of our products to potential clients.
- ICS Integration: Develop and maintain integrations between cybersecurity tools and Industrial Control Systems (ICS) environments, ensuring the unique security needs of critical infrastructure are met.
- Scalability and High Availability: Design and implement solutions to ensure the scalability and high availability of security systems across large, distributed industrial networks.
- Performance Optimization: Optimize the performance of security tools in resource-constrained environments typical of industrial settings.
- Mitigation and Resolution Support: Collaborate with incident response teams to help diagnose problems, mitigate impact, and restore services, particularly when issues affect system reliability or performance.
Qualifications
- Extensive SRE Experience: Proven track record of using SRE concepts to creatively resolve complex issues.
- Production & Development Expertise: Experience managing environments in both cloud and on-premise settings.
- Problem-Solving Skills: Demonstrated ability to handle complex issues with minimal oversight.
- Strong knowledge of Linux system administration, including proactive management, performance tuning, and troubleshooting.
- CI/CD Knowledge: Familiarity with Git platforms, static and dynamic code analysis tools, and other CI/CD tools.
- High Availability & Disaster Recovery: Proven experience designing and implementing high availability and disaster recovery strategies in mission-critical environments.
- Security stack: Hands-on experience with deploying, maintaining, and extending security tools including but not limited to Zeek and Suricata is a major plus.
- Industrial Cybersecurity Experience: Experience working in industrial environments, particularly with ICS, SCADA systems, or OT security tools.
- Regulatory Knowledge: In-depth understanding of regulatory frameworks and standards specific to industrial cybersecurity, such as NERC CIP, IEC 62443, or ISO/IEC 27019.
- Advanced Security Tool Proficiency: Experience with advanced cybersecurity tools used in industrial environments, such as SIEM systems tailored for OT, and anomaly detection tools.
- Startup Experience: Experience in seed or early A-round startups, particularly as a first or early SRE hire, is a major plus.
- Cross-Functional Collaboration: Proven experience working effectively with cross-functional teams, including engineering, professional services, and sales.
Benefits
- Competitive Base Salary
- Equity offering subject to board approval
- Comprehensive medical/dental/vision/life insurance plan
- Retirement plan with employer match
- Flexible working hours and generous time-off policy
Insane Cyber is proud to be an equal-opportunity employer. We celebrate diversity and strive to foster an inclusive environment for all employees. If you're a visionary with a passion for pushing the boundaries of industrial cybersecurity, we'd love to hear from you.Must be eligible for work in the United States without sponsorship. Accepting direct applications. No outside firms or representatives.