Senior Systems Engineer - Openstack

Kaseya
Summary
Join Kaseya, a leading provider of IT infrastructure and security management solutions, as a Senior Systems Engineer. You will ensure the reliability, uptime, and performance of Kaseya's infrastructure. Collaborate with various stakeholders to resolve issues, implement self-healing mechanisms, and improve platform reliability. Align with business units to develop and maintain complex environments, shape monitoring platforms, and scale the infrastructure. Troubleshoot complex issues, participate in on-call rotations, and communicate with various teams during incidents. This role requires a Bachelor's degree in Computer Science or equivalent experience and strong Linux expertise. The position can be based remotely in the US.
Requirements
- Bachelorโs degree in Computer Science or equivalent experience
- Strong grasp of Linux, both from a command-line perspective and operating system fundamentals
- Experience with software development, automation, infrastructure as code, and data-driven analysis
- Experience with configuration management tools such as Puppet, Ansible, or Salt
- Hands-on experience with mainline programming and scripting languages such as Bash, Python, Perl, or Ruby
- Familiar with standard tools and platforms that enable continuous delivery such as GitLab, Jenkins, Kubernetes, Docker, or JIRA
- Strong root cause analysis and troubleshooting competency
- Strong tendency to automate and monitor everything
- Excellent communication skills
- Ability to operate in a fast paced environment
- Self-motivated & willing to learn
- Ability to work independently and as part of a team
Responsibilities
- Align with various business units to develop, deploy, and maintain complex environments that service applications, components, and self-service/internal tooling in Kaseyaโs production and working environments
- Shape monitoring and alerting platforms to get the best quality signals and avoid creating churn for other engineers
- Scale the Kaseya infrastructure platform to maintain health and reduce human intervention as needed by automating any repetitive operational activities and measuring normal operation of the platform
- Collaborate with DevOps, Database Engineering, Security Engineers and SREs from the various software application engineering teams to help ensure proper documentation, process consistency to assure end to end reliability and awareness
- Troubleshoot complex issues quickly and effectively; continually improve processes and understanding based on post-mortem analysis
- Participate in a rotational on-call program and enhance troubleshooting techniques and utilities to ensure quick resolution to service impacting issues
- Communicate with Users, Support, and Development teams in the event of an incident
Preferred Qualifications
Significant experience with virtualized and bare metal infrastructure; KVM and OpenStack experience strongly preferred