Senior Cloud Operations Engineer

Centric Software
Summary
Join Centric Software's Automation and Cloud Services team as a Senior Cloud Operations Engineer and play a crucial role in building and maintaining a cutting-edge cloud platform for the Centric C8 cloud application. You will leverage cloud technologies and automation, using Pulumi, Terraform, and Ansible to automate cloud operations tasks. Strong Python programming skills are essential for developing sophisticated automation and cloud services. This role involves overseeing and maintaining cloud infrastructure across AWS, Azure, and Google Cloud platforms, ensuring optimal performance, cost-efficiency, and security. You will collaborate with various teams to meet operational needs and solve complex challenges, creating comprehensive documentation and advocating for automation best practices. The position requires advanced automation development, Infrastructure as Code (IaC) mastery, Ansible integration, and Python programming expertise.
Requirements
- Bachelorβs Degree Computer Science, MIS, or related technology field, or equivalent practical experience
- 8+ years of experience in cloud operations and infrastructure management in AWS, Azure, or Google cloud
- 5+ years in incident response and major incident management
- Expertise in Pulumi or Terraform for IaC, with a strong portfolio demonstrating successful deployments
- Proficient in Ansible for configuration management and automation tasks
- Strong programming skills in Python, with experience in developing automation tools and scripts in a cloud environment
- Advanced Linux and Windows experience
- Solid understanding of Cloud networking and security
- Expert knowledge in containerization and orchestration technologies (e.g., Docker Kubernetes, Rancher)
- Experience in version control, CI and automation tools such as Github/Bitbucket, Github Actions, Jenkins, Rundeck)
- Experience in deploying and troubleshooting Java based applications and microservices
- Experience in deploying, configuring, and troubleshooting Database technologies like MSSQL, PostgreSQL and MongoDB
- Experience in monitoring and logging tools (e.g., Nagios, Prometheus, ELK stack)
- Experience with the following technologies: Virtualization, VPN, RDP, SSO, Kafka
Responsibilities
- Lead in the creation of an automation framework using Pulumi, Terraform, and Ansible, coupled with strong Python programming skills to automate and create a catalog of services aimed at streamlining cloud operations tasks
- Employ Pulumi or Terraform extensively to design, implement, and manage cloud infrastructure with a focus on scalability, reliability, and security
- Develop Ansible playbooks for configuration management, ensuring seamless, automated provisioning and deployment processes across all cloud environments
- Develop and maintain Python scripts and applications that automate cloud operations tasks, improve system efficiencies, and contribute to the automation catalog
- Oversee and maintain a multifaceted cloud infrastructure, ensuring optimal performance, cost-efficiency, and security across AWS, Azure, and Google Cloud platforms
- Identify and automate routine cloud operations tasks, minimizing manual interventions and promoting operational excellence
- Work alongside cloud, development, and DevOps teams to ensure automation tools and practices meet operational needs and solve complex challenges
- Create comprehensive documentation for the automation processes and the service catalog. Advocate for automation best practices across the organization, providing training and support where necessary
Preferred Qualifications
- Certification in AWS, Azure or Google Cloud is a plus
- Familiarity with project management tools like Confluence/Jira is beneficial
Share this job:
Similar Remote Jobs
