πIndia
Associate Principal Engineer, Cloud Infrastructure
Nagarro
πRemote - Mexico
Please let Nagarro know you found this job on JobsCollider. Thanks! π
Summary
Join our dynamic Digital Product Engineering company as a Cloud & Infrastructure Hosting Operations Manager! Lead and mentor a team, overseeing daily operations, ensuring reliability and performance of cloud and infrastructure environments. You will manage incidents, coordinate issue resolution, and communicate performance to stakeholders. Maintain security and compliance, develop operational processes, and lead continuous improvement initiatives. This role requires strong cloud architecture knowledge, experience managing operations teams, and expertise in cloud platforms and automation tools. We offer a fast-paced, high-growth environment.
Requirements
- Must have Skills: Cloud architecture (Capable), Azure DevOps (Strong)
- 5+ years of experience managing cloud & Infrastructure operations, hosting services, or similar IT infrastructure
- Strong knowledge of cloud platforms (AWS, Azure, GCP), virtualization technologies, Kubernetes, and hosting environments
- Proven track record in leading and managing Operations teams in a fast-paced, high growth environment
- Experience with cloud automation tools (e.g., Terraform, Ansible) and monitoring solutions
- Familiarity with ITIL practices, DevOps principles, and Agile methodologies
- Excellent problem-solving skills, with the ability to diagnose complex issues and implement solutions quickly
- Strong communication skills with the ability to present technical concepts to non-technical stakeholders
Responsibilities
- Lead and manage the Cloud & Infrastructure Hosting operations team, providing guidance, mentoring and support
- Ensure team members are well-equipped and trained to handle the evolving cloud and Infrastructure Mgmt. environment
- Perform Technical and capability assessment of team, training needs, onboarding of new team members
- Foster a culture of continuous improvement, accountability, and teamwork
- Oversee the daily operations of cloud and infrastructure environments, ensuring throughput, reliability, and performance
- Manage and monitor cloud and infrastructure environments, troubleshooting incidents, managing outage bridge, coordination between multiple teams for issue resolution, leading root cause analysis for Critical service issues
- Communicate operational performance, incident status, and improvement plans to management and stakeholders
- Ensure service-level agreements (SLAs) are met and reported to key stakeholders
- Knowledgeable of security standards and regulatory requirements for cloud and on collocation environments
- Knowledgeable on security risks/threats, vulnerability management, encryption, High Availability and Disaster Recovery plans
- Act as the primary point of contact for the operations team, collaborating with software development, product, and client services teams
- Coordination between geographically distributed teams across multiple time-zones to ensure service levels and hand-offs are efficiently performed
- Develop and refine operational processes based on ITIL standards, aiming for efficiency, scalability, and reliability
- Lead continuous improvement initiatives to enhance system performance, reduce downtime, and streamline operations
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
πChina
π°$175k-$234k
πWorldwide
πUnited States
π°$159k-$200k
πUnited States
π°$126k-$262k
πUnited States
πUnited States
πMexico
π°$165k-$226k
πWorldwide