Cloud Engineer
iTMethods
Summary
Join iTMethods, a market-leading Managed DevOps SaaS Platform company, as a Cloud Engineer. Reporting to the Director of Platform Engineering and Operations, you will design, provision, and engineer cloud environments for clients, acting as their SME for DevOps tool deployment and management. You will create reusable code for automation and monitoring, contributing to product development. This client-facing role offers opportunities to leverage your expertise, learn from customer interactions, and collaborate with the product team. The company offers a flexible work environment, competitive compensation and benefits, and opportunities for growth and professional development. iTMethods is committed to fostering an inclusive and accessible work environment.
Requirements
- 4+ years of experience in a Cloud engineering role, with a proven track record of designing, deploying, and maintaining cloud infrastructure solutions
- A degree in Computer Science, Engineering, or a related discipline, with AWS certification preferred
- Direct experience in designing, implementing, and supporting DevOps solutions and cloud infrastructure, focusing on optimizing efficiency through automation and continuous integration practices
- Extensive experience with cloud-native development and a deep understanding of cloud infrastructure architectures that support highly scalable, resilient, and secure systems
- Solid hands-on experience with container technologies like Docker and Kubernetes, including the design and management of highly available systems in Kubernetes clusters
- Experience with complex event processing and event-driven architectures, leveraging modern practices for cloud service integration and automation
- Strong expertise in implementing CI/CD pipelines, with practical experience in tools like Jenkins, GitHub, and SonarSource
- Thorough understanding of microservice design and architectural patterns, and how to apply these patterns within cloud-native environments
- Advanced skills in scripting and automation technologies (e.g., Python, Terraform, CloudFormation) for automating infrastructure provisioning, scaling, and management
- Proven experience in migrating on-premises environments to public cloud platforms, particularly AWS and Azure
- Deep experience in monitoring system performance, recommending cloud infrastructure improvements, and proactively troubleshooting issues to ensure high availability and optimal performance
- Familiarity with open-source technologies, contributing to or utilizing these tools in cloud environments
- Ability to work effectively with cross-functional teams and customers, translating practical feedback into technical requirements, and fostering clear communication between teams
- Strong leadership experience in designing, building, and operating cloud infrastructure solutions that support large-scale, mission-critical applications and environments
Responsibilities
- Take ownership of the design, provisioning, and ongoing management of cloud environments, ensuring they meet customer requirements and industry standards
- Utilize Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Pulumi to fully automate the deployment, configuration, and operation of customer environments across cloud platforms
- Implement robust monitoring and logging solutions to proactively identify performance bottlenecks and issues, using tools like AWS CloudWatch, Prometheus, or Datadog. Ensure environments are continuously optimized for cost and performance
- Lead the implementation of automated deployment pipelines and promote DevOps best practices for continuous integration, continuous delivery, and version control using tools such as Jenkins, GitLab, or AWS CodePipeline
- Work closely with cross-functional teams to define and evolve technical architectures and cloud infrastructure roadmaps
- Integrate "Secure by Design" principles into every environment you design and maintain, ensuring all deployments meet compliance standards and are fortified against potential vulnerabilities
- Provide high-level expertise in cloud platform services, troubleshooting, and providing escalation support for complex issues. Participate in a rotation to provide on-call escalation support to the Platform Operations team
- Create reusable software solutions and operational processes that can be incorporated into the product for automation and scalability
- Stay up to date with emerging cloud technologies and tools, evaluating their applicability to current and future projects
- Create clear, concise, and well-structured documentation, including detailed how-to guides, troubleshooting steps, and architectural explanations, tailored to both technical and non-technical audiences
Benefits
- Flexible work environment
- Competitive compensation and benefits package
- Learning and development
- Career progression