Remote Senior DevOps Engineer

Logo of impact.com

impact.com

πŸ“Remote - South Africa

Job highlights

Summary

Join a team where your opinion is valued, and enjoy working with fun and talented people from all over the world. As a Senior DevOps Engineer at impact.com, you will lead the evaluation and adoption of new technologies and tools within the DevOps squad, ensuring alignment with the organization's objectives and future needs.

Requirements

  • Extensive experience with Linux operating systems (e.g., Ubuntu, CentOS, Red Hat), including installation, configuration, maintenance, and troubleshooting
  • Understanding of Linux networking concepts, including IP addressing, routing, firewalls (e.g., iptables)
  • Knowledge of Linux security best practices, including user management, permissions, and encryption
  • Proficiency in performance tuning techniques for Linux systems, including optimizing kernel parameters, disk I/O tuning, memory management, and CPU utilization
  • A good understanding of IaC (infrastructure as code) principles and adopting these methods to drive automation and self-service
  • Comprehensive grasp of coding and scripting in common languages, including Python, Perl, PHP, and Ruby
  • Familiarity with at least one primary coding language, like C++ or Java
  • Familiarity with containerization technologies such as Docker and container orchestration platforms like Kubernetes
  • Understanding of high availability concepts and technologies such as Linux clustering, load balancing (e.g., HAProxy, Nginx, F5, Treafik)
  • Ability to identify, evaluate, and integrate diverse open-source technologies and cloud services
  • Proven experience with business and CI/CD tools like Prometheus, GitHub, Atlassian Jira, Confluence, and Jenkins
  • Proven experience with public cloud resources and services, including AWS, Microsoft Azure, and Google Cloud
  • Familiarity with various IT monitoring and management tools like Cloudflare
  • Proficiency in troubleshooting and resolving technical issues across staging, uat and production
  • Strong focus on security, adhere to NIST and CIS standards, ability to implement security hardening measures for Linux servers and environments
  • Integrity and ethical leadership, demonstrating honesty, transparency, and fairness in all interactions, and upholding ethical standards and values in decision-making and communication
  • Ability to lead and mentor junior team members, providing guidance, support, and feedback to help them grow and develop their skills in DevOps practices and technologies
  • Excellent communication skills, both verbal and written, to effectively communicate technical concepts and ideas to non-technical stakeholders, facilitate discussions, and build consensus across teams
  • Strong problem-solving skills and the ability to make sound decisions under pressure, analyzing complex technical issues, evaluating options, and implementing effective solutions that drive results

Responsibilities

  • Lead the evaluation and adoption of new technologies and tools within the DevOps squad, ensuring alignment with the organization's objectives and future needs
  • Take a proactive approach to bug triage, including root cause analysis and fixing, to ensure the stability and reliability of the systems and applications supported by the DevOps team
  • Collaborate with cross-functional teams, including developers, QA engineers, product managers, and stakeholders, to define and achieve departmental and organizational objectives on a quarterly basis
  • Drive the automation of complex and critical manual tasks and optimize repetitive processes across the development, testing, deployment, and monitoring phrases of the software development lifecycle
  • Provide mentorship and technical leadership to junior team members, including participating in pair programming sessions, conducting code reviews, and sharing best practices
  • Lead research and prototyping efforts for emerging technologies, exploring opportunities for innovation and improvement in existing systems and processes
  • Take ownership of creating and maintaining comprehensive documentation, including design documents, user guides, and test plans, to ensure clear communication and knowledge transfer within the team and across departments
  • Drive the implementation of robust software testing and quality assurance processes, including the development and maintenance of automated tests at unit, functional, and integration levels
  • Lead incident management efforts, including responding to alerts, reviewing error messages, and diagnosing and resolving technical issues in a timely manner to minimize impact on system availability and performance
  • Ensure the stability and scalability of the infrastructure and platform by maintaining build- and stage stability, optimizing resource utilization, and implementing infrastructure as code practices
  • Lead efforts to ensure compliance with industry regulations (e.g., GDPR, HIPAA) and internal security policies, including implementing security best practices, conducting security audits, and managing access controls
  • Monitor system performance and usage trends, and conduct capacity planning to anticipate and accommodate future growth and scaling needs, ensuring the infrastructure can support increasing demands
  • Implement cost optimization strategies for cloud infrastructure and services, including rightsizing resources, leveraging reserved instances, and implementing tagging and monitoring to track and optimize costs
  • Provide leadership and support for on-call rotations, including participating in incident response activities, conducting post-mortem analyses, and implementing corrective actions to prevent recurrence
  • Evaluate and manage relationships with third-party vendors and service providers, including cloud providers, software vendors, and infrastructure partners, to ensure the organization receives value and meets its objectives
  • Work closely with business stakeholders to understand their needs, requirements, and technical specifications, translating them into actionable plans and solutions that deliver value to the organization

Benefits

  • Casual work environment, including working from home
  • Flexible work hours
  • Unlimited PTO policy
  • 6 month primary caregiver leave
  • Training & Development
  • Medical Aid and Provident Fund
  • Restricted Stock Units
  • Internet Allowance
  • Fitness club fee reimbursements
  • Technology Stipend

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs

Please let impact.com know you found this job on JobsCollider. Thanks! πŸ™