IT Resilience Specialist
Bitso
Summary
Join Bitso's IT team as an IT Resilience Specialist and play a pivotal role in shaping the organization's strategic resilience program for IT services. You will design, enforce, and monitor frameworks and policies to bolster IT resilience. This position ensures resilience principles are embedded in organizational practices, guiding strategic disaster recovery and business continuity planning. The ideal candidate possesses 3-5 years of hands-on experience in multi-cloud environments, advanced knowledge of monitoring and automation tools, and expertise in IT security. A Bachelor's degree in a related field or equivalent experience is required. You will collaborate with various teams, assess disaster recovery plans, and recommend improvements to enhance cloud service reliability and recoverability.
Requirements
- 3 - 5 years of proven hands-on experience in multi-cloud environments (AWS, Azure, GCP), including implementing backups, snapshots, and multi-region strategies
- Documented track record of executing and analyzing disaster recovery and business continuity tests
- Advanced knowledge of monitoring and automation tools like Terraform, Ansible, Prometheus, or similar, with measurable results
- Possesses advanced expertise in IT security and platform hardening, with focus on AWS, Google Suite, Slack, Cloudflare, Okta, GitHub, and related technologies
- Verifiable experience in IT crisis management and resolving incidents in critical timeframes
- Demonstrable familiarity with frameworks such as ITIL, ISO 22301, and/or NIST SP 800-160 for operational resilience
- Ability to generate clear and precise reports for technical and non-technical audiences, including specific examples of prior documentation
- Bachelor´s degree in Systems Engineering, Computer Science, or a related field (or equivalent documented experience)
- Advanced English proficiency, both written and spoken, for technical and global environments
Responsibilities
- Collaborate with IT, cloud service and Business teams to ensure alignment of Disaster Recovery strategies with the organization's business continuity goals
- Continuously assess the effectiveness of Disaster Recovery Plans for critical IT services like cloud and recommend improvements
- Monitor compliance with established Disaster Recovery and Business Continuity standards and frameworks (e.g., ISO 22301, NIST)
- Evaluate Cloud Providers´s SLAs, resilience capabilities, and data recovery solutions to ensure they meet business needs
- Propose enhancements or alternative strategies for cloud service reliability, redundancy, and recoverability
- Ensure integration of multi-cloud or hybrid-cloud environments into the resilience strategy
- Identify and assess risks related to IT disruption and cloud service outages
- Recommend mitigative measures to reduce the likelihood and impact of disruptions
- Partner with IT teams to conduct threat modeling and implement scenario planning
- Act as a bridge between technical teams (e.g., IT, DevOps, Cloud Architects) and business stakeholders
- Facilitate cross functional workshops and updates on resilience progress
- Advocate for the integration of resilience principles into IT projects and initiatives
- Define testing protocols for DR and BC plans, ensuring regular simulations and table-top exercises
- Analyze test results to identify gaps and propose actionable improvements
- Develop KPIs and dashboards to measure the effectiveness of resilience strategies and DR plans
- Provide periodic reports to leadership, highlighting progress, risks, and areas of improvement
- Design and deliver awareness campaigns to educate teams about resilience best practices
- Ensure all stakeholders understand their roles and recovery scenarios
- Keep up to date with emerging trends in cloud resilience, disaster recovery, and business continuity
- Recommend tools, methodologies, or approaches that could enhance organizational resilience
Preferred Qualifications
Relevant certifications, such as Solutions Architect in public cloud environments, are considered a plus
Benefits
- Me Time program, including unlimited paid time off
- Remote-first work environment
- Employee Stock Option program
- Zero trading fees through our Bitso Alpha app
- Extended Family Leave Policy: all birthing parents, non-birthing parents and adopting parents are eligible for a 4-months leave
- Premium health, dental and life insurances in Mexico, Gibraltar, Colombia, USA, Brazil and Argentina
- Volunteering days
- Monthly stipend for gym memberships, relaxation activities, sports equipment, cooking classes, books, entertainment and more