Software Engineer

Xero
Summary
Join Xero's Site Reliability Engineering (SRE) team as an Intermediate or Senior Tooling Engineer and contribute to improving system visibility, reliability, and performance across Xero. You will work closely with engineering teams to enable and empower them with the tools, standards, and best practices needed to build highly reliable and observable systems. You will develop and implement observability solutions, ensuring our products are resilient, scalable, and easy to monitor. Your work will drive best practices in observability, incident management, and chaos engineering, empowering teams to build and operate high-quality software with confidence.
Requirements
- Experience in improving operational outcomes for software systems in production, with a strong understanding of reliability and observability concepts in distributed systems and microservices
- Technical expertise in software development, with proficiency in one or more object-oriented programming languages ( e.g., C#, JavaScript, Golang, or Python )
- Hands-on experience with observability tooling, including instrumenting applications and integrating with solutions such as New Relic, Datadog, Dynatrace, SignalFX, SumoLogic, or Splunk
- Exquisite verbal and written communication skills, and collaboration, with the ability to build relationships, engage with stakeholders, and effectively convey complex ideas both verbally and in writing
- A proactive and adaptable mindset, thriving in fast-paced, high-growth environments with a strong bias for action, initiative, and problem-solving
Responsibilities
- Design and implement observability solutions that enhance system reliability, scalability, and performance while reducing operational toil
- Develop and maintain monitoring, logging, and tracing tools, ensuring efficient and automated observability processes
- Provide expert guidance and support to engineering teams on best practices, tooling, and troubleshooting for observability systems
- Advocate for continuous improvement, driving adoption of SLOs, reliability best practices, and resilient engineering across Xero
- Participate in on-call rotations, proactively addressing issues and improving the reliability of critical systems
Benefits
Offering very generous paid leave to use however youβd like (plus statutory holidays!), dedicated paid leave to care for your physical and mental wellbeing as well as an Employee Assistance Program to access mental health care for you and your family, health insurance, life insurance, and income protection, wellbeing and sports programmes, employee resource groups, 26 weeks of paid parental leave for primary caregivers, an Employee Share Plan, beautiful offices, flexible working, career development, and many other benefits that reflect our human value, youβll do the best work of your life at Xero
Share this job:
Similar Remote Jobs

