πTaiwan
Site Reliability Engineer

Zealogics Inc
πRemote - India
Please let Zealogics Inc know you found this job on JobsCollider. Thanks! π
Summary
Join our team as a seasoned cloud platform engineer, driving automation and operational excellence within our Azure and Microsoft 365 environments. Lead incident investigations and resolutions, optimizing PowerShell, Bicep, and YAML scripts for automated provisioning workflows. Debug and enhance .NET (C#) components within Azure Functions. Analyze telemetry data to identify systemic issues and implement improvements for workflow reliability. Own and evolve the automation framework for Teams and SPO lifecycle operations. Collaborate with stakeholders, conduct post-incident reviews, and mentor junior engineers. Stay current with Azure and Microsoft 365 API updates and automation tooling.
Requirements
- 12+ years of experience in cloud platform engineering, DevOps, or site reliability engineering (SRE) roles with a focus on automation and operational excellence
- Proficiency in PowerShell scripting, including writing reusable modules, automation logic, and error handling for production workloads
- Extensive experience with Infrastructure as Code using Bicep, including authoring, debugging, and deploying templates for complex Azure resources
- Strong understanding of CI/CD processes and YAML pipelines, with hands-on experience in automating build/release workflows in Azure DevOps
- Proficient in .NET (C#) β especially for debugging Azure Functions or working on backend components integrated into M365 automation flows
- In-depth knowledge of Microsoft 365 platform, including API usage, Teams & SharePoint Online provisioning, governance, and permissions management
- Proven ability to troubleshoot and optimize Azure-native services such as API Management, Azure Functions, Storage, Service Bus, Key Vault, and Container Apps
- Skilled in telemetry and observability β leveraging Azure Monitor, Log Analytics, Kusto queries, and custom logging to proactively identify issues
- Experience conducting root cause analysis, post-incident reviews, and implementing system-wide improvements to reduce incident frequency and MTTR
- Experience in mentoring support engineers, contributing to runbook creation, and improving team capability over time
- Strong analytical, documentation, collaboration and stakeholder communication skills
Responsibilities
- Lead investigation and resolution of critical, recurring, or high-impact incidents across Azure and Microsoft 365 automation workflows
- Deep-dive into PowerShell, Bicep, and YAML scripts to identify logic errors, misconfigurations, or scalability limitations within automated provisioning workflows
- Debug and optimize .NET (C#) components within Azure Functions or related application layers used in workflow orchestration
- Analyze usage patterns and telemetry data from Azure Monitor, Application Insights, and Log Analytics to identify systemic issues or opportunities for automation enhancement
- Implement fixes and design improvements to automation logic that reduce manual intervention and improve workflow reliability (e.g., auto-remediation scripts, retry logic)
- Own and evolve the automation framework for Teams and SPO lifecycle operations β including operations like create/delete, external sharing restrictions, and role/ownership changes
- Collaborate with product owners and architects to introduce new automation use cases or extend existing workflows
- Conduct post-incident reviews (PIRs) for high-severity incidents, drive root cause analysis (RCA), and implement corrective actions
- Mentor L1 and L2 engineers, conduct knowledge-sharing sessions, and support onboarding of new team members
- Stay updated with changes in Azure, Microsoft 365 APIs, and automation tooling (PowerShell modules, Bicep schema updates, etc.)
- Provide guidance on architecture and best practices for automation reliability
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
πChina
πSingapore
πWorldwide
πJapan
π°$60k-$120k
πAsia
πIndia
πIndia
π°$44k-$55k
πFrance