📍Taiwan
Cloud Site Reliability Engineer

NICE
📍Remote - India
Please let NICE know you found this job on JobsCollider. Thanks! 🙏
Summary
Join NiCE Public Safety, a global leader in providing state-of-the-art solutions for the Public Safety & Justice market, as a Site Reliability Engineer. This hands-on role involves ensuring the reliability, scalability, and maintainability of our cloud platforms. You will be part of a team acting as gatekeepers of production, managing the work backlog and driving reliability improvements. Lead investigations into outages and performance issues, and develop automation for low-value tasks. Provide technical leadership to Cloud Operations and Support teams, and develop monitoring dashboards and alerts using tools like Grafana and Azure Monitor. This position offers a hybrid work model (2 days in office, 3 days remote).
Requirements
- Must have 2+ years of experience in Site Reliability Engineering
- Excellent technical, analytical and troubleshooting skills
- Experience and in-depth knowledge of databases and data handling (MS-SQL, Elasticsearch, YML, JSON, XML)
- Significant experience in programming or advanced scripting (C#, PowerShell etc.)
- Experience with infrastructure/configuration as code and version control (ARM, BICEP, Git)
- Experience managing monitoring, alerting and dashboarding platforms (Azure Monitor, Prometheus, Grafana, Elasticsearch)
- Demonstrable experience of supporting live cloud services and platforms
- Production experience with Kubernetes and containerization
- Implementation and support of service level objectives (SLOs)
- Exposure to commercial cloud providers (Ideally Azure, others considered)
- Efficient, effective, and respectful communication skills both with customers and within internal departments. Including, Good listener, able to identify and validate assumptions
- Able to use effective questioning to confirm understanding of a customer problem and then provide help to solve it
- Methodical troubleshooting, technical skill and attention to detail used in diagnosing problems and reproducing issues in a local environment
- Multi-tasking and time-management to priorities and switch between varied tasks
- Be flexible with working hours when needed to address critical or urgent matters
- Be able to provide on-call services from time to time as needed
Responsibilities
- Act as part of a team of SRE’s that act as the ‘gatekeepers’ of production, and actively manage the work backlog and develop reliability improvements
- Lead investigations into root cause outages, performance, and cost issues
- Lead initiatives to develop the automation of low-value tasks balanced against project delivery demands
- You will provide technical leadership and to wider Cloud Operations and Support teams along with providing oversight to the products and services they support
- Develop and configure monitoring dashboards and alerts in tools like Grafana and Azure Monitor
- Installation and configuration of Observability Platform including tools like Grafana, Prometheus, Azure Monitor, Open telemetry etc
- Developing bicep modules for monitoring infrastructure and deploy it
Preferred Qualifications
- Exposure to Azure DevOps pipelines is desirable (CI/CD)
- Exposure to test frameworks is desirable (NUnit, Jasmine, Selenium)
Benefits
- Join an ever-growing, market-disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment!
- As the market leader, every day at NiCE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations
- Enjoy NiCE-FLEX!
- At NiCE, we work according to the NiCE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
📍China
📍Singapore
📍Japan
💰$60k-$120k
📍Asia
📍Greece
📍India
💰$161k-$265k
📍United States
📍Poland