Senior Site Reliability Engineer

Daxko Logo

Daxko

📍Remote - United States

Summary

Join Daxko as a Senior Site Reliability Engineer-Windows and become an instrumental part of our TechOps team. You will design, develop, and implement software integrations, troubleshoot production issues, and collaborate with development teams to improve applications. This role requires a deep love for automation, building scalable systems, and embracing new technologies. You will be responsible for system monitoring, disaster recovery planning, and conducting systems tests. Success in this position requires a Bachelor's degree in Computer Science or equivalent experience, along with several years of experience in DevOps/SRE and Windows management. Daxko offers a variety of benefits, including flexible paid time off, affordable health insurance, and remote work options.

Requirements

  • Bachelor Degree in Computer Science, Information Technology, or equivalent experience
  • Three or more (3+) years’ experience in a DevOPS/SRE Role
  • Five or more (5+) Years’ experience in Windows management background
  • Experience in architecting solutions and a deep understanding of application-status monitoring
  • Apply cloud (AWS, VMWare) computing skills to deploy upgrades and fixes
  • Implement automation tools and frameworks (CI/CD pipelines)
  • Strong analytical and problem-solving skills
  • Excellent time management skills with a proven ability to meet deadlines
  • Ability to prioritize tasks and to delegate them when appropriate
  • Ability to function well in a high-paced and at times stressful environment
  • Extensive experience with automation tools such as Terraform, Chef, or Ansible
  • Strong command of software-automation production systems (Jenkins and Selenium)
  • Expertise in software development methodologies
  • Experience working knowledge DevOps tools like Git and GitLab

Responsibilities

  • Be a part of the On-Call Rotation
  • Design, develop, and implement software integrations based on user feedback
  • Troubleshoot production issues and coordinate with the development team to streamline code deployment
  • Analyze code and communicate detailed reviews to development teams to ensure a marked improvement in applications and the timely completion of projects
  • Collaborate with team members to improve the company’s engineering tools, systems and procedures, and data security
  • Working with core components such as load balancers, firewalls, etc
  • Executing our disaster recovery plan; ensuring it is up-to-date and thoroughly tested
  • Monitoring system activity 24x7 as part of an on-call rotation
  • Conduct systems tests for security, business continuity, performance, and availability
  • Develop and maintain design and troubleshooting documentation
  • You can maintain .NET applications

Preferred Qualifications

  • Proficient in AWS and/or VMWare Technologies
  • Understanding of internet technologies (DNS, SNMP, HTTP, TCP/IP, CDNs)
  • Experience with Monitoring Technologies (Logicmonitor, Instana, NewRelic, Rapid7, CloudPassage, etc.)
  • Experience working tickets and managing priorities within issue tracking systems (Jira, etc.)
  • Experience in multiple scripting languages such as Perl/Python/Java/Bash
  • Experience with Containers and Orchestration (Docker, K8s, Rancher, AKS, EKS)
  • Working knowledge of Microsoft SQL, MySQL, and/or Postgres

Benefits

  • Flexible paid time off
  • Affordable health, dental, and vision insurance options
  • Monthly fitness reimbursement
  • 401(k) matching
  • New-Parent Paid Leave
  • Casual work environments
  • Remote work

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.