DevOps/Site Reliability Engineer

Token Metrics Logo

Token Metrics

πŸ“Remote - Portugal

Summary

Join Token Metrics as a results-oriented IT administrator to manage our company's IT infrastructure, upgrading and installing hardware and software, troubleshooting IT issues, and maintaining networks and servers. You will be responsible for cloud system administration (AWS and Google Cloud), monitoring and maintaining networks and servers, creating and automating alerting and monitoring system logs, and building tools to mitigate weaknesses in incident management or software delivery. The ideal candidate will possess extensive experience in administration, including system administration for cloud infrastructure, process automation, and site reliability. You will also be responsible for troubleshooting, upgrading and installing hardware and software, implementing security protocols, creating user accounts, performing diagnostic tests, documenting processes, developing data retrieval procedures, designing efficient end-user feedback systems, supervising IT employees, and staying current with IT advancements. This role requires a Bachelor's degree in a relevant field, applicable professional qualifications, and at least two years of experience.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, Information Systems, or similar
  • Applicable professional qualification, such as Microsoft, Oracle, or Cisco certification
  • At least two years' experience in a similar role
  • Extensive experience with IT systems, networks, and related technologies
  • Solid knowledge of best practices in IT administration and system security
  • Exceptional leadership, organizational, and time management skills
  • Strong analytical and problem-solving skills
  • Excellent interpersonal and communication skills

Responsibilities

  • Act as a cloud system admin (AWS and Google Cloud, and knowledge of multi-cloud infrastructure)
  • Monitoring and maintaining networks and servers
  • Creating and automating alerting and monitoring system logs
  • Building tools to mitigate weaknesses in incident management or software delivery
  • Troubleshooting Support Escalation requests
  • Upgrading, installing and configuring new hardware and software to meet company objectives
  • Implementing security protocols and procedures to prevent potential threats
  • Creating user accounts and performing access control
  • Performing diagnostic tests and debugging procedures to optimize computer systems
  • Documenting processes, as well as backing up and archiving data
  • Developing data retrieval and recovery procedures
  • Designing and implementing efficient end-user feedback and error reporting systems
  • Supervising and mentoring IT department employees, as well as providing IT support
  • Keeping up to date with advancements and best practices in IT administration

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.