Site Reliability Specialist (Engineer)


RAPID RTC is seeking an experienced Site Reliability Specialist (DevOps Engineer or Experienced Systems Administrator or Site Reliability Engineer). As part of the Site Reliability Specialist team you will be responsible with availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.  You will be working a highly skilled team that is constantly seeking to improve existing infrastructure and process.

The position involves…

  • Getting to work closely with Software Developers and Fellow Site Reliability Specialists
  • Making sure production systems stay available with low latency and high performance
  • Routing/switching technology, Cisco firewalls, load balancers, EMC storage systems and Software Defined Networking
  • Working with High Traffic websites
  • Monitoring Production systems with next generation Monitoring; NewRelic and Prometheus
  • Acquiring and demonstrating knowledge of RAPID RTC products from both the client and the development perspective
  • Providing insight from a development and systems perspective
  • Identifying opportunities to eliminate toil
  • Availability for on-call (after hours) rotation, some weekend work for maintenance and understands the obligations of participating on a team that provides multi time-zone support
  • Using metrics to make decisions and validate assumptions
  • Being familiar with security best practices
  • Writing runbooks for production changes
  • Writing and following checklists
  • Capacity planning for future growth

Our ideal candidate…

  • Possesses a University Degree or College Diploma in an applicable field
  • Possesses 3+ year’s experience as a Systems Analyst  or DevOps Engineer or Site Reliability Engineer
  • Has strong communication skills (verbal and written)
  • Is familiar with DevOps and SRE principals
  • Experience with:
    • High Traffic websites
    • Making/using Grafana dashboards
    • Cisco command line configuration
    • Virtualization (VMWare, Hyper V, Nutanix or KVM)
    • Windows and Linux administration
    • Scripting with Powershell, Bash, Puppet
    • Writing runbooks for production changes
    • Automation tooling like Puppet, Ansible, Chef, Salt
    • Windows clustering and Active Directory
    • Webserver configuration IIS, Apache, Nginix
    • Using Docker or Kubernetes in production
    • Monitoring and maintaining cache servers like Redis or Memcached
    • Managing Enterprise Queuing platforms in a Service Oriented Architecture (Kafka, Rabbitmq, ServiceBus)
    • Administrating and monitoring production database servers ( MSSQL, MYSQL, Postgres)
  • Location: Winnipeg, Manitoba

RAPID RTC offers a competitive compensation package including benefits, and a fun yet challenging work environment. We promote continuous improvement in our staff, processes, technological skills, and foster career growth throughout.

If you are ready for the challenge, please apply below or forward your resume to

Limit of 10 MB. Please submit .doc, .docx, .pdf or .txt files only.

Limit of 10 MB. Please submit .doc, .docx, .pdf or .txt files only.

* indicates required field


Back to Jobs