Site Reliability Engineer - ref. e10066623
About Us
Over the last 10 years, Commify has grown and transformed under Private Equity ownership and now operates across 9 countries including the UK, France, Spain, Italy, Romania, Germany, Netherlands, Australia, and the USA.
Our mission is clear: to make business communication brilliant by leveraging our best-in-class products that span SMS, WhatsApp, Email, VoIP, and other communication technologies. We serve over 50,000 businesses globally and send over 5 billion communications annually.
We have ambitious growth plans to expand our already impressive product portfolio. As a values-led business, we recognize that our ‘People’ are our biggest asset. Do you want to be part of our success story?
Role Summary
As a Site Reliability Engineer at Commify, you will be a crucial member of our Operations team. You will ensure that our products and platforms operate at peak performance. You will gain a deep understanding of how our software interacts with both physical and Cloud infrastructure to deliver exceptional messaging solutions to our customers.
Your responsibilities will include:
- Ensuring high levels of system performance through monitoring, analysis, and performance tuning
- Implementing scalability and fault tolerance
- Improving processes through automation or other efficiencies
- Troubleshooting application and middleware issues
- Collaborating with Engineering teams to ensure smooth functioning of their software in high-throughput production environments
- Building deployment pipelines that facilitate high-quality code deployments
Requirements
What essentials are we looking for?- Experience with Microsoft Azure
- Strong working knowledge of Terraform, App Services, and Kubernetes
- Fluent in written and spoken English
- Passion for reliability
- Experience in creating and modifying Terraform deployments
- Previous experience in an Operations role, preferably as a Site Reliability Engineer
- Ability to work collaboratively across multiple teams, take ownership of tasks, prioritize effectively, and be accountable for your work
- Excellent communication skills
- Experience with monitoring solutions (Datadog, Azure Application Insights, or Log Analytics)
- Scripting/programming skills to automate solutions, particularly in PowerShell (preferred), Bash, C#, Ruby, or Python.
- Experience supporting web-based applications
It’s desirable for you to have:
- Familiarity with Azure DevOps pipelines
- Experience with Microsoft Server Operating Systems
- Experience defining service level objectives/operational requirements for cloud-based solutions
- Understanding of Microsoft Azure Cloud offerings, especially in the Platform as a Service category (Web Apps, Storage, Functions)
- Working knowledge of tools such as Terraform, Ansible, VSTS, ARM, Puppet, Chef, Jenkins, ELK, and Grafana
- Understanding of DNS, Load Balancer configuration, Active Directory, and Cloud-based network infrastructure
- Experience working in an agile environment and familiarity with methodologies such as TDD, Scrum, or Kanban
- Knowledge of implementing monitoring and alerting systems for micro-service architectures
- Applied understanding of cloud security best practices
- Competitive Salary ($120-130,000)
- Company Bonus Scheme based on company performance
- Private Healthcare & Dental
- 20 days paid time off plus federal holidays
- Birthday off work
- 401K
- Monthly/quarterly socials
- Training & Development opportunities