Site Reliability Engineer (SRE)

Dublin, Ireland & Podgorica, Montenegro
Full-Time

The Opportunity:
We are looking to add an enthusiastic, passionate Site Reliability Engineer to our engineering team. If you have experience with SaaS software companies, let’s chat.

This is an amazing opportunity to join a fast-growing restaurant software company, with some of the largest restaurant chains in the U.S. as customers. Backed by top venture capital firms, SynergySuite has expanded over the last 3 years across Europe and the U.S. For the right candidate, this is an opportunity to join a rapidly expanding technology team with strong potential to grow your career. We are currently building out our teams in Lehi, Utah; Podgorica, Montenegro; and Dublin, Ireland.

What you’ll be doing:

  • Maintain a 24×7 production environment with a high level of service availability. Perform quality reviews, manage operational issues
  • Interface with dev and QA to identify root cause analysis and re-instrument triggers to prevent future network degradation and outages
  • Explore and innovate new cloud and HA technologies, features, and tools
  • Partner with development teams in defining and implementing improvements in service architecture
  • Implement automation and orchestration for manual processes required to operate and deploy cloud services, be at the heart of developing new ideas into internal OPS/SRE tools
  • Implement automated tests, automated deployments, and operational tools
  • Collaborate with product and support teams to plan and deploy product releases
  • Ensure services are designed with 24/7 availability and operational readiness and rigor
  • Implementation of proactive monitoring, alerting, trend analysis and self-healing systems
  • Participate in on-call rotations, driving restoration and repair of service-impacting issues
  • Define non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systems
  • Practice sustainable incident response as well as participate in peer reviews and postmortems

Qualifications:

  • Extensive experience with AWS public cloud technologies is a must
  • Experience maintaining a high-availability cloud infrastructure
  • Experience with Containerization: Docker, Kubernetes, and PaaS services on AWS
  • Track record of developing and implementing innovation strategies, processes and best practices
  • Experience with root cause analysis of critical business and production issues
  • Clear understanding of Agile development methodologies, DevOps best practices, and the product development lifecycle
  • Experience with infrastructure automation and monitoring tools (e.g., CloudFormation, SaltStack, Kubernetes, Terraform, Ansible, Containers, Docker, Puppet, Chef)
  • Familiarity with continuous integration and deployment processes and tools (e.g., VSTS, Jenkins, Maven, Nexus)
Close Menu