Job Description
Felix Recruitment is partnering with an Irish leading company in the Unified Communication Services (UCaaS) industry, offering cloud-based phone systems-as-a-service and online team collaboration software to small and medium businesses. We are looking to recruit an experienced Site Reliability Engineer to join their existing team. This is a permanent opportunity. Our client is based in Maynooth. Full-time remote opportunity from anywhere in the EU, suggested hybrid working model if you're based in Ireland.
The successful candidate will be responsible for the following:
- Design, deployment, monitoring and maintenance of infrastructure components based on Linux systems (collocation and AWS Cloud).
- Monitoring and response to incidents as part of the Incident Response and Business Continuity Team.
- Support the development team by managing the CI/CD processes.
- L3 Tech support to clients.
Additionally, senior professionals will be expected to contribute in the following areas:
- Responsible for extending the deployment automation of infrastructure.
- Setting up and maintaining automated monitoring of infrastructure.
Experience includes:
- Minimum 5 years’ experience in a similar role.
- Experience in working with high-availability, distributed systems and services in a hosting environment including hardware, OS Linux (Centos/ Ubuntu).
- Knowledge in DNS, SSH, HTTP, NTP, TLS, TCP/IP and other common network protocols.
- Knowledge in AWS services: EC2, Code Deploy, S3, Route53, CloudFront, IAM.
- Knowledge of at least one of the following server-side scripting languages: Bash/Shell, Go, Python.
The following experience will be considered a bonus:
- Experience with installation and usage of monitoring tools (Elastic stack, Prometheus/Grafana).
- Strong knowledge of DB administration: MySQL/MariaDB.
- Experience with Message Queuing (RabbitMQ, NATS or similar).
- Experience with virtualization tools like VMWare and Docker.
- SIP, WebRTC Protocols, Asterisk and Kamailio SIP servers.