Our client is a market leader powered by Technology and has engaged IT Search to speak to Site Reliability Engineers with infrastructure experience to join their Dublin based team on a permanent basis. The role is to design and deliver Observability and Monitoring processes to ensure ongoing performance, reliability and scalability of global platforms.
*This is a hybrid role with weekly onsite visits required in Dublin 2.*
Role:
Quickly get up to speed on current Observability processes and audit to focus on areas of immediate improvement
Work with global team to design updates and implement same
Utilise experience on Dashboards, Reporting and Performance Analysis to adapt to automation processes, taking responsibility for developing various metrics
Requirements:
8+ years of commercial Reliability experience
Log Retention processes knowledge
Strength in Prometheus specifically with hands on knowledge of Go/Python Scripting
In depth experience in Linux based OS, kernels etc
Config ideally will include Terraform/Ansible
Opinions on scaling, distributing systems
To learn more about the role, client and process, please forward your CV stating required salary and availability. Please note candidates will require full working rights in Ireland.