Job Specification: Cloud Infrastructure Manager
Who we are
Sustainability software specialist, AMCS, is headquartered in Ireland, with offices in Europe, the USA, and Australasia. With over 1,300 highly skilled employees across 22 countries, we specialise in delivering technology solutions to facilitate a carbon neutral future.
What we do
Our innovative SaaS solutions increase efficiency and boost sustainability in resource-intensive industries. Over 5,000 customers across 23 countries already benefit from our Performance Sustainability software, ensuring we deliver practical solutions for improved profitability and environmental resilience across the globe.
The role
We are seeking an experienced and strategic Cloud Infrastructure Manager to lead and evolve our Site Reliability Engineering, Cloud Ops and FinOps functions. This role is accountable for ensuring reliable, secure, and cost-efficient infrastructure and operations that support product delivery at scale. You will drive Operational strategy, organisational maturity, and cross-functional alignment, owning both the people leadership and technical direction for these key areas.
Key Responsibilities
Strategic Leadership:
Define and execute a unified strategy across SRE, Cloud Operations, Cloud Security and FinOps.
Establish and execute a cloud optimisation roadmap that balances cost efficiency with operational excellence, ensuring reliability and performance remain uncompromised.
Provide leadership, mentoring, and performance management for multi-disciplinary teams.
Collaborate with and support senior engineering leadership to align platform and reliability initiatives with business objectives.
Site Reliability Engineering:
Lead SRE teams responsible for reliability, availability, performance, and operational excellence.
Drive observability strategy and maturity, including metrics, logs, traces, dashboards, and alerting quality.
Champion SRE principles such as SLIs, SLOs, error budgets, and toil reduction.
Cloud Operations:
Oversee cloud infrastructure provisioning, governance, and cost optimisation across Azure/ AWS /GCP.
Promote automation-first operational models, eliminating manual processes wherever possible.
Cloud Security:
Drive secure-by-default cloud architecture, governance, and controls.
Partner with IT security teams to embed policy-as-code and identity-based access models.
Cross-Functional Collaboration:
Work closely with Platform Engineering and Product, Security, Architecture, & Engineering Managers, and leadership to align prioritisation and execution.
Act as the primary translator of Operational and reliability needs to broader engineering teams.
Operational Excellence:
Improve deployment consistency, stability, and lead time for changes.
Enhance incident response capability across SRE and DevOps domains.
Drive engineering-wide adoption of observability and operational best practices.
Required Skills & Experience
Technical Expertise:
Minimum of 5 years' experience in combined technical and people leadership roles, managing multi-disciplinary engineering teams.
Demonstrated leadership across DevOps, SRE, Platform Engineering, or Cloud Operations.
Expertise with CI/CD, Kubernetes, IaC, Cloud architecture, and Observability systems.
Deep understanding of reliability engineering, distributed systems, and cloud-native operations.
Security & Governance Knowledge:
Familiarity with cloud security principles, threat modelling, identity & access management, and compliance frameworks.