Your search has found 2 jobs

We’re looking for an experienced engineer to take ownership of CI/CD pipelines and improve developer experience in a fast scaling environment. This role is ideal for someone who thrives on solving complex problems and wants to make a measurable impact on build speed, reliability, and overall development workflows.

 

You’ll work on a large E-Trading Platform supporting multiple languages including TypeScript, Scala, Python, and Golang. Using Bazel, you’ll help optimize builds and deployments while collaborating closely with development teams to streamline processes and tighten feedback loops.

 

What you’ll do:

  • Manage and enhance CI/CD pipelines for efficiency and reliability
  • Work with GitHub Actions, Docker, Kubernetes, and Terraform
  • Leverage cloud platforms such as AWS, Azure, or GCP
  • Utilise Bazel for builds, testing, and deployments
  • Continuously improve developer experience and build performance

 

What we’re looking for:

  • 6+ years of engineering experience with a strong DevOps background
  • Expertise in CI/CD practices and cloud infrastructure
  • Familiarity with containerization and orchestration tools
  • Passion for improving build processes and developer efficiency
  • Strong problem-solving skills and ability to work in a fast-paced environment

 

This is an opportunity to work on challenging technical problems in a collaborative, forward-thinking team. If you’re ready to take on a role where your contributions directly impact productivity and innovation, we’d love to hear from you.

Opportunity Type: Permanent
Job published: 19-11-2025
Job ID: 118641

We're hiring several Senior Site Reliability Engineers to help shape a Centre of Excellence for SRE practices across a global tech estate. This is a high-impact, hands on role where you'll engineer automation frameworks, elevate observability, and transform incident response at scale.

You’ll be the go to expert guiding strategy, influencing culture, and driving adoption of SRE principles across diverse teams. From scripting to architecting resilient systems, your technical leadership will directly improve performance, scalability, and availability.

 

What you’ll do:

System Reliability & Performance: Ensure high availability, optimal performance, and scalability of services through proactive monitoring, maintenance, and capacity planning.

Incident Response & Prevention: Lead resolution and analysis of system outages. Implement preventative measures to reduce recurrence and improve system resilience.

Automation & Tooling: Develop scripts in Python or Go and tools to automate operational processes, reduce manual effort, and enhance efficiency.

Performance Optimization: Monitor system metrics, identify bottlenecks, and apply best practices for performance tuning and resource utilization.

Cross-Team Collaboration: Partner with development and infrastructure teams to embed reliability and scalability into the software development lifecycle.

Opportunity Type: Permanent
Job published: 19-09-2025
Job ID: 122699