Β 

Site Reliability Engineer

Hybrid
Mid-level
πŸ‡¬πŸ‡§ United Kingdom
Site Reliability Engineer
Technology

What you'll Do

You will work in the reliability platform team within the Connectivity Platforms and Software team providing a stable cloud environment for several cross functional teams to deliver services into. The reliability team also has the responsibility for increasing the observability and monitoring around existing services. The successful candidate is required to work on key Edge and shore-side solutions whilst also having a good understanding of how to increase the observability and alarms around the solutions. The role demands a cross-functional engineer that is comfortable deploying reliable systems, investigating the root cause of outages, and improving the reliability of existing services.

The day-to-day

The position will require interacting with the scrum teams to identify and close observability gaps within existing systems. It will also require understanding the platform needs of the scrum teams and delivering cloud-based solutions that enable the scrum teams to deliver faster. The role requires the ability to be on call and respond to and investigate system outages out of hours.

You will be responsible for leading epics, driving low level design, and delivering user stories to a high-quality deployable solution which meets the project’s definition of done. You will have the ability to prove concepts and take them to production level solution. You will be required to work in an agile scrum team environment delivering software as part of our CI/CD pipeline whilst also helping to extend our CI/CD pipeline capabilities.

Key Responsibilities:

  • Workings as a key member of a platform team with a mandate to enable and support the delivery of services into production
  • Deploying solutions using Infrastructure as Code and maintaining the platform that our software services are built upon
  • Establishing good reliability practices into new and existing software systems, including runbooks and reliability metrics
  • Automating repetitive tasks to reduce team toil
  • Investigating outages across multiple system components to meet our SLOs (Service Level Objectives) and providing long-term fixes to increase reliability.
  • Proving concepts using time-boxed technical spikes
  • Solving complex problems using cutting edge technologies.
  • Taking ownership of end-to-end deliverables across the full software development lifecycle.
  • Contribute knowledge of best practices through guilds and lunch and learns
  • Development of automated tests and tools for CI/CD

Requirements

The successful candidate will understand, interpret, and adopt new technical information rapidly. They must have a demonstrable interest in new technologies and product innovation, and a practical understanding of the technology development lifecycle and be able to participate at the appropriate point in a matrix development process. A background in IT, Internet or telecoms is desirable.

  • Exposure to distributed systems, container technologies, high availability, and cloud environments (particularly AWS)
  • Excellent understanding of the Linux operating system
  • Programming experience in an OO language (i.e. python or C++)
  • Good knowledge of observability and techniques for building and deploying reliable software
  • Ability to define, capture and display SLIs / SLOs on Grafana dashboards
  • Good understanding of networking and SDN (Software Defined Networking)
  • Good knowledge of databases (Postgres and MongoDB)
  • Comfortable working in an agile development environment
  • Excellent interpersonal skills
  • Strong problem solver with ability to communicate ideas clearly
  • SD-WAN knowledge desirable

Β 

Inmarsat

Inmarsat

Inmarsat is a global communications company that believes everyone and everything in the world can be connected

Telecommunications
Aerospace

LinkedIn

🏭telecommunications

Other jobs at Inmarsat

Β 

Β 

Β 

Β 

Β 

Β 

Β 

Β 

View all Inmarsat jobs

Why OmniJobs?

  • Rare & hidden jobs
  • New jobs every day
  • No expired job posts
  • All jobs in English

Receive emails about similar jobs

Get alerts to your inbox about new open jobs that are similar to this one.

πŸ‡¬πŸ‡§ United Kingdom
Site Reliability Engineer

No spam. No ads. Unsubscribe anytime.

Similar jobs

Β 

Β 

Β 

Β 

Β 

Β 

Β 

Β