Staff Site Reability Developer

ServiceNow

Hybrid

Mid-level

🇨🇦 Canada

Site Reliability Engineer

Software development

Ansible Configuration management Docker GoLang Grafana Infrastructure as code J2EE Java Kubernetes Linux Prometheus Puppet Python Splunk Terraform

🔥 Apply now

This role is based in Montreal and the work persona for this role is "Flexible" (1, 2 or 3 days per week the employee will work from the office).

What you get to do in this role:

The Advanced Technology Group (ATG) at ServiceNow is a customer-focused innovation group building intelligent software and smart user experiences using existing and latest advanced technologies to enable end-to-end, industry-leading work experiences for customers. We are a group of researchers, applied scientists, engineers, and product managers with a dual mission. We build and evolve the AI platform, and partner with teams to build products and end-to-end AI-powered work experiences. In equal measure, we lay the foundations, research, experiment, and de-risk AI technologies that unlock new work experiences in the future.

Contribute to the design, development and implementation of infrastructure, platform, deployment and observability features that power AI workloads;
Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements for software tooling;
Contribute to the execution of deployment and support activities for AI/ML developers;
Build high-quality, clean, scalable and reusable code by enforcing best practices around software engineering architecture and processes (Code Reviews, Unit testing, etc.);
Work with the product owners to understand detailed requirements and own your code from design, implementation, test automation and delivery of high-quality product to our users;
Implement software that is simple to use to allow customers to extend and customize the functionality to meet their specific needs;
Be a mentor for colleagues and help promote knowledge-sharing.

Rôle

Contribuer à la conception, au développement et à la mise en œuvre des fonctionnalités d’infrastructure, de plateforme, de déploiement et d’observabilité qui alimentent les charges de travail en intelligence artificielle;
Contribuer à l’amélioration continue de la pratique de l’ingénierie des fiabilités des services (SRE), en transformant les cas d’utilisation opérationnels en exigences pour les outils logiciels;
Contribuer à l’exécution des activités de déploiement et de support les développeurs en intelligence artificielle;
Produire un code de haute qualité, propre, évolutif et réutilisable en faisant respecter les meilleures pratiques en matière d’architecture logicielle et de process;
Travailler avec les responsables de produits pour comprendre les exigences détaillées et prendre en charge votre code, de la conception à la mise en œuvre, en passant par la l’automatisation des tests de livraison d’un produit de haute qualité à nos utilisateurs;
Implémenter un logiciel simple à utiliser pour permettre aux clients d’étendre et de personnaliser fonctionnalité afin de répondre à leurs besoins spécifiques;
Être un mentor pour ses collègues et contribuer à promouvoir le partage des connaissances;

Requirements

To be successful in this role you have:

4+ years of experience with infrastructure and platform operations, deployments, SRE and DevOps;
2+ years of configuration management / DevOps tooling (Ansible/Terraform/Puppet/Prometheus/Grafana/Splunk);
2+ years of development experience with Python, GoLang, Java or similar languages;
Concrete work experience with containerized workloads on Docker & Kubernetes;
Strong working experience operating distributed systems built on Linux and J2EE;
Experience with software-defined networking, infrastructure as code and configuration management;
Ability to manage projects with material technical risk at a team level;
Analytical and design skills;

Pour réussir dans ce rôle, vous devez avoir :

Plus de 4 ans d’expérience dans les opérations d’infrastructure et de plateforme, le déploiement, SRE et le développement DevOps
Plus de 2 ans d’expérience dans la gestion de configuration et l’utilisation d’outils DevOps (Ansible/Terraform/Puppet/Prometheus/Grafana/Splunk)
Plus de 2 ans d’expérience en développement logiciel avec les langages tel que Python, GoLang, Java ou des langages similaires;
Une expérience de travail avec des charges de travail conteneurisées sur Docker et Kubernetes;
Une expérience pertinente en opérations de systèmes J2EE distribués sous Linux;
La capacité de gérer des projets présentant des risques techniques importants;
Des compétences analytiques et de conception.

🔥 Apply now

ServiceNow

At ServiceNow, our technology makes the world work for everyone, and our people make it possible

Artificial Intelligence

Software

servicenow.com

🏭software development

Other jobs at ServiceNow

🇩🇪

Counsel, Global Employment Law - Strategic Projects & M&A

🇪🇸

Counsel, Global Employment Law - Strategic Projects & M&A

🇮🇪

Director, GTM Strategy & Programs, Global Partnerships & Channels

🇳🇱

Counsel, Global Employment Law - Strategic Projects & M&A

🇸🇪

Manager, Solution Consulting

View all ServiceNow jobs

Notifications about similar jobs

Get notifications to your inbox about new jobs that are similar to this one.

🇨🇦 Canada

Site Reliability Engineer

No spam. No ads. Unsubscribe anytime.

Similar jobs

🇨🇦Added 2 days ago

Mechanical Reliability Engineer

Michelin North America - Michelin is a leading mobility company that designs and distributes tires, services, and solutions for various industries

🇨🇦Added 2 days ago

Lead Site Reliability Engineer

USA Thomson Reuters (Tax & Accounting) Inc - Thomson Reuters is a global business that relies on diverse culture and thought to deliver on its goals

cloud providerssoftware developmentData DogNew RelicAWSAzureGCPCI/CDconfiguration managementinfrastructure as code + 10

Remote🇨🇦Added 6 days ago

Site Reliability Engineer

Improving healthcare through innovative technology is at the core of Intelerad’s work

LinuxCentOSPostgres DatabaseAWS cloudPythonGoJavaC/C++VMware EnterpriseWindows Server + 5

Remote🇨🇦Added 4 days ago

Site Reliability Engineer

Intelerad - Become part of our growing community of bright, motivated people who are dedicated and inspired by what they do best(health and human services)

Linux CentOSPostgres DatabaseAWS cloudPythonGoJavaC/C++VMware EnterpriseWindows ServerDatabases + 8

🇺🇸🇨🇦💰Added 6 days ago

Site Reliability Engineering

NVIDIA is a technology company that specializes in AI computing, with a focus on Deep Learning GPUs

Cloudon premDNS architecturecloud service providersISPsnetwork automation

Staff Site Reability Developer

Requirements

ServiceNow

LinkedIn

Other jobs at ServiceNow

Notifications about similar jobs

Similar jobs