Senior SRE Application Engineer

Senior

Bangalore, 🇮🇳 India

💰Equity

Site Reliability Engineer

Technology

AWS Azure Bash Cloud Datadog Docker Elastic Search Falcon Golang Google Cloud Grafana Java Kafka Kubernetes MongoDB New Relic Open Source Python Rancher Yarn

🔥 Apply now

Experience Level : Senior Level

Position Overview :

We are looking for a Senior SRE Application Engineer the role will need to work with a global team responsible for a mission critical business function, and will partner with Infrastructure, DevOps and Core practices (like Security, Identity, ProdOps, Cloud platform and Tools) teams to identify and implement automation opportunities to drive down toil, reduce technical debt and improve system reliability

Roles and Responsibilities:

Own the application, APM and work with Developers and Systems engineers to Build, Release, Monitor and run the services reliability exceeding the agreed SLAs..
Write software to automate to create custom dashboards for APM and infra monitoring tools like New Relic,datadog,grafana, etc.
Write automation to reduce toil and eliminate manual tasks that are repeatable.
Define and accelerate implementation of support processes, tools and best practices.
Maintain services once they are live by measuring and monitoring availability, latency and overall system reliability.
Define, Measure and improve Reliability Metrics (SLO/SLI), Observability (Monitoring, Logging-Tracing solutions), Ops process (Incident, Problem Mgmt).
Work with other SREs in the organization. Coach and mentor junior members in the team.
Understand the current process, system setup and propose the improvements needed in the process,so that the application exceeds the desired Service Level Objective.
Strong believer of automation to bring in sustained continuous improvement by automating Toil, monitoring,risk and SLOs.

Must Have Skills :

The successful candidate will have the following attributes/qualifications:

8 years of experience in Development and Operations of applications/services in production that has uptime over 99.9%.
3+ years of experience as a SRE in handling all the Non-functional requirements of the application and taking care of all the ilities (availability, reliability, security, performance, scalability, etc).
Strong hands-on coding experience in one or more programming languages such as Python, Golang, Java, Bash, etc.
Good understanding of Observability (monitoring, logging, tracing, metrics)
Proficiency in using Observability tools (example: New Relic, Datadog, etc) for monitoring, logging, tracing.
Hands on knowledge in public cloud platform AWS/GoogleCloud/Azure/Private cloud. Professional level certificate on one of the public clouds is highly desirable.
Hands on experience in deploying and managing distributed systems, micro services
Should have used altering systems such as Pager Duty.
Should have implemented solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services. Measurement should have been within a system and across systems in distributed systems.
Should have supported Production Incidents (PIs) on critical applications of a company. Troubleshoot, debug, and diagnose operational issues and drive them to closure.
Should have participated in the BCP execution exercises
Understanding of software delivery life cycles, particularly Agile/Lean & DevOps.
Experience as a service owner in managing large – geographically diverse stakeholders .
Ability to work with creative – fast growing engineering teams and motivate them to deliver their best work.
History of driving innovation.

Good to Have Skills :

Familiarity with handling:
Containerization – Kubernetes, Docker, Rancher, etc
Kafka, Yarn, ElasticSearch etc.
Source code management and Implementation of Security best practices.
Tech Stack - Python, Falcon, Elastic Search, MongoDB, AWS/GCP/Azure, Map Reduce.
Networking knowledge
Understanding of software delivery life cycles, particularly Agile/Lean & DevOps
Contribution to open source community

Qualification :

Master’s or Bachelor’s degree in Computer Science Engineering, or a related technical degree.

Website: [https://www.nomiso.io/>

Location:

Bangalore

About NomiSo:

NomiSo is a Product Engineering company currently focussed on Video Stream Engineering, backed by AI and ML. We are a team of Software Engineers, Architects and Cloud Experts with more than 100 years of combined expertise in Technology and Delivery Management.

Our mission is to Empower and Enhance the lives of our customers, through simple solutions for their complex business problems.

At NomiSo we encourage entrepreneurial spirit - to learn, grow and improve. A great workplace, thrives on ideas and opportunities. That is a part of our DNA. We’re in pursuit of colleagues who share similar passions, are nimble and thrive when challenged. We offer a positive, stimulating and fun environment – with opportunities to grow, a fast-paced approach to innovation, and a place where your views are valued and encouraged.

We invite you to push your boundaries and join us in fulfilling your career aspirations!

We are an equal opportunity employer and are committed to diversity, equity, and inclusion. We do not discriminate on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other protected characteristics.

🔥 Apply now

Nomiso

Product Engineering company currently focussed on Video Stream Engineering, backed by AI and ML

Artificial Intelligence

Engineering

Technology

🌍 nomiso.io All open jobs

🌍 linkedin.com

Other jobs at Nomiso

🇲🇽💰

Quality Engineer

🇲🇽💰

Senior Software Engineer

🇲🇽💰

Fullstack iOS Developer

🇲🇽💰

Fullstack Android Developer

🇲🇽💰

Senior SRE Application Engineer

View all Nomiso jobs

Why OmniJobs?

Rare & hidden jobs
New jobs every day
No expired job posts
All jobs in English

Receive emails about similar jobs

Get alerts to your inbox about new open jobs that are similar to this one.

🇮🇳 India

Site Reliability Engineer

No spam. No ads. Unsubscribe anytime.

Similar jobs

🇮🇳Added 3 days ago

Senior Engineer

Western Digital is a company that provides data-centric solutions, including storage devices and platforms for business and consumers.

NAND flash storageSSD reliability testingSDUSBSATAPCIeAgileMicrosoft Office

Remote🇮🇳Added 3 days ago

Site Reliability Engineer

WEX Brazil Technology Services - WEX is a financial technology company.(Computer Software)

JavascriptNodePythonPowerShellBashPuppetChefAnsibleDockerKubernetes + 12

🇮🇳Added 19h ago

Java Reliability Engineer

Hewlett Packard Enterprise is a global edge-to-cloud company that helps companies connect, protect, analyze, and act on their data.

JavaJ2EESpringApacheTomcatSQL ServerOracleShell ScriptingEnterpriseCloud + 1

🇮🇳Added 4 days ago

Senior Site Reliability Engineer

Roku is the #1 TV streaming platform in the US, and we've set our sights on powering every television in the world.

KubernetesIstioEnvoyPrometheusGrafanaELKJaegerKialiLokiOpenTelemetry + 10

Remote🇮🇳Added 4 days ago

Site Reliability Engineer

A global AI company with offices in San Francisco, Menlo Park, New York, London, and Bangalore, Instabase democratizes access to cutting-edge AI innovation to enable organizations to solve unstructured data problems in their industry (Computer Software)

GoPythonJavaC++DockerKubernetesAnsibleTerraform

🇮🇳🇺🇸Added 3 days ago

Reliability Engineering Co-op

Ingredion Incorporated (NA-US) - Ingredion is an equal opportunity employer seeking to provide a work environment that is free from harassment and discrimination.

Senior SRE Application Engineer

Nomiso

LinkedIn

Other jobs at Nomiso

Why OmniJobs?

Receive emails about similar jobs

Similar jobs