We are looking for a dedicated DevOps Engineer to join our Analytics team and help manage and maintain our data platform. Your primary focus will be our in-memory database (IMF), ClickHouse, and the associated services. Our entire system operates on Google Cloud Platform (GCP) and Kubernetes, and it integrates with Kafka, MongoDB, and other services. Your responsibilities will include ensuring the smooth operation of our databases and services, maintaining reliable production monitoring, and developing quality tools and automation for handling new releases, maintenance, and incident management.
The team is working remotely in the Central European Timezone. We are more than happy to meet you in Brno (Czechia) or in Bratislava (Slovakia) where our headquarters is located.
If we piqued your interest, you can chat with Milan, the team's Engineering Manager, or go ahead and apply ๐.
Responsibilities
- System Administration: Manage and configure our database systems on GCP within Kubernetes for high availability, reliability, and performance.
- Incident Management: Handle incident responses, perform root cause analysis for critical issues, and participate in a 24/7 on-call rotation.
- Automation and Tools Development: Create and maintain scripts and tools to automate operations and reduce manual tasks.
- Scaling and Resource Planning: Monitor system performance, plan for future scaling, and ensure enough resources during peak times.
- Monitoring and Logging: Set up and maintain monitoring and logging systems to detect and address issues early.
- Backup and Recovery: Develop and manage strategies for data backup and disaster recovery to ensure business continuity.
- Collaboration: Work closely with development and operations teams to align operations with overall business goals.
Qualifications
- Experience: proven experience in DevOps or site reliability engineering, preferably with databases on GCP and Kubernetes. Knowledge of CI/CD pipelines and DevOps principles.
- Skills: Expertise in automation and scripting (e.g., Python, Go, Shell), performance tuning, and managing incidents.
- Tools: Familiarity with monitoring, logging, and automation tools.
- Problem-solving: Strong analytical and problem-solving abilities.
- Communication: Excellent communication and collaboration skills for working with remote teams.
- Adaptability: Ability to work independently and handle multiple tasks in a fast-paced environment.
Our stack
- GitLab
- Prometheus, Grafana, InfluxDB, Chronograf
- IMF (our in-memory database written in C++), ClickHouse, Apache Kafka, MongoDB, and more โฆ
- Kubernetes (GKE)
- Google Cloud Platform
- Python, Go
Compensations
- Salary ranges from 3800 EUR gross/month based on your seniority and it can get much higher later depending on your performance.
- There's a bonus based on company performance and your salary.
- You will be entitled to restricted stock units ๐that will truly make you a part of Bloomreach.
- You can spend 1500 USD per year on the education of your choice (books, conferences, courses, ...).
- You can count on free access to Udemy courses.
- We have 4 company-wide disconnect days throughout the year during which you will be encouraged not to work and spend a day with your friends and family "disconnected".
- You will have an extra 5 days of paid vacation ๐. Extra days off for extra work-life balance ๐.
- Food allowance!
- Sweet referral bonus up to 3000 USD based on the position.
Your success story.
- During the first 30 days, you will get to know the team, the company, and the most important processes. Youโll work on your first tasks. We will help you to get familiar with our infrastructure, release process, tools, and product.
- During the first 90 days, you will have an active role in daily operations, including monitoring, and incident management. You will begin working on small automation projects to streamline routine tasks and improve operational efficiency. You will contribute to the development and maintenance of internal tools for monitoring, logging, and automation. Youโll join the 24/7 on-call rotation, with support from experienced team members.
- During the first 180 days, youโll take ownership of specific operational tasks and projects, working independently and confidently. Youโll contribute to scaling and resource planning initiatives, ensuring the system can handle future growth and peak seasons. And finally, you will get a sense of where the team is heading and youโll help us to shape our future.
We are looking for a dedicated and ambitious team player who is not only passionate about DevOps but also up-to-date with the latest trends. This role is ideal for someone who is looking for a challenging yet rewarding role in a fast-growing environment. We are fueling a limitless e-commerce experience. Join us!
ย
Bloomreach
Worldโs #1 Commerce Experience Cloud, empowering brands to deliver customer journeys so personalized, they feel like magic.
Other jobs at Bloomreach
ย
ย
ย
ย
ย
ย
ย
ย
Notifications about similar jobs
Get notifications to your inbox about new jobs that are similar to this one.
No spam. No ads. Unsubscribe anytime.
Similar jobs
ย
ย
ย
ย
ย
ย
ย
ย