- Strong engineering background (at least 6 years), that lean towards SRE roles (at least 3 years)
- Good communication, capable of engaging in deep technical conversations with other engineers and customers, and collaborating across organizational boundaries
- Experience with Kubernetes on any of AWS, GCP, or Azure, and working with Helm charts
- Experience with Site Reliability Engineering, System Design, and Distributed Computing
- Experience with one or more programming languages (e.g. Go, Python, JavaScript, etc)
- Experience with Linux operating systems internals, and some knowledge of networking
- Experience with calmly and actively participating in blame-free Incident Response, following up on actions, and writing high quality PIRs (Post Incident Reviews, a.k.a. post-mortem documents)
- Comfortable working within an engineering team where individuals are encouraged to have a strong sense of autonomy and self-direction
- We highly value those who are kind, intellectually curious, who default to transparency, possess a high bias towards action, and who are also kind (this is important)
- Regular 1:1s to with your manager and colleagues
- Reviewing and creating SLOs, proactively investigating ways in which we can further reduce budget burn for those SLOs, which can be self-directed or as the result of learnings from incidents, and may include improvements to monitoring, automation, increasing self-healing, auto-scaling, etc.
- Improve observability of customers within the High SLA environments
- Configuring systems to increase reliability via Helm and Jsonnet
- Collaborating with our Engineering Leaders to help define and influence product strategy, roadmaps and technical designs
- Participate in PR review and collaborating with other engineers on their Design Docs
- Teach others about Site Reliability Engineering and communicate best practices to be applied early in development of new features and functionality
- Participate in Incident Response when applicable, including investigation through to resolution, PIR, and communication with customers via Bridge calls where necessary
-
Site Reliability Engineer SRE 2
hace 3 semanas
Norconsulting Pozuelo de Alarcón, EspañaPozuelo de Alarcón, MD, Spain Site Reliability Engineer SRE 2 · Job Description: · Norconsulting busa para unos de sus clientes, empresas lider del sector de Seguridad busca Administrador de Sistemas para unirse a su equipo se systemas y desarrollo en sus oficinas en Madrid ...
-
Reliability Engineer
hace 2 semanas
Kenvue Madrid, EspañaReliability Engineer W Description Kenvue is currently recruiting for: · Reliability Engineer · Who We Are · At , we realize the extraordinary power of everyday care. Built on over a century of heritage and rooted in science, we're the house of iconic brands - including NEUTR ...
-
Reliability Engineer
hace 1 día
Mcneil Iberica S.L.U. Madrid, EspañaKenvue is currently recruiting for: Reliability Engineer Who We Are At Kenvue , we realize the extraordinary power of everyday care. · Built on over a century of heritage and rooted in science, we're the house of iconic brands - including NEUTROGENA, AVEENO, TYLENOL, LISTERINE, J ...
-
Site Reliability Engineer
hace 3 semanas
Antal International Network EspañaComo Site Reliability Engineer, tú permites el crecimiento, la mantenibilidad y la escalabilidad de las aplicaciones empresariales de nuestro cliente. Te harás cargo de las aplicaciones que se acercan a la completitud de características. Mejorarás su confiabilidad, resistencia y ...
-
Reliability Engineer
hace 2 semanas
KENVUE MADRID, España De jornada completaSubmission for the position: Reliability Engineer - (Job Number: W) · Kenvue is currently recruiting for: · Reliability Engineer · Who We Are · At Kenvue, we realize the extraordinary power of everyday care. Built on over a century of heritage and rooted in science, we're the ho ...
-
Site Reliability Engineer 1
hace 3 semanas
Norconsulting Pozuelo de Alarcón, EspañaPozuelo de Alarcón, MD, Spain Site Reliability Engineer 1 · Job Description: · Administradores de Sistemas Middleware · Una de las empresas lider del sector de Seguridad busca Administrador de Sistemas Middleware para unirse a su equipo se systemas y desarrollo en sus ofic ...
-
Site Reliability Engineer
hace 4 semanas
BANCO SANTANDER S.A. Madrid, EspañaSite Reliability Engineer - Santander Digital Services · Country: Spain · **WHAT YOU WILL BE DOING** · **Santander Digital Services **está buscando **un/a Site Reliability Engineer **para nuestras oficinas en **ABELIAS (Madrid).** · **POR QUÉ DEBERÍAS CONSIDERAR ESTA OPORTUNIDAD* ...
-
SRE (Site Reliability Engineer)
hace 1 semana
Clicars Madrid, España De jornada completaTu Misión · Buscamos un SRE para incorporarse a un equipo de desarrollo con mucho talento y participar activamente en la evolución de nuestros proyectos a nivel internacional y arquitectura tecnológica. Nuestras tareas tienen tres capas: frontend, backend e las tecnologías más u ...
-
Site Reliability Engineer
hace 1 semana
Antal International Network Madrid, EspañaComo Site Reliability Engineer, tú permites el crecimiento, la mantenibilidad y la escalabilidad de las aplicaciones empresariales de nuestro cliente. Te harás cargo de las aplicaciones que se acercan a la completitud de características. Mejorarás su confiabilidad, resistencia y ...
-
SRE (Site Reliability Engineer)
hace 2 días
Clicars Centro, EspañaTu Misión · Buscamos un SRE para incorporarse a un equipo de desarrollo con mucho talento y participar activamente en la evolución de nuestros proyectos a nivel internacional y arquitectura tecnológica. Nuestras tareas tienen tres capas: frontend, backend, e infraestructura. Las ...
-
Site Reliability Engineer
hace 3 semanas
BestSecret Alcobendas, EspañaAbout BestSecret Group · We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our ...
-
Site Reliability Engineer
hace 3 semanas
SDi Digital Group Alcobendas, España· About BestSecret Group · We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for o ...
-
Site Reliability Engineer
hace 3 semanas
Sdi Digital Group Alcobendas, EspañaAbout BestSecret Group We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our me ...
-
Sre (Site Reliability Engineer)
hace 1 semana
Clicars Madrid, España**Tu Misión**: · Buscamos un SRE para incorporarse a un equipo de desarrollo con mucho talento y participar activamente en la evolución de nuestros proyectos a nível internacional y arquitectura tecnológica. Nuestras tareas tienen tres capas: frontend, backend e infraestructura.A ...
-
Site Reliability Engineer Sre 2
hace 2 semanas
Norconsulting Madrid, EspañaPozuelo de Alarcón, MD, Spain Site Reliability Engineer SRE 2 Job Description:Norconsulting busa para unos de sus clientes, empresas lider del sector de Seguridad busca Administrador de Sistemas para unirse a su equipo de sistemas y desarrollo en sus oficinas en Madrid.ADMINISTRA ...
-
Site Reliability Engineer SRE 2
hace 3 semanas
Norconsulting Comunidad de Madrid, España· Pozuelo de Alarcón, MD, Spain Site Reliability Engineer SRE 2 Job Description: · Norconsulting busa para unos de sus clientes, empresas lider del sector de Seguridad busca Administrador de Sistemas para unirse a su equipo de sistemas y desarrollo en sus oficinas en Madrid. · A ...
-
Reliability Engineer
hace 2 semanas
Manning Global Ag Madrid, EspañaJob DescriptionOur client, a leading global ICT company, is recruiting for a Reliability Engineer (SRE) to join their business in Spain.Responsibilities:Ensure data quality and improve efficiency within projects by:Developing BI reportsDeploying automation toolsImplementing indep ...
-
Reliability Engineer
hace 4 días
Kenvue Inc Madrid, EspañaPut your passion to workCome work at the forefront of science - and help the brands you grew up with grow and evolve into the next generation. · Explore Students & Recent Graduates Opportunities Go to the main content section. Welcome. You are not signed in. | My Account Options ...
-
Site Reliability Engineer
hace 3 semanas
Matillion Madrid, EspañaMatillion is The Data Productivity Cloud.We are on a mission to power the data productivity of our customers and the world, by helping teams get data business ready, faster. Our technology allows customers to load, transform, sync and orchestrate their data. We are looking for pa ...
-
Site Reliability Engineer
hace 5 días
ING Madrid, España De jornada completaAt ING we are looking for a Site Reliability Engineer · Your role and work environment: · We are looking for a talented and enthusiastic Site Reliability Engineer (SRE) to join our Team of SRE Expert Unit · The responsibility of this team is to ensure the reliability and sca ...
Senior Site Reliability Engineer - Alcobendas, España - Grafana Labs
Descripción
Senior SRE - Databases
This is a remote position and we're considering candidates in EMEA
About the role:
We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions.
The High SLA SRE team is a new team within the Databases department, that owns the environments (customer and product cells) for our largest customers, and acts as an overlay to existing teams that run the databases within the system. As an SRE within the team, you own the configuration of the software via and , being involved with the for new features, shepherding releases to the environment and ensuring new releases do not degrade the SLOs or user experience for the customer (learn what is special about each of these customers, and mitigate risks that might be produced by a change in the software), directly contributing design docs, code, PR review, and other engineering activities to the databases to further improve reliability for the customer, observability of the customer stack, and making recommendations to customers about their use of the system to further improve reliability.
Like all SRE roles there is an on-call element, unlike other roles this one is a shared pager where "if the Mimir team are paged for this customer, then we are also paged", this allows you to focus your response on the experience the customer has, whilst also being supported by another on-call engineer who will focus on the system. As a company, we hire globally (remote-only) to ensure our on-call is as healthy as possible, and aligned to 12 daylight hours per day as the default.
What we seek:
Your day-to-day will include:
In Spain, the base compensation range for this role is €88,627 - €106,353. Actual compensation may vary based on level, experience, and skillset as assessed in the interview process. Benefits include equity, bonus (if applicable) and other benefits listed .
*Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market's defined pay range & benefits at the beginning of the process