Trabajos
>
Alcobendas

    Senior Site Reliability Engineer - Alcobendas, España - Grafana Labs

    Grafana Labs
    Default job background
    Descripción

    Senior SRE - Databases

    This is a remote position and we're considering candidates in EMEA

    About the role:

    We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions.

    The High SLA SRE team is a new team within the Databases department, that owns the environments (customer and product cells) for our largest customers, and acts as an overlay to existing teams that run the databases within the system. As an SRE within the team, you own the configuration of the software via and , being involved with the for new features, shepherding releases to the environment and ensuring new releases do not degrade the SLOs or user experience for the customer (learn what is special about each of these customers, and mitigate risks that might be produced by a change in the software), directly contributing design docs, code, PR review, and other engineering activities to the databases to further improve reliability for the customer, observability of the customer stack, and making recommendations to customers about their use of the system to further improve reliability.

    Like all SRE roles there is an on-call element, unlike other roles this one is a shared pager where "if the Mimir team are paged for this customer, then we are also paged", this allows you to focus your response on the experience the customer has, whilst also being supported by another on-call engineer who will focus on the system. As a company, we hire globally (remote-only) to ensure our on-call is as healthy as possible, and aligned to 12 daylight hours per day as the default.

    What we seek:

  • Strong engineering background (at least 6 years), that lean towards SRE roles (at least 3 years)
  • Good communication, capable of engaging in deep technical conversations with other engineers and customers, and collaborating across organizational boundaries
  • Experience with Kubernetes on any of AWS, GCP, or Azure, and working with Helm charts
  • Experience with Site Reliability Engineering, System Design, and Distributed Computing
  • Experience with one or more programming languages (e.g. Go, Python, JavaScript, etc)
  • Experience with Linux operating systems internals, and some knowledge of networking
  • Experience with calmly and actively participating in blame-free Incident Response, following up on actions, and writing high quality PIRs (Post Incident Reviews, a.k.a. post-mortem documents)
  • Comfortable working within an engineering team where individuals are encouraged to have a strong sense of autonomy and self-direction
  • We highly value those who are kind, intellectually curious, who default to transparency, possess a high bias towards action, and who are also kind (this is important)
  • Your day-to-day will include:

  • Regular 1:1s to with your manager and colleagues
  • Reviewing and creating SLOs, proactively investigating ways in which we can further reduce budget burn for those SLOs, which can be self-directed or as the result of learnings from incidents, and may include improvements to monitoring, automation, increasing self-healing, auto-scaling, etc.
  • Improve observability of customers within the High SLA environments
  • Configuring systems to increase reliability via Helm and Jsonnet
  • Collaborating with our Engineering Leaders to help define and influence product strategy, roadmaps and technical designs
  • Participate in PR review and collaborating with other engineers on their Design Docs
  • Teach others about Site Reliability Engineering and communicate best practices to be applied early in development of new features and functionality
  • Participate in Incident Response when applicable, including investigation through to resolution, PIR, and communication with customers via Bridge calls where necessary
  • In Spain, the base compensation range for this role is €88,627 - €106,353. Actual compensation may vary based on level, experience, and skillset as assessed in the interview process. Benefits include equity, bonus (if applicable) and other benefits listed .

    *Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market's defined pay range & benefits at the beginning of the process



  • Norconsulting Pozuelo de Alarcón, España

    Pozuelo de Alarcón, MD, Spain Site Reliability Engineer SRE 2 · Job Description: · Norconsulting busa para unos de sus clientes, empresas lider del sector de Seguridad busca Administrador de Sistemas para unirse a su equipo se systemas y desarrollo en sus oficinas en Madrid ...

  • Kenvue

    Reliability Engineer

    hace 2 semanas


    Kenvue Madrid, España

    Reliability Engineer W Description Kenvue is currently recruiting for: · Reliability Engineer · Who We Are · At , we realize the extraordinary power of everyday care. Built on over a century of heritage and rooted in science, we're the house of iconic brands - including NEUTR ...

  • Mcneil Iberica S.L.U.

    Reliability Engineer

    hace 1 día


    Mcneil Iberica S.L.U. Madrid, España

    Kenvue is currently recruiting for: Reliability Engineer Who We Are At Kenvue , we realize the extraordinary power of everyday care. · Built on over a century of heritage and rooted in science, we're the house of iconic brands - including NEUTROGENA, AVEENO, TYLENOL, LISTERINE, J ...

  • Antal International Network

    Site Reliability Engineer

    hace 3 semanas


    Antal International Network España

    Como Site Reliability Engineer, tú permites el crecimiento, la mantenibilidad y la escalabilidad de las aplicaciones empresariales de nuestro cliente. Te harás cargo de las aplicaciones que se acercan a la completitud de características. Mejorarás su confiabilidad, resistencia y ...

  • KENVUE

    Reliability Engineer

    hace 2 semanas


    KENVUE MADRID, España De jornada completa

    Submission for the position: Reliability Engineer - (Job Number: W) · Kenvue is currently recruiting for: · Reliability Engineer · Who We Are · At Kenvue, we realize the extraordinary power of everyday care. Built on over a century of heritage and rooted in science, we're the ho ...


  • Norconsulting Pozuelo de Alarcón, España

    Pozuelo de Alarcón, MD, Spain Site Reliability Engineer 1 · Job Description: · Administradores de Sistemas Middleware · Una de las empresas lider del sector de Seguridad busca Administrador de Sistemas Middleware para unirse a su equipo se systemas y desarrollo en sus ofic ...

  • BANCO SANTANDER S.A.

    Site Reliability Engineer

    hace 4 semanas


    BANCO SANTANDER S.A. Madrid, España

    Site Reliability Engineer - Santander Digital Services · Country: Spain · **WHAT YOU WILL BE DOING** · **Santander Digital Services **está buscando **un/a Site Reliability Engineer **para nuestras oficinas en **ABELIAS (Madrid).** · **POR QUÉ DEBERÍAS CONSIDERAR ESTA OPORTUNIDAD* ...


  • Clicars Madrid, España De jornada completa

    Tu Misión · Buscamos un SRE para incorporarse a un equipo de desarrollo con mucho talento y participar activamente en la evolución de nuestros proyectos a nivel internacional y arquitectura tecnológica. Nuestras tareas tienen tres capas: frontend, backend e las tecnologías más u ...


  • Antal International Network Madrid, España

    Como Site Reliability Engineer, tú permites el crecimiento, la mantenibilidad y la escalabilidad de las aplicaciones empresariales de nuestro cliente. Te harás cargo de las aplicaciones que se acercan a la completitud de características. Mejorarás su confiabilidad, resistencia y ...


  • Clicars Centro, España

    Tu Misión · Buscamos un SRE para incorporarse a un equipo de desarrollo con mucho talento y participar activamente en la evolución de nuestros proyectos a nivel internacional y arquitectura tecnológica. Nuestras tareas tienen tres capas: frontend, backend, e infraestructura. Las ...

  • BestSecret

    Site Reliability Engineer

    hace 3 semanas


    BestSecret Alcobendas, España

    About BestSecret Group · We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our ...

  • SDi Digital Group

    Site Reliability Engineer

    hace 3 semanas


    SDi Digital Group Alcobendas, España

    · About BestSecret Group · We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for o ...

  • Sdi Digital Group

    Site Reliability Engineer

    hace 3 semanas


    Sdi Digital Group Alcobendas, España

    About BestSecret Group We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our me ...


  • Clicars Madrid, España

    **Tu Misión**: · Buscamos un SRE para incorporarse a un equipo de desarrollo con mucho talento y participar activamente en la evolución de nuestros proyectos a nível internacional y arquitectura tecnológica. Nuestras tareas tienen tres capas: frontend, backend e infraestructura.A ...


  • Norconsulting Madrid, España

    Pozuelo de Alarcón, MD, Spain Site Reliability Engineer SRE 2 Job Description:Norconsulting busa para unos de sus clientes, empresas lider del sector de Seguridad busca Administrador de Sistemas para unirse a su equipo de sistemas y desarrollo en sus oficinas en Madrid.ADMINISTRA ...


  • Norconsulting Comunidad de Madrid, España

    · Pozuelo de Alarcón, MD, Spain Site Reliability Engineer SRE 2 Job Description: · Norconsulting busa para unos de sus clientes, empresas lider del sector de Seguridad busca Administrador de Sistemas para unirse a su equipo de sistemas y desarrollo en sus oficinas en Madrid. · A ...

  • Manning Global Ag

    Reliability Engineer

    hace 2 semanas


    Manning Global Ag Madrid, España

    Job DescriptionOur client, a leading global ICT company, is recruiting for a Reliability Engineer (SRE) to join their business in Spain.Responsibilities:Ensure data quality and improve efficiency within projects by:Developing BI reportsDeploying automation toolsImplementing indep ...

  • Kenvue Inc

    Reliability Engineer

    hace 4 días


    Kenvue Inc Madrid, España

    Put your passion to workCome work at the forefront of science - and help the brands you grew up with grow and evolve into the next generation. · Explore Students & Recent Graduates Opportunities Go to the main content section. Welcome. You are not signed in. | My Account Options ...

  • Matillion

    Site Reliability Engineer

    hace 3 semanas


    Matillion Madrid, España

    Matillion is The Data Productivity Cloud.We are on a mission to power the data productivity of our customers and the world, by helping teams get data business ready, faster. Our technology allows customers to load, transform, sync and orchestrate their data. We are looking for pa ...


  • ING Madrid, España De jornada completa

    At ING we are looking for a Site Reliability Engineer · Your role and work environment: · We are looking for a talented and enthusiastic Site Reliability Engineer (SRE) to join our Team of SRE Expert Unit · The responsibility of this team is to ensure the reliability and sca ...