Commandez avant le 19 décembre pour recevoir votre colis à temps pour vos cadeaux de Noël 🎄🎁

Site Reliability Engineer

At Murena SAS, we build privacy-focused cloud services and infrastructure that power the Murena ecosystem. Our mission is to give users a seamless digital experience without compromising their privacy. As part of our Site Reliability Engineering (SRE) operations team, you will play a key role in ensuring our services are highly available, scalable, and secure.

About this Role

We are looking for a Site Reliability Engineer (SRE) to join our core infrastructure team. In this role, you’ll be responsible for ensuring that Murena’s services are reliable, scalable, and observable. You’ll work on improving automation, monitoring, and deployment pipelines, while collaborating with developers and operations engineers to enhance performance and service uptime. You’ll thrive in this role if you enjoy solving complex system challenges, automating manual operations, and building a fully open-source infrastructure that can scale sustainably.

This role combines software engineering, operations excellence, and a deep understanding of distributed systems. You’ll be instrumental in building the foundation that keeps Murena’s services running smoothly for users around the world.

Responsibilities

  • Ensure service reliability and uptime across Murena’s cloud and root server infrastructure by monitoring performance, availability and capacity.
  • Participate in an on-call rotation to ensure round-the-clock reliability of critical services, responding to incidents, performing effective troubleshooting under pressure, leading or contributing to root cause analyses, and driving corrective and preventive actions to improve system resilience and reduce future incidents.
  • Automate operations and deployments using Infrastructure as Code (IaC) and configuration management tools to reduce manual intervention and improve repeatability.
  • Build and maintain observability systems — including logging, metrics and alerting — to proactively detect and resolve issues.
  • Collaborate with development teams to design and deploy scalable, secure, and maintainable architectures.
  • Participate in incident management by troubleshooting production issues, performing root cause analysis, and driving post-incident improvements.
  • Implement CI/CD pipelines and optimize workflows for faster, safer releases.
  • Contribute to capacity planning and performance optimization to ensure our services scale efficiently as user demand grows.
  • Maintain security and compliance standards across systems and infrastructure components.
  • Document processes, runbooks, and architectural decisions to support team-wide knowledge sharing and faster on-boarding.

Requirements

  • High degree of autonomy – ability to take ownership, self-manage priorities, and deliver results in a fully remote, distributed team.
  • 3–5 years of experience in Site Reliability Engineering, DevOps, or Systems Engineering roles.
  • Strong experience with Linux system administration (Debian or Ubuntu preferred).
  • Proficiency in automation and Infrastructure as Code (IaC) tools such as Ansible, Terraform, etc.
  • Solid understanding of containers and orchestration (Docker).
  • Hands-on experience with monitoring and observability stacks such as Prometheus, Grafana or ELK.
  • Experience managing CI/CD pipelines (GitLab CI/CD preferred).
  • Familiarity with networking fundamentals, DNS, load balancing, and reverse proxies (HAProxy or Nginx).
  • Experience working with cloud or hybrid infrastructure (e.g., Hetzner, or similar environments).
  • Scripting or automation experience in Python, Bash, or Go.
  • Understanding of security best practices in system and infrastructure design
  • Fluency in English – strong written and verbal communication skills to collaborate effectively across international teams.

Nice to have

  • Familiarity with Keycloak or other identity and access management solutions
  • Knowledge of MariaDB, PostgreSQL, or object storage operations.
  • Experience with open-source mail server components such as Postfix, Dovecot or Rspamd.
  • Background in disaster recovery,  backup, or high-availability architecture.
  • Experience with Ceph or other distributed storage systems.
  • Contributions to open-source projects or privacy-focused technologies.

What We Offer

  • Fully remote position
  • Flexible working hours
  • Exciting challenge: Build and scale a sovereign and efficient infrastructure
  • Potential to impact a large and growing user base
  • Contractor (self-employed) position
  • A fair daily rate, aligned with your skills and experience

How to apply

Please send us:

  • Your CV or links to GitHub/GitLab or similar,
  •  A few concrete examples of work related to Infrastructure as code and docker environments.
  • A brief note explaining why this topic interests you.
Apply now! Back to Jobs