Senior Site Reliability Engineer
Axiom Software Solutions · Holanda Septentrional
Location: Eindhoven, Netherlands
Work type: Contract role
Role Overview:
In this role you are working on a highly technical Software Development Environment where not a single day is the same. The team has a very broad responsibility, maintaining a great number of tools as well as a large set of infrastructure. The candidate we are looking for has a broad interest and loves to work one week as a software engineer extending our tools, and the other week to dive deep into performance optimization of our infrastructure.
Job Responsibilities
- Exp level: JG9
System Reliability & Uptime
• Design and implement strategies to ensure high availability, reliability, and performance of systems and services.
• Define and track Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
Incident Management & Troubleshooting
• Respond to system outages and incidents, lead post-mortem investigations, and implement preventive measures.
• Create runbooks and automate recovery processes to reduce manual intervention.
• Share the on-call rotation and be an escalation contact for incidents.
Infrastructure as Code (IaC)
• Build and maintain infrastructure using tools like Terraform.
• Ensure infrastructure is reproducible, version-controlled, and auditable.
Monitoring & Observability
• Implement and manage monitoring tools (preferably Splunk).
• Set up alerts and dashboards to track the health and performance of services.
Automation & Tooling
• Automate operational tasks such as deployments, scaling, backups, and failovers.
• Develop internal tools to support deployment pipelines and team workflows.
Collaboration with Development & Operations
• Work closely with developers to design systems that are scalable and supportable.
• Advocate for and implement best practices around CI/CD, testing, and release management.
Required Skillset
Programming & Scripting
• Proficiency in languages like Python, Bash, or Ruby.
• Ability to build tools, automate tasks, and debug production issues.
Cloud Platforms
• Strong experience with cloud providers (GCP, Azure).
• Knowledge of cloud-native services, networking, and security.
Linux/Unix Systems/Windows
• Deep understanding of system internals, performance tuning, and debugging.
Containers & Orchestration
• Experience with Docker and Kubernetes (or other orchestration platforms).
CI/CD & Automation Tools
• Familiarity with Jenkins, Github Actions, ArgoCD, or similar.
• Experience setting up and managing deployment pipelines.
Monitoring & Logging
• Knowledge of observability stacks.
Security & Compliance Awareness
• Understanding of securing systems and managing access control, secrets, and audit logging.
Soft Skills
• Strong communication and collaboration skills.
• Enjoy coaching more junior team members.
• Ability to work under pressure during incidents and lead blameless post-mortems.
• Analytical mindset and proactive problem-solving approach.
Ver más
¡No te pierdas nada!
Únete a la comunidad de wijobs y recibe por email las mejores ofertas de empleo
Nunca compartiremos tu email con nadie y no te vamos a enviar spam
Suscríbete AhoraÚltimas ofertas de empleo de Ingeniero/a de Fiabilidad del Sitio
DevOps Engineer
15 abr.Aristocrat
Rep. De Macedonia, ES
Ingeniero/a de datos
15 abr.Nestle
Barcelona, ES