ScienceLogic

United States of AmericaFull Time150k - 200k USD / YEAR

10d

Senior Site Reliability Engineer

Canonical

Anywhere in the worldFull Time

14d

Senior Site Reliability Engineer

Kontakt.io

PolandFull Time

Senior Site Reliability Engineer

Kontakt.io

PolandFull Time

Senior Site Reliability Engineer

Rackspace

United States of AmericaFull Time144k - 246k USD / YEAR

11d

Site Reliability Engineer

Anywhere365

South AfricaFull Time

14d

Site Reliability Engineer

SwissBorg

Austria +26Full Time

Senior Site Reliability Engineer (GCP)

Rackspace

United States of AmericaFull Time

Staff Site Reliability Engineer

neptune.ai

Armenia +46Full Time

14d

Principal Site Reliability Engineer

Deimos

Nigeria +4Full Time

11d

Site Reliability Engineer / Observability Engineer

Rackspace

EgyptFull Time120k - 180k USD / YEAR

14d

Senior Site Reliability Engineer (Big Data)

Binance

Armenia +48Full Time

Senior Site Reliability Engineer (Big Data)

Binance

Armenia +48Full Time

7 hour

Intermediate Site Reliability Engineer - FinOps

Gitlab

Angola +168Full Time98k - 210k USD / YEAR

14d

Intermediate Site Reliability Engineer, Environment Automation

Gitlab

Angola +116Full Time

13 hour

Associate Project Manager - Site Reliability

ScienceLogic

United States of AmericaFull Time80k - 120k USD / YEAR

15d

Database Reliability Engineer

CloudWalk

BrazilFull Time

Site Controller - Americas

Fluence

United States of AmericaFull Time

Site Controller - Americas

Fluence

United States of AmericaFull Time

Sr Staff Software Engineer - Reliability Engineering

Airbnb

United States of AmericaFull Time

14d

PreviousPage 1 of 103Next

← Back to Job Listings

Senior Site Reliability Engineer

Kontakt.io

PolandFull Time1d

Job Summary

Kontakt.io is seeking a Senior Site Reliability Engineer to ensure the scalability, availability, and security of their cloud-based AI-driven healthcare platform. The ideal candidate will have 3+ years of experience in SRE, expertise in Kubernetes, Docker, and container orchestration, as well as knowledge of machine learning infrastructure and healthcare compliance. As an SRE at Kontakt.io, you will collaborate with software, data, and infrastructure teams to build highly resilient and automated systems, allowing hospitals and care facilities to operate seamlessly and without downtime. You will design and maintain cloud infrastructure, implement SLOs, SLIs, and SLAs, and participate in 24/7 on-call rotation. Kontakt.io offers a competitive salary, stock option plan, flexible remote work options, and a collaborative environment.

Kontakt.io is building the platform that care operations run on.

We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and staff productivity. Our platform uses AI, RTLS, and EHR data to enable self-learning agents to automate workflows, adapt in real-time, and orchestrate all of care delivery operations.

Easy to deploy and scale, it gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience. With measurable 10X ROI and over 20+ use cases, Kontakt.io is the go-to platform for better and faster care delivery operations.

As a Site Reliability Engineer (SRE) at Kontakt.io, you will be responsible for ensuring the scalability, availability, and security of our cloud-based AI-driven healthcare platform. You will collaborate with software, data, and infrastructure teams to build highly resilient and automated systems, allowing hospitals and care facilities to operate seamlessly and without downtime.

Your expertise in cloud infrastructure, automation, monitoring, and performance optimization will directly impact how healthcare organizations leverage real-time data to enhance patient care and operational efficiency.

If you are passionate about highly available systems, automation, and making an impact in healthcare, join Kontakt.io and help us build the future of smart care operations!

Key Responsibilities:

Design and maintain highly available, fault-tolerant, and scalable cloud infrastructure.
Implement SLOs, SLIs, and SLAs to track system reliability and optimize uptime.
Participate in 24/7 on-call rotation
Oversee production platform deployments
Monitor latency, traffic, errors, and system health using modern observability tools.
Conduct root cause analysis (RCA) and post-mortems to continuously improve system resilience.
Automate infrastructure provisioning using Terraform, Ansible, or Pulumi.
Implement CI/CD pipelines to ensure seamless and safe deployments.
Enable self-healing mechanisms using Kubernetes operators, auto-scaling, and fault detection.
Ensure compliance with HIPAA, GDPR, and other healthcare data regulations.
Define and execute disaster recovery (DR) and business continuity plans.
Manage and optimize AWS environments for cost-efficiency and performance.
Deploy and manage observability tools and build real-time alerting and response frameworks
Establish best practices for logging, debugging, and performance monitoring.
Improve incident response automation through runbooks, AI-based anomaly detection, and predictive analytics.

What You Bring

3+ years of experience as an SRE
Strong expertise in Kubernetes, Docker, and container orchestration.
Experience managing cloud-native environments (AWS).
Experience with event-driven architectures, Kafka, or real-time data streaming.
Knowledge of machine learning infrastructure.
Previous experience in healthcare, compliance (HIPAA), and highly regulated environments.
Proficiency in Infrastructure as Code (IaC) using Terraform.
Deep knowledge of networking, DNS, load balancing, and security best practices.
Experience with CI/CD pipelines (Jenkins, CI, or ArgoCD).
Hands-on experience with monitoring and logging tools (Prometheus, Grafana, ELK, OpenTelemetry).
Strong programming skills in Python, Golang, or Bash for automation.
Knowledge of machine learning infrastructure.

We offer:

Work on a mission-driven platform that improves healthcare operations and patient outcomes.
B2B contract or an employment agreement
Competitive salary and stock option plan
Collaborate with top engineers, data scientists, and AI experts.
Flexible remote or hybrid work options (office in Krakow)
Collaborative and self-organized environment
private medical care, cafeteria system

We Make Things Easy

Easy to Use. Simplicity is harder than complexity. Each of our apps focuses on a single user and a specific problem. We create solutions for everyone to help them get things done.

Easy to Buy. We simplify pricing with a single, per-bed or per-room model that encompasses all the necessary products and services to achieve your desired outcomes.

Easy to Deploy. Using AI, cloud, and mobile technologies, our equipment autonomously communicates and validates itself without the need for human intervention, cutting deployment time from months to weeks or even days.

We Deliver Fast Outcomes.

Industry’s #1 Time To Value. We accelerate your ROI and deliver positive outcomes to users faster than anyone else, thanks to how easy things work with our AI- and cloud-based platform.

Delivered As A Service. Delivering everything from devices to apps to support, our as-a-service model allows you to add new use cases with a simple click. Gain agility and speed like never before.

Outcome Driven. We deliver outcomes, not boxed equipment. From on-site installation to monitoring, all the way to service-level agreements, our approach is uniquely designed to ensure the outcomes you need.

We Ensure Unmatched Scalability

Priced for Scaling. We offer scalable pricing, regardless of your project size. Enabling our customers to create value cost-effectively is a key element of our success.

A Platform for Scaling. Lower TCO, quicker adoption of new use cases, extensive cloud scalability, and future-proofing your IT investments are among the many reasons why Kontakt.io is right for you.

Managed for Scaling. SOC-2 and HIPAA compliant, our platform integrates with your wireless and security infrastructure, allowing you to use your current IT network with confidence and uninterrupted functional

Apply for this job

Apply for this position