Senior DevOps Engineer
Published: 2025-11-12Job details
The Senior DevOps Engineer will be responsible for enhancing and scaling the infrastructure powering Rhino’s Federated Computing Platform (Rhino FCP). This distributed infrastructure supports cutting-edge AI/ML research and development across highly regulated industries, including healthcare, finance, and life sciences, by enabling secure, privacy-preserving data collaboration around the world.
You will oversee a fleet of installations deployed behind the firewalls of partner organizations, alongside a centralized cloud orchestration layer and advanced CI/CD pipelines, monitoring systems, and deployment tooling. You'll collaborate closely with backend engineers, as many infrastructure components are tightly integrated with the FCP’s core capabilities.
This role involves end-to-end ownership: from architecture and design to implementation and support. It’s ideal for someone who thrives in fast-paced environments, loves working with modern infrastructure, and is excited to drive innovation in secure, distributed AI platforms.
Key Responsibilities
- End-to-End Infrastructure Ownership: Take full ownership of infrastructure components - from initial ideation and architectural design through implementation and automated deployment, across diverse technologies and geographically distributed environments.
- Scalability and Enhancement: Continuously enhance and scale the platform’s underlying infrastructure by implementing updates, driving performance improvements, and integrating new platform capabilities to support product growth.
- Maximize Development Velocity: Design, improve, and maintain DevOps tooling and workflows (CI/CD pipelines, configuration management) to significantly boost development velocity and streamline the engineering process.
- Distributed Deployment Management: Oversee and manage all platform deployments, ensuring consistency and reliability across local installations at client sites and within the centralized cloud orchestration layer.
- Product Development: Collaborate closely with Backend and Frontend Engineering teams to co-develop platform features that have significant infrastructure dependencies, ensuring new capabilities are designed and implemented with built-in scalability and operational tooling.
- Establish Foundational Quality: Ensure observability, scalability, and security are deeply integrated and prioritized across the design, implementation, and operation of all infrastructure layers.
About the candidate
Candidates should have 5+ years of professional experience with a mix of the experiences described below:
- 5+ years of experience in DevOps engineering with AWS and GCP/Azure.
- 5+ years of experience designing and developing infrastructure components with Kubernetes
- 5+ years of experience with Linux
- 5+ years of experience with Bash/Python
- 5+ years of experience working with IaC and CM tools (Terragrunt, Ansible)
- 5+ years of experience implementing monitoring solutions (Prometheus/VictoriaMetrics, Grafana, GoAlert, etc) and logging systems (VictoriaLogs or similar, Filebeat/FluentBit/Vector, etc.)
- 3+ years of experience with CI/CD tools like GitHub Actions
- 3+ years of experience with GitOps tools (ArgoCD)
- Deep understanding of networking and experience developing infrastructure with non-trivial networking components (e.g. WireGuard VPN, NGINX passthrough, mTLS, gRPC, confidential computing environments, etc)
- Experience working in a startup environment
- Advantage for experience developing AI/ML-based products or platforms
- Advantage for experience developing distributed systems
- Advantage for experience developing products with a focus on data security and privacy (e.g., PII data protection)
- The role is open to candidates who are based in Israel (hybrid work environment)