Site Reliability Engineer, Cybersecurity/SaaS/AI Security product
Published: 2025-11-16Established in 2016, the group’s vision is to “Secure the Future”, a future that will increasingly be shaped by artificial intelligence. AIFT provides services across key markets in Asia and the Middle East. We are continuously expanding our global footprint and actively recruiting international talent to join our growing team.
Job details
We are seeking a motivated and technically curious Site Reliability Engineer to help build and maintain the reliable and distributed systems that support our business operations.
In this role, you will play a vital part in supporting the following businesses:
- Vulcan: Vulcan is a cybersecurity solution for GenAI, providing red and blue team services to ensure compliance and security.
- Cymetrics: A cybersecurity platform designed specifically for small and medium enterprises in the APAC region.
- IXT: An insurance core system solution for APAC insurance markets.
Tech Blog: https://medium.com/onedegree-tech-blog
Responsibilities
- Implement and enhance system reliability, availability, scalability, performance, and efficiency by leveraging monitoring, alerting, and automation tools on public cloud platforms.
- Participate in capacity planning, analyze software performance, and fine-tune systems to ensure optimal operation.
- Develop and enhance GitLab CI/CD processes and toolset to streamline software delivery and deployment.
- Define and monitor key metrics to assess and enhance system reliability.
- Collaborate closely with the engineering team to improve reliability and operational efficiency at every software development life cycle (SDLC) stage.
- Troubleshoot, optimize infrastructure and automate repetitive tasks to increase efficiency and effectiveness
- At least 1–2 years of experience in cloud technology.
- Familiarity with monitoring solutions like Prometheus, Grafana, ELK (Elasticsearch, Logstash, Kibana).
- Understanding the complete software development life cycle (SDLC).
- Basic knowledge of network concepts, with an interest in infrastructure security.
- Hands-on experience implementing GitLab CI/CD processes.
- Exposure to automation platforms like Ansible and Terraform.
- Knowledge of container technologies like Docker;Kubernetes knowledge is a plus.
- Comfortable using Git for source control in a collaborative environment.
- Interest in or experience with AI pair programming like OpenAI.
- Ability to script in programming languages such as Bash, Python, or Go.
- Fluent in Mandarin; proficiency in English is an advantage.
Interview Process
- HR phone interview: 1 hour
- Onsite Interview: 1.5~2 hours, meet with hiring team and HR
Other Benefits
To us, people are our greatest asset, and we are more than happy to invest in employees! We create a healthy work atmosphere and provide you with the tools and support for doing your job successfully. With a culture of flexibility and transparency, we believe there should be no barriers, and everyone’s contributions matter.
Work Life Balance is a must
- 15 days annual leaves (pro-rata for partial month at first year)
- 5 days full-pay sick leaves, 3 days menstrual leaves
- Health check subsidy
- Ergonomic-design chair and fully-equipped devices for work
Grow together & keep learning
- Conferences & external subsidy
- Learning clubs to share technical skill (e.g: Frontend/Backend tech sharing, Product Management...etc)
Work Hard, Play even Harder
- Various entertainment & sports clubs, attend basketball clubs today, and play board game tomorrow!
- Snacks & beverage to refill your energy anytime