Senior Site Reliability Engineer

Global-Talent-Exchange

Japan
Full time, RemoteWork
5 - 8 Yrs
Job Openings: 1

Required Skills:

AWS

Ich-gcp

Terraform

Ci-cd Setup

Kubernetes

Community Networking

Security

Scripting

Prometheus

Grafana

Elastic Skill

Newrelic

MySQL

MongoDB

Redis

AWS

GCP

Terraform

CI/CD

Kubernetes

Networking

Security

Scripting

Prometheus

Grafana

Elastic

NewRelic

MySQL

MongoDB

Redis

This global SaaS company made it its mission to automate the travel industry. In an industry chronically suffering under labour shortage - made even more severe through the recent pandemic and the current record numbers of international tourists, especially in Japan - their suite of AI products have been massively popular among their hospitality clients. Supporting more than 10,000 clients, they had a successful IPO, acquired several competitors and are actively expanding across APAC, Europe and the US.

They are currently expanding the SRE team to scale infrastructure for growing customer demand, increase automation, and maintain the flexibility needed to support diverse client requirements.

In this role, you will:

  • Improve system reliability and performance through monitoring, testing, and release processes
  • Partner with developers to design, deploy, and operate large-scale production services
  • Lead platform architecture discussions, capacity planning, and operational improvements
  • Automate infrastructure and operations to enable scalable, repeatable service delivery
  • Support cross-functional teams by bridging technical and business perspectives

Requirements:

  • Experience building and operating large-scale web systems in production environments
  • Hands-on experience with cloud platforms (AWS or GCP) and infrastructure automation (Terraform, CI/CD)
  • Kubernetes and container orchestration knowledge
  • Understanding of networking and security concepts (VPN, routing, IAM, firewalls)
  • Scripting skills (Shell, Python, Ruby, Go, or similar)
  • Experience with observability tools (e.g., Prometheus, Grafana, Elastic, NewRelic)
  • Database operations experience (e.g., MySQL, MongoDB, Redis)
  • Strong English communication skills and ability to collaborate across teams

Nice-to-Haves:

  • Fluent Japanese skills
  • DevOps or cloud certifications
  • Experience guiding teams toward modern operational practices
  • Interest in platform strategy and technical leadership

Why join:

  • Work on a global-scale platform supporting international expansion
  • High ownership across architecture, reliability, and platform evolution
  • Business stability + startup agility: Profitable, fast-growing, and product-driven company
  • Work style flexibility: Hybrid model, with an option for full remote
  • International Team and English as internal language

About Company

Global-Talent-Exchange
https://globaltalex.com/
Discover high-impact roles Worldwide
10-20 Employees
Information Technology & Services