Cloud Operations Engineer

Global-Talent-Exchange

Singapore
Full time
2 - 3 Yrs
Job Openings: 1

Required Skills:

Aws Cloud Operations

Windows software development

Infrastructure

Sre Capabilities

DevOps

Go

Python

C11

Logging

Clinical Monitoring

Alerting

Public Cloud Platforms

Kubernetes

Prometheus

Grafana

Infrastructure As Code

Iac- Terraform

Ansible

Cloud Operations

Software Development

Infrastructure

SRE

DevOps

Go

Python

C++

Logging

Monitoring

Alerting

Public Cloud Platforms

Kubernetes

Prometheus

Grafana

Service Level Objectives

Service Level Indicators

Infrastructure as Code

Terraform

Ansible

Role

As a Software Engineer on the newly formed Operations Infrastructure team, you will design and build the foundational systems that ensure our cloud platform runs reliably. You will collaborate closely with infrastructure teams to prototype and architect robust solutions for monitoring, alerting, and dashboarding. In this role, you will define the technical requirements for metrics and playbooks, empowering engineering teams to maintain operational excellence and system health.

How your work moves the mission forward

  • Design and build scalable operations infrastructure, covering monitoring, alerting, and data visualization tools.
  • Ensure the reliability of the cloud platform by creating tools that enable proactive incident detection and response.
  • Collaborate with infrastructure teams to design and prototype new architecture that supports long-term platform stability.
  • Define technical requirements for engineering teams to standardize metrics collection and operational playbooks.

Skills you will need to be successful

  • Bachelor’s degree in Computer Science or equivalent practical experience.
  • 2-3 years of experience in software development with a focus on infrastructure, SRE, or DevOps.
  • Experience with one of Go, Python, or C++.
  • Experience building or working with logging, monitoring, and alerting tooling.
  • Experience with public cloud platforms and Kubernetes.

Skills that will differentiate your candidacy

  • Experience with specific monitoring and visualization tools (e.g., Prometheus, Grafana).
  • Experience defining Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
  • Experience working with "Infrastructure as Code" tools (e.g., Terraform, Ansible).
  • Startup experience—building from 0 to 1 in a rapidly changing technical landscape.

About Company

Global-Talent-Exchange
https://globaltalex.com/
Discover high-impact roles Worldwide
10-20 Employees
Information Technology & Services