Cloud Platform Engineer (K8S&NW Specialist) | Relocation Offered

Global-Talent-Exchange

Japan

Full time

5 - NA Yrs

- INR

span 1

Required Skills:

Kubernetes

Aws

Aws Eks

Amazon CloudFront

Video Networking

Python

Go

Iac- Terraform

Argocd

Istio

Linkers

Prometheus

Grafana

Kubernetes

AWS

EKS

Cloudfront

Networking

Python

Go

Terraform

ArgoCD

Istio

Linkerd

Prometheus

Grafana

Position

We are seeking a highly skilled and experienced Platform Engineer to manage and enhance our entire application delivery platform, from Cloudfront to the underlying EKS clusters and their associated components. The ideal candidate will possess deep expertise across cloud infrastructure, networking, Kubernetes, and service mesh technologies, coupled with strong programming skills. This role involves maintaining the stability, scalability, and performance of our production environment, including day-to-day operations, upgrades, troubleshooting, and developing in-house tools.

Main Responsibilities

Perform regular upgrades and patching of EKS clusters and associated components & oversee the health, performance, and scalability of the EKS clusters.
Manage and optimize related components such as Karpenter (cluster autoscaling) and ArgoCD (GitOps continuous delivery).
Implement and manage service mesh solutions (e.g., Istio, Linkerd) for enhanced traffic management, security, and observability.
Participate in an on-call rotation to provide 24/7 support for critical platform issues and monitor the platform for potential issues and implement preventative measures.
Develop, maintain, and automate in-house tools and scripts using programming languages like Python or Go to improve platform operations and efficiency.
Configure and manage CloudFront distributions, WAF Policies for efficient & secure content delivery & routing.
Develop and maintain documentation for platform architecture, processes, and troubleshooting guides.

Tech Stack

AWS: VPC, EC2, ECS, EKS, Lambda, Cloudfront, WAF, MWAA, RDS, ElastiCache, DynamoDB, Opensearch, S3, CloudWatch, Cognito, SQS, KMS, Secret Manager, KMS, MSK
Terraform, Github Actions, Prometheus, Grafana, Atlantis, ArgoCD, OpenTelemetry

Your qualification

Proven experience as a Platform Engineer, Site Reliability Engineer (SRE), or similar role with a focus on end-to-end platform ownership.
In-depth knowledge and hands-on experience of at least 4 years with Amazon EKS and Kubernetes.
Strong understanding and practical experience with Karpenter, ArgoCD, Terraform.
Solid grasp of core networking concepts and extensive experience of at least 5 years with AWS networking services (VPC, Security Groups, Network ACLs, CloudFront, WAF, ALB, DNS).
Demonstrable experience with SSL/TLS certificate management.
Proficiency in programming languages such as Python or Go for developing and maintaining automation scripts and internal tools.
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
Excellent problem-solving and debugging skills across complex distributed systems.
Strong communication and collaboration abilities.
Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).

Preferred Qualifications

Prior experience working with service mesh technologies (preferably Istio) in a production environment.
Experience building or contributing to Kubernetes Controllers.
Experience with multi-cluster Kubernetes architectures.
Experience building AZ isolated, DR architectures.

What we offer

Social Insurance (health insurance, employee pension, employment insurance and compensation insurance)
401K
Translation/Interpretation support
VISA sponsor + Relocation support

Working Conditions

Employment Status

Full Time

Office Location

Hybrid Workstyle (flexible working style including Remote and office)
There are no fixed rules regarding office attendance in Product group; it depends on each individual's discretion.

Work Hours

Super Flex Time (No Core Time)
In principle, 9:00am-5:45pm (actual working hours: 7h45m + 1h break)

Holidays

Every Sat/Sun/National holidays (In Japan)/New Year's break/Company-designated Special days

Paid leave

Annual leave (up to 14 days in the first year, granted proportionally according to the month of employment. Can be used from the date of hire)
Personal leave (5 days each year, granted proportionally according to the month of employment)
Special paid leave system, which can be used to attend to illnesses, injuries, hospital visits, etc., of the employee, family members, pets, etc.

Salary

Annual salary paid in 12 installments (monthly)
Based on skills, experience, and abilities
Reviewed once a year
Special Incentive once a year *Based on company performance and individual contribution and evaluation
Late overtime allowance
Payroll payment can be changed to digital salary payment for an amount set by you

About Company

Global-Talent-Exchange

https://globaltalex.com/

Discover high-impact roles Worldwide

10-20 Employees

Information Technology & Services

Send me jobs like this

This one's a match? We'll send more your way

Keep me posted!

Similar Jobs

- LPA

Design Automation Engineer, Scribe Design Non-Array