
Cloud Platform Engineer

PayPay Corporation
Summary
Join PayPay as a Platform Engineer to manage and enhance our application delivery platform. You will possess deep expertise in cloud infrastructure, networking, Kubernetes, and service mesh technologies, along with strong programming skills. Maintain the stability, scalability, and performance of our production environment, including day-to-day operations, upgrades, troubleshooting, and developing in-house tools. Responsibilities include managing EKS clusters, implementing service mesh solutions, providing 24/7 support, and developing automation tools. The ideal candidate will have extensive experience with AWS services and Kubernetes, along with strong problem-solving and communication skills. This is a full-time, hybrid workstyle position offering various benefits.
Requirements
- Proven experience as a Platform Engineer, Site Reliability Engineer (SRE), or similar role with a focus on end-to-end platform ownership
- In-depth knowledge and hands-on experience of at least 4 years with Amazon EKS and Kubernetes
- Strong understanding and practical experience with Karpenter, ArgoCD, Terraform
- Solid grasp of core networking concepts and extensive experience of at least 5 years with AWS networking services (VPC, Security Groups, Network ACLs, CloudFront, WAF, ALB, DNS)
- Demonstrable experience with SSL/TLS certificate management
- Proficiency in programming languages such as Python or Go for developing and maintaining automation scripts and internal tools
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
- Excellent problem-solving and debugging skills across complex distributed systems
- Strong communication and collaboration abilities
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
Responsibilities
- Perform regular upgrades and patching of EKS clusters and associated components & oversee the health, performance, and scalability of the EKS clusters
- Manage and optimize related components such as Karpenter (cluster autoscaling) and ArgoCD (GitOps continuous delivery)
- Implement and manage service mesh solutions (e.g., Istio, Linkerd) for enhanced traffic management, security, and observability
- Participate in an on-call rotation to provide 24/7 support for critical platform issues and monitor the platform for potential issues and implement preventative measures
- Develop, maintain, and automate in-house tools and scripts using programming languages like Python or Go to improve platform operations and efficiency
- Configure and manage CloudFront distributions, WAF Policies for efficient & secure content delivery & routing
- Develop and maintain documentation for platform architecture, processes, and troubleshooting guides
Preferred Qualifications
- Prior experience working with service mesh technologies (preferably Istio) in a production environment
- Experience building or contributing to Kubernetes Controllers
- Experience with multi-cluster Kubernetes architectures
- Experience building AZ isolated, DR architectures
Benefits
- Social Insurance (health insurance, employee pension, employment insurance and compensation insurance)
- 401K
- Translation/Interpretation support
- VISA sponsor + Relocation support
Share this job:
Similar Remote Jobs

