Senior Site Reliability Engineer

Job DescriptionJob Overview

At tripla, we make travelers happy. Whether it be for booking the perfect hotel room, dining at the trendiest restaurant, or partaking in an exciting excursion, we build smart, AI driven products to allow travelers plan the perfect trip. In the short amount of time that we’ve been around, tripla chatbot and tripla hotel booking has become Japan’s most popular services in the industry, and the pace of growth is only increasing. We’re currently expanding our services into many other areas, but we need a talented DevOps and Site Reliability Engineer to help us build the next generation of new features and functionality for our highly available, highly scalable and highly secure platform.

The ideal candidate is experienced, self-directed, comfortable supporting multiple teams, systems, and products. Perhaps most importantly, the candidate should want to be a part of a team. Nothing at Tripla is accomplished by one person. The candidate must have strong communication skills and the desire to tackle challenges as a group.

We are looking for someone

With serious desire to build great Well Architected cloud software
Drive towards automating everything
Drive towards building secure, efficient, scalable architectures
That takes ownership of the solution and deliver
With a team mindset, highly collaborative and enjoys a fast-paced, team environment
That enjoys learning new technologies, concepts, and areas of business
Champions the causes of uptime, reliability, compliance, and security

To be considered, you must have following:

A degree in computer science, software engineering or similar education
5+ years of professional DevOps or Site Reliability Engineering experience in fast paced work environment.
Proven track record of securely architecting and managing AWS (e.g. IAM, EC2, VPC, ELB, ALB, Autoscaling, Lambda) using Infrastructure as Code techniques such as CloudFormation,Terraform
Clear understanding of Networking concepts (e.g Firewalls, NAT, Port, Subnetting, VPC, VPNs, DNS, etc)
Experience with managing CI/CD environment
Experience with designing and owning production Unix container ecosystems (Docker, EKS, Fargate/ECS, Kubernetes, service discovery, service registry)
Experience in designing and running a predictive alerting platform using monitoring tools such as Cloudwatch, NewRelic, DataDog or Pingdom
Proficiency with APIs / microservices architecture
Knowledge of data backup/recovery
Ability to create scripts using Bash, Python or other language
Excellence in Analytical and problem-solving skills
Excellent oral and written communication
Ability to work under indirect supervision

Nice to haves

Certified AWS solution architect professional

APPLY

Senior Site Reliability Engineer

About Us

Service Operation

Terms & Conditions

언어