AWSclouddockerkubernetes

Site Reliability Engineer

Splyt
Europe

About Splyt

Splyt is the leading global network for mobility and lifestyle services. Our technology integrates ridehailing, airport transfers, micromobility, public transport and food delivery into our partners’ platforms, so they can unlock the world for their customers. Splyt has offices in London, Singapore, Tokyo, and Kuala Lumpur, but we feel at home anywhere. Our amazing partners are the largest in the business, with over 2 billion users in total, and we are constantly expanding our footprint beyond the 150 countries we already cover.

Working with us

We are a remote-first company, with a fun, diverse and enthusiastic bunch from all corners of the world (with 25 nationalities and 20 languages spoken amongst our employees). We live and breathe our values of Ingenuity, Independence, Collaboration, and Laughter. Our exceptional culture is underpinned by an empowering, collaborative learning environment where our people are constantly stretched and growing to be high-performing and, most importantly, happy.

Splyt maintains a strong cloud-native approach to infrastructure, which allows us to easily scale our global operations. Many of the initiatives we have planned have a focus on optimising the availability and performance of our services to our customers around the world.

The role

We are hiring a Site Reliability Engineer into a newly formed team, the monitoring team, to ensure that our platform is as dependable as can be.

You will be involved in setting best practices right from the beginning and make massive contributions to the infrastructure and wider business. As part of our team, you will evangelise Splyt’s mission, culture, and values whilst being technically hands on in building the world’s leading lifestyle marketplace.

What you’ll be doing:

  • Define and recommend SRE best practices
  • Determine service-level agreements, service-level indicators, and service-level objectives on our platform
  • Create new processes so that relevant stakeholders (internal and external) are notified of any incidents
  • Produce reports of how the platform is performing and make recommendations on how to ensure maximum availability of our products
  • Create new features and processes in order to scale our system
  • Take the lead on implementing automation as much as possible
  • Conduct incidence reviews on any interruptions to our service
  • Monitor our infrastructure and identify any bottlenecks
  • Collaborate with various engineering teams, particularly the platform team, in order to ensure the overall Splyt ecosystem is as robust as possible
  • Configure and provide input on bespoke tools for monitoring purposes
  • Fix support escalation cases when required
  • Take part in architecture design exercises, helping to define our long-term operational strategy
  • Keep up to date with the latest technologies through training and personal development
  • Become an integral part of our growing scale-up by contributing both on a technical and personal level

Requirements

Who we are looking for:

  • Good visual communication skills
  • Experience with monitoring tools such as DataDog
  • Strong experience of cloud infrastructure (GCP/AWS)
  • Solid grasp on building CI/CD pipelines
  • Knowledge of cloud networking concepts such as VPC, Firewall, Load Balancing
  • Passionate about learning and brings knowledge to the team
  • Excellent working knowledge of Docker and Kubernetes
  • Basic understanding of microservices and how they interact within a larger tech ecosystem
  • Familiarity with Linux system administration
  • Working experience with Terraform (nice to have)
  • Basic knowledge of deploying single page applications (considered advantageous)
  • Someone based in Europe (for timezone purposes)
  • A stable and reliable internet connection

Benefits

What we can offer you:

  • Competitive salary
  • Stock Options
  • Fully remote working
  • All the equipment you need
  • Home office set up
  • Lots of discounts on everyday lifestyle items
  • Annual training allowance
  • Fun, regular team events & host of other benefits

© 2020 RemoteJobs.store. Built using NextJS and Vercel.
Uses RemoteOK and Remotive APIs.