(Remote Work) Site Reliability Engineer – Float

Job Expired

Job Overview

  • Job Title Site Reliability Engineer
  • Hiring Organization Float
  • Company Website http://float.com/
  • Remote Locations Worldwide
  • Job Type  Remote, Full-Time

Float is the world’s leading software for teams to plan their time. Launched in 2012, we’ve grown every year since, and remain proudly independent, self-funded and profitable. As a certified B Corporation, we’re committed to making a positive contribution to our team, customers, the environment, and the remote community. We’re a team of 50 working 100% remotely who believe in living our Best Work Life. You’ll. partner with team members globally, including Australia, Mexico, Italy, Nigeria, Canada, and the USA. Hear what our team has to say by browsing our blog, or reading our Glassdoor reviews. Check out what our customers think of Float from our G2 reviews.

We’re on a scale up journey, and we’re seeking people who thrive in this stage, given the autonomy, and the opportunity, to do the best work of their career.

Why We’re Hiring For This Role

The role of Site Reliability Engineers at Float is to increase the autonomy of the product and engineering teams by growing their capabilities to focus on solving problems. SRE makes sure our engineers get scalable infrastructure to build software on top of, making sure pipelines from idea to customer run smoothly and are easily built upon, and we also deal with broad areas of security around our network and defining internal security policy and practices.

Our goals for the Engineering team are to increase the pace with which they deliver improvements for our customers, provide an increasingly sophisticated and reliable service from our teams, and mitigate external threats as we grow.

You will help us tackle those problems by increasing reliability of our services to support larger clients joining Float, and increasing the robust security systems we’ve implemented to continue protecting our growing customer base.

You’ll be working asynchronously with a bright, dedicated team from across the globe, with a strong focus on taking complex problems and creating solutions that feel simple and intuitive for our customers.

Job Responsibilities

  • Continuing to support the regular maintenance of all the engineering systems supporting Float’s customers
  • Identifying areas requiring support to scale
  • Identifying areas for improving service resilience, ultimately delivering the ability to be resilient within the product and engineering teams themselves
  • Optimizing our monitoring and observability stack, building on the knowledge to create a standard set of tools and configurations for the product and engineering teams
  • Understanding Float’s SLOs in context, and building out SLO patterns and procedures for product and engineering teams

Once you are settled, we expect that you will jump into the following projects:

  • Building a repeatable and trustworthy disaster recovery program using chaos engineering techniques
  • Migrating all of our deployment configurations to a global single source of truth
  • Expanding Float’s infrastructure across multiple regions to create a global network

Job Requirements

We want you to love your work and believe that these skills will allow you to succeed in the role.

  • An senior-level understanding of how SRE operates as an enabling team
  • A very good understanding of Service Level Objectives
  • Extensive knowledge of Kafka administration
  • Working experience with Terraform, Bash, and a go-to language which ideally would be one of PHP, NodeJS, Python
  • Experience with Kubernetes and GCP would be highly valued

As a fully remote team, we’re looking for someone comfortable with asynchronous communication as the default, which means you have previous remote experience and are comfortable using tools like Slack, Loom, and Linear to communicate as needed. Don’t worry—you will have significant deep work time since we have very few meetings.

How To Apply

Click “Apply” below to fill in the application form!

More Information

  • This job has expired!