AWSazurecloudelasticsearch

Senior Site Reliability Engineer (SRE)

Terminus Capital Partners (TCP)
USA Only

We are looking for an experienced, self-motivated, highly productive Site Reliability Engineer (SRE) to build and scale services in a cloud environment within our Infrastructure team.

Requirements

Key Responsibilities

  • Building, deploying, improving, and maintaining infrastructure (in on-premise, AWS, Azure, and GCP)
  • Managing operations and tooling around compute infrastructure
  • Building/optimizing monitoring and alerting
  • Managing operations on additional infrastructure components such as monitoring, alerting and databases
  • Build tools and automations
  • Be on call, respond to incidents and conduct root-cause analysis on customer-impacting issues
  • Define and manage SLO, SLI and error budgets
  • Leading new projects and initiatives around site reliability, developing or finding net-new solutions, evaluating products, and leading discussions on technology topics
  • Mentoring team members, showing thought-leadership, and helping educate the team on best practices
  • Other duties as assigned

Candidate Criteria

 

Required Experience

  • Either a B.S. degree or equivalent in Computer Science or a minimum of 7 years’ experience in Infrastructure-as-code, deployment systems and have experience writing automation in a modern programming language
  • Experience with monitoring, metrics, logs
  • Cloud computing (one or more of AWS, GCP, Azure)
  • Understanding of distributed systems and their commonly associated problems
  • Experience with CI/CD systems (Preferred)
  • Experience writing infrastructure as a code (Terraform, Ansible, Puppet, etc.) (Preferred)
  • Experience working with containers and Kubernetes (Preferred)
  • Experience utilizing system monitoring tools (i.e., Grafana, Prometheus, Elasticsearch) (Preferred)

 

Critical Skills & Qualifications

  • Strong networking fundamentals
  • Belief in automating the problems
  • Strong communication & analytical skills.
  • Curiosity, adaptability, and a willingness to learn.
  • Experience with managing measurable goals and metrics.
  • Previous experience in remote work settings preferred.

Benefits

Salary: $100k/y

Payment: Monthly

Paid days off: 10/y

50% overlap with ET time zone mandatory

© 2020 RemoteJobs.store. Built using NextJS and Vercel.
Uses RemoteOK and Remotive APIs.