Senior Site Reliability Engineer
fivetran
Job Description
Technologies You’ll Use
- Cloud Service Providers(CSPs): AWS, Azure Google Cloud
- Kubernetes: Managed Kubernetes services (EKS, AKS and GKE)
- Continuous Integration tools: github actions, Buildkite
- Continuous Delivery: ArgoCd
- Databases: Postgres and all the major Databases
- Languages: Go, Java
- Scripts: Typescript, Python, Shell
- IaC: Terraform and Pulumi
- RESTful API: FastAPI
- Cloud networking: reverse ssh tunnels, privatelinks in Azure and AWS and Private service connect in GCP & VPN tunnels.
What You’ll Do
- Responsible for ongoing reliability and robustness of Fivetran’s production infrastructure by monitoring availability, capacity, and throughput.
- Evolve systems by adding reliability into our product roadmap
- Coordinate the re-prioritize or fix critical bugs for support or sales requirements as needed
- Make recommendations to production infrastructure by interfacing with engineering to ensure 100% availability
- Ensure scalable artifacts deployment to all environments by automation scripts
- Constantly monitor infrastructure vulnerabilities and remedy them by working with the security team
Skills We’re Looking For
- 5+ years of experinece working with SaaS products at scale
- Working knowledge of Kubernetes
- Knowledge of Cloud Platforms and related tooling: AWS, Azure, GCP, Terraform, Ansible, Buildkite, Pulumi, Ansible and ArgoCD
- Experience in Python/Shell scripting. Bonus if you have Java, Go etc
- Experience with Linux operating systems internals and administration
- Experience with cloud networking like VPNs, Privatelinks and Private Service connect (gcp)