Site Reliability Engineer
docusign
Job Description
Responsibility
- Build and automate tools for incident impact analysis
- Work with Operations and Incident Command teams during and post incidents to drive excellence in Incident Management Process
- Compose and analyze dashboard to highlight areas of the business that need attention and help drive organizational KPI
- Create and respond to system generated alerts to maintain system health
- Work with Operations and Engineers to ll any gaps in alerting and telemetry
- Learn and grow in all key technologies in DocuSign and be a partner to Eng and Operations teams
Job Designation
Hybrid:Employee divides their time between in-office and remote work. Access to an office location is required. (Frequency: Minimum 2 days per week; may vary by team but will be weekly in-office expectation)
Positions at Docusign are assigned a job designation of either In Office, Hybrid or Remote and are specific to the role/job. Preferred job designations are not guaranteed when changing positions within Docusign. Docusign reserves the right to change a position's job designation depending on business needs and as permitted by local law.
What you bring
Basic
- BS in CS or equivalent work experience
- 3+ years delivering highly available services in a DevOps team, SRE role, or similar role including on-call rotations
- 5+ years’ coding and debugging
- Experience in monitoring and telemetry tools
- Experience in building dashboards and metrics analysis
- Experience with deploy and automation frameworks (Chef, Jenkins, etc.)