DevOps/Site Reliability Engineering (SRE)
qualcomm
Job Description
Responsibilities
-
Serve as an advocate for quality practices including the development of automated testing to improve business processes
-
Act as a critical part of a multi-team effort to deliver, manage and maintain configuration automation to meet business needs.
-
Create and maintain configuration standards for software and infrastructure.
-
Manage CI & CD tools and pipelines as a partner to development and QA teams.
-
Develop and socialize operational standards for teams throughout engineering.
-
Recommend, develop and implement system enhancements that will improve the performance and reliability of the system including installing, upgrading/patching, monitoring, problem resolution, configuration management and security.
-
Oversight of critical incident and major system escalations from initiation to resolution.
-
Create mechanisms/architectures that enable fault tolerance and rapid recovery from failure.
-
Participate in a rotating on-call escalation service.
-
Create and maintain configuration standards for software and infrastructure.
-
Capacity Planning and Chaos Engineering.
-
Strong communication skills, verbal and written.
Qualifications
-
Bachelor’s degree in a technical field, or equivalent experience
-
1 to 3 years’ experience in an operational environment, preferred