Site Reliability Engineer
intel
Job Description
Qualifications
The candidate must have a Bachelor's degree in Computer Science, Electrical Engineering, or related fields with 6+ years of industry experience or Master's degree with 4+ years of industry experience.Minimum qualifications:
-
5+ years of experience in the following areas:
-
Experience with Linux fundamentals, System administration scripting, performance tuning, scalability and troubleshooting.
-
Experience working with Datacenter/Cloud Hardware and infrastructure.
-
Experience with developing and deploying Cluster/Datacenter/Cloud solutions.
-
Experience with Kubernetes deployment and/or operation.
-
Experience with Elastic (ELK) deployment, operation, and optimization.
-
Experience with infrastructure automation tooling (example: Ansible and/or Jenkins).
Preferred qualifications:Experience in the following areas:
-
Knowledge of server platforms as demonstrated by hands on bring-up of systems geared towards cloud/architectures.
-
Experience with Intel Data Center platform hardware.
-
Preferred, if you have Certifications based on Kubernetes, Devops, Elastic Deployment.
-
Experience working with monitoring and visualization tools (Prometheus, Grafana)
-
Experience in scripting with Python.
-
Experience working with containerization tooling, deployment and/or support.
-
Experience in system and network performance analysis and optimization.
-
Experience managing clusters with AI accelerators