Technical Systems Engineer
cisco
Job Description
- Technical hand-on role in building and supporting NVIDIA based artificial intelligence platforms.
- Plan, build and install/upgrade new systems that support NVIDIA DGX hardware and software.
- Automate configuration management, software updates, and maintenance and monitoring of GPU system availability using modern DevOps tools (Ansible, Gitlab, etc.)
- Lead the advancement of artificial intelligence platforms and practices.
- Administer Linux systems, ranging from powerful GPU enabled servers to general-purpose compute systems.
- Collaborate closely with internal Cisco Business Units, application teams and cross-functional technical domains.
- Create written technical designs, documents, and presentations.
- Stay up to date with AI industry advancements and cutting-edge technologies.
- Accelerate the delivery of AI capabilities across our portfolio.
- Design new tools to monitor alerts that will help discover failures or issues before our customers.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.