Senior Systems Operations Engineer
wellsfargojobs
Job Description
In this role, you will:
- Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area
- Contribute in increasing system efficiencies and lowering the human intervention time on related tasks
- Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability
- Work with vendors and other technical personnel for problem resolution
- Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards
- Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability
Required Qualifications:
- 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Desired Qualifications:
- Must have 4+ Years of experience as Site Reliability Engineer
- Knowledge/experience of Python/Shell scripting.
- Hands on knowledge about LLMs/ leveraging LLM/ supporting LLM based solutions
- Knowledge/experience of Puppet/Ansible.
- 4+ years of big data experience needed (Big Query, Hadoop)
- 4+ years with Linux O/S capabilities
- 3 years of experience in AIML area (MLOps)
- 3+ years of Pyspark experience
- 3-5+ years of experience with Tableau/ MicroStrategy or similar BI tools
- Strong experience with monitoring systems such as Splunk, App Dynamics.
- Working knowledge of Auto ML technologies such as H2O Driverless AI, DataRobot, VertexAI, Elastic and Vector DB
- Good understanding and hands on with GCP
- Excellent verbal, written, and interpersonal communication skills. Ability to articulate technical solutions to both technical and business audiences
- Recent and demonstrated ability to influence management on technical or business solutions
- Working knowledge of design and build grid computing with CPU and GPU supporting AIML and NLP
- Working knowledge of high-performance storage technologies along with Object Storage
- Knowledge and understanding of network infrastructure to support high throughput and low latency grid computing
- Willing to work in shifts
- 1+year of experience in LLM , Generative AI (dev/ops)
- 1+year of experience in Elastic Search, Vector Database, Model Development would be added benefit.
- Experience with data processing technology (AbInitio, Informatica, IBM DataStage)
- Experience with large data technology (Hadoop, Teradata, Elasticsearch, etc.)
- Understanding of Agile practices and ability to work with Agile teams to define and track user stories
- Experience with implementing complex F5 or other Load Balancer Technologies
- Working knowledge of building high resiliency grid/cloud computing infrastructure supporting AIML and NLP workloads
- Knowledge and understanding of Cloud computing, PaaS design principles and micro services and containers
- Working knowledge/experience with Azure and/or GCP
- Working knowledge/experience with on-premise and Public Cloud technologies, such as Cloud Foundry, Kubernetes, Docker
- Experience in facilitating analysis of current systems and problem identification and resolution
- Ability to facilitate technically complex discussions and working sessions in person or via teleconference