Compute, VMware, and Linux Engineer
hpe
Job Description
What you will do:
Acts as a senior technical expert in Compute infrastructure, VMware virtualization, and Linux-based operating systems, providing advanced support and strategic guidance.
• Leads complex troubleshooting, root cause analysis, and performance tuning across enterprise environments.
• Provides architectural input and contributes to the design and implementation of infrastructure solutions.
• Supports transition and transformation initiatives, including migrations, upgrades, and automation efforts.
• Ensures compliance with ITIL processes and industry best practices.
• Acts as a technical liaison between internal teams, customers, and third-party vendors.
• Mentors junior engineers and contributes to knowledge sharing and process improvement.
What you will bring:
- Resolve customer’s issues via the telephone, email or remote sessions.
- Reproduce issues in-house and responding back in a timely manner.
- Regular follow ups with customers with recommendations, updates and action plans.
- Identify and escalate issues in a timely manner to vendor according to Standard Operating Procedures.
- Leverage internal technical expertise, including peers, mentors, knowledge base, community forums and other internal tools, to provide the most effective solutions to customer issues.
- Collaborate with other CoE/HW teams in diagnosing and isolating the cause of complex issues.
- Maintain quality on case documentation, SLA timeframes and operational metrics.
- Performs within the Productivity Measure of the team (scorecard)
- Incident Management: Resolve single and cross technology incidents independently. Lead the team members to resolve complex or cross technology incidents.
- Escalation Management: Identify, manage, and lead technical escalations. Participate in formal Escalation when required to support escalation especially during crisis.
- Problem Management: Proactively and reactively look for solutions to prevent problems from occurring in team/technology area. Perform Trend and Root cause analysis.
- Change Management/Implementation: Independently prepare, review, implement, rollback and test plan for change records. Perform risk and impact analysis for changes, May lead or participate in a Change Advisory Board.
- Patch and Security Management: Apply patch and security changes per policy. Proactively monitor the environment for patch compliance. Analyze patches for compatibility with each customer or internal infrastructure environment.
- Configuration Management: Ensure Configuration Management Database (CMDB) entries are complete and accurate.
-
Lead resolution of critical incidents and escalations, ensuring minimal business impact.
• Perform in-depth analysis of system logs, kernel dumps, and performance metrics.
• Design and implement automation for routine tasks using Ansible, Shell, Python, etc.
• Lead patch management, vulnerability remediation, and compliance reporting.
• Maintain and implement high availability (HA) and disaster recovery (DR) solutions.
• Conduct capacity planning, performance tuning, and infrastructure optimization.
• Own and drive problem management processes, including RCA documentation and preventive measures.
• Participate in Change Advisory Boards (CAB) and lead complex change implementations.
• Maintain and audit CMDB for accuracy and completeness.
• Provide technical leadership in customer meetings and strategic planning sessions.
Must Have:
• Deep expertise in HPE Compute platforms (C7000, Synergy, Virtual Connect, ProLiant).
• Advanced Linux administration (RHEL, SUSE) including kernel tuning, system hardening, and troubleshooting.
• Strong virtualization experience in VMware (vSphere, SRM, Horizon), KVM, and Hyper-V.
• Proficient in VMware infrastructure management: VM lifecycle operations, cluster management, performance monitoring, capacity planning, patching, backup/restore, and snapshot handling.
• Skilled in analyzing logs (VM-support, HPSreport, SOSreport) and performing root cause analysis.
• Solid understanding of storage technologies (SAN/NAS/DAS) and protocols (FC, iSCSI, FCoE).
• Experience with Red Hat Satellite, SUSE Manager, and patch lifecycle management.
• Expertise in HA/DR solutions using Serviceguard, Pacemaker, and Linux clustering.
• Familiarity with networking fundamentals (VLANs, MTU, flow control) and troubleshooting.
•