Sr. Engineer II - Infrastructure (Hybrid)
hashicorp
Job Description
What you’ll do (responsibilities)
- Collaboration and Planning: Work closely with engineering and product teams to adopt infrastructure changes. Contribute to infrastructure planning, capacity management, and architectural improvements.
- Infrastructure Optimization: Develop and implement strategies to enhance the consumption interface, performance, scalability, and reliability of HashiCorp's infrastructure.
- Software Development: Develop software solutionsDesign and refine processes for the usability of infrastructure resources, increasing efficiency and reducing manual overhead.
- Monitoring and Incident Response: Implement comprehensive monitoring solutions to proactively identify and address issues. Lead the response to infrastructure incidents, minimizing impact on service availability and performance.
- Knowledge Sharing: Serve as a subject matter expert in infrastructure technologies and practices.
What you’ll need (basic qualifications)
- Minimum 8+ years of experience in site reliability engineering, infrastructure engineering, or a closely related field, with a proven track record of managing complex, cloud-based infrastructure at scale and system administration
- Advanced technical expertise in designing and implementing large-scale systems and infrastructure solutions, with a deep understanding of cloud platforms (AWS, Azure, GCP), container orchestration (Nomad, Kubernetes), and infrastructure as code (Terraform, Ansible).
- Hands-on experience with HashiCorp tools (Terraform, Vault, Consul, Nomad) and other key technologies, including cloud services, DevOps tooling, and automation platforms.
- Proficiency in one or more programming languages (e.g. Python, Go), with experience writing production-level code and integrating it into complex systems
- Proven ability to lead high-level technical projects, drive innovation, and mentor junior engineers in the design and implementation of complex systems and infrastructure solutions.
- A strong understanding of software development principles, including agile methodologies and CI/CD pipelines
- Experience working with cloud-based services, including managed services, IaaS, and PaaS
- Excellent problem-solving skills, with a strong track record of leading incident response efforts, driving root cause analysis and resolution, and collaborating with cross-functional teams to resolve technical issues
- Strong technical leadership skills, with experience managing teams and influencing technical decisions at an executive level.
- A commitment to continuous learning and improvement, staying abreast of the latest industry trends and technologies