Cloud Operations Engineer 1
icims
Job Description
Responsibilities
Description
A Cloud Operations engineer is responsible for designing, deploying, managing, and optimizing cloud infrastructure and services. They ensure the availability, performance, and security of cloud-based applications and resources while monitoring and responding to incidents and issues. This role requires expertise in cloud platforms, automation, and best practices.
Your day-to-day job will consist of:
- Execution of day-to-day tasks related to monitoring and managing cloud infrastructure to ensure service availability and data security
- Execute ticket triage, investigation and resolution of reported incidents
- Participate in 24x7 on-call rotations to resolve incidents in support of production systems
- Monitor and configure systems resources (memory, disks space & CPU utilization)
- Improve customer experience with infrastructure optimization
- Manage and perform operating system and application software patches and updates
- Performs root cause analysis on trended incidents and major outages up through the application stack
- Develops automation that can trigger off a variety of industry standard monitoring tools to resolve common issues in the environment or maintain operating levels
Qualifications
Minimum Qualifications
-
Minimum of 2 years of relevant professional experience.
-
2+ years of experience with AWS or equivalent cloud technologies, including VMs, NSGs, VPC/VNETs, load balancers, DNS, and certificate management.
-
1+ year of experience in server administration and cloud deployment across both Windows and Linux environments.
-
1+ year of experience with Infrastructure as Code (IaC) tools such as Terraform, Bicep, Ansible, Chef, or Puppet (demonstrable proficiency preferred).
-
Familiarity with AI tooling or cloud‑based AI services.
-
Bachelor’s degree in a related field or equivalent years of relevant work experience.
- Travel occasionally, up to 5-10%, for key moments such as team summits, training, conferences, etc., with increased frequency during peak periods based on business demands.
- This position is subject to company on call policies which constitutes working hours outside of the normal workday as needed.
- Problem-solving abilities, effective communication skills, and attention to detail.
Preferred Qualifications
-
Strong understanding of IAM security best practices and identity federation.
-
Experience with observability tools such as CloudWatch, Prometheus, Grafana, or Datadog.
-
Experience with AWS cost‑optimization tools and resource right‑sizing strategies.
- Basic knowledge of cloud security controls using AWS WAF, GuardDuty, Security Hub, or Inspector
- AWS certification (Associate‑level or higher).