DevOps Engineer Job in | Yulys
×

Job Title: DevOps Engineer

Company Name: ITR
Salary: USD 0.00
-
USD 0.00 Hourly
Job Industry: Program Development
Job Type: Full time
WorkPlace Type: remote
Location: United States
Required Candidates: 3 Candidates
Skills:
Vendor management
Job Description:

DevOps Engineer – American Science Cloud (AmSC) Project

Experience Level: Mid-level to Senior

Work Location: Remote

Project Overview: American Science Cloud – A Platform for Transformative Science

The American Science Cloud (AmSC) is a secure, federated, and science-optimized cloud environment that brings together the Department of Energy’s leading computing systems, experimental facilities, data resources, and high-performance networks.

The platform enables DOE scientists to create, access, and integrate AI-ready datasets, run scalable model training on advanced systems, perform distributed simulations, control scientific instruments, and move data efficiently across multiple sites.

This initiative is a multi-lab, public-private partnership working alongside the Models Consortium (ModCon), which will deploy advanced AI models and services onto the platform.

The Team

As a Cloud/DevOps Engineer, you will join the L2 Infrastructure Services group within AmSC. You will support the multi-cloud central hub infrastructure across development, staging, pre-production, and production environments.

You will collaborate with other L2 science service teams building on top of this infrastructure, including teams focused on data catalogs, large-scale HPC compute services, user interfaces and APIs, and AI/MLOps operations.

Most contributors currently support AmSC on a part-time basis. You will be among the first full-time engineers fully dedicated to the project. Your primary responsibility will be enabling science teams by building foundational infrastructure and developing CI/CD pipelines for service deployment.

Major Duties/Responsibilities

  1. Administer Kubernetes clusters and support application deployments across environments
  2. Build and maintain pipelines for cloud infrastructure and science service deployment
  3. Manage container image registries such as Harbor
  4. Develop and maintain automation for provisioning and CI/CD using tools like Terraform, GitOps, and Python
  5. Implement security controls in alignment with DevSecOps practices
  6. Configure instrumentation for infrastructure and services to support monitoring and alerting
  7. Provide operational support and engineering for production applications
  8. Define KPIs, improve processes, and drive continuous optimization
  9. Troubleshoot and resolve platform issues efficiently
  10. Participate in on-call rotation, including 24/7 support and scheduled maintenance
  11. Deploy and manage Kubernetes clusters (EKS, AKS, GKE, or equivalent), including upgrades, node lifecycle, networking, and multi-environment promotion
  12. Collaborate with vendors to resolve hardware and software issues
  13. Align work with core values: Impact, Integrity, Teamwork, Safety, and Service
  14. Foster a culture of diversity, equity, inclusion, and accessibility

Basic Qualifications

  1. Bachelor’s degree in Computer Science or a related field
  2. Minimum of 2 years of experience as a DevOps Engineer or Cloud Engineer (or equivalent combination of education and experience)

Preferred Qualifications

  1. Experience leading or managing DevOps or Cloud Engineering teams
  2. Strong communication and collaboration skills
  3. Knowledge of cloud architecture patterns and managed services (preferably AWS or another major provider)
  4. Experience with Kubernetes administration, CRDs, and deployment strategies such as GitOps and Helm
  5. Solid understanding of Unix systems and networking protocols
  6. Strong grasp of cloud networking concepts
  7. Ability to identify performance issues and recommend improvements
  8. Experience gathering requirements and implementing solutions
  9. Strong organizational and time management skills with minimal supervision
  10. Experience with CI/CD methodologies and tools
  11. Familiarity with version control platforms like GitHub or GitLab
  12. Experience with monitoring tools such as Nagios, Grafana, and Prometheus
  13. Experience with Terraform or OpenTofu in multi-account AWS environments (including AWS Organizations, SCPs, IRSA)
  14. Hands-on experience with ArgoCD, including App of Apps and ApplicationSets
  15. Familiarity with Tanka, Jsonnet, or similar configuration-as-code tools
  16. Experience with API gateways such as Kong in Kubernetes environments
  17. Knowledge of secrets management solutions like AWS Secrets Manager or External Secrets Operator
  18. Exposure to research networks such as ESnet or Internet2 is a plus

Special Requirement

This position requires the ability to obtain and maintain a federal public trust clearance. It is classified as a Workplace Substance Abuse Program (WSAP) testing position, requiring a pre-employment drug test and participation in random drug testing. Employees must also report any drug-related arrests, convictions, or positive test results as required by ORNL policies.


Ads do not influence the answers you get from ChatGPT. Your chats stay private. Learn about ads and personalization

Become a part of our growth newsletter