AI Infrastructure

DevOps System Administrator (AI/ML Infrastructure) - TS/SCI with CI Poly – Scottsdale, AZ

DevOps System Administrator (AI/ML Infrastructure)

πŸ“ Location: Scottsdale, Arizona
🏒 Work Environment: 100% Onsite
πŸ’Ό Employment Type: Full-Time
πŸ›‘οΈ Security Clearance Required: Active DoD TS/SCI with Polygraph
✈️ Travel: Occasional
🚫 Visa Sponsorship: Not Available

πŸ’° Compensation & Benefits

  • Base Salary: $126,000 – $150,000

  • Full Benefits Package

  • Relocation Assistance Available

  • No commission or overtime eligibility

πŸš€ Position Overview

A leading defense technology organization is seeking an experienced DevOps System Administrator to support and optimize advanced Artificial Intelligence and Machine Learning (AI/ML) infrastructure within a highly secure classified environment.

This role will focus on building, maintaining, and automating enterprise-scale infrastructure supporting AI model development, training, testing, and deployment. The successful candidate will work closely with Data Scientists, ML Engineers, and Infrastructure Teams to ensure scalable, secure, and high-performing environments.

This is an excellent opportunity to work on mission-critical technologies supporting national security initiatives while leveraging cutting-edge AI, cloud-native, and automation technologies.

πŸ”§ Key Responsibilities

AI/ML Infrastructure Management

  • Design, implement, and maintain scalable infrastructure for AI/ML model training and inference

  • Manage GPU resources and high-performance computing environments

  • Support machine learning development and deployment workflows

  • Collaborate with Data Scientists and ML Engineers to streamline model development through production

DevOps & Automation

  • Develop and manage CI/CD pipelines for AI applications and ML models

  • Automate:

    • Infrastructure provisioning

    • Configuration management

    • Deployment processes

  • Utilize Infrastructure as Code (IaC) tools including:

    • Terraform

    • Ansible

Containerization & Orchestration

  • Deploy and manage containerized environments using:

    • Docker

    • Kubernetes

  • Scale and optimize AI/ML services within secure enterprise environments

Linux & Systems Administration

  • Administer Linux-based operating systems and virtualized server environments

  • Perform:

    • Server configuration

    • Performance tuning

    • Patch management

    • Troubleshooting

    • Monitoring

  • Support both physical and virtual infrastructure environments

Monitoring & Operational Support

  • Implement monitoring, logging, and alerting solutions

  • Analyze system alerts and perform root-cause investigations

  • Create scripts to automate repetitive administrative tasks

  • Troubleshoot networking, storage, and server-related issues

Technical Leadership

  • Serve as a subject matter expert for server and infrastructure operations

  • Mentor team members and provide technical guidance

  • Support system design, analysis, and continuous improvement initiatives

  • Identify opportunities to leverage AI and automation for operational efficiency

βœ… Required Qualifications

Education

  • Bachelor’s Degree in:

    • Computer Science

    • Related technical field

    • OR equivalent experience

OR

  • Master’s Degree with 6+ years of relevant experience

Experience

  • 7–10 years of relevant systems administration, infrastructure, or DevOps experience

  • Strong enterprise server administration background

  • Experience supporting large-scale production environments

  • Advanced Linux administration skills

  • Experience supporting virtualized infrastructure environments

πŸ”’ Clearance Requirements (Mandatory)

βœ” Active DoD TS/SCI with Polygraph required at time of hire
βœ” U.S. Citizenship required
βœ” Clearance must be active and current
❌ No sponsorship available
❌ No clearance reinstatement candidates

πŸ› οΈ Required Technical Skills

βœ” Linux Administration
βœ” Docker
βœ” Kubernetes
βœ” AI/ML Infrastructure Support
βœ” CI/CD Pipeline Management
βœ” Infrastructure as Code (Terraform, Ansible)
βœ” Virtualization Technologies
βœ” Enterprise Server Support
βœ” Networking & Storage Troubleshooting
βœ” Automation & Scripting

⭐ Preferred Qualifications

  • Experience supporting AI/ML development environments

  • GPU resource management experience

  • High-performance computing environments

  • Cloud-native infrastructure experience

  • Container orchestration at enterprise scale

  • Experience within classified or defense environments

🎯 Ideal Candidate

The ideal candidate is a highly technical infrastructure professional with deep Linux expertise, strong containerization skills, and experience supporting AI/ML environments in secure enterprise settings.

They will bring:

  • Active TS/SCI Poly clearance

  • Strong Docker and Kubernetes experience

  • Advanced troubleshooting capabilities

  • Enterprise systems administration expertise

  • Automation and DevOps mindset

  • Ability to work independently and collaboratively

  • Passion for emerging technologies and AI-driven innovation

πŸ“‹ Screening Questions

  1. Do you have hands-on Linux administration experience?

  2. Do you have experience with Docker, Kubernetes, or similar container platforms?

  3. Are you comfortable working onsite full-time in Scottsdale, AZ?

  4. Do you currently hold an active DoD TS/SCI with Polygraph clearance?

πŸ“Œ Candidate Snapshot

Requirement

Details

Experience

7–10 Years

Seniority

Mid-Senior

Industry

Defense / Technology

Education

Bachelor’s Degree or Equivalent

Clearance

Active TS/SCI Poly Required

Work Model

100% Onsite

Travel

Occasional

Relocation

Available

🌟 Why Join

Award-Winning Culture

  • Recognized as a Gallup Exceptional Workplace

  • Strong employee engagement and retention

  • Average employee tenure of 12 years

Work-Life Balance

  • Flexible 9/80 schedule

  • Every other Friday off

  • Generous PTO and parental leave

Outstanding Benefits

  • 401(k) with company match and immediate vesting

  • Comprehensive medical, dental, and vision coverage

  • Tuition assistance and professional development support

Meaningful Work

  • Support cutting-edge technologies including:

    • Artificial Intelligence

    • Cloud Native Platforms

    • Advanced Cyber Systems

    • National Security Programs

Collaborative Environment

  • Strong engineering culture

  • Low micromanagement

  • High autonomy and ownership

  • Opportunities to innovate and contribute to mission-critical solutions

This role offers the opportunity to combine advanced DevOps expertise with next-generation AI technologies while supporting some of the most important missions in the defense and intelligence community.