Job Description:

Diverse Agile Solutions is seeking a Junior SRE Engineer who will create and maintain operations of site reliability engineering (SRE) efforts on multi-user High Performance Computing (HPC) systems using a variety of configuration management, IT monitoring, and automation tools within a Linux environment (RedHat, CentOS). Candidates will work to create a new Nagios Alerting Database, new SRE Database, and develop an effective consistent SRE automation protocol.

Required Clearance:

TS/SCI with Poly

Responsibilities:

  • Experience and/or exposure with automation tools including: Puppet, Salt, Ansible, and Chef.
  • Experience with scripting in Bash, Python and/ Perl.
  • Experience or exposure to XFS/ZFS File Systems and NFS/Block Storage FS Sharing; SSH, TMUX, PDSH, CLUSH system access; VI, EMACS, AWK/SES, CRON system editing; and Nagios, Ganglia, SNMP information technology monitoring systems.

Required Education:

DoD 8570 IAT Level II Certification required.

Bachelor's degree in Computer Science or related field. Five(5) years demonstrable experience in systems administration and support of a large client-server based IT enterprise.