Assistant Director of Research Computing
University of Maryland
Baltimore, MD
ID: 7293587
Posted: 1 month ago
Application Deadline: Open Until Filled
Job Description
Department:
The Division of Information Technology (DoIT) has approximately 85 full time staff members and 100 undergraduate or graduate students that work each day to support the mission of UMBC through the delivery of IT services to the campus. DoIT provides robust, secure IT environments that enable solutions for advancing the UMBC community. We do this through a staff of technology professionals that are connected both nationally and to UMBC.
Learn more about our work at https://doit.umbc.edu/eis/unix-and-windows/
Position Overview:
The Assistant Director of Research Computing is part of a team that is responsible for the day-to-day operations of the research computing infrastructure managed by the Division of Information Technology.
The position will help guide the future of high performance and research computing at UMBC. In addition, the person will lead efforts to best utilize our computing resources to make our researchers successful.
Why Work at UMBC?
UMBC offers competitive compensation. This role starts at $104,000 and has over 4 weeks of vacation for regular full time roles. Tuition remission is also available.
What is it like to work at UMBC? Check out Glassdoor or Indeed.
For 15 years in a row UMBC has been recognized as a Great College to Work For!
Telework:
A hybrid telework schedule is available!
Responsibilities:
Support the wide range of disciplines UMBC’s RC/CI Offerings. Collaborate with researchers and colleagues at other research institutions in applying information technology to research problems. Work with researchers from a variety of disciplines to understand their applications and application performance. Assist researchers to leverage the parallel CPU and GPU architectures of computing resources supported by UMBC DoIT RC Group. Coordinate with cluster administrators concerning the needs of the UMBC research computing community and facilitate user-friendly access to those resources. Respond to reported problems and provide solutions.
Provide guidance to existing and potential users on how to use the cluster for their research projects through both one-on-one consultations and training sessions workflow management as well as data management. Facilitate and develop specific training programs aligned to UMBC’s advanced cyber infrastructure resources. Work closely with DoIT and campus stakeholders to actively pursue infrastructure and research computing related funding opportunities and partnerships.
Design and implement solutions to support research collaborations, including solutions involving both software and hardware components and different functional and performance requirements. Design and architect logical solutions to administer the HPCF and its resources. Build tools and support architectures used to facilitate computational workflows and organize experiments in conjunction with best practices.
Serve as the UMBC Campus NSF Campus Champion. As Campus Champion: Maintain familiarity with emerging techniques and technologies in research computing. Participate in training, education and development activities to improve knowledge and performance and enhance professional development. Maintain familiarity with current technologies related to research computing. Attend relevant academic and professional conferences.
Perform additional duties as assigned.
Required Minimum Qualifications:
Master's Degree preferably in Computer Science, Information Systems, or related field with at least three years experience working with HPC or parallel computing clusters or three years of experience in RC/CI administration.
Experience with NVIDIA Bright Cluster Manager or other cluster management software
Experience with versioning tools such as Git or Subversion
Install and/or configuration of CEPH, parallel or high performance file systems
Slurm or other cluster computing job management experience
Experience supporting customer requests and working with stakeholders to gather and fulfill project requirements
Capable of managing time effectively, working both independently and as part of a team
Help develop project plans, effectively create/update issues and keep team members and management informed of changes, impediments, and updates
Enthusiasm for learning new skills and adapting to a dynamic environment
Strong interpersonal skills, enthusiasm for customer service, and the ability to work with students, staff, and faculty from diverse backgrounds
Excellent written and verbal communication skills
Preferred Qualifications:
Master's or coursework equivalent in a traditionally non-computing, but STEM-related field (e.g., Applied Mathematics, Engineering, Biology, Geology, Chemistry, Physics, etc).
Five or more years of experience in RC/CI administration.
HPC knowledge around cluster builds, software, parallel computing, workload management, and cluster management.
Three years of advanced programming in Python and/or C/C++
Familiarity with Linux & modern computing tools (e.g., Jupyter Ecosystem, MATLAB, and/or similar).
Familiarity with machine learning workflows/pipelines.
The Division of Information Technology (DoIT) has approximately 85 full time staff members and 100 undergraduate or graduate students that work each day to support the mission of UMBC through the delivery of IT services to the campus. DoIT provides robust, secure IT environments that enable solutions for advancing the UMBC community. We do this through a staff of technology professionals that are connected both nationally and to UMBC.
Learn more about our work at https://doit.umbc.edu/eis/unix-and-windows/
Position Overview:
The Assistant Director of Research Computing is part of a team that is responsible for the day-to-day operations of the research computing infrastructure managed by the Division of Information Technology.
The position will help guide the future of high performance and research computing at UMBC. In addition, the person will lead efforts to best utilize our computing resources to make our researchers successful.
Why Work at UMBC?
UMBC offers competitive compensation. This role starts at $104,000 and has over 4 weeks of vacation for regular full time roles. Tuition remission is also available.
What is it like to work at UMBC? Check out Glassdoor or Indeed.
For 15 years in a row UMBC has been recognized as a Great College to Work For!
Telework:
A hybrid telework schedule is available!
Responsibilities:
Support the wide range of disciplines UMBC’s RC/CI Offerings. Collaborate with researchers and colleagues at other research institutions in applying information technology to research problems. Work with researchers from a variety of disciplines to understand their applications and application performance. Assist researchers to leverage the parallel CPU and GPU architectures of computing resources supported by UMBC DoIT RC Group. Coordinate with cluster administrators concerning the needs of the UMBC research computing community and facilitate user-friendly access to those resources. Respond to reported problems and provide solutions.
Provide guidance to existing and potential users on how to use the cluster for their research projects through both one-on-one consultations and training sessions workflow management as well as data management. Facilitate and develop specific training programs aligned to UMBC’s advanced cyber infrastructure resources. Work closely with DoIT and campus stakeholders to actively pursue infrastructure and research computing related funding opportunities and partnerships.
Design and implement solutions to support research collaborations, including solutions involving both software and hardware components and different functional and performance requirements. Design and architect logical solutions to administer the HPCF and its resources. Build tools and support architectures used to facilitate computational workflows and organize experiments in conjunction with best practices.
Serve as the UMBC Campus NSF Campus Champion. As Campus Champion: Maintain familiarity with emerging techniques and technologies in research computing. Participate in training, education and development activities to improve knowledge and performance and enhance professional development. Maintain familiarity with current technologies related to research computing. Attend relevant academic and professional conferences.
Perform additional duties as assigned.
Required Minimum Qualifications:
Master's Degree preferably in Computer Science, Information Systems, or related field with at least three years experience working with HPC or parallel computing clusters or three years of experience in RC/CI administration.
Experience with NVIDIA Bright Cluster Manager or other cluster management software
Experience with versioning tools such as Git or Subversion
Install and/or configuration of CEPH, parallel or high performance file systems
Slurm or other cluster computing job management experience
Experience supporting customer requests and working with stakeholders to gather and fulfill project requirements
Capable of managing time effectively, working both independently and as part of a team
Help develop project plans, effectively create/update issues and keep team members and management informed of changes, impediments, and updates
Enthusiasm for learning new skills and adapting to a dynamic environment
Strong interpersonal skills, enthusiasm for customer service, and the ability to work with students, staff, and faculty from diverse backgrounds
Excellent written and verbal communication skills
Preferred Qualifications:
Master's or coursework equivalent in a traditionally non-computing, but STEM-related field (e.g., Applied Mathematics, Engineering, Biology, Geology, Chemistry, Physics, etc).
Five or more years of experience in RC/CI administration.
HPC knowledge around cluster builds, software, parallel computing, workload management, and cluster management.
Three years of advanced programming in Python and/or C/C++
Familiarity with Linux & modern computing tools (e.g., Jupyter Ecosystem, MATLAB, and/or similar).
Familiarity with machine learning workflows/pipelines.