MECH - 98 - Hiring_ HPC Administrator_6+ Years Bangalore
MECH - 98 - Hiring_ HPC Administrator_6+ Years Bangalore

India (Karnataka) Bangalore G.P.O.

IT Digital

1 month ago

Role: HPC Administrator

Experince: 6+ Years

Location: Bangalore

Notice period: 0-15 days

Job description:

The HPC Administrator is responsible for design, implementing, and operating enterprise infrastructure and systems automation for HPC (High Performance Computing) clusters.

Must be capable of working with minimal supervision & applies the necessary technical expertise for effective and efficient use while communicating with peers, customers and leadership

Key Responsibilities:

  • Setup, configuration, general maintenance and troubleshooting of HPC Cluster for CAE Dept.
  • Manage large & diverse HPC environment including design, build, capacity planning
  • Knowledge on High Performance Computing – HPC like managing CAE Software’s, troubleshooting failed HPC jobs, PBS/SLURM/LSF/SGE or any scheduler knowledge will be added advantage
  • New CAE application integration to the existing HPC Cluster
  • Application knowledge on CAE applications like STARCCM, Abaqus, Numeca, LS-DYNA, Preonlab, Converge, Console
  • Should have a working experience on Altair Applications like ANSA, Hypermesh, Hyperworks, Medina
  • Knowledge on Altair PBS, License server management
  • Evaluate and recommend systems CAE software and hardware for enterprise systems.
  • Work with core production support personnel in IT and Engineering to automate deployment and operation of the infrastructure
  • LDAP configuration and Integration
  • Manage and maintain monitoring to ensure uptime and SLA levels.
  • Manage, deploy and configure infrastructure with Puppet / Ansible or other automation tools
  • Knowledge on Operating Systems like CENTOS, Ubuntu, Redhat
  • Supporting, interfacing and cooperating with cross functional teams. Ability to work closely with end users to understand their needs and provide guidance.
  • Knowledge in scripting languages including Shell, Python.
  • Document process and procedure followed in day to day operations as well as new implementation
  • Directs & coordinates the work assigned with respective teams & possess excellent communication & problem-solving skills.

Required Skills:

  • Minimum 6+ years of HPC experience (required).
  • Bachelor’s degree in Computer Science, Information Systems, or equivalent education
  • Having Hands on experience in HPC Infra
  • Working knowledge on HPC schedulers like PBS, SLURM
  • Providing application support for CAE applications like STARCCM, Abaqus, Numeca, LS-DYNA.
  • Troubleshooting knowledge on HPC jobs
  • Work with CAE Dept closely, get all the requirements and provide best solutions to the end user
  • Must be able to work with and provide support for cross functional groups and technical areas (compute, storage, network, applications)
  • Must have firm understanding of Linux internals and have automated system building, patching, and configuration management
  • Knowledge in systems management automation using industry-standard and open-source tools such as Python, Bash, Puppet, Ansible.
  • Good understanding of various server technologies available to deploy servers in DC and also Vendor Management
  • Excellent Communication Skills, team coordination and interpersonal skills

Must-Have Skills:HPC Cluster Management, CAE Software Management, HPC Scheduler Knowledge (PBS, SLURM, LSF, SGE), Altair Applications (ANSA, Hypermesh, Hyperworks, Medina), License Server Management, Systems Automation (Puppet, Ansible), Operating Systems (CENTOS, Ubuntu, Redhat), Scripting Knowledge (Shell, Python), Linux Internals, Capacity Planning, LDAP Configuration & Integration, Cross-Functional Support

Secondary Skills:HPC Job Troubleshooting, Automation Tools (Python, Bash, Puppet, Ansible), System Deployment in Data Centers, Vendor Management, Communication Skills, Documentation

Keywords: HPC Administrator, CAE Applications, HPC Cluster Management, HPC Scheduler (PBS, SLURM, LSF, SGE),Altair Applications, License Server Management, Systems Automation, Ansible, Puppet

Apply For This Position Refer a Friend And Earn ₹25000/-
  • Experience

    6-9 years

  • Primary Skills

    Ansible), HPC Cluster Management, CAE Software Management, HPC Scheduler Knowledge (PBS, SLURM, LSF, SGE), Altair Applications (ANSA, Hypermesh, Hyperworks, Medina), License Server Management, Systems Automation (Puppet, Operating Systems (CENTOS, Ubuntu, Redhat), Scripting Knowledge (Shell, Python), Linux Internals, Capacity Planning, LDAP Configuration & Integration, Cross-Functional Support

  • Number of Positions

    1

Related Jobs

Looking for your next career opportunity. Look no further.

WhatsApp LinkedIn