We are seeking an experienced
CAE HPC Specialist to manage, support, and optimize our on premises high-performance computing (HPC) infrastructure for the Virtual Engineering Group. The ideal candidate will have significant experience with Linux HPC systems and a deep understanding of CAE engineering software, including Ansys and Altair CFD, FEA and multi-physics applications.
The CAE HPC Specialist will be responsible for performance tuning, user support, license management, troubleshooting, automation and system configuration.
The CAE HPC Specialist will support a global engineering team across multiple locations, requiring strong expertise in remote system administration and software deployment.
Responsibilities
Key Responsibilities:
- HPC System Administration & Performance Optimization:
- Lead HPC performance tuning
- Ensure efficient job scheduling and resource allocation using Open PBS.
- Optimize parallel processing and remote visualization workflows (VNC).
- Monitor system health, usage, and troubleshoot hardware/software issues.
- CAE Software Support & Troubleshooting (Global Team):
- Troubleshoot job submission failures, simulation crashes, and visualization issues.
- Assist users with best practices for simulation workflows and HPC utilization.
- Maintaining software compatibility across updates and system upgrades
- Install and maintain CAE software on Workstations and ensure version alignment with HPC
- License & Software Asset Management:
- Assign and prioritize CAE license access for global users
- License tracking, usage reports, and cost-benefit analysis.
- HPC Support & Ticketing Coordination:
- Act as the primary contact for HPC-related support tickets.
- Maintain an internal knowledge base for common troubleshooting issues.
Qualifications
Qualifications & Skills:
- 10+ years of experience in CAE, CFD, FEA and HPC systems
- Strong expertise in Linux (Red Hat 8.7), HPC job scheduling (Open PBS), and remote visualization (VNC).
- Hands-on experience with Ansys Fluent, Maxwell, Altair Hypermesh, OptiStruct, and Radioss.
- Familiarity with license server management (FlexNet, LM-X, etc.).
- Strong scripting skills (Bash, Python) for automation and troubleshooting.
- Experience with performance tuning and parallel computing for CAE applications.
- Excellent problem-solving, communication, and global user support skills.
Essential Skills
Preferred Qualifications:
- Experience in automotive systems, thermal and electromotive systems and their simulation software.
- Knowledge of containerization tools like Docker and Kubernetes.
- Familiarity with AI/ML workflows in HPC environments.
- Experience with benchmarking and performance analysis of HPC systems.
- Familiarity with cloud-based HPC solutions and hybrid computing environments.
- Knowledge of IT security policies, data encryption, and access control management.