Higher Throughput, More Secure, and Greener Computing!"
What is the reason for this maintenance?
Oscar's current operating system, RHEL-7, and its maintenance support phase will come to an end in June 2024. So Oscar is being upgraded to latest RedHat Enterprise Linux RHEL-9.2
Due to the new kernel and glibc majority of applications will break.
We are also introducing a bunch of new features:
Upgraded OS - The OS has been upgraded with a newer kernel and improved security patches
Power Saving Mode: In the batch partition, idle nodes now enter a power-saving mode, consuming only about 40W. They seamlessly transition to high-performance mode just before job execution begins
GPU Direct Storage: GDS enables a direct data path for direct memory access (DMA) transfers between GPU memory and storage, which avoids a bounce buffer through the CPU. This direct path increases system bandwidth and decreases the latency and utilization load on the CPU
SLURM Upgrade: We have tuned the scheduler to provide much higher throughput. Now supports json
and yaml
formatting for all slurm commands
SPACK & LMOD - Newer industry standard for installing and managing applications on Oscar. We now support multple shells bash,zsh & fish
etc
Increased-core core count for GPU accounts:
Account | Partition | Current core-limit | New core-limit |
---|---|---|---|
What are exact version changes?
How to access the new cluster?
We will provide detailed instructions in coming weeks. Thank you for your patience.
Component | Current Version | New Version |
---|---|---|
Operating System
RHEL-7.9
RHEL-9.2
Kernel
3.10.0-1160.76.1
5.14.0-284.11.1
GLIBC
2.17-326
2.34-60
SLURM
22.05.7
23.02.6
Nvidia Driver
535.54.03
535.113.01
Package Manager
PyModules
SPACK
Exploratory
gpu
4-cores
12-cores
Standard GPU Priority
gpu
16-cores
24-cores
Standard GPU Priority+
gpu
32-cores
48-cores
High-End GPU Priority
gpu-he
16-cores
24-cores