HPC Cluster News
HPC CENVAL-ARC symposium (Mar. 7th 2025)
posted on: 03/01/2025
The University of California, Merced invites you to the 2025 Central Valley Accessible Research and Computational Hub (CENVAL-ARC) Symposium, proudly sponsored by the National Science Foundation’s Campus Cyberinfrastructure (CC) program*. https://cenval-arc.ucmerced.edu/
HPC clusters maintenance (Sep. 23 - 27th, 2024)
posted on: 09/02/2024
HPC clusters MERCED and Pinnacles will be under a maintenance for critical maintenance and for installing hardware from 6am Sep. 23 - 5pm Sep. 27, 2024. During this time, users will not be able to:
- Login to the clusters and access their data
- Run jobs on the cluster
Note that the Slurm reservations will be set in place to make sure jobs do not run after 6am on Sep 23. Please make sure that you are submitting your jobs with a wall-clock time does not exceed 6m on Sep. 23.
During this maintenance, CIRT team along with cluster vendors will perform the following tasks:
- Physical installation of CENVAL-ARC compute nodes
- Upgrading Slurm version
- Regular maintenance
Emergency Maintenance Notification for MERCED and Pinnacles HPC Clusters - 06/17/2024
posted on: 06/02/2024
We are writing to inform you of emergency fire management system maintenance scheduled by Facilities on Monday, June 17th, from 1:00 PM through 1:30 PM that will impact the MERCED and Pinnacles HPC clusters.
During the maintenance window, the clusters will be offline to ensure the safety and integrity of our systems. Any jobs running or scheduled to run during this period will be lost.
Please plan accordingly if you have any critical tasks requiring cluster access during this time.
For up-to-date information on the status of the clusters during this maintenance, please visit status.ucmerced.edu. We anticipate that the clusters will be back online before the end of the business day.
Thank you for your understanding and cooperation
COMPLETED: HPC cluster maintenance - 1/16/24
posted on 01/16/2024
The MERCED and Pinnacles clusters are back online. The CIRT team has completed several updates, including security advisories, bug fixes, and product enhancements. Upgrades encompassed storage server firmware, storage chassis firmware, IB and Data network expansion, and nodes' BIO and BMC firmware. Currently, the default CUDA version for GPU nodes (gnode) is 12.3.
Please feel free to resume