Overview
The purpose of this page is to inform users of ongoing issues known to ARCC. Here we detail what each problem is and what we are doing to remedy it. Any issues that users come across that are not detailed on this page should inform ARCC by emailing us at arcc-help@uwyo.edu.
Beartooth MPI Fail
6 Jan. 2023: Symptoms:
-------------------------------------------------------------------------- WARNING: Open MPI failed to TCP connect to a peer MPI process. This should not happen. Your Open MPI job may now hang or fail.
This is an issue with MPI. ARCC is working to resolve this intermittent issue. If you encounter it, simply restarting your job is the workaround at this time.
Credential Caching
Description
Occasionally some users' credentials are getting cached on Beartooth that prevents them from logging in.
Solution
We have a script in place that clears the cache on the system that runs every hour on the hour. We can also run this manually if users run into trouble with logging into Beartooth when they have previously been able to login before and can’t wait for the hourly script.
arccquota Error
25 Oct. 2022: Symptoms:
Traceback (most recent call last): File "/apps/s/arcc/latest/bin/arccquota", line 336, in <module> for each_p in _user_projs[each_u]:
This is a known issue that ARCC is working to resolve. It should have no impact on your work.
SouthPass:
3/16/22: An issue has been found with running salloc
and srun
on Southpass/Ondemand interactive desktops. ARCC is working to resolve this.