Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 19 Next »

Overview

The purpose of this page is to inform users of ongoing issues known to ARCC. Here we detail what each problem is and what we are doing to remedy it. Any issues that users come across that are not detailed on this page should inform ARCC by emailing us at arcc-help@uwyo.edu.

Beartooth MPI Fail

6 Jan. 2023: Symptoms:

--------------------------------------------------------------------------
WARNING: Open MPI failed to TCP connect to a peer MPI process.  This
should not happen.

Your Open MPI job may now hang or fail.

This is an issue with MPI. ARCC is working to resolve this intermittent issue. If you encounter it, simply restarting your job is the workaround at this time.

Credential Caching

Description

Occasionally some users' credentials are getting cached on Beartooth that prevents them from logging in.

Solution

We have a script in place that clears the cache on the system that runs every hour on the hour. We can also run this manually if users run into trouble with logging into Beartooth when they have previously been able to login before and can’t wait for the hourly script.

arccquota Error

25 Oct. 2022: Symptoms:

Traceback (most recent call last):
  File "/apps/s/arcc/latest/bin/arccquota", line 336, in <module>
    for each_p in _user_projs[each_u]:

This is a known issue that ARCC is working to resolve. It should have no impact on your work.

SouthPass:

3/16/22: An issue has been found with running salloc and srun on Southpass/Ondemand interactive desktops. ARCC is working to resolve this.


  • No labels