Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents
minLevel1
maxLevel3
typeflat
separatorpipe

Overview

The purpose of this page is to inform users of ongoing issues known to ARCC. Here we detail what each problem is and what we are doing to remedy it. Any issues that users come across that are not detailed on this page should inform ARCC by emailing us at arcc-help@uwyo.edu.

Contents

Table of Contents
minLevel1
maxLevel3
typeflat
separatorpipe

SouthPass:

3/16/22: An issue has been found with running salloc and srun on Southpass/Ondemand interactive desktops. ARCC is working to resolve this.

Teton

8/11/21: Intel MPI 2018.x.y and Earlier Libraries and Related Applications Failing

Description

Since the last maintenance where Slurm has been updated we have noticed a backward incompatibility issue when using with the Intel MPI libraries. Any application that uses the intel-mpi 2018.x.y and earlier libraries will fail to run.

For example, if you have a local version of say VASP that has been built and uses module load intel-mpi.x.y

Solution

To resolve the issue, insert the following after the srun command within your command-line / submission script: srun --mpi=pmi2

...

Jupyter Lab

18 Jan. 2023. Symptoms:

There is an issue with Jupyter that is preventing users from deleting files from within Jupyter. ARCC is working on this.
In the meantime users can:

  1. go the Southpass window,

  2. click ‘Files’ -> /project/atsc5009.

  3. Then select the offending files and click ‘Delete’.

Beartooth MPI Fail

6 Jan. 2023. Symptoms:

Code Block
--------------------------------------------------------------------------
WARNING: Open MPI failed to TCP connect to a peer MPI process.  This
should not happen.

Your Open MPI job may now hang or fail.

This is an issue with MPI. ARCC is working to resolve this intermittent issue. If you encounter it, simply restarting your job is the workaround at this time.

Credential Caching

Description

Occasionally some users' credentials are getting cached on Teton Beartooth that prevents them from logging in.

Solution

We have a script in place that clears the cache on the system that runs every hour on the hour. We can also run this manually if users run into trouble with logging into Teton Beartooth when they have previously been able to login before and can’t wait for the hourly script.

arccquota Error

25 Oct. 2022: Symptoms:

Code Block
Traceback (most recent call last):
  File "/apps/s/arcc/latest/bin/arccquota", line 336, in <module>
    for each_p in _user_projs[each_u]:

This is a known issue that ARCC is working to resolve. It should have no impact on your work.

SouthPass:

3/16/22: An issue has been found with running salloc and srun on Southpass/Ondemand interactive desktops. ARCC is working to resolve this.

...