Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

In this module section of the workshop we will discuss data management for research workflows, why it’s important, and introduce how you can use ARCC resources to manage your data. This page give background information for future topics, if you are looking for specific examples, please head back to the main Data Management page to navigate to other pages.

...

The ARCC Data Portal (Storage)

MedicineBow HPC system (Analysis)

Pathfinder (Storage)

  • Free for UWyo researchers up to a default limit

  • Accessible via the UWyo network or VPN

  • Includes backups and snapshots

  • Home (for configuration and profiles)

  • Project (for shared data during analysis)

  • gscratch (for actively read/write during analysis)

    • MedicineBow is NOT backed up, but includes snapshots

  • Cloud-like backend

  • Web-enabled S3 buckets for data storage, data transfer, etc.

  • Is NOT backed up

...

High Performance Computing is another core ARCC service and we offer an assortment of support for this type of work. Along with the MedicineBow HPC system, we provide documentation, troubleshooting consultations, software management, and workshops among the system administration of the system. Additionally, we provide facilitation of and technical support for NCAR Wyoming Supercomputing Center’s Derecho system.

...

This phase of the Research Data Life-cycle usually occurs after the work has been completed but before other work (such as a manuscript) is published. What exactly it involves depends on the requirements of the various funding agencies and/or scientific journals that you are working with. For example, if your work was funded by the NSF the resulting data of your work must be made publicly available, and if you are wanting to publish in the Journal of Science, your data has to be available before your manuscript will be published itself. Good scholarly metadata (described in the next section) will be key to completing this phase. Other key concepts in this phase are:

  • Discipline specific data repositories

  • General or institutional data repositories

  • Digital identifiers, such as a Digital Object Identifiers (DOIs)

  • Personal scholarly identifiers, such as an ORCID

...

How ARCC Can Help With the Publishing Phase

Next Steps

...

Link to Previous sub-module or Home Module

 

...

ARCC supports some of the systems used in publishing research data along with the Data Librarians at The University of Wyoming Libraries. The Data Librarians will be the primary points of contact during this phase and can seek ARCC’s assistance if needed. Additionally, some larger datasets will require ARCC to host or move for the researcher. Lastly, if the data to be published are already stored on one of ARCC’s systems, ARCC can assist in getting it moved to the appropriate place for publishing.

...

Next Steps