Named after one of Wyoming’s reservoirs on the North Platte River, Pathfinder is a low-cost storage solution that enables a Cloud-like presence for research data hosted by ARCC. The system is built to be expandable and provides data protection. Its core functionality is hosting onsite backups as well as enabling data sharing and collaboration.

Contents

Glossary

Frequently Asked Questions


How it Works

Pathfinder uses the Simple Storage Service (S3) protocol originally developed by Amazon that they define as “storage for the Internet”. S3 works on object storage through a service provided by Red Hat Enterprise Linux called Ceph.

Unique Characteristics of S3

ARCC's S3 presences, Like Pathfinder, do not function like Windows or traditional storage systems. Below is a list of a few unique characteristics of S3.

Purpose of System

The Pathfinder S3-Ceph storage architecture is designed and hosted by ARCC to serve two primary purposes:

  1. The system will act as an onsite backup target for other ARCC services such as the petaLibrary or Teton.

  2. The system will act as a publicly accessible data transfer platform via the S3 protocol.

    1. Users will be able to host their own 'bucket' to share data.

    2. User can also obtain data from external collaborators.

The system also serves a wide variety of supplementary functions:

Use Cases

Host data publicly that end users can be allowed to download directly, or with credentials.

Back data up to Pathfinder as a second (or third) copy of your critical research, using a wide variety of open-source tools.

This space is a stand-alone entity, and will not be mounted directly on other ARCC resources.

This system is *NOT* backed up. Data that reside on this system should be available in other location(s). This system is intended as a secondary backup and a temporary repository for data transfers ONLY

S3 Clients

The S3 protocol requires a client to connect to the server. There are a variety of Graphical User Interface (GUI) and Command Line Interface (CLI) clients that can be used to connect to Pathfinder. With so many S3 clients available, not all have been tested by ARCC but the few that we have are detailed in the table below.

Client Name

Operating System

GUI or CLI

Free?

ARCC recommended/supported

MSP360 Explorer (Cloudberry)

Windows, macOS

GUI

Yes, but larger transfers will require a license

Yes

Cyberduck

Windows, macOS,

GUI

Yes

Best Effort

Transmit

macOS

GUI

No

Best Effort

Dragon Disk

Windows, macOS, Linux

GUI

Yes

No

rclone

Windows, macOS, Linux

CLI

Yes

Yes

s3cmd

macOS, Linux

CLI

Yes

Best Effort

Instructions for using Pathfinder with MSP360 Explorer (Cloudberry)

Instructions for using Pathfinder with rclone

Scripting/Programming Packages

Some programming languages provide software packages that can use the S3 protocol for accessing data. ARCC has tried a few of these and are detailed in the table below.

Package Name

Language

ARCC Tested

boto3

Python

Yes

aws.s3

R

Yes

AWS

C#

No

Cost

Price Structure for S3

This price structure is based on actual hardware costs and does not include personnel or infrastructure (network/datacenter) costs. Those have been subsidized by ARCC and the Office of Research and Economic Development.

Requesting Access

To request access to Pathfinder and receive an Accesskey/Secretkey combo please do so by emailing arcc-help@uwyo.edu with the subject of “Pathfinder access request”.