Skip to content
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 10 additions & 0 deletions docs/alps/clusters.md
Original file line number Diff line number Diff line change
Expand Up @@ -43,4 +43,14 @@
[:octicons-arrow-right-24: Santis][ref-cluster-santis]
</div>

## Other systems

<div class="grid cards" markdown>
- :fontawesome-solid-mountain: __Porting and Development__

Besso is a small system for selected customers to do code portina and development with AMD and NVIDIA GPUs.

Check failure on line 51 in docs/alps/clusters.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`portina` is not a recognized word. (unrecognized-spelling)

Check failure on line 51 in docs/alps/clusters.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

[:octicons-arrow-right-24: Besso][ref-cluster-besso]

Check failure on line 53 in docs/alps/clusters.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

Check warning on line 53 in docs/alps/clusters.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)
</div>


92 changes: 92 additions & 0 deletions docs/clusters/besso.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
[](){#ref-cluster-besso}
# Besso

Check failure on line 2 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

Check warning on line 2 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

Besso is an Alps cluster that provides development resources for porting software for selected customers.

Check failure on line 4 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

Check warning on line 4 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

!!! note
Besso is a small system for development work for selected customers.

Check failure on line 7 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

Check warning on line 7 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`Besso` is not a recognized word. (unrecognized-spelling)

### Storage and file systems

Besso uses the [HPCP filesystems and storage policies][ref-hpcp-storage].

## Getting started

### Logging into Besso

To connect to Besso via SSH, first refer to the [ssh guide][ref-ssh].

!!! example "`~/.ssh/config`"
Add the following to your [SSH configuration][ref-ssh-config] to enable you to directly connect to besso using `ssh besso`.

Check failure on line 20 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`besso` is not a recognized word. (unrecognized-spelling)

Check failure on line 20 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`besso` is not a recognized word. (unrecognized-spelling)

Check warning on line 20 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`besso` is not a recognized word. (unrecognized-spelling)
```
Host besso
HostName besso.vc.cscs.ch
ProxyJump ela
User cscsusername
IdentityFile ~/.ssh/cscs-key
IdentitiesOnly yes
```

### Software

[](){#ref-cluster-besso-uenv}
#### uenv

CSCS does not provide officialy supported applications on Besso.

Check failure on line 35 in docs/clusters/besso.md

View workflow job for this annotation

GitHub Actions / Check Spelling

`officialy` is not a recognized word. (unrecognized-spelling)
Basic programming environments are provided for the a100 and mi200 node types.

<div class="grid cards" markdown>

- :fontawesome-solid-layer-group: __Programming Environments__

Provide compilers, MPI, Python, common libraries and tools used to build your own applications.

* [prgenv-gnu][ref-uenv-prgenv-gnu]
</div>

[](){#ref-cluster-besso-containers}
#### Containers

Besso supports container workloads using the [Container Engine][ref-container-engine].

To build images, see the [guide to building container images on Alps][ref-build-containers].

#### Cray Modules

!!! warning
The Cray Programming Environment (CPE), loaded using `module load cray`, is no longer supported by CSCS.

CSCS will continue to support and update uenv and the Container Engine, and users are encouraged to update their workflows to use these methods at the first opportunity.

The CPE is still installed on Besso, however it will receive no support or updates, and will be [replaced with a container][ref-cpe] in a future update.

## Running jobs on Besso

### Slurm

Besso uses [Slurm][ref-slurm] as the workload manager, which is used to launch and monitor workloads on compute nodes.

There are multiple [Slurm partitions][ref-slurm-partitions] on the system:

* the `a100` partition contains [NVIDIA A100 GPU][ref-alps-a100-node] nodes
* the `mi200` partition contains [AMD Mi250x GPU][ref-alps-mi200-node] nodes
* the `normal` partition contains all of the nodes in the system.

| name | max nodes per job | time limit |
| -- | -- | -- |
| `a100` | 2 | 24 hours |
| `mi200` | 2 | 24 hours |
| `normal` | 4 | 24 hours |

See the Slurm documentation for instructions on how to [run jobs][ref-slurm].

### FirecREST

!!! todo
add the correct API endpoint

Besso can also be accessed using [FirecREST][ref-firecrest] at the `https://api.cscs.ch/hpc/firecrest/v2` API endpoint.

## Maintenance and status

There is no regular scheduled maintenance for this system.
Loading