Skip to main content

CTE-AMD

System Overview

CTE-AMD is a cluster based on AMD EPYC processors, with a Linux Operating System and an Infiniband interconnection network.

It has the following configuration:

  • 1 login node and 33 compute nodes, each of them:
    • 1 x AMD EPYC 7742 @ 2.250GHz (64 cores and 2 threads/core, total 128 threads per node)
    • 1024GiB of main memory distributed in 16 dimms x 64GiB @ 3200MHz
    • 1 x SSD 480GB as local storage
    • 2 x GPU AMD Radeon Instinct MI50 with 32GB
    • Single Port Mellanox Infiniband HDR100
    • GPFS via two copper links 10 GBit

The operating system is Rocky Linux release 8.5 (Green Obsidian).

Current basic software stack

caution

The most recent installed version of ROCm is the 5.1.1. Installing newer versions would require a cluster-wide update of the operating system, which is something not planned to do as of now.

  • GCC (Red Hat) 8.5.0-4
  • Radeon Open Compute (ROCm) Runtime software stack 5.1.1.50101

Connecting to CTE-AMD

The first thing you should know is your username and password. Once you have a login and its associated password you can get into the cluster through the following login nodes:

  • amdlogin1.bsc.es

This will provide you with a shell in the login node. There you can compile and prepare your applications.

You must use Secure Shell (ssh) tools to login into or transfer files into the cluster. We do not accept incoming connections from protocols like telnet, ftp, rlogin, rcp, or rsh commands. Once you have logged into the cluster you cannot make outgoing connections for security reasons.