Notice
Recent Posts
Recent Comments
Link
일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | |||
5 | 6 | 7 | 8 | 9 | 10 | 11 |
12 | 13 | 14 | 15 | 16 | 17 | 18 |
19 | 20 | 21 | 22 | 23 | 24 | 25 |
26 | 27 | 28 | 29 | 30 | 31 |
Tags
- LUSTRE
- patch
- Singularity
- conda
- 1.9
- hpcm패치
- 1.10
- infiniband
- HPFSS
- ubuntu
- Docker
- Kernel
- gpfs
- nvidia
- SLURM
- version
- CUDA
- Cray
- Linux
- top500
- HPCM
- CPU
- AMD
- GPU
- PFSS
- rhel
- Source
- build
- java
- HPE
Archives
- Today
- Total
HPE CRAY 자료 공유
HPE Apollo 6500 Gen9(XL270d Gen9) 본문
1. Front panel
Item | Description | 비고 |
1 | Drive bays | |
2 | Slot 9 PICe3 x 16 | |
3 | Slot 10 PICe3 x 16 | |
4 | NIC port 2 | |
5 | NIC port 1 | |
6 | Dedicated iLO port(optional) | |
7 | Serial number and iLO label pull tab | |
8 | USB 3.0 connector | |
9 | SUV connector |
- Chassis and Accelerator Trays
Item | Description | 비고 |
1 | HPE Apollo 6500 Chassis (4U) | |
2 | Low profile PCIe Gen3 x16 slot | |
3 | Embedded 1Gb NIC 2 | |
4 | Embedded 1Gb NIC 1 | |
5 | Dedicated iLO Port (Optional) Low profile PCIe Gen3 x16 | |
6 | Unit Identification (UID) LED/button | |
7 | Server serial label pull tab | |
8 | Power Button | |
9 | USB 3.0 Connector | |
10 | SUV(Serial/USB/Video) Connector | |
11 | Low profile PCIe Gen3 x16 slot | |
12 | 8 SFF SAS/SATA Drive Bays slot | |
13 | HPE ProLiant XL270d Accelerator Trays (2U/tray) |
- Back of the Chassis
Item | Description | 비고 |
1 | Unit Identification (UID) LED | |
2 | Pass through power connections to Accelerator Tray | |
3 | HPE Apollo 6500 Chassis | |
4 | Power Shelf Data Connection | |
5 | ILO connector | |
6 | ILO connector | |
7 | HPE Advanced Power Module ILO connector | |
8 | HPE APMI 1.0 connector | |
9 | Power Shelf Data Connection | |
10 | Fan- 4 per Accelerator tray (required), 8 total per 6500 chassis with two Accelerator Trays |
- GPU accelerator numbering
(1) Server left GPU accelerator numbering
Item | Description | 비고 |
1 | GPU 1 | P40 or P100 |
2 | GPU 2 | P40 or P100 |
3 | GPU 3 | P40 or P100 |
4 | GPU 4 | P40 or P100 |
(2) Server right GPU accelerator numbering
Item | Description | 비고 |
5 | GPU 5 | P40 or P100 |
6 | GPU 6 | P40 or P100 |
7 | GPU 7 | P40 or P100 |
8 | GPU 8 | P40 or P100 |
2. Board
Item | Description | 비고 |
1 | Power riser connector | |
2 | DIMMS for processor 2 | |
3 | DIMMS for processor 1 | |
4 | Right PCIriser module connector (PCIex40) | |
5 | System maintenance switch | |
6 | Mini-SAS connector 1 (SATA x4) | |
7 | Internal USB 3.0 connector | |
8 | Mini-SAS connector 2 (SATA x4) | |
9 | Right PCIriser module connector (PCIex24) | |
10 | Dedicated iLO port connector | |
11 | NMI header | |
12 | Left PCI riser module connector (PCIex16) | |
13 | microSD slot | |
14 | System battery | |
15 | TPM Connector | |
16 | Processor 1 | |
17 | Processor 2 |
- 참고#2 : https://h20195.www2.hpe.com/v2/GetPDF.aspx/c05069179.pdf
- 참고#3 : https://support.hpe.com/hpesc/public/docDisplay?docLocale=en_US&docId=emr_na-c05318490
3. GPU 서버 BIOS 설정 및 OS 구성
3.1. BIOS 설정
1.System Options 1-1. Processor Options -> Intel (R) Hyperthreading Options -> Disabled 1-2. SATA Controller Options -> Embedded SATA Configuration -> Enable SATA AHCI Support 1-3. Virtualization Options Virtualization Technology -> Disabled Intel (R) VT-d -> Disabled SR-IOV -> Disabled 2. Configuring Boot Options -> Boot Mode -> Legacy BIOS Mode 3. Power Management options -> Power Profile -> Static HighPerformance Mode 4. Server Availability options -> ASR Status -> Disabled 5. Configuring advanced platform configuration options Advanced Options Fan and Thermal Options -> Increased Cooling Setting the fan failure policy -> Allow Operation with Critical Fan Failures |
3.2. OS 설정
- GPU Driver 설치를 위해 "rd.driver.blacklist=nouveau" 를 /etc/default/grub 에 추가
# vi /etc/default/grub GRUB_CMDLINE_LINUX 맨 뒷줄에 "rd.driver.blacklist=nouveau" 추가 후 저장. |
- grub23-mkconfig –o /boot/grub/grub.cfg 로 커널에 변경사항 적용.
- 부팅 시 "nouveau" 드라이버를 로드 하지 않기 위해 파일 생성.
# vi /etc/modprobe.d/nvidia-installer-disable-nouveau.conf blacklist nouveau options nouveau modeset=0 |
- 부팅 시 multiuser 모드로 부팅 하도록 설정 변경.
# systemctl set-default multiuser.target |
- 리부팅 후 NVIDIA-LINUX-450.102.04 와 CUDA 11.0 을 설치.
# sh NVIDIA-LINUX-450.102.04.run # sh cuda-11.0.3.XXX.XXX.XXX.run #설치 시 GPU 드라이버는 선택 해제 후 설치. |
'SYSTEMS > HPE' 카테고리의 다른 글
[H/W] HPE Cray XD2000 (0) | 2024.04.04 |
---|---|
[H/W]HPE Cray Supercomputing XD665 (0) | 2024.02.26 |
[CRAY] Lustre 2.15.2 Client OS 지원 목록 (0) | 2023.11.29 |
[H/W]HPE Cray Supercomputing XD670 (1) | 2023.11.14 |
[Intel Board] CPG MEMORY SEL LOG DECODER (0) | 2022.05.29 |