8000w网格算力测试结果
-
第一台设备:NVIDIA DGX-1 8xV100 32GB
CPU:8/40 2x Xeon E5 2698 v4
GPU:Nvidia Tesla V100 x8
超线程:关闭
RAM:500GB
系统:Ubuntu 20.04 LTS
Fluent版本:Fluent 2023R1 Linux64
打开Native GPU 模式
GPU数量:8
迭代步数:20parallel timer usage Performance Timer for 20 iterations on 8 compute nodes Average wall-clock time per iteration: 1.030 sec Global reductions per iteration: 0 ops Global reductions time per iteration: 0.000 sec (0.0%) Message count per iteration: 0 messages Data transfer per iteration: 0.000 MB LE solves per iteration: 0 solves LE wall-clock time per iteration: 0.000 sec (0.0%) LE global solves per iteration: 0 solves LE global wall-clock time per iteration: 0.000 sec (0.0%) LE global matrix maximum size: 0 AMG cycles per iteration: 0.000 cycles Relaxation sweeps per iteration: 0 sweeps Relaxation exchanges per iteration: 0 exchanges LE early protections (stall) per iteration: 0.000 times LE early protections (divergence) per iteration: 0.000 times Total SVARS touched: 366 Total wall-clock time: 20.607 sec Simulation wall-clock time for 20 iterations 173.04942 sec