Junior Member
|
The A100 Tensor Core GPU implementation of the GA100 GPU includes the following units:
7 GPCs, 7 or 8 TPCs/GPC, 2 SMs/TPC, up to 16 SMs/GPC, 108 SMs
64 FP32 CUDA Cores/SM, 6912 FP32 CUDA Cores per GPU
4 third-generation Tensor Cores/SM, 432 third-generation Tensor Cores per GPU
5 HBM2 stacks, 10 512-bit memory controllers
Figure 4 shows a full GA100 GPU with 128 SMs. The A100 is based on GA100 and has 108 SMs.
Tensor cores 就佔了將近三分之一的面積了.
__________________
宜靜默 宜從容 宜謹嚴 宜儉約 居安慮危 處治思亂
|