NVIDIA A40
POWERFUL DATA CENTER GPU FOR VISUAL COMPUTING
Manufacturer Part Number: 900-2G133-0000-000
Features and Benefits:
The NVIDIA A40 GPU is an evolutionary leap in performance and multi-workload capabilities from the data center, combining best-in-class professional graphics with powerful compute and AI acceleration to meet today’s design, creative, and scientific challenges. Driving the next generation of virtual workstations and server-based workloads, NVIDIA A40 brings state-of-the-art features for ray-traced rendering, simulation, virtual production, and more to professionals anytime, anywhere.
Specifications:
GPU architecture: NVIDIA Ampere architecture
GPU memory: 48 GB GDDR6 with ECC
Memory bandwidth: 696 GB/s
Interconnect interface: NVIDIA® NVLink® 112.5 GB/s (bidirectional), PCIe Gen4 31.5 GB/s (bidirectional)
NVIDIA Ampere architecture based CUDA Cores: 10,752
NVIDIA second-generation RT Cores: 84
NVIDIA third-generation Tensor Cores: 336
Peak FP32 TFLOPS (non-Tensor): 37.4
Peak FP16 Tensor TFLOPS with FP16 Accumulate: 149.7 | 299.4*
Peak TF32 Tensor TFLOPS: 74.8 | 149.6*
RT Core performance TFLOPS: 73.1
Peak BF16 Tensor TFLOPS with FP32 Accumulate: 149.7 | 299.4*
Peak INT8 Tensor TOPS: 299.3 | 598.6*
Peak INT 4 Tensor TOPS: 598.7 | 1,197.4*
Form factor: 4.4" (H) x 10.5" (L) dual slot
Display ports: 3x DisplayPort 1.4**; Supports NVIDIA Mosaic and Quadro® Sync4
Max power consumption: 300 W
Power connector: 8-pin CPU
Thermal solution: Passive
Virtual GPU (vGPU) software support: NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation, NVIDIA Virtual Compute Server
vGPU profiles supported: See the Virtual GPU Licensing Guide
NVENC | NVDEC: 1x | 2x (includes AV1 decode)
Secure and measured boot with hardware root of trust: Yes
NEBS ready: Level 3
Compute APIs: CUDA, DirectCompute, OpenCL™, OpenACC®
Graphics APIs: DirectX 12.075, Shader Model 5.175, OpenGL 4.686, Vulkan 1.186
MIG support: No
point performance 2.91 Tflops (GPU Boost Clocks)
1.87 Tflops (Base Clocks) 1.66 Tflops (GPU Boost Clocks)
1.43 Tflops (Base Clocks)
Peak single precision floating
point performance 8.74 Tflops (GPU Boost Clocks)
5.6 Tflops (Base Clocks) 5 Tflops (GPU Boost Clocks)
4.29 Tflops (Base Clocks)
Memory bandwidth (ECC off)² 480 GB/sec (240 GB/sec per GPU) 288 GB/sec
Memory size (GDDR5) 24 GB (12GB per GPU) 12 GB
CUDA cores 4992 ( 2496 per GPU) 2880 - See more at: http://www.nvidia.com/object/tesla-servers.html#sthash.ZmsPP43F.dpuf
This product is special order, which takes longer to process and cannot be returned.