This document provides an overview of IBM's reference architecture for deep learning clusters. It discusses the hardware and software components, including POWER-based servers with NVIDIA GPUs connected by Mellanox InfiniBand switches. It describes the storage architecture using IBM Spectrum Scale for a shared filesystem. The software stack is based on Red Hat Enterprise Linux, CUDA, Nvidia-Docker, IBM PowerAI, and container orchestration with either Kubernetes or IBM Spectrum LSF. Operational models and workflows are shown to support experimentation, scaling, and production phases of deep learning.