The latest news from Dell Technologies World is a high-end machine learning server for the data center that has four, eight, or even 10 Nvidia Tesla V100 GPUs for processing power.
The Dell EMC DSS 8440 is a two-socket server with two of the new Xeon Scalable processors and is specifically designed for machine learning applications and other demanding workloads. Each Tesla is capable of more than 100 teraflops, so the 10 GPU machine is one petaflop of processing power. Dell claims the DSS 8440 is almost on par with performance by the DGX-1, which is also Tesla-powered.
[ Read also: What is quantum computing (and why enterprises should care) ]
Obviously this is not a machine for beginners. That would be Dell EMC’s 740 and 7425 servers, which support up to three GPUs, and the 4140, which supports up to four GPU cards.
The DSS 8440 can support up to 10 2.5-in. devices, which translates to up to 32 terabytes of NVMe storage. It’s built on a high-performance switched PCIe fabric for rapid I/O. This allows it to use accelerators, storage, and network cards from other vendors.
Machine learning involves two distinctly different workloads: training and inference. The initial release of the DSS 8440 is specifically targeted at complex, training workloads. Training for complex workloads such as image recognition, facial recognition, and natural language translation is the hard part and requires the bulk of computing.
Training a model is done by iteration, where you runs massive amounts of data through a weighted, multi-layered algorithm thousands of times, compare it to a specifically targeted outcome and iteratively adjusting the model/weights to ultimately result in a “trained” model.
For example, training image recognition to recognize a cat or a car would involve thousands of training images. Once the model is trained, the inference (the question of is this image a cat or a car or not) is much easier and requires much less processing power.
And Dell EMC isn’t done pumping up this server. It has partnered with a start-up accelerator company called Graphcore to develop machine learning-specific, graph-based technology for inference workloads. Future versions of the DSS 8440 will come with the Graphcore processor, although for now it has no release date.