Erwan Gallen

Dec 10, 2019 1 min read

Machine Learning benchmarking with OpenStack and Kubernetes, Shanghai OpenInfra summit

I’ve done a talk “Machine Learning benchmarking with OpenStack and Kubernetes” at Open Infrastructure Summit Shanghai 2019, November 4.

Abstract

Deep Learning and Cloud Platforms are transforming the field of Machine Learning from theory to practice. However, implementation differences across frameworks and inference engines make the comparison of benchmark results difficult. SPEC and TPCC benchmarks are not accurate due to the complex interactions between implementation choices such as batch size, hyperparameters, or numerical precision. To address this complexity requires systematic benchmarking that is both representative of real-world use cases and valid across different software/hardware platforms.

This talk will present the best Machine Learning benchmarking tools to use with OpenStack and Kubernetes. We will show how MLPerf and Thoth help data scientists to improve their system performance and fully benefit from their CPUs, GPUs, or FPGAs. We will share insights and lessons learned over the journey of key Machine Learning training and inference use cases selection.

https://www.openstack.org/summit/shanghai-2019/summit-schedule/events/24196/machine-learning-benchmarking-with-openstack-and-kubernetes

Video

Slides

mlperf

« How to enable NVIDIA T4 GPU with podman Red Hat OpenStack Platform 15 standalone »

Machine Learning benchmarking with OpenStack and Kubernetes, Shanghai OpenInfra summit

Abstract

Video

Slides

Explore →