All Posts
Unlock the Power of Mistral AI with Red Hat OpenShift AI and NVIDIA DGX H100
I will guide you through the process of deploying Red Hat OpenShift AI on the NVIDIA DGX H100 system and run the Mistral AI model. This blog post details the process of deploying and managing a fully automated MLOps solution for a large language model (LLM) presented in three main parts:
OpenShift Bare Metal provisioning with NVIDIA GPU
TL;DR
The bare metal installation of OCP is only this installer command:
$ openshift-baremetal-install --dir ~/clusterconfigs create cluster
but I’ll take time in this post to explain how to prepare your platform and how to follow the installation.
CodeReady Containers with GPU for Data Science
Lots of Data Scientists want to focus on model building.
Just using a local Jupyter Notebook can be a limitation if you want to:
- create scalable Machine Learning systems
- test local private data ingestion
- contribute to Kubeflow
- tune your model serving pipeline
You can build an All-in-One Kubernetes environment with NVIDIA GPU for Data Science on your local PC or one bare-metal cloud server, let’s see how CodeReady Containers works.
Open Data Hub v0.5.1 release
Open Data Hub v0.5.1 was released February 16, 2020.
Release node: https://opendatahub.io/news/2020-02-16/odh-release-0.5.1-blog.html
Open Data Hub includes many tools that are essential to a comprehensive AI/ML end-to-end platform. This new release integrate some bug fixes that resolve issues when deploying on OpenShift Container Platform v4.3. JupyterHub deployment and Spark cluster resources have now a greater customization. Let’s try the Data science tools JupyterHub 3.0.7 on OpenShift 4.3.
NVIDIA GPU Operator with OpenShift 4.3 on Red Hat OpenStack Platform 13
The NVIDIA GPU Operator has been available as a Beta since 2020, Jan 27, it’s a technical preview release: https://github.com/NVIDIA/gpu-operator/release
The GPU Operator manages NVIDIA GPU resources in an OpenShift cluster and automates tasks related to bootstrapping GPU nodes. Since the GPU is a special resource in the cluster, it requires a few components to be installed before application workloads can be deployed onto the GPU, these components include:
OpenShift 4.2 on Red Hat OpenStack Platform 13 + GPU
Red Hat OpenShift Container Platform 4.2 introduces the general availability of full-stack automated deployments on OpenStack. With OpenShift 4.2, containers can be managed across multiple public and private clouds, including OpenStack. Red Hat and NVIDIA are working to provide the best platform for Artificial Intelligence and Machine Learning workloads.