Hewlett Packard Enterprise (HPE) has recently announced an extended collaboration with NVIDIA, aiming to revolutionize generative artificial intelligence (GenAI) solutions. This partnership signifies a significant step in technological advancements, offering enterprises a streamlined path to deploy AI solutions.
The joint effort between HPE and NVIDIA has resulted in a pre-configured AI tuning and inferencing solution. This solution allows enterprises of different sizes to customize foundation models using private data, facilitating the rapid deployment of production applications across multiple platforms, from edge to cloud. The objective is to simplify the intricate process of creating and implementing GenAI infrastructure, providing a comprehensive AI tuning and inferencing solution.
Enterprises utilizing GenAI models for various purposes such as conversational search, business process automation, and content creation demand a seamless software and infrastructure stack. The collaboration between HPE and NVIDIA introduces an enterprise computing solution tailored for these GenAI needs. This solution integrates HPE’s Machine Learning Development Environment, HPE Ezmeral Software, HPE ProLiant Compute, and HPE Cray Supercomputers with NVIDIA’s AI Enterprise software suite, including the NVIDIA NeMo framework.
Antonio Neri, President and CEO of HPE, expressed confidence in the collaboration, highlighting its potential to address the challenges faced by customers embarking on AI-driven transformations. Neri emphasized, “Together, HPE and NVIDIA aim to simplify the journey to develop and deploy AI models with a portfolio of pre-configured solutions.”
Jensen Huang, Founder and CEO of NVIDIA, echoed this sentiment, emphasizing the acceleration of the generative AI era. Huang stated, “Our expanded collaboration with HPE aims to drive productivity through AI applications that connect with business data to power accurate assistants, informed chatbots, and semantic search.”
The newly introduced enterprise computing solution for generative AI serves as a purpose-built AI tuning and inferencing data center solution, offering an accessible starting point for enterprises of all sizes. It enables the utilization of pretrained foundation models with private data to create production applications like AI chatbots, enhancing data quality and accuracy through retrieval-augmented generation (RAG) workstreams.
The AI computing solution features a rack-scale architecture comprising HPE ProLiant Compute DL380a with NVIDIA L40S GPUs, NVIDIA BlueField-3 DPUs, and the NVIDIA Spectrum-X Ethernet Networking Platform, catering to hyperscale AI needs. Additionally, it integrates HPE AI software, including the Machine Learning Development Environment and HPE Ezmeral Software, along with NVIDIA AI software, featuring the NVIDIA NeMo framework and essential tools for enterprise GenAI.
HPE Services now offers consulting services, workforce training, and deployment solutions tailored for AI. These services guide customers through the AI journey, assisting in developing operational models and hybrid cloud data strategies for transformative outcomes. Supported by new Global Centers of Excellence for AI and Data, these services aim to advance AI capabilities for enterprises.
Additionally, HPE unveiled a turnkey supercomputing solution powered by NVIDIA at SC23, designed for large enterprises, research institutions, and government organizations to develop and train foundational AI models. The enterprise computing solution for generative AI caters specifically to enterprise customers focused on tuning and inferencing.
The enterprise computing solution for generative AI is set to be orderable in Q1CY24, marking a significant milestone in advancing AI infrastructure and applications.
This collaboration between HPE and NVIDIA emphasizes their joint commitment to pushing the boundaries of AI technology, potentially transforming businesses across diverse industries.