Coder Social home page Coder Social logo

dcasota / vmware-generative-ai-reference-architecture Goto Github PK

View Code? Open in Web Editor NEW

This project forked from vmware-private-ai/vmware-generative-ai-reference-architecture

0.0 0.0 0.0 560.24 MB

VMware GenAI reference architecture. A set of companion assets (Python scripts and YAML config files) intended to help customers set up compute and networking accelerators in vSphere and Tanzu Kubernetes to run GenAI workloads.

License: Apache License 2.0

Shell 1.99% Python 1.14% Jupyter Notebook 96.87%

vmware-generative-ai-reference-architecture's Introduction

VMware-generative-ai-reference-architecture

Overview

This repository contains a series of Python scripts and configuration files that serve as a complement to the white paper Deploying Enterprise-Ready Generative AI on VMware Cloud Foundation

Disclaimer

The scripts provided in this repository are intended to be used for educational purposes but not for production applications. Be aware that LLMs pose inherent vulnerabilities and risks, as illustrated by the OWASP Top 10 for Large Language Model Applications. We strongly encourage customers to pay attention to OWASP guidance and the NIST AI Risk Management Framework to build safe and robust AI systems.

Directory Structure

The repository is organized by the following structure:

  • The vSphere-and-TKG-config-files directory provides configuration files to set the Tanzu Kubernetes Cluster, NVIDIA GPUs, and Network Kubernetes Operators that provide hardware acceleration services to VMware Tanzu Kubernetes clusters.
  • The Examples/LLM-fine-tuning-example directory provides the steps to configure a Python virtual environment suitable for LLM fine-tuning tasks based on a series of Hugging Face libraries. It also includes a Python notebook that illustrates all the steps required to fine-tune the Falcon LLMs on a custom dataset to teach the model to follow instructions.
  • The Examples/LLM-serving-wt-vLLM-and-RayServe-example directory provides the configuration steps, the configuration files, and the Python scripts to set a Ray cluster that serves the Falcon LLMs via vLLM running as a Ray Serve application. The Ray cluster gets deployed on Tanzu Kubernetes using Kuberay.
  • We also include Starter Packs which provide code examples about the implementation of the following use cases:
    • Intro to RAG (retrieval augmented generation with LangChain and Gradio)
    • AI coding assistance via StarCoder (Code_Assistant)
    • Improved RAG (via LlamaIndex + PGVector + DeepEval)

Contributing

The VMware-generative-ai-reference-architecture project team welcomes contributions from the community. Before you start working with VMware-generative-ai-reference-architecture, please read our Contributor License Agreement. All contributions to this repository must be signed as described on that page. Your signature certifies that you wrote the patch or have the right to pass it on as an open-source patch. For more detailed information, refer to CONTRIBUTING.md.

License

The project is licensed under the terms of the Apache 2.0 license.

vmware-generative-ai-reference-architecture's People

Contributors

vecorro avatar cwei44 avatar vmwghbot avatar qoofyk avatar vfddservice avatar fmanning07 avatar rosenhouse avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.