Coder Social home page Coder Social logo

arthur-r-oliveira / llm-in-action_k8s Goto Github PK

View Code? Open in Web Editor NEW

This project forked from cncf/llm-in-action

0.0 0.0 0.0 63 KB

๐Ÿค– Discover how to apply your LLM app skills on Kubernetes!

License: Apache License 2.0

Shell 41.72% Python 46.26% Dockerfile 12.01%

llm-in-action_k8s's Introduction

LLMs in Action: A Cloud Native Story, running on microshift

Using Ollama UBI as internal k8s service

https://github.com/williamcaban/ollama-ubi

with Ollama UBI pod, pull the required model

This command will download the mistral model for ollama to use

~~ $ ollama pull mistral ~~

$ oc get pods -A |egrep "keynote|ollama"
ollama                     ollama-serve-6b77c4df5-rq8k8               1/1     Running                    0               3h6m
test                       keynote-66c595b94b-6z6zn                   1/1     Running                    2               2d12h
$ microshift version
MicroShift Version: 4.14.18
Base OCP Version: 4.14.18
$ oc get nodes
NAME         STATUS   ROLES                         AGE     VERSION
node-nvidia   Ready    control-plane,master,worker   3d17h   v1.27.11
$ oc version
Client Version: 4.14.0-202401111553.p0.g286cfa5.assembly.stream-286cfa5
Kustomize Version: v5.0.1
Kubernetes Version: v1.27.11

$ oc get routes -A
NAMESPACE   NAME           HOST                            ADMITTED   SERVICE      TLS
chat        chat-route     chat.apps.example.com           True       chat-svc     
ollama      ollama-route   ollama.apps.example.com         True       ollama-svc   
test        keynote        keynote-test.apps.example.com   True       keynote      

$ oc get pods
NAME                           READY   STATUS    RESTARTS   AGE
ollama-serve-6b77c4df5-rq8k8   1/1     Running   0          3h9m
$ oc rsh ollama-serve-6b77c4df5-rq8k8 
sh-5.1$ ollama list
NAME              	ID          	SIZE  	MODIFIED   
falcon:7b-instruct	4280f7257e73	4.2 GB	2 days ago	
llava:latest      	8dd30f6b0cb1	4.7 GB	2 days ago	
mistral:latest    	61e88e884507	4.1 GB	2 days ago	
sh-5.1$ 

Chat front end

https://github.com/arthur-r-oliveira/chat_application_k8s

llm-in-action_k8s's People

Contributors

onlydole avatar pritianka avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.