Version Command-line (Python) version Suggestio

Yikes.. getting an error <div class="snippet-clipboard-content notranslate positio

Yikes.. getting an error <div class="snippet-clipboard-content notran

[Enhancement]: Chunk & save task gen process? Auto execute option? #invalid JSON #local LLM about gpt-pilot HOT 7 OPEN

mcchung52 commented on June 3, 2024

[Enhancement]: Chunk & save task gen process? Auto execute option? #invalid JSON #local LLM

from gpt-pilot.

Comments (7)

phalexo commented on June 3, 2024

I am running gpt-pilot with Llama-3-70B-Instruct.Q5_K_M It does not seem to suffer from malformed JSON issues, although it has other problems.

I changed the source to have 3000 timeout for reading instead of 300 and changed the code to use Llama tokenizer instead of tiktoken from OpenAI, Llama tokenizer uses tiktoken internally, so they are pretty close.

from gpt-pilot.

mcchung52 commented on June 3, 2024

Thanks for sharing that. I guess I'd need a memory upgrade then. on 32gb. will 64gb do?
I already changed api timeout to 30 min because I still got "api timeout" error w/ 10min connecting to LM Studio (llm on cpu).
Curious how you changed to Llama tokenizer. Do you mind sharing changes?
Thanks

from gpt-pilot.

phalexo commented on June 3, 2024

Thanks for sharing that. I guess I'd need a memory upgrade then. on 32gb. will 64gb do? I already changed api timeout to 30 min because I still got "api timeout" error w/ 10min connecting to LM Studio (llm on cpu). Curious how you changed to Llama tokenizer. Do you mind sharing changes? Thanks

import re
import requests
import os
import sys
import time
import json
import tiktoken
from prompt_toolkit.styles import Style

from jsonschema import validate, ValidationError
from utils.style import color_red, color_yellow
from typing import List
from const.llm import MAX_GPT_MODEL_TOKENS, API_CONNECT_TIMEOUT, API_READ_TIMEOUT
# alexo: Slow LLM, override
API_READ_TIMEOUT=3000
from const.messages import AFFIRMATIVE_ANSWERS
from logger.logger import logger, logging
from helpers.exceptions import TokenLimitError, ApiKeyNotDefinedError, ApiError
from utils.utils import fix_json, get_prompt
from utils.function_calling import add_function_calls_to_request, FunctionCallSet, FunctionType
from utils.questionary import styled_text

from .telemetry import telemetry

#tokenizer = tiktoken.get_encoding("cl100k_base")

# alexo: Should Llama-3 tokenizer be used?
from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("meta-llama/Meta-Llama-3-70B", revision="refs/pr/6")

--------------------------------------------------- just a few top lines from the file.
this file is in utils folder, llm_connection.py

from gpt-pilot.

mcchung52 commented on June 3, 2024

Yikes.. getting an error

Cannot access gated repo for url https://huggingface.co/meta-llama/Meta-Llama-3-8B/resolve/main/config.json.
Access to model meta-llama/Meta-Llama-3-8B is restricted. You must be authenticated to access it.

from gpt-pilot.

phalexo commented on June 3, 2024

Yikes.. getting an error

Cannot access gated repo for url https://huggingface.co/meta-llama/Meta-Llama-3-8B/resolve/main/config.json.
Access to model meta-llama/Meta-Llama-3-8B is restricted. You must be authenticated to access it.

Probably. It is free though.

from gpt-pilot.

mcchung52 commented on June 3, 2024

Are you storing your token somewhere? how is it pulling?

from gpt-pilot.

phalexo commented on June 3, 2024

Not sure what you mean by storing token. If you navigate to Hugging Face and try to access Llama models it will ask you to go through a quick process. You can pretty much invent the info. At some point in time, I think, I also set something up with ssh keys and HF account.

from gpt-pilot.

[Enhancement]: Chunk & save task gen process? Auto execute option? #invalid JSON #local LLM about gpt-pilot HOT 7 OPEN

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent