Coder Social home page Coder Social logo

milvus standalone时不时崩溃 about history_rag HOT 4 OPEN

wxywb avatar wxywb commented on August 14, 2024
milvus standalone时不时崩溃

from history_rag.

Comments (4)

flash201524 avatar flash201524 commented on August 14, 2024

RPC error: [insert_rows], <MilvusException: (code=1100, message=the length (74246) of dynamic field exceeds max length (65536): invalid parameter[expected=valid length dynamic field][actual=length exceeds max length])>, <Time:{'RPC start': '2024-04-11 23:49:01.457415', 'RPC error': '2024-04-11 23:49:02.505110'}>
Traceback (most recent call last):
File "E:\OneDrive\history_rag-master\cli.py", line 120, in
cli.run()
File "E:\OneDrive\history_rag-master\cli.py", line 53, in run
self.parse_input(command_text)
File "E:\OneDrive\history_rag-master\cli.py", line 65, in parse_input
self.build_index(path=commands[1], overwrite=False)
File "E:\OneDrive\history_rag-master\cli.py", line 92, in build_index
self._executor.build_index(path, overwrite)
File "E:\OneDrive\history_rag-master\executor.py", line 186, in build_index
self.index = VectorStoreIndex(nodes, storage_context=storage_context, show_progress=True)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\llama_index\indices\vector_store\base.py", line 53, in init
super().init(
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\llama_index\indices\base.py", line 75, in init
index_struct = self.build_index_from_nodes(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\llama_index\indices\vector_store\base.py", line 274, in build_index_from_nodes
return self._build_index_from_nodes(nodes, **insert_kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\llama_index\indices\vector_store\base.py", line 246, in _build_index_from_nodes
self._add_nodes_to_index(
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\llama_index\indices\vector_store\base.py", line 200, in _add_nodes_to_index
new_ids = self._vector_store.add(nodes_batch, **insert_kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\llama_index\vector_stores\milvus.py", line 199, in add
self.collection.insert(insert_list)
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\orm\collection.py", line 508, in insert
return conn.insert_rows(
^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\decorators.py", line 147, in handler
raise e from e
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\decorators.py", line 143, in handler
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\decorators.py", line 182, in handler
return func(self, *args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\decorators.py", line 122, in handler
raise e from e
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\decorators.py", line 87, in handler
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\client\grpc_handler.py", line 514, in insert_rows
check_status(response.status)
File "E:\software\Anaconda\envs\pytorch\Lib\site-packages\pymilvus\client\utils.py", line 60, in check_status
raise MilvusException(status.code, status.reason, status.error_code)
pymilvus.exceptions.MilvusException: <MilvusException: (code=1100, message=the length (74246) of dynamic field exceeds max length (65536): invalid parameter[expected=valid length dynamic field][actual=length exceeds max length])>

from history_rag.

wxywb avatar wxywb commented on August 14, 2024

这不是崩溃的,因为milvus存储text的字符串长度是有限制的,如图是65536,而由于你的切分方式导致,切出了一个74246的长文本,事实上,长文本的embedding效果一般也不好,因为信息过多让特征不再明显。因为history_rag的文本切分是针对史料,所以不一定适合你的文本,使用更通用的文本切分方式,请参考#63

from history_rag.

flash201524 avatar flash201524 commented on August 14, 2024

这不是崩溃的,因为milvus存储text的字符串长度是有限制的,如图是65536,而由于你的切分方式导致,切出了一个74246的长文本,事实上,长文本的embedding效果一般也不好,因为信息过多让特征不再明显。因为history_rag的文本切分是针对史料,所以不一定适合你的文本,使用更通用的文本切分方式,请参考#63

明白了,但是我没办法把已经切好的文本删除了,输入remove 文件夹名之后说有多少条但是删除0条,有没有什么其他删除的指令呢

from history_rag.

wxywb avatar wxywb commented on August 14, 2024
from pymilvus import Collection, connections
connections.connect("default", host="localhost", port="19530")
#在cfgs/config.yaml中的默认值
col_name = "history_rag" 
col = Collection(col_name)
col.load()
col.drop()

from history_rag.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.