Coder Social home page Coder Social logo

doctorglm's People

Contributors

absterzhu avatar dasklarleo avatar jamesqfreeman avatar xionghonglin avatar zhaozh10 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

doctorglm's Issues

评估指标

能否询问这个问诊模型的评估指标的计算方法是什么吗

prompt designer

您好,我在您的论文中读到您prompt designer模块提取用户输入的关键词,比如疾病名称和症状,然后基于专业的疾病知识库生成简要描述。
请问这个知识库是否可以在github上获得 或者通过gpt进行信息抽取

多轮对话的微调

请问多轮对话的微调与数据集是怎么样的?readme里面可以放出来么?

No module named 'load_quantization'

Gradio运行后,会出现“No module named 'load_quantization”,安装pip install load_quantization 后,又会报错“ERROR: Could not find a version that satisfies the requirement load_quantization (from versions: none)”、“ERROR: No matching distribution found for load_quantization”,请问怎么办呢?谢谢!

希望取得联系

尊敬的DoctorGLM应用开发者,我是 InternLM 社区开发者&志愿者尖米, 大佬开源的工作对我的启发很大,希望可以探讨使用 InternLM 实现DoctorGLM的可能性和实现路径,我的微信是mzm312,希望可以取得联系进行更深度的交流

cpu

如果没有那么大的显存,可以使用cpu跑吗,如果可以,应该怎么运行

使用项目里面的数据集之后 输出乱码 应该怎办

\u56de\u7b54\uff1a Σ¬¡×Ñ£¬ÊÒÏÉÓÖÖĶ¶º¬ Á  Á θ׻µÖÔ׺»µÇµÓ¡×¿ÓµÏгǿԹÒÔµ²£¬Ôµ³ÊÄÊÈÈ¿µºÊÐÔÎÒ£¬³ÊÒ¡£¬³×µÇÈȴֿȲԹ¿É¿¡Ä£¬¿Ç»»ÒÈÖ¿ÐÉ¿ÖÒ¡ÔÏÊ¡¸ÒIJ²Ò¸Öлɣ¬Ö¡ÇÄÒÖÊ£¬¢Ô´ÒµÏÔÏ£¬Ö

无法加载'load_sequential_sampler'

在跑gradio.ipynb时遇到这个问题
ImportError: cannot import name 'load_sequential_sampler' from 'numpy_io.pytorch_loader.dataloaders' (/home/arthur/miniconda3/lib/python3.10/site-packages/numpy_io/pytorch_loader/dataloaders.py)

看起来是deep_learning跟numpy_io库不匹配,numpy_io里没有load_sequential_sampler这个函数。但这两个库我都是用pip安装的最新版。它俩的版本要求是怎样的?

请问Prompt Designer是通过哪个函数实现的

我没有很确定的找到在你的论文中提到的Prompt Designer功能,请问这个功能是在哪个文件中实现的?是通过get_disease_info()向模型提问,还是在知识库里搜索匹配?

训练时间

readme里列出来的数据量很大,请问你们训了多久,几个epoch

群链接过期

朋友你好!你们的群链接好像过期了,请问是否有新的?谢谢?

feat: 增加qa_generation.py中的加载器,以支持结构化数据的问答对生成

DoctorGLM/DoctorGLM/qa_generation.py非常好用!!

我更改了qa_generation.py,用其他领域的结构化数据生成了问答对json。如果也有人遇到这个情况,可以使用如下代码。需要更改file_path和templ

(似乎不能算pr于是写在issue里了,如有不妥欢迎交流!)

from langchain.document_loaders import DataFrameLoader

file_path = 'data.csv'

df = pd.read_csv(file_path)
df.head()

loader = DataFrameLoader(df,page_content_column="from_name")

docs = loader.load()

idx = 0
qa_dict ={}

for d in docs:
	idx += 1
	# text = d.page_content
	# text = d.page_content
	text = d
	= f"""你是一个聪明的助理。

		给你一段xx相关的技术标准,你必须依据表格想出一个问题和一个对应的答案。

		你想出的问题可以被用来测试xx的专业能力。

		你想出的问题和答案必须和所给文本相关。

		当你想出问题和答案后,你必须用以下格式回复:

		```
		[
			"问题": "$你想出的问题放在这",
			"答案": "$你想出的答案放在这"
		]
		```

		所有在 ``` 中间的内容就是你要回答的格式。

		请想出一个问题与一个答案,用以上指定的列表回复,对于以下文本:
		----------------
		{text}"""

	response, history = model.chat(
		 tokenizer, templ, history=[], max_length=2048)

	while_count = 0
	if_good = True
	while ('以下哪' in response) or ('语言模型' in response) or ('文本' in response) or ('以下是' in response):
			response, history = model.chat(
				tokenizer, templ, history=[], max_length=2048)
			while_count += 1
			if while_count > 10:
				if_good = False
				break
	print(response)

	try:
		if if_good:
			question = response.split('答案:')[0][3:]
			answer = response.split('答案:')[1]
			qa = {}
			qa['问题'] = question
			qa['答案'] = answer
			qa_dict[idx] = qa
		else:
			pass
	except:
			pass
	json.dump(qa_dict, open('qa_dict.json', 'w', encoding='utf-8'),
				  indent=4, ensure_ascii=False)
	
print("json加载完成")

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.