Comments (1)
1 train, test, validation的格式如下
这 O
里 O
是 O
清 B
华 M
大 M
学 E
。 O
中 B
国 M
政 M
府 E
。 O
B代表该字符是某个entity的开始
M代表该字符是某个entity的中间
E代表该字符是某个entity的结尾
O代表该字符不属于某个entity
这里一共有两个样本(两个句话),每个样本中间用空行分割
第一列是字符,第二列是标记,第一列与第二列用\t分割
2 embedding的格式
假设有一共有2个单字,每个单字是3维的向量,格式如下:
2 3
你 1 0 1
好 0 0 1
embedding的格式是gensim的word2vec的模型输出格式,调用的函数就是model.save_word2vec_format(output_path, binary=False)
整个embedding文件可以看出是一个2x3的矩阵,行代表单字,列代表字向量的某个维度
例如:“好”这个字映射到了[0, 0, 1]这个3维向量
from sequence-labeling.
Related Issues (20)
- 请问下,HMM的predict这块的path和W指的什么? HOT 2
- 您好,请问几个代码调试过程中遇到的问题
- 在初始化的时候选择了is_crf=False,结果结果就是没有初始化loss,出错了 HOT 1
- confused about the task HOT 2
- 能帮忙提供下您实现的crf部分以及viterbi部分的一个理论上的教程吗? HOT 1
- 计算point score问题 HOT 1
- 请教您tf版本更新后的代码的变化 HOT 6
- 关于label数多于四个的问题 HOT 2
- 关于数据预处理的问题 HOT 3
- About datasets HOT 1
- 代码中带的那个NBA例子,训练后没找到存储的模型
- 关于y_train_weight_batch
- ImportError: No module named models.rnn HOT 1
- tf implemented CRF compared with API crf HOT 1
- why my loss decrease under 0 HOT 2
- word-embedding file
- 关于停用词被去掉的问题。
- 关于模型中CRF层的几个小疑问 HOT 7
- transitions reshape的问题 HOT 1
- Why my loss is every small and under 0?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sequence-labeling.