Coder Social home page Coder Social logo

Comments (9)

tumashu avatar tumashu commented on August 18, 2024

你这个词库做的不对,你用emacs打开你的词库文件,执行 pyim-update-file, 对词库排序

from pyim.

et2010 avatar et2010 commented on August 18, 2024

pyim-update-dict-file吗?我试了,还是用不了。见鬼了我

from pyim.

et2010 avatar et2010 commented on August 18, 2024

我这次整理词库,干了以下几件事:

  • 删除中英混合词
  • 删除原文件中的非汉字字符(也不是ascii,不知道是什么鬼)
  • 删除了Ext-ABCDE扩展汉字

最后用pyim自带功能转换词库(字和词分别转的,然后又cat到一起,就是我上传的文件)

这么整应该不会搞坏词库吧,还是我不小心碰了雷区?

from pyim.

tumashu avatar tumashu commented on August 18, 2024

字和词不能分开。。。

from pyim.

tumashu avatar tumashu commented on August 18, 2024

那个命令用心后,你词库按照拼音排序了吗?

from pyim.

et2010 avatar et2010 commented on August 18, 2024

是的,用过命令后词库是按照拼音排序的

from pyim.

tumashu avatar tumashu commented on August 18, 2024

你加我qq吧,329985753

from pyim.

et2010 avatar et2010 commented on August 18, 2024

我又重新来了一遍,这次貌似好了

总结经验:

  • 第二次没有加7000常用汉字
  • 这次把按word生成dict的函数改对了,没有再把单字删除
  • 没事别瞎折腾

我感觉关键问题就是第一次搞的时候,用cat合并时没有检查7000字文件和word词库文件是否都是utf-8编码,结果导致合并后的词库文件坏掉,pyim也没法处理坏掉的词库。

from pyim.

et2010 avatar et2010 commented on August 18, 2024

这个问题解决后, #53 也顺带解决了。

from pyim.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.