Coder Social home page Coder Social logo

rime-dictionary's Introduction

Rime 扩展词库

最佳实践

自用级 pinyin_plus.dict.yaml

创建一个extended目录用于存放字典,并在配置中,将其引用,主要windows下路径用\,unix下路径用/

name: pinyin_plus
version: "2022.11.22"
sort: by_weight
use_preset_vocabulary: true
import_tables:
  # pinyin_plus, 放于第一行,用作用户自造词。需要与`name`保持一致。
  - pinyin_plus
  - luna_pinyin
  - extended\userdict
  - extended\xhzd
  - extended\tsinghua.ocl
  - extended\qqpy.default
  - extended\qqpy.usually
  - extended\sougou.chengyu
  - extended\sougou.netword

说明

根据RIEM的介绍,文件名和dict.yaml里面的name字段应该为一致的。

  • 关于码表的格式
  • 導出及導入文本碼表
    • 以製表符(Tab)分隔的三列,分別是文字、編碼、使用頻次。其中,編碼是碼表中定義的完全形式,多個音節間以空格。
    • <輸入法語言代號>.userdb/ - 輸入法程序爲保存用戶的輸入習慣而創建的 用戶詞典。
  • 碼表與詞典
    • 使用頻次,往往用于多音字的场景

  1. 除了《新华字典》,《现代汉语词典》外,其他词库均去除单字词条。
  2. use_preset_vocabulary,是否導入預設詞彙表【八股文】。扩展词库默认设置为false
  • xhzd.dict.yaml

    • 新华字典,有词频,来源
  • qqpy.default.dict.yaml

    • QQ拼音默认词库,无词频,来源
  • qqpy.usually.dict.yaml

    • 常用聊天短语,无词频,来源
  • sogou.default.dict.yaml

    • 搜狗拼音默认词库,无词频,来源,其他细节见补充
  • sougou.chengyu.dict.yaml

    • 搜狗拼音默认词库,无词频,来源,其他细节见补充
  • sougou.netword.dict.yaml

    • 搜狗拼音网络新词,无词频,来源,其他细节见补充
  • tsinghua.ocl.dict.yaml

    • 清华大学开放中文词库,无词频,来源,见项目

补充

转换,工具,及其他见分支

TODO

  • [ ]使用Oxford作为蓝本

rime-dictionary's People

Contributors

swoiow avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.