Coder Social home page Coder Social logo

readability-algorithm's Introduction

a text readability algorithm on the Chinese texts for children's recognizing characters and learning to read.

儿童识字用汉语文本的难度算法。

思路

以儿童阅读入门兼识字为假设情境,主要估测文本的识读难度,而不是理解难度

因而主要依赖字频表,以及对识字过程的经验与先验假设(实测-调参)。详见代码注释。

输出每篇文本的5个值:字数;字种;文难度;句难度;分句难度;单字难度。如下,两首诗及其所得数值:

文本 字数 字种 文难度 句难度 分句难度 单字难度
解落三秋叶,能开二月花。过江千尺浪,入竹万竿斜。 20 20 2273.82 1136.91 568.46 113.69
灞原风雨定,晚见雁行频。落叶他乡树,寒灯独夜人。 20 20 2290.09 1145.05 572.52 114.50

你可以使用这些值,根据实际需要构建排序算法。

实测效果还不错,目前可用于辅助文本难易排序。(实际上没有做多少实测-调参工作,也就是说,预计不会有严重的过拟合问题。)

参考

readability-algorithm's People

Contributors

fusyong avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.