Coder Social home page Coder Social logo

0xqq's Projects

text-mining icon text-mining

近年来,随着微信、微博、市长信箱、阳光热线等网络问政平台逐步成为政府了解民意、汇聚民智、凝聚民气的重要渠道,各类社情民意相关的文本数据量不断攀升,给以往主要依靠人工来进行留言划分和热点整理的相关部门的工作带来了极大挑战。同时,随着大数据技术的发展,建立基于自然语言处理技术的智慧政务系统已经是社会治理创新发展的新趋势,对提升政府的管理水平和施政效率具有极大的推动作用。 本文针对“智慧政务”中的居民投诉建议文本评论数据,基于向量空间模型算法提取了文本关键词并我们采用了多种机器学习分类模型进行测试,从最终得到线性支持向量回归算法相对较优的结果,F1-Score评价指标达0.86。 在挖掘热点问题的前期处理上,使用了余弦相似度计算整理出文本相似的同类主题并加以筛选,通过在SPSS中建立基于因子分析法的热度评价指标模型,给出得分前五的主题样本作为Top5热点问题,分析比较了相关类问题的热度体现在各个指标上的具体表现。 为建立留言的答复意见的评价体系,我们定义了相关性、完整性、可解释性和及时性四个指标。答复意见和留言详情相关性的计算是基于LDA主题模型的中文编辑距离得到的,另外答复意见的可解释性使用了哈工大中文篇章关系的关联词表以及自定义的可解释性词典来判别。通过将这四项指标的得分相加得到某条答复意见的综合评分,分数越高,该答复的质量就越高,从而为决策者提供一个较为清晰完善的参考意见。

textinfoexp icon textinfoexp

文本处理实践相关资料,包含文本特征提取(TF-IDF),文本分类,文本聚类,word2vec训练词向量及同义词词林中文词语相似度计算、文档自动摘要,信息抽取,情感分析与观点挖掘等。

tfx icon tfx

TFX is an end-to-end platform for deploying production ML pipelines

thingsboard icon thingsboard

Open-source IoT Platform - Device management, data collection, processing and visualization.

threat-intelligence icon threat-intelligence

收集的一些国外能提供提供威胁情报的公司,涵盖网络安全、工控安全、终端安全、移动安全等领域

tidb-operator icon tidb-operator

TiDB operator creates and manages TiDB clusters running in Kubernetes

tidb-tools icon tidb-tools

tidb-tools are some useful tool collections for TiDB.

tikv icon tikv

Distributed transactional key-value database, originally created to complement TiDB

time-nlp icon time-nlp

中文语句中的时间语义识别。即通过分析中文语句,识别出话语中提到的时间。

tipdm icon tipdm

TipDM建模平台,开源的数据挖掘工具。

tispark icon tispark

TiSpark is built for running Apache Spark on top of TiDB/TiKV

titandataoperationsystem icon titandataoperationsystem

《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;

tj-idriver icon tj-idriver

An android application which detects cars and warn people of possible danger. 车辆检测;测距;预警;安全分析;

tokenizers icon tokenizers

💥Fast State-of-the-Art Tokenizers optimized for Research and Production

trill icon trill

Trill is a single-node query processor for temporal or streaming data.

txtai icon txtai

Build AI-powered semantic search applications

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.