Coder Social home page Coder Social logo

nlp's Introduction

NLP

  1. LDA主题提取 运用gensim构造LDA模型,在没有对生成的dictionary进行过滤时,dictionary较大,将文档表示为向量的维度太高,调整不同主题个数时得到的困惑度曲线始终呈上升趋势。对idctionary进行过滤后,维度降低,困惑度曲线随主题个数增加而下降,可以选出合适的主题个数。 困惑度曲线

  2. 中文幽默类型
    尝试使用卷积神经网络对中文笑话进行分类,可能是特征处理的原因,也可能是这种网络不适用于文本分析,模型效果不佳。

  3. 微博情感分析
    gensim.word2vec构建词向量模型,sklearn随机森林分类器、XGBoost分类器

  4. 搜狗新闻分类
    预处理:分词、去停词
    划分训练集、测试集(7:3)
    特征提取:词袋模型、TFIDF模型、平均词向量模型、TFIDF加权平均词向量模型
    模型训练:多项式朴素贝叶斯模型、支持向量机模型
    表现:最佳80%精确度

  5. 搜狗新闻聚类--Kmeans

  6. 电影评论极性分析

  7. 基于情感词典的情感分析
    对微博进行情感分析,使用的情感词典在文件夹中。先使用pre.py对文本进行预处理,再使用DictSentiment.py计算情感得分,最后用draw.py画出情感得分分布的饼状图和随时间变化的折线图。 情感得分随时间变化的折线图

评论情感极性饼图

nlp's People

Contributors

saturn-opposition avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.