Coder Social home page Coder Social logo

kesci-urdu-sentiment-analysis's Introduction

kesci-urdu-sentiment-analysis

主要记录kesci的nlp练习赛[Roman Urdu DataSet]的两种解法。机器学习与深度学习lstm的baseline解法

competition link:https://www.kesci.com/home/competition/5c77ab9c1ce0af002b55af86/content/0

some notes:

1.lstm.ipynb:

lstm提交得分在0.83-0.84左右。lstm配套的讲解博客地址:https://blog.csdn.net/ssswill/article/details/88533623

epoch:1~5 is enough

2.SGD.ipynb:

SGD classifier baseline,lb=0.8651。

3.lgb.ipynb:

Lightgbm baseline,lb=0.8447,use bayesian optimization to find hyperparameter for lgbm。

you can improve your score base on this method.

4.一些心得

1.你可以尝试一些简单的模型,效果可能会更好。如朴素贝叶斯,逻辑回归等,至少目前看来是这样的。 2.可以通过一些手段使TF-idf的效果可以进一步提高,比如进一步对语句更细节的清洗,在谷歌上看过一些关于urdu清洗的论文,可以关键词搜索【roman urdu】 3.添加对表情处理我觉得会对你有帮助。 4.尝试CNN,bilstm,attention等模型。 5.虽然是练习赛,所以没有花很多的心思,但是如果钻研的话,我相信可以超过0.9。Good Luck~

kesci-urdu-sentiment-analysis's People

Contributors

willinseu avatar

Watchers

James Cloos avatar Betterme avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.