Coder Social home page Coder Social logo

d2klab / twitpersonality Goto Github PK

View Code? Open in Web Editor NEW
26.0 10.0 9.0 2.41 MB

TwitPersonality: Computing Personality Traits from Tweets using Word Embeddings and Supervised Learning

License: Apache License 2.0

Python 99.42% Shell 0.58%
python svm regression wordembeddings personality-traits big5 lasso-regression natural-language-processing twitter tweepy

twitpersonality's Introduction

twitpersonality

twitpersonality implements a supervised learning approach to compute personality traits by only relying on what an individual tweets about publicly. The approach segments tweets in tokens, then it learns word vector representations as embeddings that are then used to feed a supervised learner classifier.

How to Use

twitpersonality needs to learn from examples, first thing is to read training.

Once the models are created, then you can do some tests, please refer to test

How to Cite

Carducci G., Rizzo G., Monti D., Palumbo E., Morisio M. (2018) TwitPersonality: Computing Personality Traits from Tweets Using Word Embeddings and Supervised Learning. In Information volume 9 (5), 127

Licence

Apache License, Version 2.0. More details see LICENSE

twitpersonality's People

Contributors

giulioc avatar giusepperizzo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

twitpersonality's Issues

scripts flow procedure

Can you tell me the scripts flow procedure for training dataset. and what is this fasttext dataset refers to???

Accuracy on tweets is much lower than reported in paper

HWhen reproducing the experiments using this code when trained on the ~10000 row dataset and testing against a users tweets, each user I test always get the ENFJ personality with all scores +/- 0.15 from:
Score for O is: 4.3
Score for C is: 3.4
Score for E is: 3.4
Score for A is: 3.6
Score for N is: 2.6
Jungian type is ENFJ

Do you have any idea why this may be?

I am using a mypersonality dataset with 9918 rows (excluding header) compared to your dataset with 9913 rows, however I think we may be using the same dataset and you removed a few rows?

Documentation: Feature Comparison between repos

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.