Coder Social home page Coder Social logo

iyatomilab / newspaper_word_analysis Goto Github PK

View Code? Open in Web Editor NEW
0.0 16.0 0.0 244 KB

CLCNNを用いたWeb上の新聞記事の解析とモデルの可視化 / https://www.anlp.jp/proceedings/annual_meeting/2018/pdf_dir/B5-2.pdf

iyatomilab newspaper word-analysis character-level-cnn visualization

newspaper_word_analysis's Introduction

newspaper_word_analysis

Web上の読売、朝日、毎日、産経新聞の記事をcharacter-level convolutional neural network (CLCNN) により解析。 CLCNNが得た各新聞社の特徴を表していると考えられる部位についてのヒートマップを作成。

ヒートマップ例[^fig]

マスクすると予測値が大きく低下する N 文字をfive_charsに、その中の単語群をfive_chars/listsに示す。
ヒートマップの強く発火した文字群をhot_pointsのテキストファイルに示す。

Reference

  • 宗里駿, 小谷龍ノ介, 彌冨仁. Character-level Convolutional Neural Networks を用いた新聞社間の記事の違いの解析の試み. 言語処理学会第24回年次大会, 2018.
  • Daiki Shimada, Ryunosuke Kotani, and Hitoshi Iyatomi. Document classification through image-based character embedding and wildcard training. 2016 IEEE Inter- national Conference on Big Data, pp. 3922–3927, 2016.
  • Joshua Saxe and Konstantin Berlin. A character-level convolutional neural network with embeddings for detecting malicious urls, file paths and registry keys. CoRR arXiv:1710.09435, 2017.
  • Edwaed Roff, Jon Barker, Jared Sylvester, Robert Barndon, Bryan Catanzaro, and Charles Nicholas. Malware detection by eating a whole exe. CoRR arXiv:1702.08568, 2017.
  • Matthew D Zeiler and Rob Fergus. Visualising and understanding convolutional net- works. CoRR arXiv:1311.2901, 2013.

newspaper_word_analysis's People

Contributors

hayaosato avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.