Coder Social home page Coder Social logo

ecb-speechs-nlp's Introduction

ecb-speechs-nlp

Description

In this repo, you'll find three scripts to scrap, process and (briefly) analyze the content of the European Central Bank (ECB) communication. We focus on the introductory statement of the President of the ECB. This statement is given during the press conferences following the Governing Council's meetings. All the press conferences can be found here.

Note that until December 4, 2014, statements are only available in English.

Usage

  • utils/scraping.py contains the code needed to scrap the statements (thanks to titigmr for his help);
  • utils/process_speechs.py proposes a processing of the statements (only if the statements are in English);

To scrap all English statements between 2006 and 2020 and store them in a file named raw_data.csv containing the dates, the language(s), the urls and the content of the statements:

python main.py --scrap -n raw_data.csv -l en -y 2006 2021

To process raw_data.csv and store the result in a file named processed_data.csv containing an additional column for processed statements:

python main.py --prep -i raw_data.csv -o processed_data.csv

Analysis

utils/base_analysis.py contains the code to conduct a summary analysis of statements content (words count, top words, readability scores). The notebook basic_analysis.ipynb provides an illustration.

ecb-speechs-nlp's People

Contributors

aflatoune avatar

Stargazers

Pithēkos avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.