Coder Social home page Coder Social logo

scrape-twitter-data's Introduction

Scrape Twitter Data for Data Analytics

Just download the .otd file, import into Octoparse - a free automatic web scraping tool.

You can get the twitter data you want by simply clicking.

twitter scraper_gif_1


Data Extracted in the Scraper

  • Name
  • Publish time
  • Content
  • Image URL
  • Tweet URL
  • Numbers of comments, retweets, and likes

twitter scraper_image_1


Change the scraping URL of the crawler

Search on Twitter in your browser ๐Ÿ‘‰ copy the URL of the result page ๐Ÿ‘‰ paste the URL into the Page URL of the first step: Go To Web Page, click "OK". Now you could run the scraper to get the data.

scrape-twitter-data's People

Contributors

octoparse avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Forkers

gsusi santonioni

scrape-twitter-data's Issues

Infinite scrolling limitations

I'm assisting a University research team extracting tweets from a hashtag which has quite a lot of tweets (circa 5,000-10,000). However, I'm finding some difficulties with the infinite scrolling

I tried making something more dynamic, where everytime I extract the data of one item in the loop, I do some scrolling, this way, I don't need to predict in advance the number of scrolls. However, I've been unsuccessful here.

I think the key could be in a nested loop, where the parent loop goes through all items, and the child loop extracts the data of each item and performs a few scrolls. However, I found a few issues here: parent loop is not dynamically finding new items, and I cannot make the child loop to only process the relevant item.

Could you provide some guidance here?

Thank you so much!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.