Coder Social home page Coder Social logo

redfinscraper's Introduction

An idea that has gone unseized,
Is a forest without any trees,
It exists as a ghost,
With no one to host,
Lost like a breath to the breeze,

An idea that rests in the hand,
Is a seed to be sown in the land,
As a sprout it will grow,
With a spark melting snow,
Shone brightly from your lantern of gold,

Soon trees will dawn,
Bark thick and of brawn,
A forest built to be old

A tall lantern over the forest

redfinscraper's People

Contributors

ryansherby avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

redfinscraper's Issues

API Results Limited to 350 Entries Regardless of Time Span

`
scraper = RedfinScraper()
scraper.setup(zip_database_path, multiprocessing=False)
zip_codes = ['85255']

for zip_code in zip_codes:
        data = scraper.scrape(zip_codes=[zip_code], sold=True, sale_period='1yr') # change to 3yr
        data.to_csv(os.path.join(output_dir, f'scraped_data_{zip_code}.csv'), index=False)`

--
Steps to Reproduce:

  1. Set up the scraper with a specific zip code.
  2. Scrape data for a 1-year time span.
  3. Observe the number of results (350 in my case).
  4. Increase the time span to 3 years.
  5. Scrape data again for the same zip code.
  6. Observe the number of results (still 350).

Any insights or workarounds would be appreciated. Thank you!

Time On Market

Great project. For Sold properties, is is possible to obtain List Price?

Is there documentation for the Stingray API anywhere?

Additional fields

Is it possible to have this additionally scrape the listing text?

Downloads partial data:

Sample code block where RedfinScraper does not work:

from redfin_scraper import RedfinScraper
scraper = RedfinScraper()

scraper.setup('zip_code_database.csv', False)
print(scraper.scrape(zip_codes=['78704']))
print(scraper.scrape(city_states=['Austin, TX']))

Config changes

Can the base URLs in the rsc config be changed? Meaning, if we wanted to view sold properties vs just for sale properties, could we change the base url? I think it needs to be of the shape https://www.redfin.com/zipcode/<zip_code>/filter/include=sold-<some_number>mo

reddit API limit? (building a stream)

the smallest time unit supported by this app is 1mo...is that a limit built into the reddit API?

for a class, I'm trying to find examples of real data coming in a stream; so, it would be grand if I could grab lots of zip codes, but for only a short period of time (day, hour, ??).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.