altr's People
altr's Issues
Figure out the logistics for keeping a history of audits that way the changes can be tracked and visualized.
Add support for searching the URL of the property via the Google API.
See how things look with `hover-tab`.
Build an architecture diagram.
Consider making custom components to support better visualization of the data + for a better UI experience.
Adding a new general method to scrape Zillow.
Figure out how to remove the `.config` file and API keys/sensitive info from the commit history.
Keep trying to scrape when ScraperAPI fails
Don't proceed to parsing the output until the proper response is returned.
Reinforcement learning for smart HTML tree traversal?
Write an API to easily verify the outputs of an audit for any single attribute on a specific property.
Use XPaths to parse instead
- Xpaths might be a better choice for navigating the HTML tree.
- Use
parsel
or look for a different package - Read https://scrapfly.io/blog/parsing-html-with-xpath/
- Research if anyone has used machine learning to train a smart XPath traversal algorithm ... any smart tree traversal algorithm might be applicable in the XPath context
Add capability to generate a report after the audit.
In the same way, see if we can transition to using an st.expander
for displaying the results after the audit. That way, if someone runs an audit with like 100 properties, their browser isn't overwhelmed.
Add scraping support for Redfin, Trulia, etc.
Investigate refactoring in order to reduce database queries.
Make sure all sections have `logout` capability.
Add support for better statistics.
i.e. add accuracy statistics on address, price, bedrooms, bathrooms, sqft, acre, year_built. See if there's anything else that might demonstrate how this misreporting impacts the sale of the property.
Create a testing framework.
Check out the link attached for apply machine learning to web scraping
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.