edologgerbird / sfyr-data-pipeline Goto Github PK
View Code? Open in Web Editor NEWLicense: GNU General Public License v3.0
License: GNU General Public License v3.0
Differentiate between Stocks, ETFs, DLCs, SWs, REITs, Business Trust, Company Warrants and ADRs.
Requirement: Ingest Data from the SGX Scrape and output the relevant information
Currently the scraped messages do not allow us to determine which stocks are mentioned in the posts.
To allow them to be stored and queried efficiently, we would need to automatically extract the mentioned stocks and store them in a format that is suitable for SQL query
As a user I should be able to call the function to scrape the latests posts
And it should automatically identify the stocks mentioned from those posts
Then the information should be uploaded into BigQuery
And be easily extracted out from BigQuery based on the stocks
Currently the scraped messages do not allow us to determine which stocks are mentioned in the telegram messages.
To allow them to be stored and queried efficiently, we would need to automatically extract the mentioned stocks and store them in a format that is suitable for SQL query
As a user I should be able to call the function to scrape the latest messages from the telegram groups
And it should automatically identify the mentioned stocks
Then the information should be uploaded into BigQuery
And be easily extracted out from BigQuery based on the mentioned stocks
Currently SGX Data Scrape output is stored in a csv file. However, the requirements of the assignment is to store all information in a data warehouse - in this case BigQuery.
The output should be automatically uploaded to BigQuery at the end of the extraction
Currently the scraped messages do not allow us to determine which stocks are mentioned in the posts.
To allow them to be stored and queried efficiently, we would need to automatically extract the STI Movement and store them in a format that is suitable for SQL query
As a user I should be able to call the function to scrape the latests posts
And it should automatically identify the STI Movements from those posts
Then the information should be uploaded into BigQuery
And be easily extracted out from BigQuery
GBQ Ingest API should check to ensure data input is in the form of a pandas dataframe before calling the ingest API
Exploring if it is possible to ensure SBR scraper does not scrape information that occur before last scraped date
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.