kashyapdevesh / manga-newsletter Goto Github PK
View Code? Open in Web Editor NEWWeekly Manga Newsletter Telegram Bot
Home Page: https://t.me/+7qn27AYiREo3M2Y1
Weekly Manga Newsletter Telegram Bot
Home Page: https://t.me/+7qn27AYiREo3M2Y1
In my project, I'm currently using a text summarization model and sentiment analysis model provided by the hugging face hub.
I intend to replace it with a custom model with comparatively less latency and faster processing time.
If you have any suggestions on how to proceed with respect to the nature of the work, please do comment or make a pull request with a sample work/prototype.
Also if you need the dataset, for training the model or for simple observation, do comment.
The python codes in the project are written while following the functional programming paradigm. The functions written can easily be converted into python class object format and be made modular.
The given issue is good to get a rough estimate of the project and is a beginner-friendly issue as the codebase is still growing and this issue can easily be resolved at this stage.
Currently, the flow of execution of the program is sequential and it's taking lots of time while computing a single pic.
I intend to run this program on a real-time feed and on such a system such a processing time would suck. I'm currently stuck with another aspect of the project, so I'm not able to focus my time here.
Any form of help regarding this issue that could reduce the overall processing time would be very beneficial and highly appreciated.
The entire code was written over a very short period of time, and I didn't pay much attention to code commenting styles and documentation paradigms.
Now that the scope of the project is increasing, I'm having issues with keeping logs and track of the project.
I need help with ordering the comments in the code files and further, if you have any ideas or suggestions, please do share
The entire structure of the project is as follows:
I am facing a critical issue with implementing the multithreaded queue data structure which should be connected with the Manganelo Scraper, Page Scraper, Sentiment Analysis, and Summary Generator and final Newsletter Generation files.
The idea behind using a multithreaded queue data structure is such that the whole pipelined process(from scraping to newsletter generation) can be treated as a transaction and if the process fails at any step due to any issue we can push it into a queue shared over the running processes.
This would give me a running solution that I could use as a placeholder and move forward with the project. I know an optimal solution may be more complex or even require writing a custom multithreaded priority queue, but I need a working solution at the moment.
PS:
This is the first time I working with concurrency control in python, and I don't have much prior knowledge of the same, If you have another solution, suggestion or idea, please please do comment.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.