Coder Social home page Coder Social logo

zayd-jamadar / text_summarizer Goto Github PK

View Code? Open in Web Editor NEW

This project forked from harsh188/text_summarizer

0.0 0.0 0.0 14.88 MB

This project is a part of my semester long 'mini-project' course at PES University. With the guidance of Dr. S Natarajan and the help of my fellow colleague Harshith Mohankumar we were able to deploy google's state of the art abstractive text summarization model, PEGASUS, onto the internet for anyone to utilize.

License: MIT License

text_summarizer's Introduction

Text_Summarizer

Licence

This project is a part of my semester long 'mini-project' course at PES University. With the guidance of Dr. S Natarajan and the help of my fellow colleague Zayd J we were able to deploy google's state of the art abstractive text summarization model, PEGASUS, onto the internet for anyone to utilize.

Deploying Abstractive Text Summarizer using GCP

Website: (https://textsummarization-ddf77.web.app/) Note: The website will depriciate on June 30th 2021 due to budget limitations.

Started: Jan 2021

Finished: May 2021

Abstract:

Although emerging technology and a surplus of data are resulting in ground breaking research in the field of deep learning, deploying state of the art models into products is still a challenging task. In this work, we aim to expose the difficulties in deployment and operation to help familiarize individuals to the ML pipeline by utilizing various services on the Google Cloud Platform. We have implemented a state of the art transformer model, PEGASUS, in an end-to-end abstractive text summarizer product which has been deployed onto the internet for anyone to utilize.

Description:

This project is a fully scalable end-to-end abstractive summarizer built on the state of the art PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization (ICML 2020) model. This deep learning transformer based model was built last year [2020] by incredible people at Google. Our product takes this state of the art model and puts into a production environment for any user on the internet to utilize.

The main goal of our project is to familiarize developers to the process of deploying a state of the art deep learning model into production.

Branches:

This github repository contains 3 different branches, each of which contain a different segment of our end-to-end product. The React_Website branch contains, well you can guess, the code for our website which is programmed using REACT js. The api branch contains the code for the docker container which exists on cloud run. Finally the model branch contains all of the experiements we performed with different transformer based models during our initial testing phase.

What we have learned:

This project is one of the first full scale products that we have created and deployed onto the internet. It is also our first time deploying an ML model. We learned a lot about the transformer/attention models, cloud technologies, ML pipeline, and much more.

Here is a neat list of all the technologies/skills/knowledge that we picked up with this project:

  • Technologies
    • Front End
      • React JS
      • REST API
      • Firebase Web Hosting
    • Back End
      • Docker
      • Google Cloud Run
      • Node JS
      • Flask
    • Model Traning
      • Python
      • TensorFlow
      • Jupyter
      • Hugging Face Transformers
      • Local training done using RTX 3080
  • Research
    • RNNs
    • LSTMs
    • GRUs
    • Transformers
      • Self-Attention
      • GPT
      • BERT
      • PEGASUS

text_summarizer's People

Contributors

harsh188 avatar zayd-jamadar avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.