Coder Social home page Coder Social logo

dialog-summarization-samsum's Introduction

Dialog-Summarization-SamSum

Dialog Summarization modelling using Topic modelling and similar sentence reduction

Project

This project has been carried as part of a course project which will be extended to further research down the line. The primary objective of this summarization project is to augment information available in each utterance with additional modules that extract relevant features from the dialogue set. With the advent of transformer models, several augmentation steps can help improve the generation task in hand.

Contributions of this project

We have made of use of two modules mainly:

We have noticed that certain sentences in dialogue set contribute more to the dialogue summary than others. Hence topic classification is one of the important steps that helps to segregate utterances such as greetings, fillers, affirmations, acknowledgements from questions, explanations, responses. This marked difference classified at utterance level helps to augment the dataset.
Also, in many dialogue sets, there can be back and forth explanations around intent/topic that makes it redundant to be included in a summary. Hence, identifying similar sentences across the dataset which ensures inclusion of topics that are looped back at a later point in conversation, helps to avoid them being included in the summary.

How to run

The python notebook in experiments folder is self explanatory. Different experiments are marked with sub-headings which can be run directly in colab. Even the dataset is attached only for reference purpose and they can also be directly imported in the python environment.

Baselines and model

T5 is the baseline chosen on which the above mentioned modules have been implemented to augment the dataset. These are available in the python notebook.

Members of the project

  • Tharun Suresh
  • Aseem Srivastava

dialog-summarization-samsum's People

Contributors

tharunsuresh-code avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.