Coder Social home page Coder Social logo

sehatqav2's Introduction

๐Ÿคณ๐Ÿฝ SehatQA

This repository serves to document my undergraduate thesis project.

SehatQA is a web-based answer recommendation system. It uses and is trained on Alodokter question-answer data from 2014-2020. The system performs three tasks for every question input by a user:

1. Topic classification

This task classifies user input into one or more topics (multi-label classification). For development efficency and data limitation reasons, user question has to be under the 10 topics specified in labels.csv. Classification is performed using a neural network-based model (BiLSTM-CNN and BiGRU-CNN among the high-performing models).

2. Similar questions selection

This task selects top 10 most similar questions. Similarity between input question and each dataset question is evaluated using Cosine Similarity. Text is represented as vectors using pretrained word vectors from FastText Bahasa Indonesia.

3. Answer recommendation extraction

The answers from each selected similar questions are summarized extractively and presented back to the user as recommended answers.

Requirements

  1. Download Python 3.8.10 here
  2. Install packages from requirements.txt
pip install -r requirements.txt
  1. Upgrade scikit-learn to v1.1.1
pip install scikit-learn==1.1.1
  1. Install FastAPI, Uvicorn, Jinja2, and python-multipart to run system locally
pip install fastapi uvicorn jinja2 python-multipart
  1. Download the Pickle file for FastText Bahasa Indonesia Word Vectors here. Put the file in your local project directory.
  2. Run the web server locally by running this in terminal
uvicorn main:app --reload

sehatqav2's People

Contributors

arommal avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.