MultiTask Learning in Intent Detection and Slot Prediction for Tamil Conversational Dialogues using Multilingual Pretrained Models

The aim of this project is to develop an intent detction and slot prediction system for Tamil language. An open source dataset from the paper "TamilATIS: Dataset for Task-Oriented Dialog in Tamil (S et al., DravidianLangTech 2022)" was used.Both the Single Task learning based approach and Multi-task learning approaches are experimented. In MultiTask Learning, we use a Random Loss Weighting for intent-detection to account for class-imbalance.

Getting Started

Dataset

The TamilATIS dataset was created by translating the ATIS dataset to English and then the slots are annotated manually. To get the dataset, email the authors of the paper "TamilATIS: Dataset for Task-Oriented Dialog in Tamil (S et al., DravidianLangTech 2022)".

Demo

The demo for this project is available here

Usage

As a first step, clone this repo.

To run the MultiTask Learning experiements, Edit all the configurations in the configs folder.This project uses Hydra as a configuration management tool. If you are new to hydra, I would recommend this tutorial.

After setting up all the configs,

python3 main.py

The single task learning experiments are inside, the stl folder.

Results

To experiment with this dataset, two multilingual pretrained models are used XLM-Roberta-Base and XLM-Align-Base. Both the Single task learning and Multi-task learning approach is used.

Single Task Learning

Intent-Detection

Model	Accuracy	Macro F1	Weighted F1
XLM-ROBERTA-BASE	0.9252	0.4080	0.9166
XLM-ALIGN-BASE	0.9013	0.2506	0.8839
MURIL-BASE	0.8146	0.0498	0.7314
MURIL-LARGE	0.8968	0.2290	0.8715

Slot Filling

Model	Accuracy	Macro F1	Weighted F1
XLM-ROBERTA-BASE	0.9507	0.6298	0.9205
XLM-ALIGN-BASE	0.9428	0.57	0.89
MURIL-BASE	0.6437	0.0635	0.5456
MURIL-LARGE	0.96	0.7611	0.9548