Coder Social home page Coder Social logo

jordandeklerk / oraclecoder Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 6.68 MB

A QLoRA+ LLM Ensemble with Schema-Linking for Text-to-SQL Generation

License: MIT License

Jupyter Notebook 99.30% Python 0.70%
artificial-intelligence deep-learning llms lora machine-learning peft-fine-tuning-llm

oraclecoder's Introduction

OracleCoder: Enhancing Text-to-SQL Generation with QLoRA+ Ensemble

NOTE: This project is on-going. The results here are not complete. Stay tuned for future updates on the evaluation of OracleCoder.

OracleCoder is a cutting-edge project aimed at advancing text-to-SQL generation capabilities. It leverages a novel approach, combining schema linking and SQL generation tasks, to fine-tune open-source language models efficiently. This project introduces an innovative ensemble method using QLoRA+ to achieve high performance while addressing data privacy and cost concerns, democratizing access to state-of-the-art text-to-SQL technologies.

Key Features

  • Efficient Fine-Tuning: Utilizes LoRA+ for efficient adaptation of smaller models, enabling performance on par with larger models without significant computational resources.
  • Ensemble Approach: Employs a novel ensemble strategy with QLoRA+ to enhance model performance and reliability, showcasing a path forward for achieving high accuracy in text-to-SQL generation.
  • Schema Linking and SQL Generation: Decomposes the complex text-to-SQL task into schema linking and SQL generation sub-tasks, allowing for more focused model training and improved accuracy.
  • Community-Driven and Open-Source: Emphasizes the importance of community-driven, open-source innovation in advancing natural language processing and database interaction technologies.

Project Structure

  1. Schema Linking: The first phase involves identifying relevant tables and columns from a database schema based on a given natural language question. This step is crucial for understanding the context and narrowing down the focus for SQL generation.

  2. SQL Generation: In the second phase, the model generates SQL queries based on the identified schema elements and the user's question. This step involves fine-tuning language models to accurately translate natural language questions into SQL queries.

  3. Ensemble Model: Multiple models with varying learning rates for the LoRA adapters are trained and then merged into a single ensemble model. This approach enhances model performance and reliability, demonstrating a novel way to leverage ensemble methods in LLMs.

  4. Evaluation: The project includes a comprehensive evaluation process, utilizing the Spider dataset and the BIRD benchmark to assess the performance of the developed models in schema linking and SQL generation tasks.

See my website for the full project.

oraclecoder's People

Contributors

jordandeklerk avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.