Coder Social home page Coder Social logo

deep_quadric_learning's Introduction

Deep Quadric Learning - A tale in 3 episodes

What is quadric learning? If digital technological evolution taught us one thing then it's this: Moore's 1st law will not always be on our side, his 2nd law never was.

The current hype concerning Large Language Models (LLM), the huge amount of data and parameters to build and train them clearly show that the trend of ever bigger data and ever bigger models can't be sustained and clearly negatively impacts

available AI compute power AI costs the democratization of access to AI technology the AI carbon foot-print & sustainability of deep learning models Everything that contributes positively in a meaningful way in order to improve the situation is absolutely crucial.

AI needs a paradigm shift because it is an illusion that much more progress can be achieved just by simply inflating model size and data. The JEPA approach of Yann LeCun seems to be a very promising next step but real world applicability is not imminent to say the least.

In the meantime, reducing the number of parameters is always a good idea.

This project of Deep Quadric Learning wants to be a small and humble contribution to the above mentioned dilemma by introducing quadric decision hypersurfaces, second order separability and enable the study of its impact on model size, parameters and interconnectivity of deep learning models. The idea for this project is based on my research thesis back in the day:

https://www.researchgate.net/publication/221582251_Using_Quadratic_Perceptrons_to_Reduce_Interconnection_Density_in_Multilayer_Neural_Networks

While attempting to directly reduce net model size itself, quadric learning at the same time still allows for various subsequent methods like hyper parameter optimization, model distillation, sparse activation, evolutionary architecture search etc. etc.

In 3 episodes this project will try to introduce the concept of quadric learning from the motivation (1) the single quadric neuron (2) to quadric layers and their application in a little hybrid model (3).

The proof of concept implementation in PyTorch does in no way affect the applicability of Deep Quadric Learning to other frameworks.

In fact you soon might find more application examples of deep quadric learning in this repo with PyTorch, Keras 3.0 and tinygrad

deep_quadric_learning's People

Contributors

diro5t avatar

Stargazers

jist avatar Mehmetcan Budak avatar Alexander avatar Antonio Castro Gutierrez  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.