Coder Social home page Coder Social logo

Mudit Agarwal's Projects

automation-of-slide-matching icon automation-of-slide-matching

With the onset of of internet era, there is a boom in online learning. So, for better visual experience, along with the video of the lecture, soft copy of the slides is also being embedded into the video. But most of this slide matching process is done manually which is a laborious task. So to automate the task of slide matching, here is a small contribution from my side.

delaytron-efficient-learning-of-multiclass-classifiers-with-delayed-bandit-feedbacks icon delaytron-efficient-learning-of-multiclass-classifiers-with-delayed-bandit-feedbacks

In this paper, we present online algorithm called {\it Delaytron} for learning multi class classifiers using delayed bandit feedbacks. The sequence of feedback delays $\{d_t\}_{t=1}^T$ is unknown to the algorithm. At the $t$-th round, the algorithm observes an example $\mathbf{x}_t$ and predicts a label $\tilde{y}_t$ and receives the bandit feedback $\mathbb{I}[\tilde{y}_t=y_t]$ only $d_t$ rounds later. When $t+d_t>T$, we consider that the feedback for the $t$-th round is missing. We show that the proposed algorithm achieves regret of $\mathcal{O}\left(\sqrt{\frac{2 K}{\gamma}\left[\frac{T}{2}+\left(2+\frac{L^2}{R^2\Vert \W\Vert_F^2}\right)\sum_{t=1}^Td_t\right]}\right)$ when the loss for each missing sample is upper bounded by $L$. In the case when the loss for missing samples is not upper bounded, the regret achieved by Delaytron is $\mathcal{O}\left(\sqrt{\frac{2 K}{\gamma}\left[\frac{T}{2}+2\sum_{t=1}^Td_t+\vert \mathcal{M}\vert T\right]}\right)$ where $\mathcal{M}$ is the set of missing samples in $T$ rounds. These bounds were achieved with a constant step size which requires the knowledge of $T$ and $\sum_{t=1}^Td_t$. For the case when $T$ and $\sum_{t=1}^Td_t$ are unknown, we use a doubling trick for online learning and proposed Adaptive Delaytron. We show that Adaptive Delaytron achieves a regret bound of $\mathcal{O}\left(\sqrt{T+\sum_{t=1}^Td_t}\right)$. We show the effectiveness of our approach by experimenting on various datasets and comparing with state-of-the-art approaches.

dqn_agent icon dqn_agent

Implement a Deep Q-Network (DQN) on the game of Atari Breakout from the OpenAI Gym

fighter-jet icon fighter-jet

Try to emulate a fighter jet game in 3-D using OpenGL in C++

gan icon gan

Designed and trained a GAN to generate data from the given normal distribution.

geometry icon geometry

Boost.Geometry - Generic Geometry Library

jetpack-joyride icon jetpack-joyride

An attempt to replicate the famous JetPack-JoyRide Game using openGl in cpp

learning-multiclass-classifier-under-noisy-bandit-feedback-code icon learning-multiclass-classifier-under-noisy-bandit-feedback-code

This algorithm is formulated to addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner maynot receive true feedback. Instead, it receives feedback that has beenflipped with some non-zero probability. We propose a novel approachto deal with noisy bandit feedback, based on the unbiased estimatortechnique. This algorithm can also efficiently estimate the noise rates, and thus providing an end-to-end framework. The proposed algorithm enjoys mistake bound of the order ofO(√T) in the highnoise case and of the order ofO(T^2/3) in the worst case.

mario- icon mario-

An attempt to make famous Mario Game

reinforcement-learning icon reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

sql-engine- icon sql-engine-

A mini sql-engine which will run a subset of SQL queries using command line interface

xtreme-tictactoe-bot icon xtreme-tictactoe-bot

AI bot for Xtremem TicTacToe (a slight variant of ultimate tictactoe with 2 big boards) using alpha beta pruning, winning heuristic and quiescence search

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.