mudit-1999 Goto Github PK
Name: Mudit Agarwal
Type: User
Location: IIIT Hyderabad
Name: Mudit Agarwal
Type: User
Location: IIIT Hyderabad
Tasks
An algorithm which can learn a mutliclass classifier under bandit feedback setting actively
With the onset of of internet era, there is a boom in online learning. So, for better visual experience, along with the video of the lecture, soft copy of the slides is also being embedded into the video. But most of this slide matching process is done manually which is a laborious task. So to automate the task of slide matching, here is a small contribution from my side.
Rippling Acad
Assignment
A basic ticket booking appp
Implemented a functional compiler which converts C language to x86
In this paper, we present online algorithm called {\it Delaytron} for learning multi class classifiers using delayed bandit feedbacks. The sequence of feedback delays $\{d_t\}_{t=1}^T$ is unknown to the algorithm. At the $t$-th round, the algorithm observes an example $\mathbf{x}_t$ and predicts a label $\tilde{y}_t$ and receives the bandit feedback $\mathbb{I}[\tilde{y}_t=y_t]$ only $d_t$ rounds later. When $t+d_t>T$, we consider that the feedback for the $t$-th round is missing. We show that the proposed algorithm achieves regret of $\mathcal{O}\left(\sqrt{\frac{2 K}{\gamma}\left[\frac{T}{2}+\left(2+\frac{L^2}{R^2\Vert \W\Vert_F^2}\right)\sum_{t=1}^Td_t\right]}\right)$ when the loss for each missing sample is upper bounded by $L$. In the case when the loss for missing samples is not upper bounded, the regret achieved by Delaytron is $\mathcal{O}\left(\sqrt{\frac{2 K}{\gamma}\left[\frac{T}{2}+2\sum_{t=1}^Td_t+\vert \mathcal{M}\vert T\right]}\right)$ where $\mathcal{M}$ is the set of missing samples in $T$ rounds. These bounds were achieved with a constant step size which requires the knowledge of $T$ and $\sum_{t=1}^Td_t$. For the case when $T$ and $\sum_{t=1}^Td_t$ are unknown, we use a doubling trick for online learning and proposed Adaptive Delaytron. We show that Adaptive Delaytron achieves a regret bound of $\mathcal{O}\left(\sqrt{T+\sum_{t=1}^Td_t}\right)$. We show the effectiveness of our approach by experimenting on various datasets and comparing with state-of-the-art approaches.
Implement a Deep Q-Network (DQN) on the game of Atari Breakout from the OpenAI Gym
Try to emulate a fighter jet game in 3-D using OpenGL in C++
Designed and trained a GAN to generate data from the given normal distribution.
Boost.Geometry - Generic Geometry Library
An attempt to replicate the famous JetPack-JoyRide Game using openGl in cpp
This algorithm is formulated to addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner maynot receive true feedback. Instead, it receives feedback that has beenflipped with some non-zero probability. We propose a novel approachto deal with noisy bandit feedback, based on the unbiased estimatortechnique. This algorithm can also efficiently estimate the noise rates, and thus providing an end-to-end framework. The proposed algorithm enjoys mistake bound of the order ofO(√T) in the highnoise case and of the order ofO(T^2/3) in the worst case.
An attempt to make famous Mario Game
memory game
A fully functional quiz portal
An algorithm to generate random point inside all types of polygon
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Reinforcement Learning with a corrupted reward channel
Application of Policy and Value Iteration in real world
A mini sql-engine which will run a subset of SQL queries using command line interface
Attempt to writer shader or lightning module in Webgl
Predicting Times Series and leraning temporal dependencies using vanilla as well as stacked RNN and LSTM models.
A basic C-shell
AI bot for Xtremem TicTacToe (a slight variant of ultimate tictactoe with 2 big boards) using alpha beta pruning, winning heuristic and quiescence search
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.