rasbt / stat453-deep-learning-ss21 Goto Github PK

View Code? Open in Web Editor NEW

427.0 10.0 265.0 114.82 MB

STAT 453: Intro to Deep Learning @ UW-Madison (Spring 2021)

License: MIT License

Python 0.38% Jupyter Notebook 99.62%

stat453-deep-learning-ss21's Introduction

stat453-deep-learning-ss21

STAT 453: Intro to Deep Learning @ UW-Madison (Spring 2021)

stat453-deep-learning-ss21's People

Contributors

Stargazers

Watchers

Forkers

snowdj dumpmemory ethanjc123 zhili51 yutong-zhang0326 sum-coderepo jiangtang muammerugur cannonlock jabertuhin imvansh25 hongzhonglu ahmed-al-khaffaf sudokhan112 hchen549 ppchar jonathansum wzj207 ashish615 madhavjk tanmy21 pruibobo spencerraw hesamshaelaie ducdh1210 est09 vishal733 anhnguyendepocen bhaskar-j bhavnoormarok chaimaadd rockdeldiablo sushil1312 bkiselgof alisonsin leminina subrota-mondal zoewang2557 calegit ziaf daominhkhanh20 fusion-research rahuul2001 abhishek-shrm gwg12 sajidahmeduiu rounak97 aiswaryame94 sbalas billy-odera christophwuersch mcubuktepe py5gol jocluis uk04gk 4owl7 bonny2016 sarahboufelja plissonf tantai17132002 jaidip1994 aghilasclipher haizhuolaojisite fabienmerceron mehmet-sari joyli2333 geeklurnai taoi666 mikky-li ambarish1998 richardjmorton hsudheeer dzakpasu weiqimaster seanjds yuanhuaqian alimurtaza096 hashleyjr lakshmanaraja mirxonius kagawa588 reiselmillan arunkottilukkal vyas-ankit aka-95 aditya2kahol ming-yao normanli33 ssrrbb deng000 aarya180 tangtianyi1998 ceste vinbhuynh standingdesk00 singhkumaranuj bahaabufayed tavo-robotas ct608 rodrighons

stat453-deep-learning-ss21's Issues

Small error in bias-computation in L08/code/softmax-regression_scratch.ipynb

Hello @rasbt,

first of all thanks for making all this material available online, as well as your video lectures! A really helpful resource!

A small issue and fix: The classic softmax regression implementation in L08/code/softmax-regression_scratch.ipynb has a small error in the bias computation (I think). Output for training (cell 8) gives the same weight for all bias terms:

Epoch: 049 | Train ACC: 0.858 | Cost: 0.484
Epoch: 050 | Train ACC: 0.858 | Cost: 0.481

Model parameters:
  Weights: tensor([[ 0.5582, -1.0240],
        [-0.5462,  0.0258],
        [-0.0119,  0.9982]])
  Bias: tensor([-1.2020e-08, -1.2020e-08, -1.2020e-08])

whereas the second implementation with nn.Module API gives different bias terms.

The problem lies in the torch.sum call in SoftmaxRegression1.backward: it computes a single sum over all biases which is later broadcast across all bias terms. You can fix this by changing

    def backward(self, x, y, probas):  
        grad_loss_wrt_w = -torch.mm(x.t(), y - probas).t()
        grad_loss_wrt_b = -torch.sum(y - probas)
        return grad_loss_wrt_w, grad_loss_wrt_b

    def backward(self, x, y, probas):  
        grad_loss_wrt_w = -torch.mm(x.t(), y - probas).t()
        grad_loss_wrt_b = -torch.sum(y - probas, dim=0)
        return grad_loss_wrt_w, grad_loss_wrt_b

it learns the toy problem a (very slight) bit better then.

code error in dataloader

why here is train_dp_list?

train_loader = DataLoader(train_dp_list,
batch_sampler=BatchSamplerSimilarLength(dataset = train_dp_list,
batch_size=BATCH_SIZE),
collate_fn=collate_batch)
valid_loader = DataLoader(train_dp_list,
batch_sampler=BatchSamplerSimilarLength(dataset = valid_dp_list,
batch_size=BATCH_SIZE,
shuffle=False),
collate_fn=collate_batch)
test_loader = DataLoader(train_dp_list,
batch_sampler=BatchSamplerSimilarLength(dataset = test_dp_list,
batch_size=BATCH_SIZE,
shuffle=False),

Unable to load CelebA dataset. File is not zip file error.

More of a FYI... Tried to reproduce L17 4_VAE_celeba-inspect notebook. When loading dataset, got ERROR "Unable to load CelebA dataset. File is not zip file error" with "BadZipFile: File is not a zip file". Found TorchVision Issue #2262 that identified problem as exceeding daily max quote on GoogleDrive, punted issue back to dataset authors, and closed their issue. A future version of TorchVision should give a better descriptive error message.

So, FYI to your students. Work-around is to...

try again tomorrow
download from authors
download from Kaggle

About L14 vgg

Hi!
I'd like to ask why should we use avgpool instead pf maxpool?

rasbt / stat453-deep-learning-ss21 Goto Github PK

stat453-deep-learning-ss21's Introduction

stat453-deep-learning-ss21

stat453-deep-learning-ss21's People

Contributors

Stargazers

Watchers

Forkers

stat453-deep-learning-ss21's Issues

Small error in bias-computation in L08/code/softmax-regression_scratch.ipynb

code error in dataloader

Unable to load CelebA dataset. File is not zip file error.

About L14 vgg

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent