Too many parameters (fc layers) in both cnn encoder and rnn decoder, causing dramatic overfitting! #51

mashijie1028 · 2021-08-18T10:12:55Z

There are so many fc layers in both CNN encoder and RNN decoder, only one is enough. When I implement the CRNN training, I got over 70% test acc with only one fc layer in both CNN and LSTM (However, there is still a huge overfitting). When the num_fc_layers increases, the performance degrades.

Plus, BatchNorm probably contradicts with dropout, because dropout could affect the statistics of BN, BN is already a regularizer. Maybe no dropout is better.

The text was updated successfully, but these errors were encountered:

mashijie1028 · 2021-08-18T10:16:55Z

I was wondering how you could get 85.68% test acc in ResNet-152 + LSTM, could you please tell me the hyper-parameters? Thanks!
@HHTseng

mashijie1028 · 2021-08-19T06:37:05Z

I use ResNet-18(pretrained) + LSTM and get over 80% test acc, but only 40% test acc when training ResNet-18 + LSTM from scratch. It seems that pretraining ResNet CNN encoder on ImageNet is essential.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Too many parameters (fc layers) in both cnn encoder and rnn decoder, causing dramatic overfitting! #51

Too many parameters (fc layers) in both cnn encoder and rnn decoder, causing dramatic overfitting! #51

mashijie1028 commented Aug 18, 2021 •

edited

Loading

mashijie1028 commented Aug 18, 2021 •

edited

Loading

mashijie1028 commented Aug 19, 2021

Too many parameters (fc layers) in both cnn encoder and rnn decoder, causing dramatic overfitting! #51

Too many parameters (fc layers) in both cnn encoder and rnn decoder, causing dramatic overfitting! #51

Comments

mashijie1028 commented Aug 18, 2021 • edited Loading

mashijie1028 commented Aug 18, 2021 • edited Loading

mashijie1028 commented Aug 19, 2021

mashijie1028 commented Aug 18, 2021 •

edited

Loading

mashijie1028 commented Aug 18, 2021 •

edited

Loading