- The KTH Dataset: Recognition of human actions (year 2004)
- The Weizmann Dataset (year 2005)
- HMDB: a large human motion database (year 2011)
- Moments in Time Dataset
- Charades Challenge (Recognize and locate activities taking place in a video)
- ActivityNet: A Large-Scale Video Benchmark for Human Activity Understanding
- Kinetics Dataset [paper link: The Kinetics Human Action Video Dataset]
- UCF101 (year 2012) [blog introduction][blog CSDN]
- (Kaggle) Violent and Normal Behavior Videos Dataset
- (B站) [ValseWebinar]视频行为识别 Action Recognition
- (B站) 人工智能 | 基于人体骨架的行为识别
- (知乎) 计算机视觉技术深度解读之视频动作识别
- (github) Awesome Action Recognition
- (CSDN blog) Kinetics-600 dataset介绍(包括ActivityNet)
- (CSDN blog) 计算机视觉技术深度解读之视频动作识别
- (CSDN blog) 视频行为识别检测综述 IDT TSN CNN-LSTM C3D CDC R-C3D
- (CSDN blog) 行为识别数据集汇总
- (CSDN blog) CVPR 2020 论文大盘点-动作识别篇
- KTH(ICPR2004) Recognizing human actions: a local SVM approach [paper link]
- Weizmann(ICCV2005) Actions as space-time shapes [paper link]
- UCF101(arxiv2012) UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild [arxiv link]
- Kinetics(arxiv2017) The Kinetics Human Action Video Dataset [arxiv link]
- EPIC-Kitchens(ECCV2018) Scaling Egocentric Vision: The EPIC-KITCHENS Dataset [project link]
- HACS(arxiv2019) HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization [arxiv link]
- Moments-in-Time(TPAMI2019) Moments in Time Dataset: one million videos for event understanding [project link]
- FineGym(CVPR2020) FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding [project link]
-
梯度直方图HOG(CVPR2005) Histograms of Oriented Gradients for Human Detection [paper link]
-
时空兴趣点检测(IJCV2005) On Space-Time Interest Points [paper link]
-
光流直方图(CVPR2008) Learning Realistic Human Actions from Movies [paper link]
-
密集轨迹特征DT(CVPR2011) Action Recognition by Dense Trajectories [paper link][project link][Codes|offical C++]
-
密集轨迹特征iDT(ICCV2013) Action Recognition with Improved Trajectories [paper link][CSDN blog1][CSDN blog2]
-
RepresentationFlows(CVPR2019) Representation Flow for Action Recognition [arxiv link][project link][Codes|PyTorch(offical)]
-
Two-Stream(NIPS2014) Two-Stream Convolutional Networks for Action Recognition in Videos [arxiv link]
-
two-stream+LSTM(CVPR2015) Long-term Recurrent Convolutional Networks for Visual Recognition and Description [arxiv link][project link][Codes|offical]
-
two-stream+LSTM(CVPR2015) Beyond short snippets: Deep networks for video classification [paper link]
-
two-stream fusion(CVPR2016) Convolutional Two-Stream Network Fusion for Video Action Recognition [arxiv link][Codes|offical Matlab MatConvNet]
-
TSN(ECCV2016) Temporal Segment Networks: Towards Good Practices for Deep Action Recognition [arxiv link][project link][Codes|PyTorch(offical)]
-
Co-occurrence+LSTM(+pose)(AAAI2016) Co-occurrence Feature Learning for Skeleton based Action Recognition using Regularized Deep LSTM Networks [arxiv link]
-
RNN-based(+pose)(ECCV2016) Online Human Action Detection using Joint Classification-Regression Recurrent Neural Networks [arxiv link]
-
TSN-based improved 1(CVPR2017) Deep Local Video Feature for Action Recognition [arxiv link]
-
ST+Attention+LSTM(+pose)(AAAI2017) An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data [arxiv link]
-
TRN(TSN-based improved 2)(ECCV2018) Temporal Relational Reasoning in Videos [arxiv link]
-
ST-GCN(+openpose)(AAAI2018) Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition [arxiv link]
-
密集扩张网络(TIP2019) Dense Dilated Network for Video Action Recognition [paper link]
-
C3D(ICCV2015) Learning Spatiotemporal Features with 3D Convolutional Networks [arxiv link][paper link][project link][Codes|offical caffe]
-
3D-ResNets(CVPR2018) Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? [arxiv link][paper link][Codes|PyTorch(offical)]
-
I3D(Facebook, use inception-V1)(CVPR2017) Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [arxiv link][Codes|Tensorflow(offical)][Codes|PyTorch(unoffical v1)][Codes|PyTorch(unoffical v2)]
-
T3D(CVPR2017) Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification [arxiv link][Codes|offical PyTorch]
-
P3D(MSRA)(ICCV2017) Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks [arxiv link][CSDN blog]
-
TPC(based on CDC)(AAAI2018) Exploring Temporal Preservation Networks for Precise Temporal Action Localization [arxiv link]
-
3D-ResNets(arxiv2020) Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs? [arxiv link][Codes|PyTorch(offical)]
- two-stream+LSTM(CVPR2015) Long-term Recurrent Convolutional Networks for Visual Recognition and Description [arxiv link][project link][Codes|offical]
- GAN-based(IJCAI2018) Exploiting Images for Video Recognition with Hierarchical Generative Adversarial Networks [arxiv link]