Machine learning and deep learning have achieved many successes in problem-solving applications in many areas in recent years. Deep learning in which the Convolutionary Neural Network (CNN) have a powerful model to solve the problem of image recognition in decades of image recognition. With the internet and intelligent device growth speed up, video data on media, storage and cloud are explosive. The data information so enormous that video must be analyzed, translated and understood for many progress such as video processing, recommendation, content identification, filter content, etc. While normal method is not good for classifying action, video type and video analysis, the CNN model with a straight network can reverse the act into a simple process. CNN model research focuses on how to classify video or analyze video.

One of these is the C3D model that uses convolution 3D to obtain spatio-temporal data feature. Image below interpretation of the convolution operation 1d, 2d and 3d in the field of image processing.

Using the 3D convolutions in video, we can extract features from the video dataset. The model would perform better if the action was classified in combination with linear SVM.

With the purpose of the model, we can recognize the type of video. At next step, we will then process transcoding to get specific variable parameters to improve the quality of the video. As a result, we can profoundly change the bitrate of the video following each type of video.

Link project:
http://vlg.cs.dartmouth.edu/c3d/

Link video demo :  
https://www.youtube.com/watch?v=LslfD4V_bMs
https://www.youtube.com/watch?v=zT-cd4vrZfA

However, the model is not the only one can get benefit from this technology. There are also other ones which can also gets some advantage at some point. Assuming you have a decent amount of expertise in CNN model knowledge, the C3D model can be understood in logical manner.

Reference:

http://blog.qure.ai/notes/deep-learning-for-videos-action-recognition-review#singlestream
https://arxiv.org/pdf/1412.0767.pdf https://www.tensorflow.org/api_docs/python/tf/layers/conv3d
https://cs.stanford.edu/people/karpathy/deepvideo/
https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/42455.pdf
https://code.google.com/archive/p/sports-1m-dataset/
http://crcv.ucf.edu/data/UCF101.php

Credit to: Van Hoang Cong (vanhc@uiza.io)