TY - GEN
T1 - Video classification using compacted dataset based on selected keyframe
AU - Rachmadi, Reza Fuad
AU - Uchimura, Keiichi
AU - Koutaki, Gou
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/2/8
Y1 - 2017/2/8
N2 - Shared human actions in the video are the biggest problem for video classification system. For example, long jump sports video will share a running action with the long jump or running sports video. In this paper, we present a video classification system by combining the keyframe extractor system and convolutional neural network (CNN) classifier. The visual attention modeling was used to build the keyframe extractor system and top k frames with the highest saliency value is chosen for the classification process. By using the top k keyframe with the highest saliency value, it may reduce the shared action of the video and makes the classifier easier to classify the video by using only the spatial features. The keyframe extracted from video summarization method was used for training process, which in our system proved very efficient and speed up the training process. As a result, our system is effective and the average accuracy is increased compared with the system without using the keyframe extractor system. Our proposed method also outperforms the system using video summarization method as keyframe extractor system by around 3%.
AB - Shared human actions in the video are the biggest problem for video classification system. For example, long jump sports video will share a running action with the long jump or running sports video. In this paper, we present a video classification system by combining the keyframe extractor system and convolutional neural network (CNN) classifier. The visual attention modeling was used to build the keyframe extractor system and top k frames with the highest saliency value is chosen for the classification process. By using the top k keyframe with the highest saliency value, it may reduce the shared action of the video and makes the classifier easier to classify the video by using only the spatial features. The keyframe extracted from video summarization method was used for training process, which in our system proved very efficient and speed up the training process. As a result, our system is effective and the average accuracy is increased compared with the system without using the keyframe extractor system. Our proposed method also outperforms the system using video summarization method as keyframe extractor system by around 3%.
UR - http://www.scopus.com/inward/record.url?scp=85015404790&partnerID=8YFLogxK
U2 - 10.1109/TENCON.2016.7848130
DO - 10.1109/TENCON.2016.7848130
M3 - Conference contribution
AN - SCOPUS:85015404790
T3 - IEEE Region 10 Annual International Conference, Proceedings/TENCON
SP - 873
EP - 878
BT - Proceedings of the 2016 IEEE Region 10 Conference, TENCON 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 IEEE Region 10 Conference, TENCON 2016
Y2 - 22 November 2016 through 25 November 2016
ER -