Understanding and Predicting Interestingness of Videos

Understanding and Predicting Interestingness of VideosYu-Gang Jiang , Yanran Wang , Rui Feng , Hanfang Yang, Yingbin Zheng, Xiangyang Xue

School of Computer Science, Fudan University, Shanghai, ChinaAAAI 2013Bellevue, USA

Applications:• Web Video Search• Video Recommendation System

Related Work:• There is a few studies about predicting Aesthetics and

Interestingness of Images

Key Idea is building computational model to predict which video is more interesting, when given two videos.

Contributions:• Conducted a pilot study on video interestingness• Built two new datasets to support this study• Evaluated a large number of features and get interesting

observations

Can a computational model automatically analyze video contents and predict the interestingness of videos?

We conduct a pilot study on this problem, and demonstrates a simple method to identify more interesting videos.

The problem

Key Idea

Two New DatasetsFlickr Dataset:• Source: Flickr.com• Video Type: Consumer Videos• Video Number: 1200 • Categories: 15 (basketball, beach…)• Duration: 20 hrs in total• Label: Top 10% as interesting videos;

Bottom 10% as uninteresting

YouTube Dataset:• Source: YouTube.com• Video Type: Advertisements• Video Number: 420• Categories: 14 (food, drink…)• Duration: 4.2 hrs in total• Label: 10 human assessors to compare

video pairs

Prediction & EvaluationComputational Framework: • Aim: train a model to compare the interestingness of two videos

Feature:

Prediction:• Adopt Joachims’ Ranking SVM (Joachims 2003) to train prediction models• For both datasets, we use 2/3 of the videos for training and 1/3 for testing• Use Kernel-level Fusion & Equal Weights to fuse multiple features.

Evaluation：• Accuracy (the percentage of correctly ranked test video pairs)

Visual features

Audio features

High-level attribute features

Ranking SVM

resultsMulti-modal fusionVS.

Multi-modal feature extraction

Visual features Color Histogram SIFT HOG SSIM GIST

Audio features MFCC Spectrogram SIFT Audio-Six

High-level attribute features

Classemes Objectbank Style

ResultsVisual Feature Results:

• Overall the visual features achieve very impressive performance on both datasets• Among five features, SIFT and HOG are very effective, and their combination performs best

Audio Feature Results:

• The three audio features are effective and complementary. Comparing them gets best performance

Attribute Feature Results:

• Attribute features do not work as well as we expected. Especially style performs poorly. It is a very interesting observation since in the prediction of image interestingness, style is claimed effective

Visual+Audio+Attribute Fusion Results:

• Fusing visual and audio features leads to substantial performance gains with 2.6% increase on Flickr and 5.4% increase on YouTube. While adding Attribute features is not that effective

HOGSSI

M GIST

Color Hist

rogram

SIFT+H

OG+SSIM

SIFT+H

OG+GIST

SIFT+H

OG+Color

80 74.2

HOGSSI

M GIST

Color Hist

rogram

SIFT+H

OG+SSIM

SIFT+H

OG+GIST

SIFT+H

OG+Color

50556065707580 76.4

50556065707580

Classemes

Objectbank

Style+Classe

Classemes+

Objectbank

50607080

Classemes

Objectbank

Style+Classe

Classemes+

Objectbank

50607080

Visual(S

IFT+HOG)

Audio(MFCC+SS+Audio-Six)

Attribute(O

bjectbank+

Classeme)

Visual+Audio

Visual+Audio+Attrib

ute50607080

50607080

Flickr YouTube

Datasets are available at: www.yugangjiang.info/research/interestingness

76.6 68.074.567.0 67.1

65.764.874.7

64.5 56.8

71.778.676.6

2.6% 5.4%

ConclusionWe conducted a study on predicting video interestingness. We also built two new datasets. A great number of features have been evaluated, leading to interesting observations:• Visual and Audio features are effective in predicting video interestingness• A few features useful in image interestingness do not extend to video domain

(Style…)

Understanding and Predicting Interestingness of Videos

Documents

Predicting, speculation, and assessing

Understanding Rating Behaviour and Predicting Ratings by … · 2015. 11. 29. · Understanding Rating Behaviour and Predicting Ratings by Identifying Representative Users Rahul Kamath

Predicting the Future - english6uftm20102.wikispaces.comenglish6uftm20102.wikispaces.com/file/view/predicting+the+future.pdf · Predicting the Future. Horary Astrology. Chinese Astrology

Observational Methods for Predicting Embankment Settlement

Predicting consumer sentiments from online text

Predicting RNA Secondary Structures

#5 Predicting Machine Translation Quality

Artificial intelligence algorithm for predicting mortality

Predicting outcomes of rectus femoris transfer surgery

Changes and Bugs: Mining and Predicting Development Activitiesthomas-zimmermann.com/publications/files/zimmermann... · 2008. 9. 21. · 6 Predicting Defects for Subsystems 75

Understanding Understanding Source Code with …ckaestne/pdf/icse14_fmri.pdf · Understanding Understanding Source Code with Functional Magnetic Resonance Imaging Janet Siegmundˇ,

Predicting Site Response

Predicting Outcomes from Patient-Clinician Dialoguecis.eecs.qmul.ac.uk/201303_Kyoto_presentations/MatthewPurver... · Predicting Outcomes from Patient-Clinician Dialogue ... • Existing

The Interestingness of Images...The Interestingness of Images Michael Gygli1,2 Helmut Grabner1,2 Hayko Riemenschneider1 Fabian Nater2 Luc Van Gool1,2,3 1Computer Vision Laboratory

Predicting Lymph Node Metastasis Using Histopathological

Predicting system trustworthyness

Dynamic perfusion digital radiography for predicting pulmonary … · 2021. 2. 9. · RESEARCH Open Access Dynamic perfusion digital radiography for predicting pulmonary function

Usefulness of semiquantitative elastography in predicting

predicting preeclampsia.pdf

Predicting the winner of C.Y. award