skip to main content
research-article

Enriching media fragments with named entities for video classification

Published: 13 May 2013 Publication History

Abstract

With the steady increase of videos published on media sharing platforms such as Dailymotion and YouTube, more and more efforts are spent to automatically annotate and organize these videos. In this paper, we propose a framework for classifying video items using both textual features such as named entities extracted from subtitles, and temporal features such as the duration of the media fragments where particular entities are spotted. We implement four automatic machine learning algorithms for multiclass classification problems, namely Logistic Regression (LG), K-Nearest Neighbour (KNN), Naive Bayes (NB) and Support Vector Machine (SVM). We study the temporal distribution patterns of named entities extracted from 805 Dailymotion videos. The results show that the best performance using the entity distribution is obtained with KNN (overall accuracy of 46.58%) while the best performance using the temporal distribution of named entities for each type is obtained with SVM (overall accuracy of 43.60%). We conclude that this approach is promising for automatically classifying online videos.

References

[1]
L. Bentivogli, P. Forner, B. Magnini, and E. Pianta. Revising the wordnet domains hierarchy: semantics, coverage and balancing. In Workshop on Multilingual Linguistic Resources, 2004.
[2]
D. Brezeale and D. J. Cook. Automatic video classification: A survey of the literature. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 38(3):416--430, 2008.
[3]
K. Filippova and K. B. Hall. Improved video categorization from text metadata and user comments. In 34th International ACM SIGIR Conference on Research and development in Information Retrieval, pages 835--842, 2011.
[4]
B. Haslhofer, W. Jochum, R. King, C. Sadilek, and K. Schellner. The LEMO annotation framework: weaving multimedia annotations with the web. International Journal on Digital Libraries, 10(1):15--32, 2009.
[5]
C. Huang, T. Fu, and H. Chen. Text-based video content classification for online video-sharing sites. Journal of the American Society for Information Science and Technology, 61(5):891--906, 2010.
[6]
P. Katsiouli, V. Tsetsos, and S. Hadjiefthymiades. Semantic video classification based on subtitles and domain terminologies. In Workshop on Knowledge Acquisition from Multimedia Content (SAMT'07), 2007.
[7]
S. Kotsiantis, I. Zaharakis, and P. Pintelas. Supervised machine learning: A review of classification techniques. Frontiers in Artificial Intelligence and Applications, 160:3, 2007.
[8]
Y. Li, G. Rizzo, R. Troncy, M. Wald, and G. Wills. Creating enriched YouTube media fragments with NERD using timed-text. 11th International Semantic Web Conference, Demo Session, 2012.
[9]
Y. Li, M. Wald, T. Omitola, N. Shadbolt, and G. Wills. Synote: Weaving Media Fragments and Linked Data. In 5th International Workshop on Linked Data on the Web (LDOW'12), 2012.
[10]
A. Y. Ng. Feature selection, l1 vs. l2 regularization, and rotational invariance. In 21st International Conference on Machine learning, 2004.
[11]
J. Niebles, C.-W. Chen, and L. Fei-Fei. Modeling temporal structure of decomposable motion segments for activity classification. Computer Vision--ECCV 2010, pages 392--405, 2010.
[12]
G. Rizzo and R. Troncy. NERD: A Framework for Unifying Named Entity Recognition and Disambiguation Extraction Tools. In 13th Conference of the European Chapter of the Association for computational Linguistics (EACL'12), 2012.
[13]
B. Schopman, D. Brickly, L. Aroyo, C. Van Aart, V. Buser, R. Siebes, L. Nixon, L. Miller, V. Malaise, M. Minno, et al. Notube: making the web part of personalised tv, 2010.
[14]
T. Steiner. SemWebVid - Making Video a First Class Semantic Web Citizen and a First Class Web Bourgeois. In 9th International Semantic Web Conference (ISWC'10), Demo Session, 2010.
[15]
J. Waitelonis, N. Ludwig, and H. Sack. Use what you have: Yovisto video search engine takes a semantic turn. In 5th International Conference on Semantic and digital media technologies (SAMT'10), 2011.
[16]
J. R. Zhang, Y. Song, and T. Leung. Improving video classification via youtube video co-watch data. In Workshop on Social and behavioural networked media access, 2011.

Cited By

View all
  • (2023)A Smart Movie Suitability Rating System Based on SubtitleGazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji10.29109/gujsc.114635211:1(252-262)Online publication date: 25-Mar-2023
  • (2019)Web video classification with visual and contextual semanticsInternational Journal of Communication Systems10.1002/dac.399432:13Online publication date: 23-Jun-2019
  • (2016)Machine-Crowd Annotation Workflow for Event Understanding Across Collections and DomainsProceedings of the 13th International Conference on The Semantic Web. Latest Advances and New Domains - Volume 967810.1007/978-3-319-34129-3_50(813-823)Online publication date: 29-May-2016
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web
May 2013
1636 pages
ISBN:9781450320382
DOI:10.1145/2487788

Sponsors

  • NICBR: Nucleo de Informatcao e Coordenacao do Ponto BR
  • CGIBR: Comite Gestor da Internet no Brazil

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. concept extraction
  2. media annotation
  3. media fragment
  4. named entity extraction
  5. nerd
  6. video classification

Qualifiers

  • Research-article

Conference

WWW '13
Sponsor:
  • NICBR
  • CGIBR
WWW '13: 22nd International World Wide Web Conference
May 13 - 17, 2013
Rio de Janeiro, Brazil

Acceptance Rates

WWW '13 Companion Paper Acceptance Rate 831 of 1,250 submissions, 66%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)3
Reflects downloads up to 23 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)A Smart Movie Suitability Rating System Based on SubtitleGazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji10.29109/gujsc.114635211:1(252-262)Online publication date: 25-Mar-2023
  • (2019)Web video classification with visual and contextual semanticsInternational Journal of Communication Systems10.1002/dac.399432:13Online publication date: 23-Jun-2019
  • (2016)Machine-Crowd Annotation Workflow for Event Understanding Across Collections and DomainsProceedings of the 13th International Conference on The Semantic Web. Latest Advances and New Domains - Volume 967810.1007/978-3-319-34129-3_50(813-823)Online publication date: 29-May-2016
  • (2015)LinkedCultureProceedings of the 8th International Conference on Personalized Access to Cultural Heritage - Volume 135210.5555/3001323.3001332(37-40)Online publication date: 29-Mar-2015
  • (2015)The Concentric Nature of News Semantic SnapshotsProceedings of the 8th International Conference on Knowledge Capture10.1145/2815833.2815836(1-8)Online publication date: 7-Oct-2015
  • (2015)Delivering related web content synchronized to online television: The LinkedTV solution2015 IEEE 5th International Conference on Consumer Electronics - Berlin (ICCE-Berlin)10.1109/ICCE-Berlin.2015.7391270(323-327)Online publication date: Sep-2015
  • (2015)Generating Semantic Snapshots of Newscasts Using Entity ExpansionProceedings of the 15th International Conference on Engineering the Web in the Big Data Era - Volume 911410.1007/978-3-319-19890-3_26(410-419)Online publication date: 23-Jun-2015
  • (2014)Describing and contextualizing events in TV news showProceedings of the 23rd International Conference on World Wide Web10.1145/2567948.2579326(759-764)Online publication date: 7-Apr-2014
  • (2014)Survey of Semantic Media Annotation Tools for the Web: Towards New Media Applications with Linked MediaThe Semantic Web: ESWC 2014 Satellite Events10.1007/978-3-319-11955-7_9(100-114)Online publication date: 16-Oct-2014
  • (2014)A Companion Screen Application for TV Broadcasts Annotated with Linked Open DataThe Semantic Web: ESWC 2014 Satellite Events10.1007/978-3-319-11955-7_27(241-244)Online publication date: 16-Oct-2014

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media