Skip to main content
Log in

SportLight: statistically principled crowdsourcing method for sports highlight selection

  • Research Article
  • Published:
Journal of the Korean Statistical Society Aims and scope Submit manuscript

Abstract

Sports highlight selection has traditionally required expert opinions and manual labor of video editors. To automate this laborious task, crowdsourcing viewers’ live comments has recently emerged as a promising tool, which can remove the burden of extracting semantic information by computer vision. However, popular crowdsourcing methods based on peak-finding are sensitive to noise and may produce deviant highlights from the expert choice. To increase the accuracy of automated selection of sports highlight, we introduce a statistically sound crowdsourcing method, SportLight. In this work, we take a statistical approach that combines multiple hypothesis testing and \(\ell _1\)-trend filtering (fused lasso), supported by a computationally inexpensive algorithm. By analyzing 29 baseball games played in the 2016 and 2017 seasons, we demonstrate that our approach properly reduces the risk of false alarm and generates the results closer to expert-chosen highlights than that of the peak-finding method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. Adding the \(\ell _1\) penalties to the Poisson log-likelihood function may be plausible, but the theory of Son and Lim (2019) only supports normal models, and our empirical experience favors the transformation approach.

References

  • Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A., & Nunziati, W. (2003). Semantic annotation of soccer videos: Automatic highlights identification. Computer Vision and Image Understanding, 92(2–3), 285–305.

    Article  Google Scholar 

  • Assfalg, J., Bertini, M., Del Bimbo, A., Nunziati, W., & Pala, P. (2002). Soccer highlights detection and recognition using HMMs. In Proc. 2002 IEEE international conference on multimedia and expo (ICME’02) (Vol. 1, pp. 825–828). IEEE.

  • Babaguchi, N., Kawai, Y., Ogura, T., & Kitahashi, T. (2004). Personalized abstraction of broadcasted American football video by highlight selection. IEEE Transactions on Multimedia, 6(4), 575–586.

    Article  Google Scholar 

  • Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological), 57, 289–300.

    Article  MathSciNet  Google Scholar 

  • Bernstein, M.S., Little, G., Miller, R.C., Hartmann, B., Ackerman, M.S., Karger, D.R., Crowell, D., & Panovich, K. (2010). Soylent: A word processor with a crowd inside. In Proceedings of the 23nd annual ACM symposium on User interface software and technology (pp. 313–322). ACM.

  • Bettadapura, V., Pantofaru, C., & Essa, I. (2016). Leveraging contextual cues for generating basketball highlights. In: Proceedings of the 2016 ACM on Multimedia Conference, MM ’16 (pp. 908–917). ACM. https://doi.org/10.1145/2964284.2964286

  • Bland, J. M., & Altman, D. G. (1995). Multiple significance tests: The bonferroni method. BMJ, 310(6973), 170.

    Article  Google Scholar 

  • Chao, C.Y., Shih, H.C., & Huang, C.L. (2005). Semantics-based highlight extraction of soccer program using DBN. In Proc. 2005 IEEE international conference on acoustics, speech, and signal processing (ICASSP’05) (Vol. 2, p. ii-1057). IEEE.

  • Chu, W. T., & Chou, Y. C. (2017). On broadcasted game video analysis: Event detection, highlight detection, and highlight forecast. Multimedia Tools and Applications, 76(7), 9735–9758.

    Article  Google Scholar 

  • Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2009). Introduction to algorithms. Cambridge: MIT Press.

    MATH  Google Scholar 

  • D’Orazio, T., & Leo, M. (2010). A review of vision-based systems for soccer video analysis. Pattern Recognition, 43(8), 2911–2926.

    Article  Google Scholar 

  • Ha, S., Kim, D., & Lee, J. (2013). Crowdsourcing as a method for digital media interaction. In HCI 2013 (pp. 153–154). The HCI Society of Korea.

  • Ha, S., Kim, D., & Lee, J. (2013). Crowdsourcing as a method for indexing digital media. In CHI’13 extended abstracts on human factors in computing systems (pp. 931–936). ACM.

  • Hannon, J., McCarthy, K., Lynch, J., & Smyth, B. (2011). Personalized and automatic social summarization of events in video. In Proceedings of the 16th international conference on intelligent user interfaces (pp. 335–338). ACM

  • Hoefling, H. (2010). A path algorithm for the fused lasso signal approximator. Journal of Computational and Graphical Statistics, 19(4), 984–1006.

    Article  MathSciNet  Google Scholar 

  • Jacobson, V. (1988). Congestion avoidance and control. In ACM SIGCOMM computer communication review (Vol. 18, pp. 314–329). ACM.

  • Kapetanakis, A. (2018). IBM Watson: Inside the ’black box’. US Open News. Accessed 29 Apr 2019.

  • Liu, C., Huang, Q., Jiang, S., Xing, L., Ye, Q., & Gao, W. (2009). A framework for flexible summarization of racquet sports video using multiple modalities. Computer Vision and Image Understanding, 113(3), 415–424.

    Article  Google Scholar 

  • Marcus, A., Bernstein, M.S., Badar, O., Karger, D.R., Madden, S., & Miller, R.C. (2011). Twitinfo: Aggregating and visualizing microblogs for event exploration. In Proceedings of the SIGCHI conference on Human factors in computing systems (pp. 227–236). ACM.

  • Metulini, R. (2017). Filtering procedures for sensor data in basketball. Statistica & Applicazioni, 15(2), 133–150.

  • Moyer, S. (2013). In America’s pastime, baseball players pass a lot of time. The Wall Street Journal. https://www.wsj.com/articles/SB10001424127887323740804578597932341903720.

  • Paxson, V., Allman, M., Chu, J., & Sargent, M. (2011). Computing TCP’s retransmission timer. RFC 6298.

  • Qian, X., Wang, H., Liu, G., & Hou, X. (2012). HMM based soccer video event detection using enhanced mid-level semantic. Multimedia Tools and Applications, 60(1), 233–255.

    Article  Google Scholar 

  • Quinn, A.J., & Bederson, B.B. (2011) Human computation: A survey and taxonomy of a growing field. In Proceedings of the SIGCHI conference on human factors in computing systems (pp. 1403–1412). ACM.

  • Rui, Y., Gupta, A., & Acero, A. (2000). Automatically extracting highlights for TV baseball programs. In Proceedings of the eighth ACM international conference on Multimedia (pp. 105–115). ACM.

  • Shih, H. C. (2018). A survey of content-aware video analysis for sports. IEEE Transactions on Circuits and Systems for Video Technology, 28(5), 1212–1231.

    Article  MathSciNet  Google Scholar 

  • Son, W., & Lim, J. (2019). Modified path algorithm of fused lasso signal approximator for consistent recovery of change points. Journal of Statistical Planning and Inference, 200, 223–238.

    Article  MathSciNet  Google Scholar 

  • Tang, A., & Boring, S. (2012). #EpicPlay: Crowd-sourcing sports video highlights. In Proceedings of the SIGCHI conference on human factors in computing systems (pp. 1569–1572). ACM.

  • Thompson, C. (2010). What is I.B.M.’s Watson? The New York Times Magazine. https://www.nytimes.com/2010/06/20/magazine/20Computer-t.html. Accessed 29 Apr 2019.

  • Tibshirani, R., Saunders, M., Rosset, S., Zhu, J., & Knight, K. (2005). Sparsity and smoothness via the fused lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(1), 91–108.

    Article  MathSciNet  Google Scholar 

  • Von Ahn, L., & Dabbish, L. (2004). Labeling images with a computer game. In Proceedings of the SIGCHI conference on Human factors in computing systems (pp. 319–326). ACM.

  • Von Ahn, L., Maurer, B., McMillen, C., Abraham, D., & Blum, M. (2008). recaptcha: Human-based character recognition via web security measures. Science, 321(5895), 1465–1468.

    Article  MathSciNet  Google Scholar 

  • Xiong, Z., Radhakrishnan, R., & Divakaran, A. (2004). Method and system for extracting sports highlights from audio signals. US Patent App. 10/374,017

  • Xiong, Z., Radhakrishnan, R., Divakaran, A., & Huang, T.S. (2003). Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework. In Proc. 2003 IEEE International Conference on Multimedia and Expo (ICME’03) (Vol. 3, p. III-401). IEEE.

Download references

Acknowledgements

Won Son was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government (MSIT) (No. 2020R1F1A1A01051039). Joong-Ho Won was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2019R1A2C1007126).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joong-Ho Won.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jung, J., Ha, S., Son, W. et al. SportLight: statistically principled crowdsourcing method for sports highlight selection. J. Korean Stat. Soc. 51, 127–148 (2022). https://doi.org/10.1007/s42952-021-00128-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s42952-021-00128-2

Keywords

Navigation