Abstract
This paper describes a methodology for the statistical modeling of music works. Starting from either the representation of the symbolic score or the audio recording of a performance, a hidden Markov model is built to represent the corresponding music work. The model can be used to identify unknown recordings and to align them with the corresponding score. Experimental evaluation using a collection of classical music recordings showed that this approach is effective in terms of both identification and alignment. The methodology can be exploited as the core component for a set of tools aimed at accessing and actively listening to a music collection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bartsch, M.A., Wakefield, G.H.: Audio thumbnailing of popular music using chroma-based representations. IEEE Transactions on Multimedia 7(1), 96–104 (2005)
Cano, P., Loscos, A., Bonada, J.: Score-performance matching using HMMs. In: Proceedings of the International Computer Music Conference, pp. 441–444 (1999)
Choi, F.Y.Y.: Advances in domain independent linear text segmentation. In: Proceedings of the Conference on North American chapter of the Association for Computational Linguistics, pp. 26–33 (2000)
Cont, A.: Realtime audio to score alignment for polyphonic music instruments using sparse non-negative constraints and hierarchical HMMs. In: IEEE International Conference in Acoustics and Speech Signal Processing, pp. V245–V248 (2006)
Dannenberg, R.B., Mukaino, H.: New techniques for enhanced quality of computer accompaniment. In: Proceedings of the International Computer Music Conference, pp. 243–249 (1988)
Dixon, S., Widmer, G.: MATCH: a music alignment tool chest. In: Proceedings of the International Conference of Music Information Retrieval, pp. 492–497 (2005)
Doraisamy, S., Rüger, S.: A polyphonic music retrieval system using N-grams. In: Proceedings of the International Conference on Music Information Retrieval, pp. 204–209 (2004)
Durbin, R., Eddy, S., Krogh, A., Mitchison, G.: Biological Sequence Analysis. Cambridge University Press, Cambridge (2000)
Fujishima, T.: Realtime chord recognition of musical sound: a system using common Lisp music. In: Proceedings of the International Computer Music Conference, pp. 464–467 (1999)
Gómez, E., Herrera, P.: Estimating the tonality of polyphonic audio files: Cognitive versus machine learning modelling strategies. In: Proceedings of the International Conference on Music Information Retrieval, pp. 92–95 (2004)
Grubb, L., Dannenberg, R.B.: A stochastic method of tracking a vocal performer. In: Proceedings of the International Computer Music Conference, pp. 301–308 (1997)
Harte, C., Sandler, M., Abdallah, S., Gómez, E.: Symbolic representation of musical chords: a proposed syntax for text annotations. In: Proceedings of the International Conference on Music Information Retrieval, pp. 66–71 (2005)
Hu, N., Dannenberg, R.B., Tzanetakis, G.: Polyphonic audio matching and alignment for music retrieval. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 185–188 (2003)
Herrera, P., Serrá, J., Gómez, E., Serra, X.: Chroma binary similarity and local alignment applied to cover song identification. IEEE Transactions on Audio, Speech, and Language Processing 16(6), 1138–1151 (2008)
Kurth, F., Müller, M.: Efficient index-based audio matching. IEEE Transactions on Audio, Speech, and Language Processing 16(2), 382–395 (2008)
Middleton, R.: Studying Popular Music. Open University Press, Philadelphia (2002)
Miotto, R., Orio, N.: Automatic identification of music works through audio matching. In: Proceedings of 11th European Conference on Digital Libraries, pp. 124–135 (2007)
Miotto, R., Orio, N.: A music identification system based on chroma indexing and statistical modeling. In: Proceedings of the International Conference on Music Information Retrieval, pp. 301–306 (2008)
Mohri, M.: Finite-state transducers in language and speech processing. Computational Linguistics 23(2), 269–311 (1997)
Montecchio, N., Orio, N.: Automatic alignment of music performances with scores aimed at educational applications. In: AXMEDIS 2008: Proceedings of the 2008 International Conference on Automated solutions for Cross Media Content and Multi-channel Distribution, Washington, DC, USA, pp. 17–24. IEEE Computer Society, Los Alamitos (2008)
Nattiez, J.-J.: Musicologie générale et sémiologie. Christian Bourgois éditeur, Paris, FR (1987)
Orio, N.: Alignment of performances with scores aimed at content-based music access and retrieval. In: Proceedings of European Conference on Digital Libraries, pp. 479–492 (2002)
Orio, N., Déchelle, F.: Score following using spectral analysis and hidden Markov models. In: Proceedings of the International Computer Music Conference, pp. 125–129 (2001)
Orio, N., Schwarz, D.: Alignment of monophonic and polyphonic music to a score. In: Proceedings of the International Computer Music Conference, pp. 129–132 (2001)
Orio, N., Zattra, L.: Audio matching for the philological analysis of electroacoustic music. In: Proceedings of the International Computer Music Conference, pp. 157–164 (2007)
Peeters, G.: Chroma-based estimation of musical key from audio-signal analysis. In: Proceedings of the International Conference of Music Information Retrieval, pp. 115–120 (2006)
Rabiner, L.R.: A tutorial on hidden Markov models and selected application. Proceedings of the IEEE 77(2), 257–286 (1989)
Raphael, C.: Automatic segmentation of acoustic musical signals using hidden Markov models. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(4), 360–370 (1999)
Reynar, J.C.: Topic Segmentations: Algorithms and Applications. PhD Thesis, Computer and Information Science, University of Pennsylvania, USA (1998)
Shifrin, J., Pardo, B., Meek, C., Birmingham, W.: HMM-based musical query retrieval. In: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, pp. 295–300 (2002)
Stammen, D.R., Pennycook, B.: Real-time recognition of melodic fragments using the dynamic timewarp algorithm. In: Proceedings of the International Computer Music Conference, pp. 232–235 (1993)
Turetsky, R.J., Ellis, D.P.W.: Ground-truth transcriptions of real music from force-aligned MIDI syntheses. In: Proceedings of the International Conference of Music Information Retrieval, pp. 135–141 (2003)
Typke, R., Wiering, F., Veltkamp, R.C.: A search method for notated polyphonic music with pitch and tempo fluctuations. In: Proceedings of the International Conference of Music Information Retrieval, pp. 281–288 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Miotto, R., Montecchio, N., Orio, N. (2010). Statistical Music Modeling Aimed at Identification and Alignment. In: Raś, Z.W., Wieczorkowska, A.A. (eds) Advances in Music Information Retrieval. Studies in Computational Intelligence, vol 274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11674-2_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-11674-2_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11673-5
Online ISBN: 978-3-642-11674-2
eBook Packages: EngineeringEngineering (R0)