slides - Department of Computer Science, Hong Kong Baptist
Download
Report
Transcript slides - Department of Computer Science, Hong Kong Baptist
Content-based
Music Information Retrieval
Student: DENG Jie. Supervisor: Prof. LEUNG, Clement
Department of Computer Science
Hong Kong Baptist University
March, 2010
1
Outline
What will be Covered:
Introduction
A Brief Review of Music
MIR in the Real World and Challenges
Current Content-based MIR Key Techniques
Evaluation of MIR
Conclusion and Future Work
2
Introduction
Music Information Retrieval (MIR) is the interdisciplinary
science of retrieving information from music.
– Mainly based on three-filed subjects: traditional
information retrieval, musicology and digital audio.
Content-based MIR is the science of extracting features
from musical content, such as melody, rhythm and tempo
and so on to facilitate tasks such as analysis and music
retrieval.
Aim:
– To better understand “music” in the music work
– To really search music by “music”
3
Motivation - Music Discovery
Music Channels
Chart Shows
Record store
Gigs
+
Radio
Mate’s
recommendation
4
Music IR Application
Digital Music Libraries / Sound Archives
–
–
Seeking for content-based access music libraries
Combined with metadata search of existing catalogues
Music Education
–
Voice or instrumental teaching
Music Related Legal and Copyright
–
Is the creative content of this music work based on
something for which others hold the rights?
Musicology
–
–
–
Is this piece of music work similar to any other works?
Dose any part of this piece closely resemble any part of
any other works?
Is this piece of music work is based on others?
5
A Brief Review of Music
Music Concepts
Three basic features of a musical sound
–
–
–
Pitch
Intensity / Dynamics
Timbre / Tone color
There are many other terms describing music
–
–
–
–
Tempo
Tonality
Time Signature
Key Signature
6
A Brief Review of Music
Music Characteristics
Music can be defined as the art of disposing and producing
sounds and silences in time
–
Has horizontal and vertical dimensions
The main dimensions of music can be used for music
retrieval (Reference Nicola Orio)
–
–
–
–
–
–
–
Timbre : Quality of the produced sound
Orchestration : Sources of sound production
Acoustics : Quality of the recorded sound
Rhythm : Patterns of sound onsets
Melody : Sequence of notes
Harmony : Sequence of chords
Structure : Organization of the musical work
7
A Brief Review of Music
Music Representation
–
–
–
–
Visual (musical scores, manuscripts)
Aural (digital music)
Text
Hybrid (visual representation of an audio music file )
__________________________________________________________________________________
Example: Visual Representation
Common
Music
Notation
Tablature
E---------------0---------3-3------3--1-1-------------3-1-1---------------B---1-------------3-2-----2-2------2--3-3-----3-2---2---3-3------------1--G---0--0--0h1-------0-----0-0------0--2-2---2---0-0-----2-2------------2--D---2--2------------------------------0---0-------------0----0-1-2-3-3---3A-3-------------------0-0-----0--0--------------0-------------------------E-------------0------------------------------------------------------------
8
MIR in the Real World
There are mainly three-category USERS in MIR
“Amateur”
“Professional”
Librarians
Just about
anyone!
Publishers
“Academic”
Musicologists
Educators
Producers
Performers
Composers
Lawyers ...
Vast numbers
Very many
Significant
numbers
9
MIR in the Real World
Common Music Data and Format
Audio recordings
–
Sampled sound
–
Wave, MP3, AAC, etc.
Symbolic recordings
–
Abstract musical instructions, MusicXML
–
Scores, MIDI, Humdrum, etc.
10
MIR in the Real World
Overview of some existing music search systems
Search by music related metadata: (artists, albums, tracks, music
reviews, new release, etc.) Yahoo! Music and Allmusic are the examples of
this search type
Search by music lyrics: Lyrics.com and SongLyrics.com
Music Media Management and Track Identification: Identify metadata for
music tracks, for example Gracenote and MusicIP
Recommend similar music: by mining some music feature elements
(melody, rhythm, tone color, etc) to recommend user some similar music
Recommend personalized music: by mining some users’ information to
recommend them some their favorite music
11
Challenges in MIR
Began in the 1950’s, still an emerging discipline
Subjectivity and Versioning
Many levels of music knowledge
Lack of bibliographic control and data quality
________________________________________________________________________
MIR Pipeline
Hearing
Representation
Signal Processing
Understanding
Analysis
Machine Learning
Reacting
Interaction
Human Computer Interaction 12
A Simplified MIR Map
This very schematic diagram highlights trends
Extracted
or produced
information
Actions
External
data
integration of audio visual, symbolic and textual data
13
Basic Steps of Content-based MIR
Representation of music contents
– Features: melody, rhythms, etc.
Feature extraction from music data
Feature indexing
Query interface
Matching query features against the feature
index
14
Content-based MIR
Representation of Music Content
– Most music features used to represent music are
always melody.
– Rhythm feature only consider the rhythm
omitting the melody.
– Melody contour method uses three characters to
express the contour of melody.
15
Content-based MIR
Feature extraction from music data
– There are two category algorithms: time domain
(Autocorrelation function, Average magnitude
difference function and Simple inverse filter
tracking) and frequency domain models
(Spectrum and Cepstrum)
– Common extracted tools in the following:
• Short-term Fourier Transform features (FFT)
• Mel-Frequency Cepstral Coefficients (MFCC)
• Daubechies Wavelet Coefficient Histogram (DWCH)
– Pitch is the main feature extracted in practice
16
Content-based MIR
Feature Indexing
– Index terms: play a similar role of words in textual
documents.
– Sequence matching techniques: consider both the
query and the documents as sequences of symbols
and model the possible difference between them.
– Geometric methods: cope with polyphonic scores and
also exploit the properties of continuous distance
measures.
– Based on the above methods, there are mainly three
category music search on the melody feature.
• Melodic retrieval based on index terms (N-grams)
• Melodic retrieval based on sequence matching
• Melodic retrieval based on geometric methods
17
Content-based MIR
Query Interface
– Query by text (keywords: album, artist, track, etc.)
– Query by aural (singing or humming)
• Wave input (sing the whole or part of the songs)
• Music notes segmentation
• Thematic melodies are extracted, translated into text
representations of intervals, pith, and harmony
• Comparison procedure
– Query by tapping
• Wave input by tapping
• Compute the duration of each note
• Similarity comparison
18
Content-based MIR
Matching Query Features against the Feature Index
–
–
–
–
–
Approximate/Partial matching
Similarity measure (MFCC, GMM, KNN)
Precision: how many of the answers are in fact correct
Recall: how many of the correct answers are in fact retrieved
Relevance feedback
Vector space model
– Documents and queries are presented by vectors
– Each element in a vector is determined by an indexing scheme (Ngrams or others)
– The value of each element is determined by a weight scheme
– The similarity between document di and query qj :
19
Evaluation of MIR
The community has established an array of software
tools to support this work
– see http://music-ir.org/evaluation
In 2004, Audio Description Contest first attempted
to build comparative benchmark of MIR algorithms.
Downie has already given us the foundations and
future of the scientific evaluation of MIR systems.
Traditional information retrieval evaluation can also
be adopted in MIR, for example precision and recall
measures.
20
Conclusion
Music is a complicated art form of information and requires
special retrieval systems
MIR technology is improving, but the real application is still lacking
Basic music concepts and characteristics
Basic steps and models of MIR
Current Content-based MIR Key Techniques
Scientific evaluation of MIR
___________________________________________________________
Future Work
Mining the semantic information in multimedia, especially in digital
audio music, and then propose a comprehensive and adaptive method
to automatically analysis and retrieve the high level semantic
information of music, for example, emotion, mood, and style, etc.
21
Related Researches and Projects
ISMIR since 2000
– International Symposium on Music Information
Retrieval
WOCMAT since 2005
– Workshop On Computer Music and Audio Technology
Digital archive application
– Data mining in digital music archive
Free music audio, sound processing tools and musicrelated visualization and mining tools
– http://www.music-ir.org/evaluation/tools.html
Music IR evaluation since 2005
– http://www.music-ir.org/mirexwiki/index.php/Main_Page
– Test collection: music documents, query sets, and
judgment
– Major handle: copyright issue
22
Reference
•
•
•
•
•
•
•
•
•
Michael S. LEW, Nicu Sebe. Content-Based Music Information Retrieval: Current
Directions and Feature Challenges. Proceedings of the IEEE, April 2008
Nicola Orio. Music Retrieval: A Tutorial and Review. Foundations and Trends in
Information Retrieval, Volume1, Issue 1, Pages 1-96, 2006.
J. T. Foote, "An Overview of Audio Information Retrieval." In ACMSpringer Multimedia Systems, vol. 7 no. 1, pp. 2-11, ACM Press/Springer-Verlag,
January 1999
Remco C. Veltkamp, Frans Wiering, Rainer Typke. Content Based Music Retrieval.
In B. Furht (Ed.), Encyclopedia of Multimedia. Springer, 2006.
Giovanna Neve, Nicola Orio: A Comparison of Melodic Segmentation Techniques
for Music Information Retrieval. ECDL 2005: 49-56.
Hwei-Jen Lin, Hung-Hsuan Wu. Efficient geometric measure of music similarity.
Information Processing Letters, Volume 109, Issue2, Page 116-120, 2008.
Iman S. H. Suyoto, Alexandra L. Uitdenbogerd, and Falk Scholer. Searching
Musical Audio Using Symbolic Queries.IEEE. Transactions onAudio, Speech and
Language Processing, 16(2):372–381, 2008.
The Scientific Evaluation of Music Information. Retrieval Systems: Foundations
and Future. Computer Music Journal, Computer Music Journal, 28:2, pp. 12–23,
Summer 2004.
Michael Fingerhut. Real music libraries in the virtual future: for an integrated
view of music and music information. Digitale bibliotheken voor muziek, 2005.
23
Thank you!
Questions?
24