Musicology, Music Cognition and Musical Similarity
Download
Report
Transcript Musicology, Music Cognition and Musical Similarity
Melodic Features and
Retrieval
ISMIR Graduate School, Barcelona 2004
Musicology 3-4
Frans Wiering, ICS, Utrecht University
Outline
yesterday’s assignment
demo: MIR outside academia (7:20; 44:10)
one-dimensional melody retrieval
Gestalt view of melody
advanced melody retrieval
assignment
one-dimensional melody retrieval
common assumption is (was?) pitch-only retrieval is
sufficient
e.g. CCGGAAGGFFEEDDEC
mechanisms for fuzzy matching
variants
interval (distance between 2 pitches)
pitch-contour
same/up/down (Parson’s Code)
RURURDRDRDRDRUD
examples:
www.musipedia.com (Rainer Typke)
www.themefinder.org (CCARH)
Results from Musipedia
query is ranked 3
other hits are
very unlikely
unfortunately no
notation/sound
available
Haydn: evident
false positive
why?
Themefinder
Several 1-dimensional
search options, e.g.
pitch
interval
contour
rhythm
wildcards
each theme stored as a
number of strings
matching by regular
expressions
ca. 40.000 themes
Barlow and Morgenstern
(1948)
ESAC encodings
Lincoln, 16th Century Motet
(DARMS project)
results from Themefinder
Query: +m2 +M2 P1 -M2 -m2 -M2
Example from Byrd &
Crawford (2001)
other hits
not as far-fetched as
musipedia’s
different rhythm
different meter
still not very similar
is this what people have in
mind?
Nice one we’ve just discovered
www.tuneteller.com
Pitch-only search of
MIDI on the internet
many more MIR
systems in Rainer
Typke’s survey.
URL is in your
mailbox
Why pitch-only retrieval is unsatisfactory
information contribution of other 3 parameters
(estimate for Western music; Byrd & Crawford
2001)
pitch: 50%
rhythm: 40%
timbre + dynamics: 10%
melodic confounds (Selfridge-Field 1998):
rests
repeated notes
grace notes, ornamentation
Mozart example
Why pitch-only retrieval is unsatisfactory
information contribution of other 3 parameters
(estimate for Western music; Byrd & Crawford
2001)
pitch: 50%
rhythm: 40%
timbre + dynamics: 10%
melodic confounds (Selfridge-Field 1998):
rests
repeated notes
grace notes, ornamentation
Mozart example
Gestalt and melody
melody: coherent succession of pitches
coherence important for similarity: creates musical
meaning
from New Harvard Dictionary of Music
bottom-up (pitches and durations)
top-down: segmenting, Gestalt
Gestalt theory of perception
late 19th/early 20th century, Germany, later US
perception of wholes rather than parts
explanations: Gestalt principles of grouping
application in visual and musical domain
Low-level Gestalt principles
Snyder mentions:
proximity
similarity
duration
articulation
continuity
rhythmic
intervallic
melodic
these produce closure of
wholes
Example: Beethoven 5th
symphony: beginning 1st
movement
also illustrates high-level
principles
from Snyder (2001)
Low-level Gestalt principles
Snyder mentions:
proximity
similarity
duration
articulation
continuity
rhythmic
intervallic
melodic
these produce closure of
wholes
Example: Beethoven
also illustrates high-level
principles
from Snyder (2001)
High-level Gestalt principles
parallellism
very strong in Mozart, Ah
vous, second half of
melody
intensification
important organisational
principle in variations and
improvisations
Mozart’s last variation
from Snyder (2001)
Application in analysis and retrieval
Gestalt reduces memory
overload: we can ignore the
details
Analytical: Schering (1911)
14th century Italian songs
basic melodic shape
might be nice for retrieval
Problem with Gestalt
principles:
many different formulations
overlap; no rules for conflict
intuitive, cannot be
successfully formalized
from New Grove, Music analysis
The cognitive interpretation: chunking
what creates a boundary
interval leap
long duration
tonality (stable chords)
etc
Example of quantification: Melucci & Orio (2004)
using local boundary detection (Cambouropoulos 1997)
apply weight to intervals and durations
boundary after maximum
chunks forther processed for indexing
Organising chunks
STM problem: max. 5-7
different elements
very short span
solution: hierarchical
grouping
melody schemas
contours of melody
cf. Schering ex.
examples: axial, arch, gapfill
Mozart begins with gap-fill
next level: form
A-B-A
from Snyder (2001)
mental model of a song
Ah, vous dirai-je maman
melody level
A
A
B
analysis
chunk level
synthesis
phrase level
subchunk level
analysis: from ear to LTM
(sub) chunks created by similarity and
continuity
synthesis: from LTM to focus of attention
recollection
a lot of parallellism
boundaries by leaps and harmony
chunks may have a harmonic aspect too
(I, V, V->I)
using general characteristics of phrases and
chunks
performance
notes are reconstitued through some musical
grammar
Problems of melody retrieval
People remember high-level concepts, not notes
melodic variability and change
often confused with poor performance abilities
theme-intensive music (fugues) stimulate formation of such
concepts
transposition
augmentation/diminution
ornamentation
variation
compositional processes: inversion, retrograde
other factors
polyphony
harmony
Set-based approaches to melody retrieval
in polyphony
General idea:
Clausen, Engelbrecht, Meyer, Schmidt (2000):
compare note sets: find supersets, calculate distance
usually take rhythm and pitch into account
hopefully more tolerant agains some of the problems of melodic variety
PROMS
matches onset times; wildcards
elegant indexing
Lemström, Mäkinen, Ukkonen, Turkia (several articles, 2003-4)
C-Brahms
algorithms for matching line segments
P1: onsets
P2: partial match onset times
P3: common shared time
attention to time complexity
Typke, Veltkamp, Wiering (2003-2004)
Orpheus system
Earth Mover’s Distance
The Earth Mover’s
Distance (EMD)
measures similarity by
calculating a minimum
flow that would match
two set of weighted
points. One set emits
weight, the other one
receives weight
Y. Rubner (1998); S.
Cohen (1999)
Application to music
represent notes as
weighted point sets
in 2-dimensional
space (pitch, time)
weight represents
duration
other possibilities
contour/metric
position etc
other possible
application:
pitch event +
acoustic feature(s)?
here, the ‘earth’ is only moved along the temporal axis
Another example
interesting
properties
tolerant against
melodic
confounds
suitable for
polyphony
continuous
partial matching
disadvantage
triangle inequality
doesn’t hold
less suitable for
indexing:
after alignment, the ‘earth’ is moved both along the
temporal axis and along the pitch axis
Test on RISM A/II
Matching polyphony with the EMD
EMD’s partial matching property is essential
MIDI example used as query for RISM database
gross errors in playing are ironed out
Proportional Transportation Distance (PTD)
Giannopoulos &
Veltkamp (2002)
EMD, weigths of sets
normalised to 1
suitable for indexing
triangle inequality
holds
no partial matching
Test on RISM A/II
only hits with
approximately
same length
need 4 queries to
find all known
items
False positive (EMD)
problems arise when length and/or number of notes differs
considerably
Segmenting
overlapping segments of 69 consecutive notes
not musical units
search results are
combined
better Recall-Precision
averages
Example of new search
http://teuge.labs.cs.uu.nl/Rntt.cgi/mir/mir.cgi
Concluding remarks about melodic retrieval
lots of creativity go into melody; difficult to give rules
not a ‘basic musical structure’ (Temperley 2001)
essential to use multiple features
pitch, rhythm
harmony
segmentation
finding perceptually relevant chunks is not easy
finding complete melodies may be harder
arbitrary segments may also work
indexing strategies for melody
melodic change over time
several projects have tentative results for polyphony
gut feeling: false positives are big issue
notion of salience (Byrd and Crawford)