Page 1 |
Save page Remove page | Previous | 1 of 141 | Next |
|
small (250x250 max)
medium (500x500 max)
large ( > 500x500)
Full Resolution
All (PDF)
|
This page
All
Subset |
DIGITAL SIGNAL PROCESSING TECHNIQUES FOR
MUSIC STRUCTURE ANALYSIS
by
Yu Shiu
A Dissertation Presented to the
FACULTY OF THE GRADUATE SCHOOL
UNIVERSITY OF SOUTHERN CALIFORNIA
In Partial Fulfillment of the
Requirements for the Degree
DOCTOR OF PHILOSOPHY
(ELECTRICAL ENGINEERING)
December 2007
Copyright 2007 Yu Shiu
Object Description
| Title | Digital signal processing techniques for music structure analysis |
| Author | Shiu, Yu |
| Author email | atoultaro@gmail.com |
| Degree | Doctor of Philosophy |
| Document type | Dissertation |
| Degree program | Electrical Engineering |
| School | Viterbi School of Engineering |
| Date defended/completed | 2007-08-20 |
| Date submitted | 2007 |
| Restricted until | Unrestricted |
| Date published | 2007-09-24 |
| Advisor (committee chair) | Kuo, C.-C. Jay |
| Advisor (committee member) |
Narayanan, Shrikanth Chew, Elaine |
| Abstract | Automatic music structure analysis from audio signals is an interesting topic that receives much attention these days. Its objective is to find the music structure by decomposing the music audio signals into sections and detect the repetitive parts. The technique will benefit music data analysis, indexing, retrieval and management. In this research, a three-level framework of music structure analysis is proposed. The first level is the beat level. Musical audio signals are analyzed via tempo analysis and the beat is derived as the basic temporal unit for each music piece. Then, feature vectors are extracted for each basic unit. The second level is the measure level, a similarity matrix between measures can be constructed based on multiple feature vectors in one measure. Then, the third level is the structure level, whose elements, for example, in a pop or rock song consists of the repetitive parts such as verses and choruses and the non-repetitive parts such as intro, outro and bridge. A technique based on dynamic programming is proposed to search similar parts of a song. With post-processing, the musical sections of the song can be extracted and their boundaries are estimated.; Many digital signal processing (DSP) techniques are proposed to address low-level music signal processing problems in the thesis. They are used to analyze the characteristics of musical audio signals. Specifically, techniques based on the phase-locked-loop (PLL), Kalman filter and the Hidden Markov Model (HMM) are developed for musical beat tracking. The beat locations are estimated on-line for the first two techniques while off-line for the last technique. To further tackle the incorrect measurements of beats, an enhanced probabilistic data association (PDA) that considers both information of prediction residual and music onsets' intensities is applied to original Kalman filter. On the other hand, for HMM-based musical beat tracking, a special state space is built to model the beats' periodic progression and Viterbi algorithm is used to estimate the beats' locations by decoding the musical audio signal into a sequence of beat states and non-beat states. Moreover, dynamic time warping (DTW) is used to calculate the optimized distance between two segments of music signals and thus helps building the measure-level similarity matrix. Finally, the measure-level similarity matrix is analyzed and repetitive parts of a song such as verses and choruses are identified via dynamic-programming-based technique. These are pioneering efforts in the music signal processing field, which appears to be a new frontier in digital signal processing. |
| Keyword | musical information retrieval; music segmentation; dynamic programming; Kalman filter; hidden Markov model |
| Language | English |
| Part of collection | University of Southern California dissertations and theses |
| Publisher (of the original version) | University of Southern California |
| Place of publication (of the original version) | Los Angeles, California |
| Publisher (of the digital version) | University of Southern California. Libraries |
| Type | texts |
| Legacy record ID | usctheses-m826 |
| Rights | Shiu, Yu |
| Repository name | Libraries, University of Southern California |
| Repository address | Los Angeles, California |
| Repository email | http://www.usc.edu/isd/libraries/services/ask_a_librarian/email/ |
| Filename | etd-Shiu-20070924 |
| Archival file | uscthesesreloadpub_Volume14/etd-Shiu-20070924.pdf |
Description
| Title | Page 1 |
| Full text | DIGITAL SIGNAL PROCESSING TECHNIQUES FOR MUSIC STRUCTURE ANALYSIS by Yu Shiu A Dissertation Presented to the FACULTY OF THE GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Fulfillment of the Requirements for the Degree DOCTOR OF PHILOSOPHY (ELECTRICAL ENGINEERING) December 2007 Copyright 2007 Yu Shiu |
Comments
Post a Comment for Page 1

