A method and apparatus are provided for segmenting and summarizing a music
video (507) in a multimedia stream (505) using content analysis. A music
video (507) is segmented in a multimedia stream (505) by evaluating a
plurality of content features that are related to the multimedia stream.
The plurality of content features includes at least two of a face
presence feature; a videotext presence feature; a color histogram
feature; an audio feature, a camera cut feature; and an analysis of key
words obtained from a transcript of the at least one music video. The
plurality of content features are processed using a pattern recognition
engine (1000), such as a Bayesian Belief Network, or one or more video
segmentation rules (1115) to identify the music video (507) in the
multimedia stream (505). A chorus is detected in at least one music video
(507) using a transcript (T) of the music video (507) based upon a
repetition of words in the transcript. The extracted chorus may be
employed for the automatic generation of a summary of the music video
(507).