The system according to the invention comprises a text-to-speech
conversion processing unit, and a phrase dictionary as well as a waveform
dictionary, connected independently from each other to the conversion
processing unit. The conversion processing unit is for converting any
Japanese text inputted from outside into speech. In the phrase
dictionary, voice-related terms representing the reproduced sounds of
actually recorded sounds, for example, notations of terms such as
onomatopoeic words, background sounds, lyrics, music titles, and so
forth, are previously registered. Further, in the waveform dictionary,
waveform data obtained from the actually recorded sounds, corresponding
to the voice-related terms, are previously registered. Furthermore, the
conversion processing unit is constituted such that as for a term in the
text matching the voice-related term registered in the phrase dictionary
upon correlation of the former with the latter, actually recorded speech
waveform data corresponding to the relevant voice-related term matching
the term in the text, registered in the waveform dictionary, is outputted
as a speech waveform of the term.