Voice conversion based on static speaker characteristics
Voice conversion has recently emerged as an interesting branch of speech processing that deals with the modification of a speaker's perceived identity. This technology has applications in speech recognition, the entertainment and security industries. This paper provides a brief introduction to current voice conversion approaches, and discusses the development of the PASS system, a parametric voice conversion algorithm based on static speaker characteristics. The system is easy to implement, requires no phonetic transcription of the speech data, and is shown to be valuable in the case where very little training data is available. Particular mention is made of the pitch extraction subsystem, which uses a novel pitch determination algorithm to ensure the robust estimation of pitch statistics.