Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: PSOLA


Related Topics

  
  A new approach to the evaluation of vocal effort by PSOLA method
PSOLA is a method used in voice synthesis to create speech material while retaining a good level of naturalness.
PSOLA method, in fact, simulates the natural concatenation between successive elementary elements by overlapping and adding them, but the quality of the synthetic sequences is not natural if the overlapping is too heavy (about 80% overlapping).
This may be attributed to some limitations of the PSOLA method, which is sensitive to the accuracy of the pitch marks, and which cannot compensate for too large gaps between the sequences to process.
www.essex.ac.uk /web-sls/papers/00-01/00-01.html   (4329 words)

  
 Changing Pitch with PSOLA for Voice Conversion   (Site not responding. Last check: 2007-10-17)
PSOLA (“Pitch-Synchronous Overlap and Add”) is a method used to manipulate the pitch of a speech signal to match it to that of the target speaker.
PSOLA deals with diphones, which are the units of speech that extend from the middle of one region of steady-state sound to the middle of the next; thus, they represent transitions between speech sounds.
The original PSOLA is now often referred to as the TD-PSOLA, for “time domain.” An alternative is referred to as the LP-PSOLA.
cnx.rice.edu /content/m12474/latest   (324 words)

  
 [No title]
PSOLA works directly with the signal waveform and therefore is not expensive while not losing any detail of the signal.
Two main advantages of the PSOLA method are preservation of phase even when the length of the s ound is modified and preservation of the spectral envelope (formant positions) even when pitch is shifted.
PSOLA was found adequate to correct the prosody by changing pitch evolu t ion and duration and energy of phonemes.
www.cnmat.berkeley.edu /SDIF/ICMC2000/results/CD/software/IRCAM/RodetCIM2000.rtf   (5083 words)

  
 Discussion
The basis of this subjective impression may be related to the observation in the present study that PSOLA tended to produce both the best and the poorest speech in the experiment.
This was true independent of processing method (although PSOLA produced slightly more degradation than HYBRID LPC for the adult talker) and independent of direction of F0 shift.
While the Hybrid LPC method we've described is not overall better at preserving speech intelligibility than the PSOLA method, it may produce slightly better sounding speech, and seems to have advantages for some talkers and/or when F0 is increased.
www.asel.udel.edu /speech/reports/mohonk/node10.html   (545 words)

  
 SPRUCE
Examples of low-level synthesisers are the Holmes parallel formant device, the Klatt hybrid parallel and cascade formant device, the PSOLA concatenated waveform system developed by the Centre National d'Etude des Télécommunications.
This is a diphone-based concatenated waveform synthesiser requiring for its input an allophone string indexed with duration and fundamental frequency for each individual segment.
The low-level PSOLA system consists of two parts: a set of recorded and marked-up diphones and a set of algorithms for concatenating these and altering their overall durations and pitch.
www.cs.bris.ac.uk /~eric/research/spruce97.html   (3376 words)

  
 Search the lexicon   (Site not responding. Last check: 2007-10-17)
PHONETICS: The PSOLA algorithm (Pitch-Synchronous-Overlap-Add) is used for the manipulation (F0 and duration) of the speech signal as well as for speech synthesis.
The PSOLA manipulations are applied directly to the waveform.
First, the fundamental frequency is detected by a separate algorithm and a window is placed around every pulse.
www2.let.uu.nl /UiL-OTS/Lexicon/zoek.pl?lemma=PSOLA&lemmacode=1268   (125 words)

  
 [No title]
Signal generation is implemented according to the phrase control file, which describes the phrase as a sequence of allophones code names with assigned duration, energy and fundamental frequency values.
To transform the base allophones to required prosodic values we use procedures that are close to TD PSOLA technology.
After implementing any PSOLA algorithms the energy of the resultant acoustic signal is changed and we need to normalize it to some value.
www.dialog-21.ru /Archive/2003/Krivnova.htm   (2470 words)

  
 [music-dsp] pitch modification: PSOLA technique   (Site not responding. Last check: 2007-10-17)
Both are aimed to high quality and independent modification of pitch or time-scale.
I saw that a lot of people knows about phase-vocoder like a more complex technique but with better results comparing with OLA generic algorithms.
The PSOLA -Pitch Synchronous Overlap-and-Add- (and variations FD-PSOLA -Freq.
shoko.calarts.edu /pipermail/music-dsp/2001-June/010286.html   (300 words)

  
 Methods, Techniques, and Algorithms
The PSOLA (Pitch Synchronous Overlap Add) method was originally developed at France Telecom (CNET).
There are several versions of the PSOLA algorithm and all of them work in essence the same way.
Another variations of PSOLA, Frequency Domain PSOLA (FD-PSOLA) and the Linear-Predictive PSOLA (LP-PSOLA), are theoretically more appropriate approaches for pitch-scale modifications because they provide independent control over the spectral envelope of the synthesis signal (Moulines et al.
www.acoustics.hut.fi /~slemmett/dippa/chap5.html   (6161 words)

  
 A Mandarin Text-to-Speech System   (Site not responding. Last check: 2007-10-17)
In TA, statistical model based method is first employed to automatically tag the input text to obtain the word sequence and the associated part-of-speech (POS) sequence.
In PIG, a four-layer recurrent neural network (RNN) is employed to generate some prosodic information including the pitch contour, energy level, initial duration and final duration of syllables as well as the inter-syllable pause duration.
Lastly, in PSOLA, the basic waveform sequence is modified using the prosodic information to generate output synthetic speech.
rocling.iis.sinica.edu.tw /CLCLP/Vol1-1/a3.htm   (248 words)

  
 CEC-Conference: The International Electroacoustic Community Dis   (Site not responding. Last check: 2007-10-17)
PSOLA (Pitch Synchronous Overlap Add) pitch/time shifting and also would
Csound or SuperCollider had a PSOLA objects, that would be work (but they
Praat has PSOLA and works quite nicely, but since it can't import breakpoint
alcor.concordia.ca /~kaustin/cecconference/current/1163.html   (294 words)

  
 Changing Pitch with PSOLA for Voice Conversion   (Site not responding. Last check: 2007-10-17)
Using PSOLA to change the pitch of a speech signal without changing the voice quality.
Gina Upperman, "Changing Pitch with PSOLA for Voice Conversion," Connexions, December 17, 2004, http://cnx.rice.edu/content/m12474/1.3/.
Upperman, G. Changing Pitch with PSOLA for Voice Conversion.
cnx.rice.edu /content/m12474/latest/content_info   (140 words)

  
 Department for phonetics and methods of teaching foreign languages, SPbGU - RUSVOX: Catalogue, vol. 2
It is supplemented with a list of rules for production of theoretically possible but exceptionally rare and theoretically impossible diphones.
Following the requirements of the PSOLA technology diphones were selected from logotoms pronounced by a professional speaker with so-called "normative saint-peterburgian pronunciation ".
The system is implemented in C and is integrated into SIC architecture for use under UNIX.
schools.keldysh.ru /uvk1838/Sciper/volume2/speesynt/rusvox.htm   (246 words)

  
 Electronic Equipment - time stretching for dictation - granular synthesis
PSOLA is in essence a subset of granular synthesis.
High Physics I think that this task is beyond you.
What you are trying to do is a mid-level DSP problem.
www.electronic1.net /detail-2815975.html   (2074 words)

  
 Abstracts   (Site not responding. Last check: 2007-10-17)
The implementation is based on resynthesis using PSOLA algorithm.
The generation program itself is written in C++ and is embedded in the xwaves speech analysis environment.
Different approaches of how to apply the algorithm to the selected units are discussed and details of the implementation are described.
www.ims.uni-stuttgart.de /~moehler/abstracts.html   (612 words)

  
 ICMC00-K   (Site not responding. Last check: 2007-10-17)
It intentionally integrates artistic considerations with research and engineering matters, thus giving a complete picture of a concrete collaboration in the context of the creation of electronic music.
The second stage places PSOLA markers in the harmonic parts by a novel two-steps algorithm.
The synthesis algorithm allows various transformations of the analysis sound of a single voice by the introduction of stochastic as well as deterministic variations.
recherche.ircam.fr /equipes/analyse-synthese/peeters/ARTICLES/Schnell_Peeters_2000_ICMC_OperaK.html   (269 words)

  
 Publication list of Perceptual Computing Group (Tetsunori Kobayashi lab.): Publication 4057   (Site not responding. Last check: 2007-10-17)
A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of F0 modified speech with the PSOLA (Pitch Synchronous OverLap Add) method.
In the conventional PSOLA method, the extracted spectrum envelope using a Hanning window with two-pitch-period length had no reliable information in the band of frequencies lower than the original F0.
This problem causes sound degradation of the F0 modified speech when the F0 is shifted downward.
www.pcl.cs.waseda.ac.jp /publications/data/robots/4057.htm   (176 words)

  
 PROSODY 2000 ABSTRACTS
A couple of such tools are presented: delexicalization by scrambling a speech synthesizer, and extraction of the prosodic properties of a speech signal by replacing each pitch period with a standard signal that retains only its amplitude and periodicity.
A text-to-speech (TTS) system for the Polish language is described, based on the concatenation of diphones using the Time Domain PSOLA technique, with appropriate prosody control.
Results of the listening tests of the synthesised speech will be presented, with a comparison of intrinsic prosody, prosody copied from natural speech and the (local and global) prosody model.
www.staff.amu.edu.pl /~fonetyka/abstract.html   (17261 words)

  
 Festival Speech Synthesis System - 21 Diphone synthesizer
Waveform files may be in any form, as long as every file is the same type, headered or unheadered as long as the format is supported the speech tools wave reading functions.
These may be standard linear PCM waveform files in the case of PSOLA or LPC coefficients and residual when using the residual LPC synthesizer.
Pitch mark files consist a simple list of positions in milliseconds (plus places after the point) in order, one per line of each pitch mark in the file.
www.raimokoski.com /docs/festival-1.4.2/festival_21.html   (1936 words)

  
 CEC-Conference: The International Electroacoustic Community Dis
In reply to: Andreas Bergsland: "SV: PSOLA with complex envelopes?"
Next in thread: Bret Battey: "Re: SV: PSOLA with complex envelopes?"
Reply: Jean-Baptiste Thiebaut: "Re: SV: PSOLA with complex envelopes?"
alcor.concordia.ca /~kaustin/cecconference/current/1165.html   (348 words)

  
 Lineage 2 Orphus Forums -> Whos The Zerg?   (Site not responding. Last check: 2007-10-17)
Im going to sleep till siege, which is in the next 8 + hours, so dont expect much of an answer here from me till then.
Yet an other day RK/Thor/FU decided to "go xp" in DVC, although they know they been kicked out of it many times...
Lol chin talking about "bad trainers" you should teach Oks and Shortbus' golem some more things cuz they ended up getting all of us(i mean us and your own people) killed on 3rd bridge, after karik corner.
www.l2orphus.com /forum/index.php?showtopic=34420   (1782 words)

  
 Lineage 2 Orphus Forums > Whos The Zerg?   (Site not responding. Last check: 2007-10-17)
Aug 22 2004, 12:09 PM (Psola @ Aug 22 2004, 12:04 PM)
Aug 22 2004, 12:21 PM (Psola @ Aug 22 2004, 12:01 PM)
Aug 22 2004, 12:26 PM (Psola @ Aug 22 2004, 12:16 PM)
www.l2orphus.com /forum/lofiversion/index.php/t34420.html   (4129 words)

  
 Emotion Recognition and Synthesis System on Speech   (Site not responding. Last check: 2007-10-17)
It realizes emotion recognition and synthesis just through easy linear operation using the relation information.
In the system, the pitch contour is expressed by the model proposed by Fujisaki (7 parameters) and the power envelope is approximated by 5 line segments (11 parameters), and PSOLA is applied to synthesize the speech.
The relation information was verified to be significant and from the result of the experiments, the system was able to recognize and synthesize emotional content in speech as subjects did.
csdl2.computer.org /persagen/DLAbsToc.jsp?resourcePath=/dl/proceedings/&toc=comp/proceedings/icmcs/1999/0253/01/0253toc.xml&DOI=10.1109/MMCS.1999.779310   (236 words)

  
 [No title]   (Site not responding. Last check: 2007-10-17)
Speech Signals Stretched by Phase-Vocoder Techniques With/Without Phase Offset Correction and PSOLA
Sound files are in a WAV format (sampling rate 16kHz, Linear coding)
The Psola speech files have been gracefully provided by Vincent Colotte
www.loria.fr /~jdm/PhaseVocoder   (61 words)

Try your search on: Qwika (all wikis)

Factbites
  About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.