Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: Speech compression


Related Topics

In the News (Wed 15 Feb 12)

  
  Speech Coding
Speech coding techniques can be broadly divided into two classes: waveform coding that aims at reproducing the speech waveform as faithfully as possible and vocoders that preserve only the spectral properties of speech in the encoded signal.
Speech coders attempt to minimize the bit rate for transmission or storage of the signal while maintaining required levels of speech quality, communication delay, and complexity of implementation (power consumption).
Speech quality is usually evaluated on a five-point scale, known as the mean-opinion score (MOS) scale, in speech quality testing---an average over a large number of speech data, speakers, and listeners.
cslu.cse.ogi.edu /HLTsurvey/ch10node4.html   (1978 words)

  
 Our site is for informational purposes on areas that where we know you'll learn everything you want to know on audio ...   (Site not responding. Last check: 2007-10-15)
GSM 6.10 is a fixed rate lossy compression algorithm that compresses data at a fixed rate of 13 Kbps.
Objective methods of speech codec performance is normally done by providing quantitative measures on how well the reconstructed voice signal matches the original by looking at the mean square error distortion (MSE), classical signal to noise ratio (SNR), and segmented signal to noise ratio (SSNR) [1].
Our site's content on speech compression so far gives wealth from a top notch topic on compression test with a state of the art resource on gprs gsm that is as good as it gets with information on audio player software.
speech.lselvon.com   (2283 words)

  
 Speech Compression
With current compression techniques (all of which are lossy), it is possible to reduce the rate to 8 kbps with almost no perceptible loss in quality.
The model says that the digital speech signal is the output of a digital filter (called the LPC filter) whose input is either a train of impulses or a white noise sequence.
Original (64000 bps) This is the original speech signal sampled at 8000 samples/second and u-law quantized at 8 bits/sample.
www.data-compression.com /speech.html   (940 words)

  
 DataCompression.info - Speech   (Site not responding. Last check: 2007-10-15)
Compression of human speech is a specialized form of audio compression.
A report on an audio compression algorithm that relies on compression of the waveform with Huffman compression of the residuals.
This is supposed to be a 2.4 Kbps LPC speech codec.
datacompression.info /Speech.shtml   (2722 words)

  
 Q3.1: Speech compression techniques   (Site not responding. Last check: 2007-10-15)
The aim of speech compression is to produce a compact representation of speech sounds such that when reconstructed it is perceived to be close to the original.
The standard reference point is toll quality speech, this is the same as what would be expected over a telephone line, for example, speech coded at 8 kHz using 8 bit ulaw coding and a maximum frequency of about 3.3 kHz.
At the decoder a regular set of pulses for voiced speech or white noise for unvoiced speech is passed through the linear filter and multiplied by the gain to produce the speech.
www.speech.cs.cmu.edu /comp.speech/Section3/Q3.1.html   (449 words)

  
 [No title]   (Site not responding. Last check: 2007-10-15)
In comparison to audio signals, speech signals can be characterized by a rather low analogue bandwidth and by particular model assumptions that may be used during the design of the coding algorithm.
For voiced speech sounds, the periodic character of the speech signal is modeled by a long-term synthesis filter, e.g.
Currently a normalization procedure for a new wideband speech and audio codec at bit rates of 16, 24 and 32 kbit/s is being pursued by the ITU-T study group 16 (Q.20/16).
www.ind.rwth-aachen.de /research/speech_coding.html   (3029 words)

  
 Towards Efficient and Scalable Speech Compression Schemes for Robust Speech RecognitionApplications   (Site not responding. Last check: 2007-10-15)
Compression rates and recognition performance for both a digit and an alphabet database are reported.
In this paper we study speech compression for distributed speech recognition, where speech is first acquired and then transmitted to a remote recognition engine.
The speech utterance was segmented using overlapping Hamming window of length 24 ms, with adjacent windows separated by 12 ms.
biron.usc.edu /~snaveen/icm/icm.html   (2582 words)

  
 Disaggregate Consulting   (Site not responding. Last check: 2007-10-15)
Disaggregate is pleased to annouce a new speech compression technology that offers remarkable fidelity, low-power operation, and the highest compression ratios of any known speech technology.
Today's state-of-the-art speech technologies capture the model of the vocal tract in mathematical form, and use that mathematical model to compress speech, recognize speech, produce speech from text, or perform voice biometrics (e.g., recoognizing the speaker).
Playback is the opposite of compression, with the added advantage that the exact air molecules from the original utterance are available, and the air pressure of the original compression can operate the HMM to reproduce the utterance.
www.disaggregate.com /talks/compression.html   (402 words)

  
 DDJ, December 1994: Digital Speech Compression
Speech signals, whose interesting frequencies go up to 4 kHz, are often sampled at 8 kHz and quantized on either a linear or logarithmic scale.
The linear-predictive short-term filter, which is the first stage of compression and the last during decompression, assumes the role of the vocal and nasal tract.
The decoder (the synthesis part) reconstructs the speech by passing the residual pulse through the long-term prediction filter, and passes the output of that through the short-term predictor.
kbs.cs.tu-berlin.de /~jutta/gsm/degener.htm   (3049 words)

  
 Home Page of Hee Thong How   (Site not responding. Last check: 2007-10-15)
The most recent standards are the G729 8 kbit/s speech codec with its reduced complexity version Annex A and silence compression scheme version Annex B. Most work on speech coding today is based on telephone-bandwidth speech, nominally limited to about 3.2 kHz and sampled at a rate of 8 kHz.
Wideband speech coding is of increasing interest today and is intended for speech or audio signals of 7 kHz, sampled at 16 kHz.
The naturalness of wideband speech is a significant feature for extended telecommunications processes, such as audio teleconferencing and program broadcasting.
www-mobile.ecs.soton.ac.uk /hth97r   (251 words)

  
 Library turns digital signal controllers to speech: News from Microchip Technology
Many design engineers are seeking to add speech compression to their bandwidth- or memory-constrained applications, but have opted for lower compression methodologies to avoid a stiff per-unit royalty for the intellectual property.
The speech encoder samples data at 8kHz, using either the dsPIC DSC's on-chip 12bit analogue-to-digital convertor or a 16bit input obtained from an external codec, via the on-chip codec interface port.
The library permits the compressed speech data to be stored in on-chip memory (Flash, EEPROM or RAM) or to an external Flash memory.
www.electronicstalk.com /news/ari/ari192.html   (697 words)

  
 DataCompression.info - Image Compression   (Site not responding. Last check: 2007-10-15)
Compression that is specifically oriented towards still images is given here.
Compression is one of them, and there is certainly a great deal to learn about compression and media formats at this conference.
By using data compression techniques, it is possible to remove some of the redundant information contained in images, requiring less storage space and less time to transmit.
www.datacompression.info /ImageCompression.shtml   (7441 words)

  
 Lossless audio compression   (Site not responding. Last check: 2007-10-15)
Compression of these 6 second samples will no-doubt produce different ratios then compressing the entire file, due to variations in the sound signal from moment to moment.
Note that the average file size and compression ratio is based on the best achievable after compressing each file in four ways and manually choosing the smallest file size - something which is not likely to be practical for everyday use.
Compressing one section of a test file (03CC.wav) he found that the Windows and Macintosh file sizes were 57.03% and 56.50% respectively.
www.firstpr.com.au /audiocomp/lossless   (7537 words)

  
 Speech Compression
SF6 Player - Plays sound files compressed by the "Sound Squeezer" program and has the ability to change the tone and tempo of sounds and music.
The coder has facilities to change the level of compression that allows you to receive a stream from 3 ?bit/sec (Telephone Quality Speech) up to 300 ?bit/sec (for high-quality music).
Recommended: Pentium III 600 that allows in real time to code 32 channels of telephone quality sound and in case of Stereo Music compression there are two high quality channels.
nest-soft.co.uk /speech_tech/speech_compression   (237 words)

  
 Speech Master v3.0   (Site not responding. Last check: 2007-10-15)
These features make Speech Master very useful tool for anyone who wants to organize their sound data in compressed archives.
That's why it is useful to distribute your sound data in compressed form and everybody could play them back with free VIMAS Speech Master trial version.
We want to note that the quality of the compressed speech is very high and the algorithms was specially developed for the high quality multimedia systems.
www.vimas.com /ve_spm.htm   (356 words)

  
 DataCompression.info - Audio   (Site not responding. Last check: 2007-10-15)
Speech compression can afford to deal with a much more limited range of frequencies and types of sound.
Like other lossless compression schemes the data reduction varies with the source, but it is generally between 25% and 50% for typical popular music and somewhat better than that for classical music and other sources with greater dynamic range.
When listening to recordings of human speech (like audio books, conference recordings,...) it can be useful to have the ability to adjust the playback speed without changing the pitch of the recording.
datacompression.info /Audio.shtml   (8871 words)

  
 The educational encyclopedia, acoustics, speech technology   (Site not responding. Last check: 2007-10-15)
Speech is normally transmitted to a listener's ears or to a microphone through the air
Speech coding demonstrations demonstrates the speech/music quality obtained after one stage of encoding and decoding through the specified speech codec
Speech compression how to reduce bandwith, the aim of speech compression is to produce a compact representation of speech sounds such that when reconstructed it is perceived to be close to the original
users.telenet.be /educypedia/electronics/acousticsspeech.htm   (294 words)

  
 2002-003: Joint Speech Compression and Automatic Speech Recognition for Wireless and Wireline Transmission   (Site not responding. Last check: 2007-10-15)
The proposed work is of importance to the design of systems in which speech acquisition is done using low-power, and possibly mobile, computers, while the more complex speech recognition task is performed at a remote server.
Typically, in compression, the most significant constraint is preserving the perceptual quality of the signal.
This approach becomes relevant when the difficult conditions under which the recognition has to operate require a significant amount of computation, which is beyond what would be feasible in a power-efficient portable device.
www.ucop.edu /research/micro/abstracts/2002/2002_003.html   (253 words)

  
 Speex: Home Page
Finally, Speex is part of the GNU Project and is available under the Xiph.org variant of the BSD license.
Speex is based on CELP and is designed to compress voice at bitrates ranging from 2 to 44 kbps.
Note that Speex has a number of features that aren’t in other codecs such as intensity stereo encoding, integration of multiple sampling rates in the same bitstream, and a VBR mode; see our comparison page for more.
www.speex.org   (537 words)

  
 comp.speech WWW site
This site is mirrored at several other WWW sites around the world (Australia, UK, Japan and USA) and the information is also available in a plain text format.
There are 250 comp.speech WWW pages and they include over 500 hyperlinks to speech technology web sites, ftp servers, mailing lists, and newsgroups.
The comp.speech ftp site is an excellent repository of speech technology information, software and resources.
fife.speech.cs.cmu.edu /comp.speech   (375 words)

  
 The GSM 06.10 lossy speech compression library and its applications
Microsoft's Audio Compression Manager includes a GSM 6.10 CODEC (in addition to those for ADPCM, IMA ADPCM, the DSP Group's TrueSpeech(TM), and a PCM converter).
It compreses speech at 5.6kbps using two 7-bit codebooks for unvoiced speech and one 9-bit codebook for voiced segments.
Rather than pointing to every speech processing gizmo in existence, this subtree explains principles and formats, and gives crucial software and theory references, for three general classes of speech codecs and a the most important standards.
kbs.cs.tu-berlin.de /~jutta/toast.html   (5675 words)

  
 LPC-10 speech coder software
When I modified this software, my intent was not to alter the speech processing algorithms of the original code at all, and I believe I have achieved that goal.
So, I have some knowledge about the details of the code, but I wasn't attempting to learn all of the speech processing algorithms as I was doing it.
If you want to know more about LPC-10's algorithms in particular, there are several research articles mentioned in the file FAQ in the distribution (the first half of this file is about the CELP 4800 bps standard -- the second half has references about LPC-10).
www.arl.wustl.edu /~jaf/lpc   (635 words)

  
 Compandent Inc. - Home
Compandent's world-class expertise in speech coding and savvy DSP expertise have taken the MELPe to the next level, the Compandent MELPe suite (a.k.a MELPe++ or MELPe-Plus-Plus).
Compandent's world-class expertise in speech coding algorithms has led to a new generation of proprietary low rate compression algorithms, The ComPacketer, that lead the state-of-the-art.
Among our areas of expertise are speech and audio processing, signal compression, DSP, telecommunications, algorithms, software, and implementation.
www.compandent.com   (621 words)

  
 G.729/G.729A Speech Compression
G.729 Annex A speech coder was developed for use in multimedia simultaneous voice and data applications like DSVD.
The input/output of this algorithm is 16 bit linear PCM samples that are converted from/to 8 kbps compressed data stream.
Common compressed speech frame stream interface to support systems with multiple speech coders (G.723, G.728, G.726 et al).
www.vocal.com /data_sheets/full/g729.html   (382 words)

  
 Ultradata Selects Sensory Speech Recognition Chip for Road Genie® Hand-Held Navigator
Sensory's RSC-4128 is the latest in the RSC line of speech micros, allowing manufacturers to replace their existing 8-bit microcontrollers with a voice-enabled solution.
Sensory, Inc., (www.sensoryinc.com) based in Santa Clara, CA, is the world leader in embedded speech technologies.
Sensory is a profitable private company offering a complete line of IC and software-only solutions for speech recognition, speech synthesis, speaker verification, music synthesis and more.
www.prweb.com /releases/2005/5/prwebxml239056.php   (602 words)

  
 Speech Compression based on exact modeling and Structured Total Least Norm optimization
Speech Compression based on exact modeling and Structured Total Least Norm optimization
We present a new speech coding algorithm, based on an all-pole model of the vocal tract.
The new algorithm is demonstrated on a speech signal s.
www.esat.kuleuven.ac.be /sista/yearreport97/node44.html   (190 words)

  
 Speech compression   (Site not responding. Last check: 2007-10-15)
These commercial codecs are suitable for use with HawkVoice or stand alone.
They are optimized for the compression of human speech.
You can listen to speech samples created with these codecs along with all the free HawkVoice codecs on the HawkVoice codecs page.
www.hawksoft.com /hawkvoice/commcodecs.shtml   (315 words)

  
 Real Time Speech Compression Software   (Site not responding. Last check: 2007-10-15)
There have been 321 successful downloads of the source code since November 15, 1996.
Compress either sound files from disk or real time (from /dev/dsp)
The compressed version of this file is 53,248 bytes!
www.lightlink.com /mhp/lpp   (349 words)

  
 Data Compression Pointers
Among my writings on data compression, the only one written in English is History of Data Compression in Japan, which is rather outdated.
JPEG-LS JPEG-LS, a proposed JPEG lossless/near-lossless image compression mode, is faster and in many cases compresses tighter than zlib-based PNG.
SPIHT Image Compression (Image Compression with Set Partitioning in Hierarchical Trees)...seems to have crashed.
oku.edu.mie-u.ac.jp /~okumura/compression.html   (1371 words)

Try your search on: Qwika (all wikis)

Factbites
  About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.