Speech encoding - CompWisdom
About us  |  Why use us?  |  Press  |  Contact us

 

Topic: Speech encoding


  
 Method and apparatus for speech encoding, speech decoding, and speech post processing - Patent 5651092
The speech analysis means calculates and outputs a value of power of the input speech, which is taken by locating the center of the analysis window at the center of the frame every time, as the power of the frame.
The speech analysis means calculates and outputs a value of power of the input speech as a power of analysis frame concerned.
The input speech 4 is input into the speech analysis means 6 through the line 101.
http://www.freepatentsonline.com/5651092.html   (7985 words)

  
 Parametric Encoding of Speech
Speech processing methodologies utilised in hearing prostheses are likely to be similar (if not identical) to algorithms and models utilised in speech synthesis and so it should be useful to examine the intelligibility of various synthesiser configurations in a systematic fashion.
The pooled bark-scaled channel vocoded speech intelligibilities for the 0.75 bark condition are not significantly different from the natural speech condition for all phonetic classes except "consonants" (larger number of tokens for all consonants so a percentage point difference of 4 is significant for this comparison).
It is obviously desirable in the development of speech processing algorithms for hearing prostheses to utilise optimal models of speech.
http://www.ling.mq.edu.au/rmannell/research/tahaaci90/index.html   (5590 words)

  
 3.1.6 Speech Recognition for Encoding Digital Transmission
Speech recognition scores were computed for each speech segment under each of the 3 conditions.
For example, on might envision scoring speech recognizers and human listeners on identical speech-to-text transcription tasks, and then computing the correlation in performance.
This work was designed to be a first step in exploring the feasibility and applicability of using automated speech recognition technology to model human perception of communication channel quality.
http://www.itl.nist.gov/div898/pubs/ar/ar1999/node11.html   (276 words)

  
 RFC 3557 (rfc3557) - RTP Payload Format for European Telecommunications St
Encoding considerations : This type is defined for transfer via RTP [RFC3550] as described in Sections 3 and 4 of RFC 3557.
The remote device processes the speech, compresses the data, and adds error protection to the bitstream in a manner optimal for speech recognition.
However, the voice codecs typically employed in mobile devices were designed to optimize audible voice quality and not speech recognition accuracy, and using these codecs with speech recognizers can result in poor recognition performance.
http://www.faqs.org/rfcs/rfc3557.html   (2702 words)

  
 Efficient Scalable Encoding for Distributed Speech Recognition - Srinivasamurthy, Ortega, Narayanan (ResearchIndex)
This enables a low complexity client, which does not have the computational and memory resources to host a complex speech recognizer, to make use of distributed resources to provide speech recognition services to the user.
1 A two-stage speech recognition method with an error correcti..
2: distributed speech recognition; front-end feature extraction algorithm; compress..
http://citeseer.ist.psu.edu/570487.html   (609 words)

  
 Speech and Channel Coding
The Class Ib bits (together with the encoded Class IA bits) are encoded using convolutional encoding.
The exact algorithms used differ for speech and for different data rates.
As a result, the channel encoded bit sequence is now 378+78=456 bits long.
http://www.eecg.toronto.edu/~nazizi/gsm/coding/index.html   (1077 words)

  
 RFC3557 RTP Payload Format for Es 201 108 Distributed Speech Recognition Encoding
The remote device processes the speech, compresses the data, and adds error protection to the bitstream in a manner optimal for speech recognition.
Encoding considerations : This type is defined for transfer via RTP [RFC3550] as described in Sections 3 and 4 of RFC 3557.
However, the voice codecs typically employed in mobile devices were designed to optimize audible voice quality and not speech recognition accuracy, and using these codecs with speech recognizers can result in poor recognition performance.
http://leadercomm.com/rfc/rfc3557.htm   (2637 words)

  
 Speech encoding - Wikipedia, the free encyclopedia
The Speex project is an attempt to create a free software speech coder, unemcumbered by patent restrictions.
The A-law algorithm and the Mu-law algorithm are used in nearly all land-line long distance telephone communications.
Speech coding is the compression of speech (into a code) for transmission with speech codecs that use audio signal processing and speech processing techniques.
http://www.wikipedia.org/wiki/Speech_coding   (439 words)

  
 Speech encoding method and speech encoding system - US Patent 6581031
Thus, the encoding of input speech signal is completed.
For example, the method of learning the codebook is described in Linde et al., "An Algorithm for Vector Quantization Design", IEEE Trans.
Multistage low bit-rate CELP speech coder with switching code books depending on degree of pitch periodicity
http://www.patentstorm.us/patents/6581031.html   (6160 words)

  
 Pseudo-random sequencing for speech predictive encoding communications system - United States Patent 3,997,729
The 2:1 compression ratio is achieved by applying to each of the n channels the predictive encoding algorithm called a zero-order predictor, well known in the art, and described above.
The disclosed system would be operative to provide an efficient use of trnsmission capacity where a small percentage of the input trunks contaned digital data.
In this manner, a number of talkers greater than the number of available channels may be serviced by sharing the channels on a talkspurt interpolated basis.
http://xrint.com/patents/us/3997729   (8852 words)

  
 Progress Report No.20 -- Short Report & Work-in-progress 4.
A multimodal speech signal is extremely robust and informative and provides information that perceivers are able to exploit during perceptual analysis.
These results have implications for current theories of speech perception and spoken language processing.
Multimodal Encoding of Speech in Memory: A First Report
http://www.indiana.edu/~srlweb/publication/short204.html   (122 words)

  
 Publications
Ballard, K.J., Barlow, J.A., and Robin, D.A. (2001) The underlying nature of apraxia of speech: A critical evaluation of Varley and Whiteside’s dual route speech encoding hypothesis.
Robin, D.A., Tomblin, B., Kearney, A., and Hug, L.N. (1989) Auditory Temporal Pattern Learning in Children with Speech and Language Impairments.
Emmorey, K. Language, cognition, and the brain: Insights from sign language research.
http://slhs.sdsu.edu/publications.php   (2992 words)

  
 Utterance format affects phonological priming in the picture-word task: implications for models of phonological ...
Jescheniak, J.D., Schriefers, H.J., and Hantsch, A. Utterance format affects phonological priming in the picture-word task: implications for models of phonological encoding in speech production.
Utterance format affects phonological priming in the picture-word task: implications for models of phonological encoding in speech production
NICI > Publications > 2003 > Utterance format affects phonological priming in the picture-word task: implications for models of phonological encoding in speech production
http://www.nici.kun.nl/Publications/2003/17223.html   (186 words)

  
 Library turns digital signal controllers to speech: News from Microchip Technology
The evaluation/development version of the dsPIC30F Speech Encoding/Decoding Library (SW300070-EVAL) is US $5.
The dsPICDEM 1.1 general purpose development board (DM300014) can be used to evaluate this library.
The encoder requires 19MIPS (worst case), 33Kbyte of program memory and 6.2Kbyte of RAM.
http://www.electronicstalk.com/news/ari/ari192.html   (726 words)

  
 Interactive Voice Response System - PhoneBrowser
This research is funded by DARPA under the Advanced Speech Encoding Program.See
http://www.caip.rutgers.edu/speech/ase.html   (60 words)

  
 RTP Payload Format for ETSI ES 201 108 Distributed Speech Recognition Encoding
RTP Payload Format for ETSI ES 201 108 Distributed Speech Recognition Encoding
http://xml.resource.org/public/rfc/bibxml3/reference.I-D.ietf-avt-dsr.xml   (12 words)

  
 Layered neural network interfaced with a cochlear model for the study of speech encoding in the auditory system
Layered neural network interfaced with a cochlear model for the study of speech encoding in the auditory system
Keywords: Speech analysis; Neural networks; Mathematical models; Nonlinear filtering; Computer simulation; Audition
http://www1.elsevier.com/gej-ng/31/32/156/30/22/22/abstract.html   (48 words)

  
 Tong Family Blog: Music Archives
I use the Nero 6.0 Ultra Edition to do the encoding or if you use the older Nero 5.5, it is a $20 plug-in.
Even with MP3 or OGG encoding, a book takes 700MBs or so even at 44Kpbs.
Rarewares has a tool called Speexdrop which lets you drag and drop a WAV file onto it and then an SPX file is created
http://www.tongfamily.com/guide/music/index.php   (4305 words)

  
 NBL Conference: The Time Course of Phonological Encoding during Speech Production Estimated from Event Related ...
One central question in psycholinguistic research is when the various types of information (conceptual/semantic, syntactic, and phonological) involved in speaking become available during the process of speech planning.
Here we investigated the relative time course of phonological encoding in an implicit picture-naming task in Dutch using event-related brain potentials (ERPs).
The results will be discussed in relation to a theory of speech production.
http://www.let.rug.nl/nbl/program/18.html   (136 words)

  
 A neural correlate of syntactic encoding during speech production
A neural correlate of syntactic encoding during speech production
NICI > Publications > 2001 > A neural correlate of syntactic encoding during speech production
Indefrey, P., Brown, C.M., Hellwig, F., Amunts, K., Herzog, H., Rüdiger, J.S., and Hagoort, P. A neural correlate of syntactic encoding during speech production.
http://www.nici.kun.nl/Publications/2001/14693.html   (58 words)

  
 Hexapedia - Synthesizer
Synthesizers create sounds through direct manipulation of electrical currents (as in analog synthesizers), mathematical manipulation of discrete values using computers (as in software synthesizers), or by a combination of both methods.
The term "speech synthesizer" is also used in electronic speech processing, often in connection with vocoders.
When natural tonal instruments' sounds are analyzed in the frequency domain, the spectra of tonal instruments exhibit amplitude peaks at the harmonics.
http://www.hexafind.com/encyclopedia/Synthesizer   (3032 words)

  
 Advanced Speech Encoding Program Phase 2
PROGRAM OBJECTIVES AND DESCRIPTION: The Defense Advanced Research Projects Agency (DARPA) Advanced Technology Office (ATO) is soliciting proposals under this BAA for Phase 2 of its Advanced Speech Encoding (ASE) Program.
BROAD AGENCY ANNOUNCEMENT (BAA) 04-35 ADVANCED SPEECH ENCODING PROGRAM, CLOSING DATE: 05 JUL 2005, FIRST SELECTIONS: 4:00PM Arlington, VA Local Time, 01 SEP 2004, POC: Dr. Lisa Porter DARPA/ATO, E-MAIL: ase@darpa.mil, WEB: http://www.darpa.mil/ato/solicit/ASE/index.htm.          
The ASE Program Phase 2 will also explore and characterize the nature of subauditory (nonacoustic) speech and its potential utility as an alternative means of communication in acoustically harsh environments.
http://www.darpa.mil/ato/solicit/ASE/index.htm   (451 words)

  
 Press Release on "Speech Quality for Wireless and Wireline Expected to Improve"
A bandwidth of 50 to 7 000 Hz improves the intelligibility and naturalness of speech, adds a feeling of transparent communication and eases speaker recognition.
New Wideband Speech Coding Standard Set by ITU
January 2002 — The ITU has approved a new Standard for high-quality digital wideband speech encoding that will bring significant improvements in terms of interoperability, easier implementation, and improved quality, for wideband voice applications and services across a wide range of communication systems and platforms.
http://www.itu.int/newsroom/press_releases/2002/03.html   (528 words)

  
 Question 8/16 - Encoding of speech signals at bit rates around 4 kbit/s
What algorithm should be specified for the encoding of 3.4 kHz band-limited, telephone-quality speech at transmission rates in the region of 2.4 kbit/s to 6.4 kbit/s?
g) Study of the support of text-telephony (TDD) devices in systems using low bit-rate speech coding.
a) Study and definition of applications and performance for low-rate speech coding.
http://www.itu.int/ITU-T/2001-2004/com16/sg16-q8.html   (407 words)

  
 speech encoding
try this book to get a basic idea about coding: Speech Coding Algorithms Foundation and Evolution of Standar
Shortly speech encoding is a way to transmit speech using lower data rate.
Usually speech encoding makes a model of the speech process and extracts the parameters of the model.
http://www.edaboard.com/ftopic81569.html   (251 words)

  
 Audio data compression
More advanced codecs such as Shorten (SHN) and FLAC use linear prediction to come up with an optimal whitening filter.
Apple Lossless Encoding (also known as Apple Lossless or Apple Lossless Audio Codec)
Lossless audio codecs have no quality issues, so the usabilty can be estimated by
http://www.brainyencyclopedia.com/encyclopedia/a/au/audio_data_compression.html   (732 words)

  
 [mp3encoder] LAME setting for speech encoding
While there are many presets available for high end encoding, I do not see any for the speech range.
I'd like to encode some speech, which had perviously recorded on tapes.
Google does not produce any useful results either.
http://minnie.tuhs.org/pipermail/mp3encoder/2003-August/005847.html   (144 words)

  
 T.20B When will better quality speech at higher encoding rates be available?
This raises the possibility that even higher encoding rates may arise in the future.
T.20B When will better quality speech at higher encoding rates be available?
Question Q.10 mentions that there will be a faster DTE rate in the next generation of modems.
http://www.faqs.org/faqs/modems/ZyXEL/FAQ/part3/section-30.html   (223 words)

  
 Periodica Polytechnica Electrical Engineering 1997/Resource id #2364
When compared with other existing methods, this algorithm is better in terms of robustness and complexity.
ABSTRACT: Line Spectral Frequencies (LSF) provide an alternate parameterization of the analysis and synthesis filters used in Linear Predictive Coding (LPC) of speech.
KEYWORDS: speech coding, low bit rate, Linear Predictive Coding, Line Spectral, Frequencies, Karhunen-Loeve transform
http://www.pp.bme.hu/ee/1997_1/ee1997_1_07.html   (104 words)

  
 U.S. Pregrant 20040019480 - Speech encoding device having TFO function and method
Speech encoding device having TFO function and method
U.S. Pregrant 20040019480 - Speech encoding device having TFO function and method
At this time, to alleviate the processing burden of the encoder, part of the demultiplexed encoded data, for example, stochastic codebook data, is extracted and supplied to the encoding functional unit.
http://cxp.paterra.com/uspregrant20040019480.html   (196 words)

  
 Rate detection apparatus and method for variable rate speech encoding - US Patent 6480556
A method of detecting the data rate of a received digital signal is provided.
Rate detection apparatus and method for variable rate speech encoding
Method and apparatus for determining the rate of received data in a variable rate communication system
http://www.patentstorm.us/patents/6480556.html   (234 words)

  
 School of Communication at Northwestern University :: Auditory Neuroscience Laboratory :: Visual Influences on Auditory ...
Speech and music encoding is often assumed to be primarily an auditory process.
By studying visual influences on neural responses to speech and music we are able to uncover the mechanisms of multisensory interaction in natural perception.
This line of research focuses on cortical and subcortical responses to audiovisual stimuli in normal, learning impaired populations, and musicians.
http://www.communication.northwestern.edu/brainvolts/projects/visual   (94 words)

  
 ECS EPrints Service - Noiseless encoding of speech signals
ECS EPrints Service - Noiseless encoding of speech signals
Gharavi, H. and Steele, R. Noiseless encoding of speech signals.
Full text of this item is not available.
http://eprints.ecs.soton.ac.uk/3659   (44 words)

  
 speech pathology - definition of speech pathology in the Medical dictionary - by the Free Online Medical Dictionary, ...
This information should not be considered complete, up to date, and is not intended to be used in place of a visit, consultation, or advice of a legal, medical, or any other professional.
speech pathology - definition of speech pathology in the Medical dictionary - by the Free Online Medical Dictionary, Thesaurus and Encyclopedia.
The science concerned with the diagnosis and treatment of functional and organic speech defects and disorders.
http://medical-dictionary.thefreedictionary.com/speech%20pathology   (98 words)

  
 Speech Production: Phonetic Encoding of Real and Non-words
Speech Production: Phonetic Encoding of Real and Non-words
For further information about this item go to:
Home >> Journals and Conference Proceedings >> Text, Speech and Dialogue (TSD)
http://wotan.liu.edu/docis/dbl/tsdtsd/2003__281_SPPEOR.htm   (44 words)

  
 COMPARISON OF SPEECH ENCODING STRATEGIES (SPEAK, ACE, CIS) ASN - Archives for Sensology and Neurootology
Keywords: children, cochlear implantation, cochlear implant, speech encoding.
Published on the Archives for Sensology and Neurootology in Science and Practice - ASN
The editors welcome authors to submit articles for publications in the ASN.
http://www.neurootology.org/index/104.html   (102 words)

  
 Digital video services Los Angeles: VHS to DVD, Video hosting, video encoding, streaming services, editing & more
Digital video services Los Angeles: VHS to DVD, Video hosting, video encoding, streaming services, editing & more
http://www.vchange.com   (177 words)

Compwisdom
 About us   |  Why use us?   |  Press   |  Contact us

 Copyright © 2006 CompWisdom.com Usage implies agreement with terms.