|
| |
| | Method and apparatus for speech encoding, speech decoding, and speech post processing - Patent 5651092 |
 | | The speech analysis means calculates and outputs a value of power of the input speech, which is taken by locating the center of the analysis window at the center of the frame every time, as the power of the frame. |  | | The speech analysis means calculates and outputs a value of power of the input speech as a power of analysis frame concerned. |  | | The input speech 4 is input into the speech analysis means 6 through the line 101. |
|
http://www.freepatentsonline.com/5651092.html
(7985 words)
|
|
| |
| | Parametric Encoding of Speech |
 | | Speech processing methodologies utilised in hearing prostheses are likely to be similar (if not identical) to algorithms and models utilised in speech synthesis and so it should be useful to examine the intelligibility of various synthesiser configurations in a systematic fashion. |  | | The pooled bark-scaled channel vocoded speech intelligibilities for the 0.75 bark condition are not significantly different from the natural speech condition for all phonetic classes except "consonants" (larger number of tokens for all consonants so a percentage point difference of 4 is significant for this comparison). |  | | It is obviously desirable in the development of speech processing algorithms for hearing prostheses to utilise optimal models of speech. |
|
http://www.ling.mq.edu.au/rmannell/research/tahaaci90/index.html
(5590 words)
|
|
| |
| | 3.1.6 Speech Recognition for Encoding Digital Transmission |
 | | Speech recognition scores were computed for each speech segment under each of the 3 conditions. |  | | For example, on might envision scoring speech recognizers and human listeners on identical speech-to-text transcription tasks, and then computing the correlation in performance. |  | | This work was designed to be a first step in exploring the feasibility and applicability of using automated speech recognition technology to model human perception of communication channel quality. |
|
http://www.itl.nist.gov/div898/pubs/ar/ar1999/node11.html
(276 words)
|
|
| |
| | RFC 3557 (rfc3557) - RTP Payload Format for European Telecommunications St |
 | | Encoding considerations : This type is defined for transfer via RTP [RFC3550] as described in Sections 3 and 4 of RFC 3557. |  | | The remote device processes the speech, compresses the data, and adds error protection to the bitstream in a manner optimal for speech recognition. |  | | However, the voice codecs typically employed in mobile devices were designed to optimize audible voice quality and not speech recognition accuracy, and using these codecs with speech recognizers can result in poor recognition performance. |
|
http://www.faqs.org/rfcs/rfc3557.html
(2702 words)
|
|
| |
| | Speech and Channel Coding |
 | | The Class Ib bits (together with the encoded Class IA bits) are encoded using convolutional encoding. |  | | The exact algorithms used differ for speech and for different data rates. |  | | As a result, the channel encoded bit sequence is now 378+78=456 bits long. |
|
http://www.eecg.toronto.edu/~nazizi/gsm/coding/index.html
(1077 words)
|
|
| |
| | RFC3557 RTP Payload Format for Es 201 108 Distributed Speech Recognition Encoding |
 | | The remote device processes the speech, compresses the data, and adds error protection to the bitstream in a manner optimal for speech recognition. |  | | Encoding considerations : This type is defined for transfer via RTP [RFC3550] as described in Sections 3 and 4 of RFC 3557. |  | | However, the voice codecs typically employed in mobile devices were designed to optimize audible voice quality and not speech recognition accuracy, and using these codecs with speech recognizers can result in poor recognition performance. |
|
http://leadercomm.com/rfc/rfc3557.htm
(2637 words)
|
|
| |
| | Speech encoding - Wikipedia, the free encyclopedia |
 | | The Speex project is an attempt to create a free software speech coder, unemcumbered by patent restrictions. |  | | The A-law algorithm and the Mu-law algorithm are used in nearly all land-line long distance telephone communications. |  | | Speech coding is the compression of speech (into a code) for transmission with speech codecs that use audio signal processing and speech processing techniques. |
|
http://www.wikipedia.org/wiki/Speech_coding
(439 words)
|
|
| |
| | Speech encoding method and speech encoding system - US Patent 6581031 |
 | | Thus, the encoding of input speech signal is completed. |  | | For example, the method of learning the codebook is described in Linde et al., "An Algorithm for Vector Quantization Design", IEEE Trans. |  | | Multistage low bit-rate CELP speech coder with switching code books depending on degree of pitch periodicity |
|
http://www.patentstorm.us/patents/6581031.html
(6160 words)
|
|
| |
| | Pseudo-random sequencing for speech predictive encoding communications system - United States Patent 3,997,729 |
 | | The 2:1 compression ratio is achieved by applying to each of the n channels the predictive encoding algorithm called a zero-order predictor, well known in the art, and described above. |  | | The disclosed system would be operative to provide an efficient use of trnsmission capacity where a small percentage of the input trunks contaned digital data. |  | | In this manner, a number of talkers greater than the number of available channels may be serviced by sharing the channels on a talkspurt interpolated basis. |
|
http://xrint.com/patents/us/3997729
(8852 words)
|
|
| |
| | Progress Report No.20 -- Short Report & Work-in-progress 4. |
 | | A multimodal speech signal is extremely robust and informative and provides information that perceivers are able to exploit during perceptual analysis. |  | | These results have implications for current theories of speech perception and spoken language processing. |  | | Multimodal Encoding of Speech in Memory: A First Report |
|
http://www.indiana.edu/~srlweb/publication/short204.html
(122 words)
|
|
| |
| | Publications |
 | | Ballard, K.J., Barlow, J.A., and Robin, D.A. (2001) The underlying nature of apraxia of speech: A critical evaluation of Varley and Whiteside’s dual route speech encoding hypothesis. |  | | Robin, D.A., Tomblin, B., Kearney, A., and Hug, L.N. (1989) Auditory Temporal Pattern Learning in Children with Speech and Language Impairments. |  | | Emmorey, K. Language, cognition, and the brain: Insights from sign language research. |
|
http://slhs.sdsu.edu/publications.php
(2992 words)
|
|
| |
| | Utterance format affects phonological priming in the picture-word task: implications for models of phonological ... |
 | | Jescheniak, J.D., Schriefers, H.J., and Hantsch, A. Utterance format affects phonological priming in the picture-word task: implications for models of phonological encoding in speech production. |  | | Utterance format affects phonological priming in the picture-word task: implications for models of phonological encoding in speech production |  | | NICI > Publications > 2003 > Utterance format affects phonological priming in the picture-word task: implications for models of phonological encoding in speech production |
|
http://www.nici.kun.nl/Publications/2003/17223.html
(186 words)
|
|
| |
| | Library turns digital signal controllers to speech: News from Microchip Technology |
 | | The evaluation/development version of the dsPIC30F Speech Encoding/Decoding Library (SW300070-EVAL) is US $5. |  | | The dsPICDEM 1.1 general purpose development board (DM300014) can be used to evaluate this library. |  | | The encoder requires 19MIPS (worst case), 33Kbyte of program memory and 6.2Kbyte of RAM. |
|
http://www.electronicstalk.com/news/ari/ari192.html
(726 words)
|
|
| |
| | Tong Family Blog: Music Archives |
 | | I use the Nero 6.0 Ultra Edition to do the encoding or if you use the older Nero 5.5, it is a $20 plug-in. |  | | Even with MP3 or OGG encoding, a book takes 700MBs or so even at 44Kpbs. |  | | Rarewares has a tool called Speexdrop which lets you drag and drop a WAV file onto it and then an SPX file is created |
|
http://www.tongfamily.com/guide/music/index.php
(4305 words)
|
|
| |
| | NBL Conference: The Time Course of Phonological Encoding during Speech Production Estimated from Event Related ... |
 | | One central question in psycholinguistic research is when the various types of information (conceptual/semantic, syntactic, and phonological) involved in speaking become available during the process of speech planning. |  | | Here we investigated the relative time course of phonological encoding in an implicit picture-naming task in Dutch using event-related brain potentials (ERPs). |  | | The results will be discussed in relation to a theory of speech production. |
|
http://www.let.rug.nl/nbl/program/18.html
(136 words)
|
|
| |
| | A neural correlate of syntactic encoding during speech production |
 | | A neural correlate of syntactic encoding during speech production |  | | NICI > Publications > 2001 > A neural correlate of syntactic encoding during speech production |  | | Indefrey, P., Brown, C.M., Hellwig, F., Amunts, K., Herzog, H., Rüdiger, J.S., and Hagoort, P. A neural correlate of syntactic encoding during speech production. |
|
http://www.nici.kun.nl/Publications/2001/14693.html
(58 words)
|
|
| |
| | Hexapedia - Synthesizer |
 | | Synthesizers create sounds through direct manipulation of electrical currents (as in analog synthesizers), mathematical manipulation of discrete values using computers (as in software synthesizers), or by a combination of both methods. |  | | The term "speech synthesizer" is also used in electronic speech processing, often in connection with vocoders. |  | | When natural tonal instruments' sounds are analyzed in the frequency domain, the spectra of tonal instruments exhibit amplitude peaks at the harmonics. |
|
http://www.hexafind.com/encyclopedia/Synthesizer
(3032 words)
|
|
| |
| | Advanced Speech Encoding Program Phase 2 |
 | | PROGRAM OBJECTIVES AND DESCRIPTION: The Defense Advanced Research Projects Agency (DARPA) Advanced Technology Office (ATO) is soliciting proposals under this BAA for Phase 2 of its Advanced Speech Encoding (ASE) Program. |  | | BROAD AGENCY ANNOUNCEMENT (BAA) 04-35 ADVANCED SPEECH ENCODING PROGRAM, CLOSING DATE: 05 JUL 2005, FIRST SELECTIONS: 4:00PM Arlington, VA Local Time, 01 SEP 2004, POC: Dr. Lisa Porter DARPA/ATO, E-MAIL: ase@darpa.mil, WEB: http://www.darpa.mil/ato/solicit/ASE/index.htm. |  | | The ASE Program Phase 2 will also explore and characterize the nature of subauditory (nonacoustic) speech and its potential utility as an alternative means of communication in acoustically harsh environments. |
|
http://www.darpa.mil/ato/solicit/ASE/index.htm
(451 words)
|
|
| |
| | Press Release on "Speech Quality for Wireless and Wireline Expected to Improve" |
 | | A bandwidth of 50 to 7 000 Hz improves the intelligibility and naturalness of speech, adds a feeling of transparent communication and eases speaker recognition. |  | | New Wideband Speech Coding Standard Set by ITU |  | | January 2002 — The ITU has approved a new Standard for high-quality digital wideband speech encoding that will bring significant improvements in terms of interoperability, easier implementation, and improved quality, for wideband voice applications and services across a wide range of communication systems and platforms. |
|
http://www.itu.int/newsroom/press_releases/2002/03.html
(528 words)
|
|
| |
| | Question 8/16 - Encoding of speech signals at bit rates around 4 kbit/s |
 | | What algorithm should be specified for the encoding of 3.4 kHz band-limited, telephone-quality speech at transmission rates in the region of 2.4 kbit/s to 6.4 kbit/s? |  | | g) Study of the support of text-telephony (TDD) devices in systems using low bit-rate speech coding. |  | | a) Study and definition of applications and performance for low-rate speech coding. |
|
http://www.itu.int/ITU-T/2001-2004/com16/sg16-q8.html
(407 words)
|
|
| |
| | speech encoding |
 | | try this book to get a basic idea about coding: Speech Coding Algorithms Foundation and Evolution of Standar |  | | Shortly speech encoding is a way to transmit speech using lower data rate. |  | | Usually speech encoding makes a model of the speech process and extracts the parameters of the model. |
|
http://www.edaboard.com/ftopic81569.html
(251 words)
|
|
| |
| | Audio data compression |
 | | More advanced codecs such as Shorten (SHN) and FLAC use linear prediction to come up with an optimal whitening filter. |  | | Apple Lossless Encoding (also known as Apple Lossless or Apple Lossless Audio Codec) |  | | Lossless audio codecs have no quality issues, so the usabilty can be estimated by |
|
http://www.brainyencyclopedia.com/encyclopedia/a/au/audio_data_compression.html
(732 words)
|
|
| |
| | [mp3encoder] LAME setting for speech encoding |
 | | While there are many presets available for high end encoding, I do not see any for the speech range. |  | | I'd like to encode some speech, which had perviously recorded on tapes. |  | | Google does not produce any useful results either. |
|
http://minnie.tuhs.org/pipermail/mp3encoder/2003-August/005847.html
(144 words)
|
|
| |
| | T.20B When will better quality speech at higher encoding rates be available? |
 | | This raises the possibility that even higher encoding rates may arise in the future. |  | | T.20B When will better quality speech at higher encoding rates be available? |  | | Question Q.10 mentions that there will be a faster DTE rate in the next generation of modems. |
|
http://www.faqs.org/faqs/modems/ZyXEL/FAQ/part3/section-30.html
(223 words)
|
|
| |
| | Periodica Polytechnica Electrical Engineering 1997/Resource id #2364 |
 | | When compared with other existing methods, this algorithm is better in terms of robustness and complexity. |  | | ABSTRACT: Line Spectral Frequencies (LSF) provide an alternate parameterization of the analysis and synthesis filters used in Linear Predictive Coding (LPC) of speech. |  | | KEYWORDS: speech coding, low bit rate, Linear Predictive Coding, Line Spectral, Frequencies, Karhunen-Loeve transform |
|
http://www.pp.bme.hu/ee/1997_1/ee1997_1_07.html
(104 words)
|
|
| |
| | U.S. Pregrant 20040019480 - Speech encoding device having TFO function and method |
 | | Speech encoding device having TFO function and method |  | | U.S. Pregrant 20040019480 - Speech encoding device having TFO function and method |  | | At this time, to alleviate the processing burden of the encoder, part of the demultiplexed encoded data, for example, stochastic codebook data, is extracted and supplied to the encoding functional unit. |
|
http://cxp.paterra.com/uspregrant20040019480.html
(196 words)
|
|
| |
| | Speech Production: Phonetic Encoding of Real and Non-words |
 | | Speech Production: Phonetic Encoding of Real and Non-words |  | | For further information about this item go to: |  | | Home >> Journals and Conference Proceedings >> Text, Speech and Dialogue (TSD) |
|
http://wotan.liu.edu/docis/dbl/tsdtsd/2003__281_SPPEOR.htm
(44 words)
|
|
|