Daniel Gutiérrez-Galán, Juan P. Domínguez-Morales, Ángel Jiménez-Fernández, Alejandro Linares-Barranco, G. Jiménez,
... provides support to several audio input interfaces (AC’97 audio codec, I2S-ADC and PDM microphones), different processing architectures ( ...
Tópico(s): Analog and Mixed-Signal Circuit Design
2021 - Elsevier BV | Neurocomputing
R. Salami, Rémi Lefebvre, A. Lakaniemi, Kalervo Kontola, Stefan Bruhn, Anas Abu Taleb,
This article presents the architecture, performance, and application scenarios of the AMR-WB+ (extended AMR-WB) audio codec, which provides high quality at exceptionally low rates, and consistent quality over all audio types. This codec was recently selected by 3GPP and DVB to support low-bit-rate audio and audiovisual applications on mobile networks
Tópico(s): Speech Recognition and Synthesis
2006 - Institute of Electrical and Electronics Engineers | IEEE Communications Magazine
Thierry Alpert, Vittorio Baroncini, DaHye Choi, L. Contin, Rob Koenen, Fernando Pereira, Herbert B. Peterson,
A new audio-visual coding standard, MPEG-4, is currently under development. MPEG-4 will address not only compression, but also completely new audio-video coding functionalities related to content-based interactivity and universal access. As part of the MPEG-4 standardization process, in November, 1995 assessments were performed on technologies proposed for incorporation in the standard. These assessments included formal subjective tests, as well as expert panel evaluations. This paper describes ...
Tópico(s): Digital Rights Management and Security
1997 - Elsevier BV | Signal Processing Image Communication
L. Contin, Bernd Edler, D. J. Meares, Peter R. Schreiner,
Abstract During December 1995, subjective tests were carried out by members of the Moving Picture Experts Group (MPEG, ISO/JTC1/SC29/WG11) to select the proposed technology for inclusion in the audio part of the new MPEG-4 standard. The new standard addresses coding for more than just the functionality of data rate compression. Material coded at very low bit-rates is also included. Thus, different testing methodologies were applied, according to ITU-R Rec. BS 1116 for a bit-rate of 64 kbit/s per ...
Tópico(s): Digital Rights Management and Security
1997 - Elsevier BV | Signal Processing Image Communication
Adam B. Kinsman, Nicola Nicolici,
This paper details our experience of developing an MPEG-2 audio/video decoder which operates at main level/main profile, 720 × 480 4:2:0 at 29.97 frames per second, with audio at 16 bits, 48 000 samples per second. The design has been developed with a focus on energy-efficiency.
Tópico(s): Advanced Data Compression Techniques
2009 - Institute of Electrical and Electronics Engineers | IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Jinglei Zhou, Rangding Wang, Chao Jin, Diqun Yan,
... method can effectively detect whether the given WAV audio is original or not, and furthermore, it can identify the type of the codec. The overall hit rate can reach over 97 %.
Tópico(s): Advanced Steganography and Watermarking Techniques
2015 - Springer Science+Business Media | Lecture notes in computer science

L.M. de Silva, Abraham Alcaim,
In this letter, we propose a CELP algorithm in which the complexity of the search procedure in the adaptive codebook is greatly reduced. This is achieved by means of a modified model of the CELP synthesizer, while keeping the usual perceptual weighting of the synthesis error in the analysis procedure. Simulation results show that the proposed algorithm can provide speech quality comparable to the one obtained with the conventional CELP codec. >
Tópico(s): Speech and
1995 - Institute of Electrical and Electronics Engineers | IEEE Signal Processing Letters
An efficient enhanced variable rate codec (EVRC) codebook search method based on a two-stage search is proposed. At the first stage, a coarse codevector is selected by a fast sequential search, and at the second stage, the pulse replacement procedure is run to enhance the performance of selected codevector. Simulations with various speech data show that the proposed method yields voice quality equivalent to that by the standard method with only 23% codebook search load.
Tópico(s): Speech and
2000 - Institute of Electrical and Electronics Engineers | IEEE Signal Processing Letters
The performance obtainable with four-tap wavelet filters for low bit rate audio coding is presented. For the investigation and comparison of the performance of these wavelet filters, a codec model has been designed and implemented based on wavelet packet algorithm and the model of auditory perception.
Tópico(s): Digital Filter Design and Implementation
1996 - Institute of Electrical and Electronics Engineers | IEEE Signal Processing Letters
... greatly affected by changes in the resolution or codec used and image color values. which are used for similarity comparison. The method showed a 97.7% search success rate, given a set of 2,000 video files whose audio-bit-rate had been altered or were purposefully written in a different codec.
Tópico(s): Image Retrieval and Classification Techniques
2009 - Korean Society of Computer Information | Journal of the Korea Society of Computer and Information
Majid Haji Bagheri, Emma Gu, Asif Abdullah Khan, Yanguang Zhang, Gaozhi Xiao, Mohammad Nankali, Peng Peng, Pengcheng Xi, Dayan Ban,
... audio captioning using the EnCodec Combining Neural Audio Codec and Audio‐Text Joint Embedding for Automated Audio Captioning model (EnCLAP) showcases rapid and precise processing capabilities that are suitable for live‐streaming environments. The Bidirectional Encoder representation from the Audio Transformers (BEATs) model also demonstrated exceptional performance, achieving an accuracy of 97.25%. These models were fine‐tuned using the ...
Tópico(s): Conducting polymers and applications
2025 - Wiley | Advanced Sensor Research
Views Icon Views Article contents Figures & tables Video Audio Supplementary Data Peer Review Share Icon Share Twitter Facebook Reddit LinkedIn Tools Icon Tools Reprints and Permissions Cite Icon Cite Search Site Citation Forrest F.‐T. Tzeng; Wear‐toll quality 4.8 kbps speech codec. J. Acoust. Soc. Am. 1 April 1995; 97 (4): 2627. https://doi.org/10.1121/1. ...
Tópico(s): Video Coding and Compression Technologies
1995 - Acoustical Society of America | The Journal of the Acoustical Society of America
Martin Hansen, Birger Kollmeier,
A model is presented which was developed for the prediction of speech transmission quality of (low-bit-rate) speech codecs [Hansen and Kollmeier, ICASSP’97, paper ♯2056 (1997)]. The model is based on a quantitative psychoacoustical preprocessing scheme [Dau et al., J. Acoust. Soc. Am. 99, 3614–3622 (1996)] and was successfully applied to various speech codec test databases. This study presents measurements and modeling results of the detectability of band-specific modulated-noise distortions. Two sentences ...
Tópico(s): Hearing Loss and Rehabilitation
1997 - Acoustical Society of America | The Journal of the Acoustical Society of America
Ashraf A. Kassim, F. K. Fong, Kok Seng Chua, S. Rangananth,
... the video encoder, the video decoder and the audio codec processor. Each functional unit consists of a Texas ...
Tópico(s): Digital Filter Design and Implementation
1997 - Elsevier BV | Microprocessors and Microsystems
... multimedia conferencing, and the typical bandwidth requirements for audio, video, and data, including text, still images, and graphics. — Results of an investigation examining how an end-to-end asynchronous transfer mode (ATM) network architecture can satisfy the stringent requirements of multimedia conferencing. The delay within the network has been calculated to estimate the upper-and lower-delay budgets for the video codec, to ensure that the end-to-end transmission ...
Tópico(s): Peer-to-Peer Network Technologies
1994 - Institute of Electrical and Electronics Engineers | AT&T Technical Journal
Domingo López-Oller, Nadir Benamirouche, Ángel M. Gómez, José L. Pérez-Córdoba,
Voice over IP (VoIP) communications are prone to transmission delays and data losses as they are carried out over packet-switched networks which are unable to guarantee real-time packet delivery. Speech codecs used in these channels strongly rely on Packet Loss Concealment (PLC) algorithms, the performance of which can be compromised as frame losses often occur in bursts. Thus, advanced PLC algorithms for erasure channels have already been proposed in the literature but these frequently focus on the ...
Tópico(s): Advanced Adaptive Filtering Techniques
2018 - Elsevier BV | Speech Communication
Lahcene Merah, Pascal Lorenz, Adda Ali-Pacha, Naïma Hadj-Said,
The enormous progress in communication technology has led to a tremendous need to provide an ideal environment for the transmission, storing, and processing of digital multimedia content, where the audio signal takes the lion's share of it. Audio processing covers many diverse fields, its main aim is presenting sound to human listeners. Recently, digital audio processing became an active research area, it covers everything from theory to practice in relation to transmission, compression, filtering, ...
Tópico(s): Digital Filter Design and Implementation
2021 - | International Journal of Future Computer and Communication
A. Utku Yargicoglu, Hakkı Gökhan İlk,
... inputs. During training, weight values which dissociate the codec types best are investigated. Finally, performances of the trained classifiers are evaluated by using test sets. According to the test results, for a closed group which is formed from nine different output types, polynomial SVM classifiers have identified more than 97% of the samples correctly.
Tópico(s): Chaos-based Image/Signal Encryption
2012 - University of Zulia | Communications - Scientific letters of the University of Zilina
Shih‐Chang Hsia, Cheng Hung Hsiao,
... object can be seriously distorted. The MPEG-4 codec provides both intra- and inter-shape coding. Inter- ... coded video surveillance systems. 5 References 1Coding of audio-visual objects: video, ISO/IECJTC/SC29/WG11, January ...
Tópico(s): Optical measurement and interference techniques
2016 - Institution of Engineering and Technology | IET Image Processing
John G. Bereends, A.P. Hekstra,
PSQM (Perceptual Speech Quality Measure), measuring speech quality objectively, has been standardized by ITU-T as recommendation P.861. PSQM characterizes the perception of the (degraded) output speech signal of the system in comparison to the (ideal) input speech. A perceptual model is used that maps input and output signals onto psychophysical representations using psychophysical equivalents of frequency (Bark) and intensity (compressed Sone). The quality of the device under test is determined with ...
Tópico(s): Advanced Adaptive Filtering Techniques
1999 - Acoustical Society of America | The Journal of the Acoustical Society of America