Limpar
20 resultados

Acesso aberto

Tipo do recurso

Ano de criação

Produção nacional

Revisado por pares

Áreas

Idioma

Editores

Artigo Acesso aberto Revisado por pares

Daniel Gutiérrez-Galán, Juan P. Domínguez-Morales, Ángel Jiménez-Fernández, Alejandro Linares-Barranco, G. Jiménez,

... provides support to several audio input interfaces (AC’97 audio codec, I2S-ADC and PDM microphones), different processing architectures ( ...

Tópico(s): Analog and Mixed-Signal Circuit Design

2021 - Elsevier BV | Neurocomputing

Artigo

R. Salami, Rémi Lefebvre, A. Lakaniemi, Kalervo Kontola, Stefan Bruhn, Anas Abu Taleb,

This article presents the architecture, performance, and application scenarios of the AMR-WB+ (extended AMR-WB) audio codec, which provides high quality at exceptionally low rates, and consistent quality over all audio types. This codec was recently selected by 3GPP and DVB to support low-bit-rate audio and audiovisual applications on mobile networks

Tópico(s): Speech Recognition and Synthesis

2006 - Institute of Electrical and Electronics Engineers | IEEE Communications Magazine

Artigo Acesso aberto Revisado por pares

Thierry Alpert, Vittorio Baroncini, DaHye Choi, L. Contin, Rob Koenen, Fernando Pereira, Herbert B. Peterson,

A new audio-visual coding standard, MPEG-4, is currently under development. MPEG-4 will address not only compression, but also completely new audio-video coding functionalities related to content-based interactivity and universal access. As part of the MPEG-4 standardization process, in November, 1995 assessments were performed on technologies proposed for incorporation in the standard. These assessments included formal subjective tests, as well as expert panel evaluations. This paper describes ...

Tópico(s): Digital Rights Management and Security

1997 - Elsevier BV | Signal Processing Image Communication

Artigo Revisado por pares

L. Contin, Bernd Edler, D. J. Meares, Peter R. Schreiner,

Abstract During December 1995, subjective tests were carried out by members of the Moving Picture Experts Group (MPEG, ISO/JTC1/SC29/WG11) to select the proposed technology for inclusion in the audio part of the new MPEG-4 standard. The new standard addresses coding for more than just the functionality of data rate compression. Material coded at very low bit-rates is also included. Thus, different testing methodologies were applied, according to ITU-R Rec. BS 1116 for a bit-rate of 64 kbit/s per ...

Tópico(s): Digital Rights Management and Security

1997 - Elsevier BV | Signal Processing Image Communication

Artigo Revisado por pares

Adam B. Kinsman, Nicola Nicolici,

This paper details our experience of developing an MPEG-2 audio/video decoder which operates at main level/main profile, 720 × 480 4:2:0 at 29.97 frames per second, with audio at 16 bits, 48 000 samples per second. The design has been developed with a focus on energy-efficiency.

Tópico(s): Advanced Data Compression Techniques

2009 - Institute of Electrical and Electronics Engineers | IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Capítulo de livro Revisado por pares

Jinglei Zhou, Rangding Wang, Chao Jin, Diqun Yan,

... method can effectively detect whether the given WAV audio is original or not, and furthermore, it can identify the type of the codec. The overall hit rate can reach over 97 %.

Tópico(s): Advanced Steganography and Watermarking Techniques

2015 - Springer Science+Business Media | Lecture notes in computer science

Artigo Brasil Produção Nacional Revisado por pares

L.M. de Silva, Abraham Alcaim,

In this letter, we propose a CELP algorithm in which the complexity of the search procedure in the adaptive codebook is greatly reduced. This is achieved by means of a modified model of the CELP synthesizer, while keeping the usual perceptual weighting of the synthesis error in the analysis procedure. Simulation results show that the proposed algorithm can provide speech quality comparable to the one obtained with the conventional CELP codec. >

Tópico(s): Speech and

1995 - Institute of Electrical and Electronics Engineers | IEEE Signal Processing Letters

Artigo Revisado por pares

Hochong Park,

An efficient enhanced variable rate codec (EVRC) codebook search method based on a two-stage search is proposed. At the first stage, a coarse codevector is selected by a fast sequential search, and at the second stage, the pulse replacement procedure is run to enhance the performance of selected codevector. Simulations with various speech data show that the proposed method yields voice quality equivalent to that by the standard method with only 23% codebook search load.

Tópico(s): Speech and

2000 - Institute of Electrical and Electronics Engineers | IEEE Signal Processing Letters

Artigo Revisado por pares

Panos Kudumakis, M. Sandler,

The performance obtainable with four-tap wavelet filters for low bit rate audio coding is presented. For the investigation and comparison of the performance of these wavelet filters, a codec model has been designed and implemented based on wavelet packet algorithm and the model of auditory perception.

Tópico(s): Digital Filter Design and Implementation

1996 - Institute of Electrical and Electronics Engineers | IEEE Signal Processing Letters

Artigo

Myoungbeom Chung, Ilju Ko,

... greatly affected by changes in the resolution or codec used and image color values. which are used for similarity comparison. The method showed a 97.7% search success rate, given a set of 2,000 video files whose audio-bit-rate had been altered or were purposefully written in a different codec.

Tópico(s): Image Retrieval and Classification Techniques

2009 - Korean Society of Computer Information | Journal of the Korea Society of Computer and Information

Artigo Acesso aberto Revisado por pares

Majid Haji Bagheri, Emma Gu, Asif Abdullah Khan, Yanguang Zhang, Gaozhi Xiao, Mohammad Nankali, Peng Peng, Pengcheng Xi, Dayan Ban,

... audio captioning using the EnCodec Combining Neural Audio Codec and Audio‐Text Joint Embedding for Automated Audio Captioning model (EnCLAP) showcases rapid and precise processing capabilities that are suitable for live‐streaming environments. The Bidirectional Encoder representation from the Audio Transformers (BEATs) model also demonstrated exceptional performance, achieving an accuracy of 97.25%. These models were fine‐tuned using the ...

Tópico(s): Conducting polymers and applications

2025 - Wiley | Advanced Sensor Research

Artigo Revisado por pares

F.F. Tzeng,

Views Icon Views Article contents Figures & tables Video Audio Supplementary Data Peer Review Share Icon Share Twitter Facebook Reddit LinkedIn Tools Icon Tools Reprints and Permissions Cite Icon Cite Search Site Citation Forrest F.‐T. Tzeng; Wear‐toll quality 4.8 kbps speech codec. J. Acoust. Soc. Am. 1 April 1995; 97 (4): 2627. https://doi.org/10.1121/1. ...

Tópico(s): Video Coding and Compression Technologies

1995 - Acoustical Society of America | The Journal of the Acoustical Society of America

Artigo Revisado por pares

Martin Hansen, Birger Kollmeier,

A model is presented which was developed for the prediction of speech transmission quality of (low-bit-rate) speech codecs [Hansen and Kollmeier, ICASSP’97, paper ♯2056 (1997)]. The model is based on a quantitative psychoacoustical preprocessing scheme [Dau et al., J. Acoust. Soc. Am. 99, 3614–3622 (1996)] and was successfully applied to various speech codec test databases. This study presents measurements and modeling results of the detectability of band-specific modulated-noise distortions. Two sentences ...

Tópico(s): Hearing Loss and Rehabilitation

1997 - Acoustical Society of America | The Journal of the Acoustical Society of America

Artigo Revisado por pares

Ashraf A. Kassim, F. K. Fong, Kok Seng Chua, S. Rangananth,

... the video encoder, the video decoder and the audio codec processor. Each functional unit consists of a Texas ...

Tópico(s): Digital Filter Design and Implementation

1997 - Elsevier BV | Microprocessors and Microsystems

Artigo

Radhika Ranjan Roy,

... multimedia conferencing, and the typical bandwidth requirements for audio, video, and data, including text, still images, and graphics. — Results of an investigation examining how an end-to-end asynchronous transfer mode (ATM) network architecture can satisfy the stringent requirements of multimedia conferencing. The delay within the network has been calculated to estimate the upper-and lower-delay budgets for the video codec, to ensure that the end-to-end transmission ...

Tópico(s): Peer-to-Peer Network Technologies

1994 - Institute of Electrical and Electronics Engineers | AT&T Technical Journal

Artigo Revisado por pares

Domingo López-Oller, Nadir Benamirouche, Ángel M. Gómez, José L. Pérez-Córdoba,

Voice over IP (VoIP) communications are prone to transmission delays and data losses as they are carried out over packet-switched networks which are unable to guarantee real-time packet delivery. Speech codecs used in these channels strongly rely on Packet Loss Concealment (PLC) algorithms, the performance of which can be compromised as frame losses often occur in bursts. Thus, advanced PLC algorithms for erasure channels have already been proposed in the literature but these frequently focus on the ...

Tópico(s): Advanced Adaptive Filtering Techniques

2018 - Elsevier BV | Speech Communication

Artigo Acesso aberto

Lahcene Merah, Pascal Lorenz, Adda Ali-Pacha, Naïma Hadj-Said,

The enormous progress in communication technology has led to a tremendous need to provide an ideal environment for the transmission, storing, and processing of digital multimedia content, where the audio signal takes the lion's share of it. Audio processing covers many diverse fields, its main aim is presenting sound to human listeners. Recently, digital audio processing became an active research area, it covers everything from theory to practice in relation to transmission, compression, filtering, ...

Tópico(s): Digital Filter Design and Implementation

2021 - | International Journal of Future Computer and Communication

Artigo Acesso aberto

A. Utku Yargicoglu, Hakkı Gökhan İlk,

... inputs. During training, weight values which dissociate the codec types best are investigated. Finally, performances of the trained classifiers are evaluated by using test sets. According to the test results, for a closed group which is formed from nine different output types, polynomial SVM classifiers have identified more than 97% of the samples correctly.

Tópico(s): Chaos-based Image/Signal Encryption

2012 - University of Zulia | Communications - Scientific letters of the University of Zilina

Artigo Revisado por pares

Shih‐Chang Hsia, Cheng Hung Hsiao,

... object can be seriously distorted. The MPEG-4 codec provides both intra- and inter-shape coding. Inter- ... coded video surveillance systems. 5 References 1Coding of audio-visual objects: video, ISO/IECJTC/SC29/WG11, January ...

Tópico(s): Optical measurement and interference techniques

2016 - Institution of Engineering and Technology | IET Image Processing

Artigo Revisado por pares

John G. Bereends, A.P. Hekstra,

PSQM (Perceptual Speech Quality Measure), measuring speech quality objectively, has been standardized by ITU-T as recommendation P.861. PSQM characterizes the perception of the (degraded) output speech signal of the system in comparison to the (ideal) input speech. A perceptual model is used that maps input and output signals onto psychophysical representations using psychophysical equivalents of frequency (Bark) and intensity (compressed Sone). The quality of the device under test is determined with ...

Tópico(s): Advanced Adaptive Filtering Techniques

1999 - Acoustical Society of America | The Journal of the Acoustical Society of America