FMUSER Wirless Transmit Video And Audio More Easier !
es.fmuser.org
it.fmuser.org
fr.fmuser.org
de.fmuser.org
af.fmuser.org ->Afrikaans
sq.fmuser.org ->Albanian
ar.fmuser.org ->Arabic
hy.fmuser.org ->Armenian
az.fmuser.org ->Azerbaijani
eu.fmuser.org ->Basque
be.fmuser.org ->Belarusian
bg.fmuser.org ->Bulgarian
ca.fmuser.org ->Catalan
zh-CN.fmuser.org ->Chinese (Simplified)
zh-TW.fmuser.org ->Chinese (Traditional)
hr.fmuser.org ->Croatian
cs.fmuser.org ->Czech
da.fmuser.org ->Danish
nl.fmuser.org ->Dutch
et.fmuser.org ->Estonian
tl.fmuser.org ->Filipino
fi.fmuser.org ->Finnish
fr.fmuser.org ->French
gl.fmuser.org ->Galician
ka.fmuser.org ->Georgian
de.fmuser.org ->German
el.fmuser.org ->Greek
ht.fmuser.org ->Haitian Creole
iw.fmuser.org ->Hebrew
hi.fmuser.org ->Hindi
hu.fmuser.org ->Hungarian
is.fmuser.org ->Icelandic
id.fmuser.org ->Indonesian
ga.fmuser.org ->Irish
it.fmuser.org ->Italian
ja.fmuser.org ->Japanese
ko.fmuser.org ->Korean
lv.fmuser.org ->Latvian
lt.fmuser.org ->Lithuanian
mk.fmuser.org ->Macedonian
ms.fmuser.org ->Malay
mt.fmuser.org ->Maltese
no.fmuser.org ->Norwegian
fa.fmuser.org ->Persian
pl.fmuser.org ->Polish
pt.fmuser.org ->Portuguese
ro.fmuser.org ->Romanian
ru.fmuser.org ->Russian
sr.fmuser.org ->Serbian
sk.fmuser.org ->Slovak
sl.fmuser.org ->Slovenian
es.fmuser.org ->Spanish
sw.fmuser.org ->Swahili
sv.fmuser.org ->Swedish
th.fmuser.org ->Thai
tr.fmuser.org ->Turkish
uk.fmuser.org ->Ukrainian
ur.fmuser.org ->Urdu
vi.fmuser.org ->Vietnamese
cy.fmuser.org ->Welsh
yi.fmuser.org ->Yiddish
1 Introduction
If H.264 is currently the actual overlord of video CODEC, then AAC is the queen of audio CODEC. The mainstream audio and video formats are H.264 with AAC, whether it is a non-real-time media file or a real-time media stream.
2. AAC History
Advanced Audio Coding (AAC) is a lossy compressed audio coding set (in fact, the new coding tool also supports lossless).
Its design goal is to replace the original MP3 coding standard, and hope that the quality is better than MP3 at a similar bit rate as MP3. This goal has been reached and standardized in MPEG-2 and MPEG-4 by ISO and IEC standards organizations.
AAC has been widely supported and applied to various devices and systems YouTube, iPhone, iPod, iPad, Nintendo DSi, Nintendo 3DS, iTunes, DivX Plus Web Player and PlayStation 3. It is supported on PlayStation Vita, Wii (with the Photo Channel 1.1 update installed), Sony Walkman MP3 series and later, Android and BlackBerry, etc.
In 1997, AAC first appeared in the standard MPEG-2 Part 7 (ISO/IEC 13818-7:1997). Similar to the video CODEC standard, AAC has three profiles in MPEG-2 Part 7. They are respectively.
l Low-Complexity profile (AAC-LC / LC-AAC)
l Main profile (AAC Main)
l Scalable Sampling Rate profile (AAC-SSR)
It can be seen from this that AAC-LC appeared the earliest, so AAC-LC is the most widely used and has the best compatibility.
In 1999, AAC upgraded from the original standard and incorporated the standard MPEG-4 Part 3 (ISO/IEC 14496-3:1999)
An important change in this upgrade is the introduction of Audio Object Types (AOT) and the integration of the AOT concept into profiles. At this time, the profiles have also become four.
l Main (which includes most of the MPEG-4 Audio Object Types)
l Scalable (AAC LC, AAC LTP, CELP, HVXC, TwinVQ, Wavetable Synthesis, TTSI),
l Speech (CELP, HVXC, TTSI)
l Low Rate Synthesis (Wavetable Synthesis, TTSI) to synthesize speech.
In 2000, the version was updated to 2, MPEG-4 Audio Version 2 (ISO/IEC 14496-3:1999/Amd 1:2000), the standard defined a new AOT, low delay AAC, the low delay AAC (AAC -LD).
In 2001, the High-Efficiency Advanced Audio Coding (HE-AAC) ISO/IEC 14496-3:2001 was standardized.
In 2003, standardized HE-AAC v2 Profile (AAC LC with SBR and Parametric Stereo) ISO/IEC 14496-3:2005
The current standardized version of AAC is ISO/IEC 14496-3:2009.
From the above standardization history, it can be seen that AAC is not a simple encoder, but a huge collection of audio coding tools.
3. AOT
AOT is the abbreviation of MPEG-4 Audio Object Types. The AOT ID is used for capability set negotiation.
It is precisely because of the many AOTs of AAC that it is difficult to identify users who use AAC.
AAC-LC can be considered as an AAC with an AOT of 2.
The following table is the corresponding table of AOT.
4. MPEG-4 Audio Profiles
The description of the set of audio capabilities in the audio coding direction of MPEG-4 is called Audio Profiles, and the description of audio capabilities is based on AOT.
5. AAC-LC
Based on the popularity and importance of AAC-LC, although it is introduced above, it is listed separately here.
Advanced Audio Coding Low-Complexity, (AAC-LC / LC-AAC) format is MPEG-2 format, designed for digital TV. AAC-LC is used when storage space and computing power are limited. This type does not use the two tools of prediction and gain control, and the order of instantaneous noise shaping is relatively low.
AAC-LC makes full use of the psychoacoustic principle. It has the characteristics of incoherence and statistical redundancy in human perception of audio signals. It minimizes the bit data used to express the signal and realizes the rapid and effective compression of audio signals. Pursue the similarity between the output signal and the original signal.
The important technical points of AAC-LC are as follows.
Temporal Noise Shaping: Instantaneous noise shaping is used to control the instantaneous shape of quantization noise and solve the problem of false matching of masking threshold and quantization noise. TNS utilizes time-frequency duality. A signal that is stable in the real-time domain will change drastically in the frequency domain, while a signal that is stable in the frequency domain may change drastically in the time domain. For transient signals in the time domain, the spectral coefficients can be predicted and coded. Predicting the spectral coefficients, the quantizer can be adjusted in time to adapt to the time domain state of the input signal, and the quantization noise can be effectively controlled.
Intensity Stereo: A method of using psychoacoustic principles to improve coding efficiency. Since the human ear is not sensitive to the phase of the high-frequency signal, as long as the energy and frequency spectrum of the signal are similar, there is no difference in perception. Therefore, when the signal correlation of a pair of channels is high, the high-frequency part can be processed to a certain extent , Only encode and transmit data in one channel without affecting the reconstructed sound quality after decoding.
AAC-LC regards 6kHz as the starting frequency of sound intensity stereo processing, and all the sound intensity stereo processing is performed on this frequency. Calculate the energy and total energy of each sub-band of the left and right channels, and then calculate the ratio of the left channel energy to the total energy and convert it into an intensity factor. According to this intensity factor, all the spectrums in the band are summed for the left and right channels. Normalization, the data of the right channel is all set to zero, so that only the left channel data needs to be quantized and coded.
Perceptual Noise Substitution: Perceptual noise substitution is used for spectral components to divide similar noise (the power spectral density is uniform), and artificial noise is used instead. When it is judged that a certain frequency band needs to be replaced by perceptual noise, only the energy of this frequency band is used as a parameter to encode and transmit, without the need to encode the spectrum value in the subband, and the subband energy and random vector generation function are obtained during decoding. Similar to noise.
Middle/Side: Stereo encoding is a method of using the correlation between the signals of a pair of channels to eliminate redundancy and reduce the encoding bit rate. When the AAC-LD encoder has a large correlation between the left and right channel data, Middle=(L+R)/2, Side = (L-R)/2 can be used to replace the left and right channel data for encoding. In this way, the energy is concentrated in one channel of data, while the other channel requires only a few bits of data, which achieves data compression.
5. AAC-LD
Now that I talked about AAC-LC, let’s talk about AAC-LD
AAC is a perceptual audio codec that can provide very high-quality subjective sound quality at a lower bit rate. However, the algorithmic delay of such a codec at low bit rates often exceeds 100ms, so it is not suitable for real-time two-way communication. The G.722-based voice codec scheme is suitable for two-way communication because of its smaller algorithmic delay. However, this kind of speech-based codec can only provide better subjective quality for speech signals, and is not suitable for more complex audio signals. Moreover, even at a very high bit rate, the result of the codec is not good. It is difficult to achieve good sound quality.
The delays of commonly used perceptual audio encoders include:
Framing delay: the block length required for block transformation;
Filterbank delay: Analysis-the delay required by the integrated filter;
Look-ahead delay for block switching: the delay required for block switching to detect transients;
Use of bit reservoir: The delay required for the size of the bit pool relative to the average bit rate.
Total delay calculation formula:
Take the following AAC-LC as an example:
In AAC-LD, in order to reduce the delay, the original 1024 frame length is changed to 512; there is no window switching function, which reduces the forward delay required for window switching; at the same time, to enhance the encoding of transient signals Quality, a window switching mechanism is introduced. The window type includes a general SINE window and a window with less overlap. This window has very little overlap with the following windows. In this way, the TNS tool is optimized to eliminate the transient signal generation. Echo effect.
MPEG-4 Low Delay Audio Coder (AAC-LD) is directly derived from MPEG-2 AAC and combines the low latency requirements necessary for perceptual audio coding and two-way communication. It can guarantee the maximum 20ms algorithmic delay and good sound quality of signals including voice and music. The current MPEG-4 AAC LD supports a maximum sampling rate of 48kHz, and the maximum number of channels is 2 (which can be expanded to multiple channels).
6. AAC package format
Take two commonly used formats as examples.
ADIF (Audio Data Interchange Format)
ADIF is the AAC audio packaging format specified by MPEG2. The feature of this format is that the start of the audio data can be found deterministically, without the need to start decoding in the middle of the audio data stream, that is, its decoding must be carried out at the clearly defined start. Therefore, this format is commonly used in disk files.
ADTS (Audio Data Transport Stream)
The characteristic of this format is that it is a bit stream with synchronization words, and decoding can start at any position in this stream. Its characteristic is the data stream format.
The purpose of frame synchronization is to find out the position of the frame header in the bit stream. The frame header synchronization word in the aac ADTS format is the 12-bit "1111 1111 1111". The header information of ADTS is composed of two parts, one is fixed header information, tight Next is the variable header information. The data in the fixed header information is the same every frame, while the variable header information is variable from frame to frame.
Fixed header information
Variable header information
Taken together as the following table
7. AAC copyright and patent license
AAC files and code streams do not involve copyright and patent license issues, so AAC files are easier to legally spread and distribute than MP3 formats. The copyright and patent license of AAC are only for the manufacturers and developers of AAC codec for commercial purposes, so FFMPEG and FAAC and other software containing AAC codec are released in open source form.
AAC's copyright license is a five-year period, and a five-year period can be continued after the expiration of the re-evaluation.
AAC's copyright fee is a one-time entry fee of $15,000, and no annual follow-up fee is required. The entry fee for small businesses can be reduced to $1,000. The definition of a small business is less than 15 employees and an annual income of less than one million dollars.
AAC Licensors
l AT&T Corp.
l Dolby Laboratories, Inc.
l Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung, e.V.
l Koninklijke Philips N.V.
l Microsoft Corporation
l NEC Corporation
l NTT DOCOMO, INC.
l Orange SA
l Panasonic Corporation
l Telefonaktiebolaget LM Ericsson
8. Finally
What is the format of AAC-LC? What is the difference between AAC and AAC?
AAC is the general term for the set of audio coding standardized in MPEG2 and MPEG4.
AAC-LC is an audio codec with AOT of 2 in the standardized AAC. It is characterized by low computational complexity, small memory footprint, early standardization time, good connectivity, good compatibility, and wide use. The disadvantage is that the algorithm has a high latency, which is not conducive to real-time audio communication.
|
Enter email to get a surprise
es.fmuser.org
it.fmuser.org
fr.fmuser.org
de.fmuser.org
af.fmuser.org ->Afrikaans
sq.fmuser.org ->Albanian
ar.fmuser.org ->Arabic
hy.fmuser.org ->Armenian
az.fmuser.org ->Azerbaijani
eu.fmuser.org ->Basque
be.fmuser.org ->Belarusian
bg.fmuser.org ->Bulgarian
ca.fmuser.org ->Catalan
zh-CN.fmuser.org ->Chinese (Simplified)
zh-TW.fmuser.org ->Chinese (Traditional)
hr.fmuser.org ->Croatian
cs.fmuser.org ->Czech
da.fmuser.org ->Danish
nl.fmuser.org ->Dutch
et.fmuser.org ->Estonian
tl.fmuser.org ->Filipino
fi.fmuser.org ->Finnish
fr.fmuser.org ->French
gl.fmuser.org ->Galician
ka.fmuser.org ->Georgian
de.fmuser.org ->German
el.fmuser.org ->Greek
ht.fmuser.org ->Haitian Creole
iw.fmuser.org ->Hebrew
hi.fmuser.org ->Hindi
hu.fmuser.org ->Hungarian
is.fmuser.org ->Icelandic
id.fmuser.org ->Indonesian
ga.fmuser.org ->Irish
it.fmuser.org ->Italian
ja.fmuser.org ->Japanese
ko.fmuser.org ->Korean
lv.fmuser.org ->Latvian
lt.fmuser.org ->Lithuanian
mk.fmuser.org ->Macedonian
ms.fmuser.org ->Malay
mt.fmuser.org ->Maltese
no.fmuser.org ->Norwegian
fa.fmuser.org ->Persian
pl.fmuser.org ->Polish
pt.fmuser.org ->Portuguese
ro.fmuser.org ->Romanian
ru.fmuser.org ->Russian
sr.fmuser.org ->Serbian
sk.fmuser.org ->Slovak
sl.fmuser.org ->Slovenian
es.fmuser.org ->Spanish
sw.fmuser.org ->Swahili
sv.fmuser.org ->Swedish
th.fmuser.org ->Thai
tr.fmuser.org ->Turkish
uk.fmuser.org ->Ukrainian
ur.fmuser.org ->Urdu
vi.fmuser.org ->Vietnamese
cy.fmuser.org ->Welsh
yi.fmuser.org ->Yiddish
FMUSER Wirless Transmit Video And Audio More Easier !
Contact
Address:
No.305 Room HuiLan Building No.273 Huanpu Road Guangzhou China 510620
Categories
Newsletter