FMUSER Wirless Transmit Video And Audio More Easier !
es.fmuser.org
it.fmuser.org
fr.fmuser.org
de.fmuser.org
af.fmuser.org ->Afrikaans
sq.fmuser.org ->Albanian
ar.fmuser.org ->Arabic
hy.fmuser.org ->Armenian
az.fmuser.org ->Azerbaijani
eu.fmuser.org ->Basque
be.fmuser.org ->Belarusian
bg.fmuser.org ->Bulgarian
ca.fmuser.org ->Catalan
zh-CN.fmuser.org ->Chinese (Simplified)
zh-TW.fmuser.org ->Chinese (Traditional)
hr.fmuser.org ->Croatian
cs.fmuser.org ->Czech
da.fmuser.org ->Danish
nl.fmuser.org ->Dutch
et.fmuser.org ->Estonian
tl.fmuser.org ->Filipino
fi.fmuser.org ->Finnish
fr.fmuser.org ->French
gl.fmuser.org ->Galician
ka.fmuser.org ->Georgian
de.fmuser.org ->German
el.fmuser.org ->Greek
ht.fmuser.org ->Haitian Creole
iw.fmuser.org ->Hebrew
hi.fmuser.org ->Hindi
hu.fmuser.org ->Hungarian
is.fmuser.org ->Icelandic
id.fmuser.org ->Indonesian
ga.fmuser.org ->Irish
it.fmuser.org ->Italian
ja.fmuser.org ->Japanese
ko.fmuser.org ->Korean
lv.fmuser.org ->Latvian
lt.fmuser.org ->Lithuanian
mk.fmuser.org ->Macedonian
ms.fmuser.org ->Malay
mt.fmuser.org ->Maltese
no.fmuser.org ->Norwegian
fa.fmuser.org ->Persian
pl.fmuser.org ->Polish
pt.fmuser.org ->Portuguese
ro.fmuser.org ->Romanian
ru.fmuser.org ->Russian
sr.fmuser.org ->Serbian
sk.fmuser.org ->Slovak
sl.fmuser.org ->Slovenian
es.fmuser.org ->Spanish
sw.fmuser.org ->Swahili
sv.fmuser.org ->Swedish
th.fmuser.org ->Thai
tr.fmuser.org ->Turkish
uk.fmuser.org ->Ukrainian
ur.fmuser.org ->Urdu
vi.fmuser.org ->Vietnamese
cy.fmuser.org ->Welsh
yi.fmuser.org ->Yiddish
1、 Video transmission principle
Video is to use the principle of human eye vision retention, through playing a series of pictures, make the human eye feel of movement. The video is transmitted only, and the amount of video is very large, which is unacceptable to the existing network and storage. In order to make video easy to transmit and store, people find that video has a lot of repeated information. If the duplicate information is removed from the transmission end and recovered at the receiving end, the file of video data is greatly reduced, so H.264 video compression standard is available.
The original image data in the video will be compressed in H.264 encoding format, and the audio sampling data will be compressed in AAC coding format. After coding and compressing, video content is really conducive to storage and transmission. However, when watching the playback, the decoding process is also needed accordingly. Therefore, it is obvious that a kind of convention can be understood by both encoder and decoder is required between encoding and decoding. In terms of video image coding and decoding, this Convention is simple:
The encoder encodes multiple images and produces a GOP (group of pictures) in a segment. When playing, the decoder reads a section of GOP for decoding, then reads the picture and then renders the display. GOP (group of pictures) is a series of continuous pictures, which consists of one I frame and several B / P frames. It is the basic unit of video image encoder and decoder access. Its arrangement sequence will be repeated until the end of image. I frame is an internal coding frame (also known as keyframe), P frame is forward prediction frame (forward reference frame), and frame B is bidirectional interpolation frame (bidirectional reference frame). In short, frame I is a complete picture, while P and B record changes relative to I frame. Without I frames, P and B frames cannot be decoded.
In H.264 compression standard, I frame, P frame and B frame are used to represent the transmitted video picture.
Back to the top
2、 Frame I, P, frame B, GOP
Frame I
Frame I: that is, intra coded picture. Frame I represents key frame, and you can understand that the picture of this frame is completely preserved; decoding only needs the data of the frame to complete (because it contains the complete picture). Also known as internal picture, I frame is usually the first frame of each GOP (a video compression technology used by MPEG). After moderate compression, it can be used as a random access reference point and can be regarded as an image. In MPEG coding, some video frame sequences are compressed into I frames; some are compressed into P frames; and some are compressed into B frames. I frame method is an in frame compression method, also known as "key frame" compression method. I frame method is a compression technology based on DCT (discrete cosine transform), which is similar to JPEG compression algorithm. The compression ratio of 1/6 can be achieved by using I frame compression without obvious compression trace.
[I frame features]
1. it is a full frame compression encoding frame. It encodes and transmits the whole frame image information by JPEG compression;
2. the complete image can be reconstructed only by using the data of I frame during decoding;
3. frame I describes the background of the image and the details of the moving body;
4. I frame is not generated by referring to other pictures;
5. I frame is the reference frame of P frame and B frame (its quality directly affects the quality of each frame in the same group);
6. I frame is the basic frame (the first frame) of frame group GOP, and only one I frame is in a group;
7. Motion vector is not required for frame I;
8. the information of I frame is relatively large.
[I frame coding process]
(1) The intra prediction is carried out to determine the intra prediction mode.
(2) The pixel value subtracts the predicted value to get the residual.
(3) The residual is transformed and quantified.
(4) Variable length coding and arithmetic coding.
(5) The image is reconstructed and filtered, and the image is used as reference frame of other frames.
For example, in video conference system, the image sent by the terminal to MCU (or MCU to terminal) does not send a complete picture to the remote end at a time, but only the part that changes after sending the picture based on the previous picture. If the network condition is not good, the terminal will receive the remote end or send to the remote picture with packet loss and the image will be blocked. In this case, if there is no frame mechanism to send a new complete image to the local (or local re send a new complete image to the remote), the flower screen of the output image of the terminal will be displayed The phenomenon of Caton will become more and more serious, which will cause the meeting to fail to be carried out normally.
In the process of video screen playing, if I frame is lost, the P frame behind will also appear as soon as it can not be solved, and the phenomenon of black screen will appear; if P frame is lost, the video screen will show the phenomenon of flower screen and mosaic.
In video conference system, I frame will only occur within the bandwidth limit of the meeting, and will not exceed the meeting bandwidth and take effect. I frame mechanism not only exists in MCU, but also in TV wall server and video server. It is to solve the problem of packet loss in the case of bad network conditions, such as image screen and button, which will affect the normal progress of the meeting.
P frame
P frame: that is, predictive coded picture. P frame represents the difference between this frame and the previous key frame (or P frame). When decoding, the difference defined in this frame needs to be superimposed on the previous cached picture to generate the final picture. (that is, difference frame, P frame has no complete picture data, only data different from the previous frame)
[prediction and reconstruction of P frame]
P frame is a frame of reference I, in which the prediction value and motion vector of "a point" of P frame are found in the I frame, and the prediction difference and motion vector are transmitted together. The predicted value of "point" of P frame is found from the I frame according to the motion vector at the receiver and the difference is added to get the sample value of "point" of P frame, so that the complete P frame can be obtained.
[P frame features]
1. P frame is the encoding frame with 1-2 frames behind I frame;
2. P frame adopts motion compensation method to transmit the difference and motion vector (prediction error) between P frame and I frame;
3. the prediction value and prediction error in I frame must be summed up in decoding before the complete P frame image can be reconstructed;
4. P frame belongs to forward prediction inter frame coding. It only refers to the I frame or P frame closest to it in front;
5. P frame can be the reference frame of P frame after it, or it can be the reference frame of B frame before and after it;
6. because P frame is a reference frame, it may cause the spread of decoding error;
7. because of the difference transmission, the compression of P frame is relatively high.
Frame B
Frame B: that is, bidirectionally predicted picture. Frame B is a two-way difference frame, that is, the difference between the frame and the front and rear frames is recorded in B frame. In other words, to decode B frame, not only the previous cache picture but also the decoded picture is needed. The final picture is obtained by the superposition of the front and rear pictures and the data of the frame. B frame compression rate is high, but CPU will be more tired when decoding.
[prediction and reconstruction of B frame]
Frame B is based on the front I or P frame and the P frame at the back as the reference frame. The prediction value and two motion vectors of "a point" of frame B are found, and the prediction difference and motion vector are taken to transmit. The receiver "finds out (calculates)" predicted value in two reference frames according to motion vector and sums up with difference, and obtains the sample value of "certain point" of frame B, thus obtaining complete B frame. Two way prediction coding between frames by motion prediction
[B frame features]
1. Frame B is predicted by the front I or P frame and the later P frame;
2. The B frame transmits the prediction error and motion vector between it and the I frame or P frame and the P frame at the back;
3. B frame is a bidirectional prediction coding frame;
4. The compression ratio of frame B is the highest, because it only reflects the change of the main body of the motion between C reference frames, and the prediction is more accurate;
5. B frame is not a reference frame, and it will not cause the spread of decoding error
[why B frame is needed]
From the above, we know that the decoding algorithm of I and P is relatively simple and the resource occupation is relatively small. I just need to complete it by myself. P. it only needs the decoder to cache the previous picture. When encountering P, it is better to use the previously cached picture. If the video stream only has I and P, the decoder can read and decode while reading and linear advance. It is very comfortable for us to move forward I don't like it. So why do you want to introduce B frame?
Many movies on the network adopt B frame, because the difference between front and rear frames recorded by B frame can save more space than P frame. However, the file is small, and the decoder is in trouble. In decoding, not only the picture cached before, but also the next I or P picture (that is, pre read and decode) is used. Moreover, frame B can not be simply lost, Because B frame actually contains picture information, if it is simply lost and repeated with the previous picture, it will cause the picture card (in fact, it is lost). In order to save space, the movies on the network often use quite a lot of B frames. The more B frames are used, it will cause more trouble for players who do not support B frame, and the more the picture will be stuck.
GOP (sequence) and IDR
In h264, the image is organized in sequence, and one sequence is a data stream after image coding.
The first image in a sequence is called an IDR image (refresh the image immediately), and the IDR image is an I frame image. H. 264 introduces IDR image to resynchronize decoding. When decoder decodes IDR image, it will immediately empty the reference frame queue, output or discard all decoded data, re search parameter set and start a new sequence. In this way, if there is a major error in the previous sequence, you can get the opportunity to resynchronize here. Images after IDR images will never be decoded using data from previous IDR images.
A sequence is a series of data streams generated after the image coding with little content difference. When the motion changes less, a sequence can be very long, because less movement changes represent the content change of the image picture is very small, so you can make an I frame, and then always P frame and B frame. When the motion changes a lot, one sequence may be shorter, for example, it contains an I frame and 3 or 4 P frames.
In video coding sequence, GOP is group of picture, which refers to the distance between two I frames, and reference refers to the distance between two P frames. A group of pictures is formed between the two I frames, namely GOP (group of picture).
|
Enter email to get a surprise
es.fmuser.org
it.fmuser.org
fr.fmuser.org
de.fmuser.org
af.fmuser.org ->Afrikaans
sq.fmuser.org ->Albanian
ar.fmuser.org ->Arabic
hy.fmuser.org ->Armenian
az.fmuser.org ->Azerbaijani
eu.fmuser.org ->Basque
be.fmuser.org ->Belarusian
bg.fmuser.org ->Bulgarian
ca.fmuser.org ->Catalan
zh-CN.fmuser.org ->Chinese (Simplified)
zh-TW.fmuser.org ->Chinese (Traditional)
hr.fmuser.org ->Croatian
cs.fmuser.org ->Czech
da.fmuser.org ->Danish
nl.fmuser.org ->Dutch
et.fmuser.org ->Estonian
tl.fmuser.org ->Filipino
fi.fmuser.org ->Finnish
fr.fmuser.org ->French
gl.fmuser.org ->Galician
ka.fmuser.org ->Georgian
de.fmuser.org ->German
el.fmuser.org ->Greek
ht.fmuser.org ->Haitian Creole
iw.fmuser.org ->Hebrew
hi.fmuser.org ->Hindi
hu.fmuser.org ->Hungarian
is.fmuser.org ->Icelandic
id.fmuser.org ->Indonesian
ga.fmuser.org ->Irish
it.fmuser.org ->Italian
ja.fmuser.org ->Japanese
ko.fmuser.org ->Korean
lv.fmuser.org ->Latvian
lt.fmuser.org ->Lithuanian
mk.fmuser.org ->Macedonian
ms.fmuser.org ->Malay
mt.fmuser.org ->Maltese
no.fmuser.org ->Norwegian
fa.fmuser.org ->Persian
pl.fmuser.org ->Polish
pt.fmuser.org ->Portuguese
ro.fmuser.org ->Romanian
ru.fmuser.org ->Russian
sr.fmuser.org ->Serbian
sk.fmuser.org ->Slovak
sl.fmuser.org ->Slovenian
es.fmuser.org ->Spanish
sw.fmuser.org ->Swahili
sv.fmuser.org ->Swedish
th.fmuser.org ->Thai
tr.fmuser.org ->Turkish
uk.fmuser.org ->Ukrainian
ur.fmuser.org ->Urdu
vi.fmuser.org ->Vietnamese
cy.fmuser.org ->Welsh
yi.fmuser.org ->Yiddish
FMUSER Wirless Transmit Video And Audio More Easier !
Contact
Address:
No.305 Room HuiLan Building No.273 Huanpu Road Guangzhou China 510620
Categories
Newsletter