"1. Main differences between H.264 and H.265
H.265 still uses a mixed codec, and the codec domain H.264 is basically consistent.
The main difference is:
• Coding block division structure: The recursive structure of Cu (CodingUnit), PU (PREDictionUnit) and TU (TransformUnit).
? Basic details: There are many differences in the internal details of each function block.
? Parallel Tools: Added parallel tool sets such as Tile and WPP to increase coding speed
• Filter: Added the SAO (Sample Adaptive Offset filtering module) after the block filtering
Fig.h.265 frame map
Be
2. Comparison of compression performance
PSNR calculation method
H.265 / HEVC HM-9.0 and H.264 JM-18.4 BD-RATE Comparison:
Allintra Case: 22%
Randomaccess Case: 34%
Lowdelay Case: 37%
Be
3. Each module technology difference summary
Be
4. Block division structure
In H.265, the size of the macroblock is extended from H.264 to 16 × 16 of 64 × 64 to facilitate the compression of high resolution video.
At the same time, a more flexible encoding structure is used to improve coding efficiency.
Includes coding units, predictUnit, and transformunit.
As shown below:
in:
The encoding unit is similar to the concept of macroblock in H.264 / AVC, for encoding the process.
The prediction unit is a basic unit for prediction,
The transform unit is a basic unit for conversion and quantization.
Be
The separation of these three units makes the transformation, prediction, and encoding various processing links,
It is also conducive to the division of each link to more in line with the texture characteristics of the video image.
It is conducive to the completion of their respective functions more optimized.
Be
Be
RQT is an adaptive transform technology that extends and expands the extension and expansion of ABT (AdaptiveBlock-size transform) technology in H.264 / AVC.
For inter-frame encoding, it allows the size of the transform block to adaptive adjustments according to the size of the motion compensation block;
For intra coding, it allows the size of the transform block to adaptive adjustments according to the characteristics of the intra predictive residual.
Big block transformations can provide better energy concentrations with respect to small blocks, and can save more image details after quantification, but on the other hand, there will be more ringing after quantification. effect.
Therefore, according to the characteristics of the current block signal, the adaptive selection variable block size, as shown in the figure below, can be obtained by the excellent energy concentration, the degree retention level, and the rings effect of the image.
Flexible block structure diagram
Be
Be
5. Frame prediction mode
Essentially H.265 is the increase of more predictions on the prediction direction of H.264
H.265: All size Cu blocks, 35 prediction directions in brightness, 5 prediction directions
H.264: 9 directions of brightness 4x4 block, 8x8 blocks in 9 directions, 16x16 blocks 4 orientation, 4 directions
Be
The intra prediction direction of H.264:
Be
The intra prediction direction of H.265:
Be
6. Inter-frame prediction
Essentially H.265 is a number of tap coefficients, changing tap coefficients, and increasing the number of candidates for the motion vector predictive value to achieve the purpose of reducing predictive residuals.
H.265 is the same as H.264 is brightness to 1/4, chromaticity to 1/8 precision, but the interpolation filter tip length and coefficient are different.
H.265 increases the number of motion vector predicted values, while H.264 predicted value only one
Be
H.265 airspace candidate:
H.265 Time Domain Common Position Candle
Be
7. Device filtering
Essentially H.265's deck filtering process is consistent with H.264's deck filtering process, which has made the following most significant changes:
Filter boundary: H.264 minimum to 4x4 boundary filtering; H.265 adapted to the latest Cu, PU and TU division structure filter edge, minimum filtering boundary is 8x8,
• Filter order: The vertical boundary is used in the H264 prior macro block, and the horizontal boundary of the current macroblock; and the vertical boundary of the H.265 first frame, the horizontal boundary of the whole frame
Be
The ALF is within the codecline, after DEBLOCK and SAO,
Used to restore the reconstructed image to achieve a wifter difference (MSE) between the reconstruction image and the original image.
The factor of the ALF is in frame-level computing and transmission, and ALF can be applied to the entire frame.
ALF can also be performed for partial regions based on block or quadtree.
If it is an ALF based on the partial area, additional information indicating the area information must also be passed.
Be
8. Sample AdaptiveOffset filtering point adaptiveoffset
SAO (Sample Adaptive Offset filtering is actually classified by the rebuilding pixels after the deblock filtering, and compensates for each classification pixel, and the classification template is divided into BO (BAND Offset) and EO (Edge Offset).
Be
BO Category:
Be
EO classification module:
SAO is in the codeca, after DEBLOCK, by the classification of the reconstructed image, add an offset to each type of image pixel value, reducing the purpose of reducing distortion, thereby increasing the compression ratio, reducing the code stream.
After SAO, the average can reduce the code stream of 2% to 6%, while the performance consumption of the encoder and decoder increases by about 2%.
Be
9. TILE
TILE: Split the image into a rectangular area.
Its main purpose is to enhance parallel processing performance.
Each Tile region is equivalent to a sub-image, which can be coded in units of LCU blocks.
A Tile block is a basic parallel unit, each Tile is a subcode stream.
Be
10. WPP
WPP: Full called Wavefront Parallel Process, basic encoding units based on LCU behavior.
Taking a line of LCU blocks as a basic parallel unit, each line of LCU is a sub-code stream
Be
11. Dependentslice
Dependent Slice: This technology can be understood as a data division of the original Slice NALU to make it suitable for more flexible packaging methods.
The schematic of Slice and Dependent Slice is as follows
12. Other related technologies
TRANSFORM_SKIP mode: Transform_skip_flag, this mode is not transformed, but it is quantified, and this mode has a better effect on text desktop video.
• Internal bit depth increase: in order to ensure intermediate prediction, transformation, and internal bits during the quantization process to achieve better compression performance
1. Main differences between H.264 and H.265
H.265 still uses a mixed codec, and the codec domain H.264 is basically consistent.
The main difference is:
• Coding block division structure: The recursive structure of Cu (CodingUnit), PU (PREDictionUnit) and TU (TransformUnit).
? Basic details: There are many differences in the internal details of each function block.
? Parallel Tools: Added parallel tool sets such as Tile and WPP to increase coding speed
• Filter: Added the SAO (Sample Adaptive Offset filtering module) after the block filtering
Fig.h.265 frame map
Be
2. Comparison of compression performance
PSNR calculation method
H.265 / HEVC HM-9.0 and H.264 JM-18.4 BD-RATE Comparison:
Allintra Case: 22%
Randomaccess Case: 34%
Lowdelay Case: 37%
Be
3. Each module technology difference summary
Be
4. Block division structure
In H.265, the size of the macroblock is extended from H.264 to 16 × 16 of 64 × 64 to facilitate the compression of high resolution video.
At the same time, a more flexible encoding structure is used to improve coding efficiency.
Includes coding units, predictUnit, and transformunit.
As shown below:
in:
The encoding unit is similar to the concept of macroblock in H.264 / AVC, for encoding the process.
The prediction unit is a basic unit for prediction,
The transform unit is a basic unit for conversion and quantization.
Be
The separation of these three units makes the transformation, prediction, and encoding various processing links,
It is also conducive to the division of each link to more in line with the texture characteristics of the video image.
It is conducive to the completion of their respective functions more optimized.
Be
Be
RQT is an adaptive transform technology that extends and expands the extension and expansion of ABT (AdaptiveBlock-size transform) technology in H.264 / AVC.
For inter-frame encoding, it allows the size of the transform block to adaptive adjustments according to the size of the motion compensation block;
For intra coding, it allows the size of the transform block to adaptive adjustments according to the characteristics of the intra predictive residual.
Big block transformations can provide better energy concentrations with respect to small blocks, and can save more image details after quantification, but on the other hand, there will be more ringing after quantification. effect.
Therefore, according to the characteristics of the current block signal, the adaptive selection variable block size, as shown in the figure below, can be obtained by the excellent energy concentration, the degree retention level, and the rings effect of the image.
Flexible block structure diagram
Be
Be
5. Frame prediction mode
Essentially H.265 is the increase of more predictions on the prediction direction of H.264
H.265: All size Cu blocks, 35 prediction directions in brightness, 5 prediction directions
H.264: 9 directions of brightness 4x4 block, 8x8 blocks in 9 directions, 16x16 blocks 4 orientation, 4 directions
Be
The intra prediction direction of H.264:
Be
The intra prediction direction of H.265:
Be
6. Inter-frame prediction
Essentially H.265 is a number of tap coefficients, changing tap coefficients, and increasing the number of candidates for the motion vector predictive value to achieve the purpose of reducing predictive residuals.
H.265 is the same as H.264 is brightness to 1/4, chromaticity to 1/8 precision, but the interpolation filter tip length and coefficient are different.
H.265 increases the number of motion vector predicted values, while H.264 predicted value only one
Be
H.265 airspace candidate:
H.265 Time Domain Common Position Candle
Be
7. Device filtering
Essentially H.265's deck filtering process is consistent with H.264's deck filtering process, which has made the following most significant changes:
Filter boundary: H.264 minimum to 4x4 boundary filtering; H.265 adapted to the latest Cu, PU and TU division structure filter edge, minimum filtering boundary is 8x8,
• Filter order: The vertical boundary is used in the H264 prior macro block, and the horizontal boundary of the current macroblock; and the vertical boundary of the H.265 first frame, the horizontal boundary of the whole frame
Be
The ALF is within the codecline, after DEBLOCK and SAO,
Used to restore the reconstructed image to achieve a wifter difference (MSE) between the reconstruction image and the original image.
The factor of the ALF is in frame-level computing and transmission, and ALF can be applied to the entire frame.
ALF can also be performed for partial regions based on block or quadtree.
If it is an ALF based on the partial area, additional information indicating the area information must also be passed.
Be
8. Sample AdaptiveOffset filtering point adaptiveoffset
SAO (Sample Adaptive Offset filtering is actually classified by the rebuilding pixels after the deblock filtering, and compensates for each classification pixel, and the classification template is divided into BO (BAND Offset) and EO (Edge Offset).
Be
BO Category:
Be
EO classification module:
SAO is in the codeca, after DEBLOCK, by the classification of the reconstructed image, add an offset to each type of image pixel value, reducing the purpose of reducing distortion, thereby increasing the compression ratio, reducing the code stream.
After SAO, the average can reduce the code stream of 2% to 6%, while the performance consumption of the encoder and decoder increases by about 2%.
Be
9. TILE
TILE: Split the image into a rectangular area.
Its main purpose is to enhance parallelismProcessability.
Each tile region corresponds to a sub-image independently to the codec block units LCU.
Tile a basic block units in parallel, each sub-stream of a Tile
10. WPP
WPP: full name wavefront parallel process, to conduct basic coding unit LCU.
LCU one line substantially parallel block units, each row of a sub-stream LCU
11. Dependentslice
Dependent Slice: This technology can be understood as a data division of the original Slice NALU to make it suitable for more flexible packaging methods.
The schematic of Slice and Dependent Slice is as follows
12. Other related technologies
? Transform_skip mode: transform_skip_flag, the model does not transform, but to quantify, the model has a good effect on the text desktop video
? Internal bit depth increased: in order to ensure inter-prediction, transform and quantization bit depth inside the process in order to achieve better compression performance "
Our other product: