isovideo Demeler: Realtime GPU accerated HD motion-compensated deinterlacing/inverse telecine technology

Abstract

The HEVC and AVC compression efficiency of interlace content coded with interlace modes is informally compared against the same content deinterlaced and coded as progressive sequences. For example, each raw source 1080i 60 fields/sec interlace sequence is coded as a level 4.1 bitstream, while a copy of the source is also deinterlaced to 1080p 60 frames/sec and then coded as level 4.2 progressive bitstream. Each deinterlaced progressive-coded sequence has the same temporal sample rate, but twice the lines and hence twice the pixel rate of the original 1080i interlace source sequence. On the test set selected of content known to be challenging for deinterlacers, the bitstreams produced by encoding with HEVC HM 8.0 in Main Profile RA fixed-Qp on deinterlaced progressive frame sequences average ~17% smaller according to the bdrate() macro than bitstreams on the same content coded as interlaced field sequences. In a separate experiment comparing the effect of deinterlacing before AVC encoding vs. deinterlacing after decoding, bdrate() suggests that better results are yielded by the former path of deinterlacing prior to encoding: AVC tests performed with x264 default MBAFF mode (also fixed-QP “rate control”), the deinterlaced progressive coded bitstream average 18% smaller than their MBAFF bitstream counterparts.

Introduction

The unobserved fields in between each known field in the interlace source sequence is interpolated by a 2500+ operation per pixel prediction engine in the Demeler deinterlacer that blends motion compensated pixels from the adjacent known fields of opposite parity with the current field. The quality of the deinterlacer is judged by observers to fall into the “high” professional category of commercial products.

As illustrated in the accompanying PDF slides to this document, the first test compares HEVC coding of deinterlaced frame sequences against interlaced field sequences on the same content. For simulation speed reasons, AVC is applied in the second test which compares the effect of deinterlacing prior to encoding vs. deinterlacing after decoding.

Results

As summarized in Table 1 below, the average size of the HEVC coded deinterlaced frame sequences average 17% smaller than HEVC fields sequence bitstreams. The list of sequences [1] below are the subset in the study that were captured from interlaced cameras or computer generated to challenge deinterlacers (BBC_sequence3, ants).

Table 1 shows the results of deinterlaced frame sequences vs. interlaced field sequences with HM 8.0 RA common configurations [2].

Table 1: HEVC deinterlaced frame sequence vs. interlaced field sequence

Sequence	BD Rate
BBC_sequence3	-8.21%
ants	-35.74%
basketball	9.79%
cactus_and_comb	-38.91%
canoe	12.88%
flower_garden (480i)	-28.42%
flower_garden (576i)	-37.10%
harpist	-8.70%
mobile_and_calendar	-20.02%
pageant	-21.87%
rugby	14.83%
schumacher	31.74%
susie (480i)	-41.64%
susie (576i)	-31.32%
tennis (480i)	-33.20%
table_tennis (576i)	-30.70%
Average	-16.66%

As shown in Table 2, a different test set (from Table 1) of progressive source material is first interlaced (by filtering and scaling each source frame into a simulated field using Meler), and then encoded with x264’s default mode [3] settings including MBAFF.

Table 2: pre-deinterlacing BD rate reductions over post-deinterlaced

Sequence	Y BDRATE
Portland_Front_St	-20.75%
Portland_Max	-10.43%
basketball	-27.47%
beach_waterfall	-29.71%
bike_jumping2	-26.10%
cat	-27.69%
cheerleaders	0.75%
controlled_burn	22.47%
crowd_run	-16.43%
ducks_take_off	12.79%
ice_skating	-2.95%
mobcal_ter	-35.01%
octopus_tree	-40.21%
old_town_cross	-35.36%
park_joy	-11.91%
riverplace	-29.45%
train_pan_zoom	-26.44%
Average	-17.70%

Conclusions

Though overall pleasing to the viewer, interpolation of missing fields in deinterlacing introduces small amount of artificial spurious decision noise and jaggy edges at the pixel level that trigger HEVC HM to code more coefficients and smaller blocks than it would otherwise for native progressive sources. Suppression of such artifacts either by pre-processing or various encoder optimizations such as coefficient thresholding/decimation will likely improve deinterlace-before-encoding results. The sequences which show gains from deinterlacing prior to encoding are likely due to the number of motion-compensated interpolation steps tallying over 2500 operations per output pixel that provide smooth continuity for spatial and temporal video coding tools. Preliminary results not included here also suggest that adaptive quantization is a more appropriate mode for AVC and HEVC deinterlaced coding.

Deinterlaced Raw Sequences for Independent Compression Tests

In order allow independent confirmation of our compression test results, isovideo is making the Demeler-deinterlaced output available for some well-known interlaced-captured sources. The links below provide access to files that were deinterlaced and then lossless-compressed using x264, in the matroska contrainer format. Lossless-compressed files are still very large, and must be downloaded before viewing. They can be converted using ffmpeg/avconv, or viewed with the freely available vlc and mplayer (with versions available for all major operating systems). Some HD sources (*pip_isovideo.mkv) were originally progressive, so they were interlaced before being deinterlaced.

Note: Since this paper was originally published in October 2012, the old test video results are no longer available for download.

References

[1] Interlaced source sequences:https://media.xiph.org/vqeg/TestSeqences.
[2] F. Bossen, “HM 8 Common Test Conditions and Software Reference Configurations”, JCTVC-J1100, July 2012.
[3] x264 reference software:http://x264.nl/.

Follow:

AVC and HEVC Coding Efficiency of Deinterlaced Sequences October 2012

Table 2: pre-deinterlacing BD rate reductions over post-deinterlaced

AVC and HEVC Coding Efficiency of Deinterlaced Sequences
October 2012