An estimation-theoretic framework for spatially scalable video coding.

IEEE Transactions on Image Processing(2014)

引用 5|浏览44
暂无评分
摘要
This paper focuses on prediction optimality in spatially scalable video coding. It draws inspiration from an estimation-theoretic prediction framework for quality (SNR) scalability earlier developed by our group, which achieved optimality by fully accounting for relevant information from the current base layer (e.g., quantization intervals) and the enhancement layer, to efficiently calculate the conditional expectation that forms the optimal predictor. It was central to that approach that all layers reconstruct approximations to the same original transform coefficient. In spatial scalability, however, the layers encode different resolution versions of the signal. To approach optimality in enhancement layer prediction, this paper departs from existing spatially scalable codecs that employ pixel domain resampling to perform interlayer prediction. Instead, it incorporates a transform domain resampling technique that ensures that the base layer quantization intervals are accessible and usable at the enhancement layer despite their differing signal resolutions, which in conjunction with prior enhancement layer information, enable optimal prediction. A delayed prediction approach that complements this framework for spatial scalable video coding is then provided to further exploit future base layer frames for additional enhancement layer coding performance gains. Finally, a low-complexity variant of the proposed estimation-theoretic prediction approach is also devised, which approximates the conditional expectation by switching between three predictors depending on a simple condition involving information from both layers, and which retains significant performance gains. Simulations provide experimental evidence that the proposed approaches substantially outperform the standard scalable video codec and other leading competitors.
更多
查看译文
关键词
base layer quantization intervals accessibility,quality snr scalability,layer approximations reconstruction,estimation-theoretic prediction,image resolution,transform coefficient,quantisation (signal),additional enhancement layer coding performance gains,delayed prediction,estimation-theoretic prediction framework,estimation theory,optimal prediction,scalable video coding,delayed prediction approach,transform domain resampling,image sampling,spatially scalable video coding,video coding,layer frames,spatially scalable codecs,low-complexity variant,prediction theory,current base layer,transform domain resampling technique,spatial scalability,transforms,signal resolutions,image enhancement,enhancement layer prediction,scalable video codec,video codecs
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要