INTRODUCTION

Welcome to SIAT Video Team (SVT), comprised of members from High Performance Computing Center (HPCC), Shenzhen Institute of Advanced Technology (SIAT), Chinese Academy of Sciences (CAS). We have long been engaged in the field of multimedia communications and visual signal processing for 2D/3D, VR/AR videos, including video coding, visual signal pre/post-processing, and computational visual perception. We are also pursuing challenging problems in the innovation areas, such as VR/AR, AI etc.
Join Us      

RECENT PUBLICATIONS
Deep Learning Based Just Noticeable Difference and Perceptual Quality Prediction Models for Compressed Video

IEEE Transactions on Circuits and Systems for Video Technology (IEEE T-SCVT) 2021 .

Yun Zhang,Huanhua Liu, You Yang, Xiaoping Fan, Sam Kwong, C. C. Jay Kuo Full-Text

Full-Text     
Highly Efficient Multiview Depth Coding Based on Histogram Projection and Allowable Depth Distortion

IEEE Transactions on Image Processing (IEEE T-IP), 2021 .

Yun Zhang*, Linwei Zhu, Raouf Hamzaoi, Sam Kowng, Yo-Sung Ho

Full-Text     
Projection Invariant Feature and Visual Saliency-Based Stereoscopic Omnidirectional Image Quality Assessment     

IEEE Transactions on Broadcasting (IEEE T-BC), 2021 .

Xuemei Zhou, Yun Zhang*, Na Li, Xu Wang, Yang Zhou and Yo-Sung Ho

Full-Text     
Learning-based Satisfied User Ratio Prediction for Symmetrically and Asymmetrically Compressed Stereoscopic Images     

IEEE Transactions on Multimedia (IEEE T-MM), 2021 .

Chunling Fan, Yun Zhang*, Raouf Hamzaoui, Qingshan Jiang, Djemei Ziou

Full-Text     
Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding     

IEEE Transactions on Circuits and Systems for Video Technology (IEEE T-CSVT), 2020

Linwei Zhu, Yun Zhang*, Shiqi Wang, Sam Kwong, Xin Jin, and Yu Qiao

Full-Text     
Machine Learning Based Video Coding Optimizations: A Survey     

Information Sciences (INS), Elsevier, 2020.

Yun Zhang, Sam Kwong*, Shiqi Wang

Full-Text     
Deep Learning Based Picture-Wise Just Noticeable Distortion Prediction Model for Image Compression     

IEEE Transactions on Image Processing (IEEE T-IP). 2020 .

Huanhua Liu, Yun Zhang*, Huan Zhang, Chunling Fan, Sam Kwong, C.C. Jay Kuo, and Xiaoping Fan

Full-Text     
Sparse Representation based Video Quality Assessment for Synthesized 3D Videos     

IEEE Transactions on Image Processing (IEEE T-IP) . 2020 .

Yun Zhang*, Huan Zhang, Mei Yu, Sam Kwong, and Yo-Sung Ho

Full-Text     
Generative Adversarial Network Based Intra Prediction for Video Coding     

IEEE Transactions on Multimedia (IEEE T-MM) . 2020 .

Linwei Zhu, Sam Kwong, Yun Zhang, Shiqi Wang, and Xu Wang

Full-Text     
DATABASE
SIAT Synthesized Video Quality Database Project Page     

We develop a synthesized video quality database which includes ten different MVD sequences and 140 synthesized videos in 1024×768 and 1920×1088 resolution. For each sequence, 14 different texture/depth quantization combinations were used to generate the texture/depth view pairs with compression distortion. A total of 56 subjects participated in the experiment. Each synthesized sequence was rated by 40 subjects using single stimulus paradigm with continuous score. The Difference Mean Opinion Scores (DMOS) are provided.

SIAT Depth Quality Database      Project Page     

We develop a stereoscopic video depth quality database which includes ten different stereoscopic sequences and 160 distorted stereo videos in 1920×1080 resolution. The ten sequences are from the Nantes-Madrid-3D-Stereoscopic-V1 (NAMA3DS1) database. There are four categories of impairments in the NAMA3DS1 database which are H.264 coding, JPEG2000 coding, down-sampling and sharpening. However, only symmetric distortions are considered in NAMA3DS1 database. Since both symmetric and asymmetric distortions are necessary to study, we generate additional stereoscopic videos with asymmetric distortions. There are 90 symmetrically distorted video pairs and 70 asymmetrically distorted video pairs. 30 subjects (24 male, 6 female) participated in the symmetric distortion experiment and 24 subjects (19 male, 5 female) participated in the asymmetric distortion experiment.

Picture-level JND Database (Symmetric & Asymmetric)      Project Page     

We study the Picture-level Just Noticeable Difference (PJND) of symmetrically and asymmetrically compressed stereoscopic images for JPEG2000 and H.265 intra coding. We conduct interactive subjective quality assessment tests to determine the PJND point using both a pristine image and a distorted image as a reference. We generate two PJND-based stereo image datasets, including Shenzhen Institutes of Advanced Technology-picture-level Just noticeable difference-based Symmetric Stereo Image dataset (SIAT-JSSI) and Shenzhen Institutes of Advanced Technology-picture-level Just noticeable difference-based Asymmetric Stereo Image dataset (SIAT-JASI). Each dataset includes ten source images, respectively. The PJNDPRI and PJNDDRI are provided. PJNDPRI reveals the minimum distortion against a pristine image. PJNDDRI reveals the minimum distortion against a distorted image.

VOD SYSTEM
Ultra-High Definition 3D Video Live System Project Page     

Ultra-high Definition 3D video live and on-demand system aims to solve the issues in processing, storage and transmission, and quality evaluation fields, providing a realistic and immersive viewing experience. This system can be widely applied in film and television production, digital games, remote control, cultural relic protection, military simulation, and other applications.

VR/360° Video Projection Conversion Software Project Page     

Projection conversion is one of the essential procedures in the virtual reality video/panoramic video technology. The projection format of 360-degree panoramic video will determine the quality and the compression efficiency of panoramic video. The selection of projection format in different application scenarios can effectively reduce the video transmission bandwidth and storage and provide customers with high-quality virtual reality experience.

VR Video Live System Project Page     

The immersive virtual reality video live system allows customers to watch 4K Ultra High Videos on demand and for live broadcast, and can provide 360-degree high-quality, realistic and interactive visual experience.

JND Predition Software Project Page     

The distortion perceptron is a deep learning technology software platform developed based on the PW-JND (Picture Wise Just Noticeable Difference) prediction model. This technology is used in VR image and video compression with serious bandwidth storage bottlenecks, and maximizes compression efficiency in the case of JND property by the human eye.

Note: All resources shall not be used for commercial purposes, if you have any questions, please contact us: (yun.zhang@siat.ac.cn)

NEWS

May
2021

Congratulations to Jinyong Pi, Xinju Wu and Xuemei Zhou for successfully passing their Master thesis defense. More     

Apr
2021

Prof. Yun Zhang’s paper “Deep Learning Based Just Noticeable Difference and Perceptual Quality Prediction Models for Compressed Video” was accepted by IEEE Transactions on Circuits and Systems for Video Technology. More     

Apr
2021

Book "Three Dimensional Video Processing" authored by Prof. Gangyi Jiang, Prof. Yun Zhang, Prof. Mei Yu, and Prof. Zongju Peng has been published by SCIENCE PRESS, 2020. More     

Mar
2021

Congratulations that our project titled "High-efficiency Computational Theory and Method of Video Coding" has been awarded with Natural Science Award by the Minister of Education (MOE), China. This project was completed by Tongji University, CityU, NUIST and SIAT, CAS. More     

Jan
2021

Congratulations to Prof. Yun Zhang for the organization of special Issue "Advances in Deep-Learning-Based Sensing, Imaging, and Video Processing". More     

Dec
2020

Congratulations to Xuemei Zhou on her paper “Projection Invariant Feature and Visual Saliency Based Stereoscopic Omnidirectional Image Quality Assessment” accepted by IEEE Transactions... More     

Dec
2020

Congratulations to Xinju Wu on her Audio Video coding Standard (AVS) input documents M6064 "Response to N2591" and M6065 "Subjective quality evaluation dataset of point clouds for human vision tasks" adopted by Quality Assessment Group of AVS. More     

Dec
2020

Congratulations to Ph. D. Linwei Zhu on his paper “Circular Intra Prediction for 360 Degree Video Coding” accepted by Journal of Visual Communication and Image Representation. More     

Dec
2020

Congratulations to Ph. D. Linwei Zhu and Jinyong Pi on their paper accepted for oral presentation at 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP). More     

Dec
2020

Congratulations to Ph. D. Na Li for promotion to be Associate Professor at SIAT CAS, Shenzhen, China. More     

Nov
2020

Congratulations to Huan Zhang for successfully defending her Ph. D. thesis. More     

Nov
2020

Congratulations to Prof. Yun Zhang for being invited to give a report entitled "Machine Learning based Video Coding Optimizations" More     

Oct
2020

Congratulations to Ph. D. Linwei Zhu on his paper “Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding” accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT). More     

Oct
2020

Congratulations to Prof. Yun Zhang on his paper “Highly Efficient Multiview Depth Coding Based on Histogram Projection and Allowable Depth Distortion” accepted by IEEE Transactions on Image Processing (TIP). More     

Aug
2020

Congratulations to Jinyong Pi on his paper “Content-aware Hybrid Equi-angular Cubemap Projection for Omnidirectional Video Coding” accepted by 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP). More     

Aug
2020

Congratulations to Ph. D. Linwei Zhu on his paper “Sparse Representation-Based Intra Prediction for Lossless/Near Lossless Video Coding” accepted by 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP). More     

May
2020

Congratulations to Xiaoyan Liu for successfully defending her Master thesis. More