Parallax range estimation in S3D video
Introduction
The parallax in S3D video is defined as the offset of a pixel in the right view relative to its position in the left view. This offset is commonly denoted as a percentage of screen width. If a pixel’s position in the right view is to the right of its position in the left view (positive parallax), we perceive the point further than the screen:
At zero parallax the pixel’s depth corresponds to the screen level. Pixels with negative parallax are viewed closer than the screen:
To get a point infinitely far away, we should make the parallax equal to the distance between the eyes. If the parallax is higher than that distance, the point becomes “father than infinite”, which is impossible in real life and can cause pain to the viewer:
We can easily observe that a 3D movie will have huge parallaxes when watched on a cinema screen because the screen width is big in comparison with the pupillary distance. If we watch the same movie on a mobile 3D-capable monitor, the screen width and hence parallaxes would be small (that is why mobile screens have “small depth”):
Estimating the range of parallaxes can show us how well the movie will be perceived.
Proposed method
We propose a method for estimating parallax distribution. The method’s outline is as follows:
- A stereo matching algorithm is used to determine disparity between left and right views;
- A histogram of lengths of matching vectors is computed;
- The histogram is cut off at the ends to eliminate outliers.
The resulting histogram shows the range of parallaxes in the movie.
Experiments
We estimated parallaxes in 105 movies. The following diagram depicts average extreme negative and positive disparities for each movie relative to its release date:
The “Avatar” movie (2009) was selected as a baseline in parallax range comparison, with parallaxes from -0.5 percent of screen width to 1.0, which is the most comfortable range. Some recent movies are still far from the ideal parallax distribution.
Each column represents a movie. Bottom red bars mean almost flat scenes, while top red bars mean excessively large parallaxes.
-
MSU Benchmark Collection
- Super-Resolution for Video Compression Benchmark
- Video Colorization Benchmark
- Defenses for Image Quality Metrics Benchmark
- Learning-Based Image Compression Benchmark
- Super-Resolution Quality Metrics Benchmark
- Video Saliency Prediction Benchmark
- Metrics Robustness Benchmark
- Video Upscalers Benchmark
- Video Deblurring Benchmark
- Video Frame Interpolation Benchmark
- HDR Video Reconstruction Benchmark
- No-Reference Video Quality Metrics Benchmark
- Full-Reference Video Quality Metrics Benchmark
- Video Alignment and Retrieval Benchmark
- Mobile Video Codecs Benchmark
- Video Super-Resolution Benchmark
- Shot Boundary Detection Benchmark
- The VideoMatting Project
- Video Completion
- Codecs Comparisons & Optimization
- VQMT
- MSU Datasets Collection
- Metrics Research
- Video Quality Measurement Tool 3D
- Video Filters
- Other Projects