Stereo and Depth-from-Defocus Dataset 1: Patio

Spanish patio: The original POV-Ray scene

This is a remake of an old scene, tried as a way to recover inspiration. The architecture uses various isosurfaces, and the rest is mostly CSG and some height_fields. To generate the palm trees I used the TOMTREE include file (required if you want to render this scene), with the help of POVTree (mirror), a java front-end for TOMTREE.

Stereo and Depth-from-Defocus dataset

New! A second dataset is available, based on another POV-Ray scene by Jaime Piqueres: office.

Images

Here are JPEG-encoded previews for all the images in the dataset. The dataset includes better quality images, encoded as 16-bits 1152x864 PNG. Click on low-resolution images to get a high-resolution preview.

Note that there are small low-frequency illumination differences between the left and right images, which are due to radiosity computation (if someone knows how to share the radiosity computations between several views, please tell me).

All-in focus

Focus on the background tree, large and small depth-of-focus

Focus on the fountain, large and small depth-of-focus

Depth, disparity and occlusion map ground truth

Stereo and focal blur parameters

The interocular is 8 (see variable half_interocular in the pov files), and the convergence (or zero-disparity) distance (see variable convergence_dist in the pov files) is 320.

The *_near images are focused on the fountain (disparity=10.1768), the *_far images are focused on the tree in the back (disparity=14.7377), and the *_all images are all-in-focus.

The images with a small depth-of-focus were rendered with an aperture setting of 10, which means that the iris is a disc with diameter 40 (5 times the interocular) in world units.

The images with a larger depth-of-focus were rendered with an aperture setting of 5, which means that the iris is a disc with diameter 20 (2.5 times the interocular) in world units.

Reference

If you use this dataset, please cite the original publication where it appeared:

F. Devernay, S. Pujades and V. Ch.A.V., ``Focus mismatch detection in stereoscopic content'', Stereoscopic Displays & Applications XXIII, Woods, Holliman, Favalora (eds.), Proc. SPIE 8288, paper 8288-12, 2012 (HAL, presentation video).

We are also preparing a journal paper which extends the original paper with new interesting results, and hope it will be published in 2013. Please check out my publication list to see if it's already there.

Download

patio-groundtruth.zip (21.1Mb): Ground truth data (disparity in TIFF format) and VLPov metadata (depth in binary format and camera parameters in text format, which can be read by VLPovUtils to produce more ground truth data)
patio-images-all.zip (10.4Mb): All-in-focus images (PNG16 and EXR formats).
patio-images-near.zip (20.0Mb): Images focused on the fountain (PNG16).
patio-images-far.zip (20.3Mb): Images focused on the background tree (PNG16).
patio-focalblur.zip (7.4Kb): POV-Ray and MegaPOV files for re-rendering the images (see below). Also requires patio.zip, lightsys4c.zip, tomtree.zip).

Re-rendering the dataset

Here are the instructions if you want to re-render the images from the original POV-Ray files. This way, you can adjust the camera parameters to your needs.

POV-Ray files

The files can be rendered using MegaPOV 1.2.1 with the annotation patch, and POV-Ray 3.7 RC6 (POV-Ray 3.6 generates visible artifacts between the left and right views). On Mac OS X, MegaPOV has to be compiled with GCC 4.2 (not LLVM-GCC or clang), or the rendering will be incorrect (I have instructions to install GCC 4.2 on recent Mac OS X).

download patio-focalblur.zip
unpack it:
- unzip -a patio-focalblur.zip
set the current directory to patio-focalblur:
- cd patio-focalblur
download patio.zip from http://www.ignorancia.org/en/index.php?page=Patio (mirror) :
- curl -O http://www.ignorancia.org/uploads/zips/patio.zip
- unzip -a patio.zip
apply patch found in patio-focalblur.zip:
- patch -p0 < patio.patch
download tomtree from http://www.aust-manufaktur.de/austv2x.html (mirror):
- curl -O http://www.aust-manufaktur.de/tomtree.zip
- mkdir tomtree
- (cd tomtree; unzip -a ../tomtree.zip)
download lightsys from http://www.ignorancia.org/en/index.php?page=Lightsys (mirror):
- curl -O http://www.ignorancia.org/uploads/zips/lightsys4c.zip
- unzip -a lightsys4c.zip

Rendering

If no radiosity is applied, the rendering will be completely wrong: the shadowed aread (under the arcades) will be very dark.

In order to apply radiosity:

Change in patio/patio.pov the line

#declare usar_rad=0;

#declare usar_rad=2;

then, run one rendering. This will save a radiosity file "patio.rad" in the current directory.

For subsequent renderings from the same point of view (it is a good idea to render the left and right views in separate directories), change the line to

#declare usar_rad=1;

This will load the radiosity file and thus save computations.

All-in-focus view are rendered without focal blur, but with antialiasing (thus the options +A0.0 +J0.0 for the patio_stereo*_all.png images, which mean to perform 3x3 antialiasing on every pixel). The +FN16 option means to render 16-bits PNG images.

Left view:

Set usar_rad=0 in patio.pov
mkdir patio-left; cd patio-left
megapov +Q0 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K0.0 +Ipatio_stereo_megapov.pov +Opatio_stereo1_megapov.png
Set usar_rad=2 in patio.pov
povray +HR +RF"patio_stereo1.rad" +RFO +WT8 +FN16 -UV +w1152 +h864 +A +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K0.0 +Ipatio_stereo.pov +Opatio_stereo1.png
Set usar_rad=1 in patio.pov
povray +HR +RF"patio_stereo1.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +AM2 +A0.1 -J +R4 +AG1.0 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K0.0 +Ipatio_stereo.pov +Opatio_stereo1_all.png
povray +HR +RF"patio_stereo1.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K0.0 +Ipatio_stereo_far.pov +Opatio_stereo1_far.png
povray +HR +RF"patio_stereo1.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K0.0 +Ipatio_stereo_near.pov +Opatio_stereo1_near.png
povray +HR +RF"patio_stereo1.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K0.0 +Ipatio_stereo_near2.pov +Opatio_stereo1_near2.png
povray +HR +RF"patio_stereo1.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K0.0 +Ipatio_stereo_far2.pov +Opatio_stereo1_far2.png

Right view (+K0.0 is changed to +K1.0):

Set usar_rad=0 in patio.pov
mkdir patio-right; cd patio-right
megapov +Q0 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K1.0 +Ipatio_stereo_megapov.pov +Opatio_stereo2_megapov.png
Set usar_rad=2 in patio.pov
povray +HR +RF"patio_stereo2.rad" +RFO +WT8 +FN16 -UV +w1152 +h864 +A +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K1.0 +Ipatio_stereo.pov +Opatio_stereo2.png
Set usar_rad=1 in patio.pov
povray +HR +RF"patio_stereo2.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +AM2 +A0.1 -J +R4 +AG1.0 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K1.0 +Ipatio_stereo.pov +Opatio_stereo2_all.png
povray +HR +RF"patio_stereo2.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K1.0 +Ipatio_stereo_far.pov +Opatio_stereo2_far.png
povray +HR +RF"patio_stereo2.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K1.0 +Ipatio_stereo_near.pov +Opatio_stereo2_near.png
povray +HR +RF"patio_stereo2.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K1.0 +Ipatio_stereo_near2.pov +Opatio_stereo2_near2.png
povray +HR +RF"patio_stereo2.rad" +RFI +WT8 +FN16 -UV +w1152 +h864 +L.. +L../patio +L../patio/maps +L../LightsysIV +L../tomtree +K1.0 +Ipatio_stereo_far2.pov +Opatio_stereo2_far2.png

Ground-truth disparity and occlusion maps

The ground truth disparity map can be obtained using the VLPovUtils utilities. Here are the commands to produce this data, once the patio_stereo1_all and patio_stereo2_all scene have been rendered.

The first command computes the motion field and occlusion maps. The results are stored in patio-left/patio_stereo1_all.patio_stereo2_all.mx.tif, patio-left/patio_stereo1_all.patio_stereo2_all.occ.tif, patio-right/patio_stereo2_all.patio_stereo1_all.mx.tif, patio-right/patio_stereo2_all.patio_stereo1_all.my.tif, patio-right/patio_stereo2_all.patio_stereo1_all.occ.tif. The disparity maps are in the *.mx.tif files (stored as 32-bit floating point TIFF images) and the occlusion maps in the *.occ.tif files (stored as 8-bit grayscale TIFF files, converting them to PNG saves a lot of disk space).

vlpov_motionfield2 patio-left/patio_stereo1_megapov patio-right/patio_stereo2_megapov
convert patio-left/patio_stereo1_megapov.patio_stereo2_megapov.occ.tif patio-left/patio_stereo12.occ.png
mv patio-left/patio_stereo1_megapov.patio_stereo2_megapov.mx.tif patio-left/patio_stereo12.mx.tif
mv patio-left/patio_stereo1_megapov.depth patio-left/patio_stereo1.depth
mv patio-left/patio_stereo1_megapov.txt patio-left/patio_stereo1.txt
rm patio-left/patio_stereo1_megapov.patio_stereo2_megapov.occ.tif patio-left/patio_stereo1_megapov.patio_stereo2_megapov.my.tif
convert patio-right/patio_stereo2_megapov.patio_stereo1_megapov.occ.tif patio-right/patio_stereo21.occ.png
mmv patio-right/patio_stereo2_megapov.patio_stereo1_megapov.mx.tif patio-right/patio_stereo21.mx.tif
mmv patio-right/patio_stereo2_megapov.depth patio-right/patio_stereo2.depth
mmv patio-right/patio_stereo2_megapov.txt patio-right/patio_stereo2.txt
rm patio-right/patio_stereo2_megapov.patio_stereo1_megapov.occ.tif patio-right/patio_stereo2_megapov.patio_stereo1_megapov.my.tif

The stereo disparity corresponding to the two focus distances can be obtained using the vlpov_project utility. The focal points are (-270,-20,270) [far] and (0,-6,0) [near] in POV-Ray coordinates. Note the Z coordinate of the 3D points has to be reversed, due to the left-handedness of POV-Ray's reference frame (see the code of function vlpov_get_cam in vlpovutils). The pixel coordinates of each points are in the first two columns, and the disparities (14.7 and 10.2) are in the fourth column of the output (last column, the vertical disparity, is almost zero):

(echo -270 -20 -270; echo 0 -6 0) | vlpov_project patio-left/patio_stereo1_megapov patio-right/patio_stereo2_megapov
396.881 615.647 927.554 14.7377 -3.05172e-05 // far
555.852 699.549 584.269 10.1768 -4.30894e-05 // near