# PTV file format specification PTV is the file format used by VideoStitch. It uses full [JSON](http://www.json.org/) syntax. Projects are simply defined by populating the root object with a few named variables. ## Objects VideoStitch will interpret the following objects. An object is of course free to have other members in addition to these. In the tables below, mandatory members are shown in bold. If not set, optional members take a default value specified in the "Default value" column. ### Root VideoStitch will interpret the following members of the root object:

Member	Type	Default value
lib_version	string	-	Version of our core library used to save the project file.
first_frame	int	-	First frame of the project.
last_frame	int	-	Last frame of the project. Set to -1 to try to autodetect the last frame. All readers may not support autodetection.
pano	object	-	Panorama object (see below).
audio_pipe	object	-	Audio pipeline object (see below).
output	object	-	Output object (see below).
merger	object	-	Merger object (see below).
flow	object	-	Flow object (see below).
warper	object	-	Warper object (see below).
buffer_frames	int	2	The number of frames to buffer. If set to > 0, the writer will buffer that many frames. This can improve GPU utilization. It will usually not be interesting to buffer more than a few frames. Memory usage (CPU RAM only) will go up linearly with buffer_frames.

### Pano This specifies the geometry and photometry options of the panorama.

Member	Type	Default value
width	int	-	The output width, in pixels.
height	int	-	The output height, in pixels.
length	int	-	The length of an edge of the cubemap, in pixels.
hfov	double	-	The horizontal field of view, in degrees.
proj	string	-	The projection. See possible values below.
spherescale	string	-	The stitching sphere scale.
precomputed_coordinate_buffer	bool	false	Whether to precompute the coordinate buffer for looking up pixel value. If yes, the program will use more GPU memory and run slightly faster.
precomputed_coordinate_shrink_factor	bool	false	If precomputed_coordinate_buffer is true, the precomputed coordinate buffer will be downsampled by this factor to run faster.
inputs	array[object]	-	The list of input objects (see below).
overlays	array[object]	-	The list of overlay objects (see below).
wrap	boolean	true	If wrap is true, hfov is 360.0, and the projection supports it, then the panorama will wrap seamlessly across 360 border.
ev	double	0.0	Exposure value correction.
global_yaw	object	null	The panorama's global yaw. Can depend on time. Default is no global yaw.
global_pitch	object	null	The panorama's global pitch. Default is no global pitch.
global_roll	object	null	The panorama's global roll. Default is no global roll.
rig	object	array[object]	Rig definition for calibration presets. See the algorithm documentation for full format.
cameras	object	array[object]	Cameras definition for calibration presets. See the algorithm documentation for full format.

Possible values for (Pano) proj are: * "rectilinear" * "equirectangular" * "ff_fisheye" * "stereographic" * "cubemap" ### Input This specifies the geometry and photometry options for each input.

Member	Type	Default value
width	int	-	The input width, in pixels.
height	int	-	The input height, in pixels.
hfov	double	-	The horizontal field of view, in degrees (old syntax, replaced by "horizontalFocal" in geometries).
yaw	double	-	The yaw, in degrees (old syntax, now in geometries).
pitch	double	-	The pitch, in degrees (old syntax, now in geometries).
roll	double	-	The roll, in degrees (old syntax, now in geometries).
reader_config (previously filename)	string or object	-	The reader configuration. Usually a filename, but can be more elaborated to enable advanced features. See the Readers section below.
proj	string	-	The projection. Possible values
geometries	curve object	-	The camera geometry curves.
crop_left	int	0	Crop that many pixels from the left of the input.
crop_right	int	=width	Crop that many pixels from the right of the input.
crop_top	int	0	Crop that many pixels from the top of the input.
crop_bottom	int	=height	Crop that many pixels from the bottom of the input.
viewpoint_model	string	"hugin"	Viewpoint model: "hugin" or "ptgui".
translation_x	double	0.0	Viewpoint translation along the X axis (old syntax, now in geometries).
translation_y	double	0.0	Viewpoint translation along the Y axis (old syntax, now in geometries).
translation_z	double	0.0	Viewpoint translation along the Z axis (old syntax, now in geometries).
viewpoint_pan	double	0.0	Viewpoint pan (if viewpoint_model=="ptgui").
viewpoint_tilt	double	0.0	Viewpoint tilt (if viewpoint_model=="ptgui").
dist_center_x	double	0.0	Horizontal shift (old syntax, replaced by "center_x" in geometries).
dist_center_y	double	0.0	Vertical shift (old syntax, replaced by "center_y" in geometries).
response	string	"emor"	Camera response model. One of "emor", "gamma", "linear", "inverse_emor" or "curve".
gamma	double	1.0	If response is "gamma", the gamma response parameter.
emor_a	double	0.0	If response is "emor" or "inverse_emor", the first emor response parameter.
emor_b	double	0.0	If response is "emor" or "inverse_emor", the second emor response parameter.
emor_c	double	0.0	If response is "emor" or "inverse_emor", the third emor response parameter.
emor_d	double	0.0	If response is "emor" or "inverse_emor", the fourth emor response parameter.
emor_e	double	0.0	If response is "emor" or "inverse_emor", the fifth emor response parameter.
response_curve	[int]	[]	If response is "curve", a list of 1024 integers describing the camera response curve by value.
ev	double	0.0	Exposure value correction.
red_corr	double	1.0	Red white balance multiplier.
blue_corr	double	1.0	Blue white balance multiplier.
lens_dist_a	double	0.0	Lens distortion parameter (degree 0) (old syntax, replaced by "distort_a" in geometries).
lens_dist_b	double	0.0	Lens distortion parameter (degree 1) (old syntax, replaced by "distort_b" in geometries).
lens_dist_c	double	0.0	Lens distortion parameter (degree 2) (old syntax, replaced by "distort_c" in geometries).
vign_a	double	1.0	Vigneting parameter (degree 0).
vign_b	double	0.0	Vigneting parameter (degree 1).
vign_c	double	0.0	Vigneting parameter (degree 2).
vign_d	double	0.0	Vigneting parameter (degree 3).
vign_x	double	0.0	Vigneting center along x axis, relative to image center.
vign_y	double	0.0	Vigneting center along y axis, relative to image center.
frame_offset	int	0	Offset of this input relative to the origin of time, in frames.
preprocessors	list	[]	A list of processors to run before mapping. See below for a list of available preprocessors.
mask_data	string	""	An inline, base64 encoded 2-color colormapped png file, the size of the input. Red pixels are masked out.
no_delete_masked_pixels	bool	false	If true, masked pixels will just have alpha 0. If false, they will also influence how stitching seams are computed. To get smooth blending around masked areas, always disable no_delete_masked_pixels.

Possible values for (Input) proj are: * "rectilinear" * "circular_fisheye" * "ff_fisheye" * "circular\_fisheye_opt" * "ff\_fisheye_opt" * "equirectangular" ### Geometries The "geometries" member inside an Input object is a curve of temporally varying camera geometry parameters.

horizontalFocal	double	1000.0	The horizontal focal parameter, allowing to transform meters into pixels on the sensor plane.
verticalFocal	double	-	The vertical focal parameter, allowing to transform meters into pixels on the sensor plane. If no value is provided, it is considered equal to the horizontalFocal.
center_x	double	0.0	Principal point / center of distortion horizontal shift in pixels w.r.t. the sensor center.
center_y	double	0.0	Principal point / center of distortion vertical shift in pixels w.r.t. the sensor center.
distort_a	double	0.0	Lens radial distortion parameter (degree 0).
distort_b	double	0.0	Lens radial distortion parameter (degree 1).
distort_c	double	0.0	Lens radial distortion parameter (degree 2).
distort_p1	double	0.0	First lens tangential distortion parameter.
distort_p2	double	0.0	Second lens tangential distortion parameter.
distort_s1	double	0.0	First lens thin-prism distortion parameter.
distort_s2	double	0.0	Second lens thin-prism distortion parameter.
distort_s3	double	0.0	Third lens thin-prism distortion parameter.
distort_s4	double	0.0	Fourth lens thin-prism distortion parameter.
distort_tau1	double	0.0	First lens Scheimpflug distortion angle parameter, in radians.
distort_tau2	double	0.0	Second lens Scheimpflug distortion angle parameter, in radians.
yaw	double	0.0	Camera yaw, in degrees.
pitch	double	0.0	Camera pitch, in degrees.
roll	double	0.0	Camera roll, in degrees.
translation_x	double	0.0	Camera translation along the X axis, in meters.
translation_y	double	0.0	Camera translation along the Y axis, in meters.
translation_z	double	0.0	Camera translation along the Z axis, in meters.

### Overlay This specifies the geometry and photometry options for each input. . . . . . .

Member	Type	Default value
reader_config (previously filename)	string or object	-	The reader configuration. Usually a filename, but can be more elaborated to enable advanced features. See the Readers section below.
width	int	-	The overlay input width, in pixels.
height	int	-	The overlay input height, in pixels.
frame_offset	int	0	Offset of this overlay input relative to the origin of time, in frames.
globalOrientationApplied	[boolean]	[]	Boolean to apply the stitcher orientation to the overlay input.
scaleCurve	[curve object]	[]	The overlay object size scale curve, value should be in the interval [0.0, 1.0].
alphaCurve	[curve object]	[]	The overlay object alpha blending curve, value should be in the interval [0.0, 1.0].
transXCurve	[curve object]	[]	The overlay object position X translation curve, in meters.
transYCurve	[curve object]	[]	The overlay object position Y translation curve, in meters.
transZCurve	[curve object]	[]	The overlay object position Z translation curve, in meters.
rotationCurve	[curve object]	[]	The overlay object position Yaw, Pitch, Roll orientation curve.

### Mergers Mergers specify how to blend remapped images.

Member	Type	Default value
type	string	laplacian	Type of blending. One of the types below.

#### Mergers for end users The mergers in this category are available in the products for stitching. ##### "gradient" merger The gradient merger simply blends images using weights taken from a mask. A feather parameter is used to control the level of smoothness of the generated mask. Larger feathers have smoother transition but will make calibration errors more apparent, while smaller feathers make the distorted hard egdes visible. The special value 100 will make the transition as smooth as possible while never overflowing the overlay zone.

Member	Type	Default value
mask_merger	int	0	The algorithm used to generate the merger mask. Voronoi mask is used by default.
feather	int	100	The feather parameter [0..100] is used to control the level of smoothness for the generated mask. A low value will produce mask with hard edge while a high value will generate smooth mask.

##### "laplacian" merger The laplacian merge blends across multiple spacial frequency bands to hide calibration errors and preserve high frequency details while providing a smooth perceptual blending. Low frequency signals will always be blended over large area. This property is not sensitive to the feature parameter. However, blending of high frequency signals (strong edge) is very sensitive to the feature parameter. It's slower than gradient merging but of much higher quality.

Member	Type	Default value
base_size	int	64	The size of the base level in the laplacian pyramid. The lower this number, the smoother the output (up to a point).
levels	int	5	[DEPRECATED, please use base_size] The number of levels in the laplacian pyramid.
gaussian_radius	int	5	The radius of the low pass gaussian filter used to build the laplacian pyramid.
filter_passes	int	1	The number of passes for gaussian filter computation. 3 makes up for a 97% accuracy. 1 is enough for plausible blending.
mask_merger	int	0	The algorithm used to generate the merger mask. Voronoi mask is used by default. Currently, box filter of radius 3 is used to construct the gaussian pyramid of the generated mask.
feather	int	100	The feather parameter [0..100] is used to control the level of smoothness for the generated mask. A low value will produce mask with hard edge while a high value will generate smooth mask.

#### Debug mergers The mergers in this category are for various debugging purposes, and are usually not available through an app interface, unless debug functionality is enabled. ##### "stack" merger Inputs are not merged, but simply stacked on top of each other. ##### "noblendv1" merger For each input, the warped pixel is tranformed to grayscale. The first input which maps to a given panorama pixel will be stored in the R component. The second input which maps to the same panorama pixel will be stored in the G component. The alpha bit is set to 1 if and only if two inputs map to a panorama pixel. This is used to make computations on overlap regions in the panorama output space. ##### "array" merger The array merger is not a merger per se, and will ignore the actual content of the inputs. Instead, it enables visualizing the overlap between inputs (camera array). ##### "checkerboard" merger In overlapping areas, the checkerboard merger alternates between contributing inputs in a checkerboard pattern. With that it's possible to compare the differences in the inputs indepedent of the stitching line and without any pseudo-color. The "feather" parameter can be used to set the size of the checker squares, in pixels. ##### "diff" merger The diff merger shows how well inputs coincide in overlapping zones. The output will be as usual outside of overlapping zones. In overlapping zones, green indicates a perfect match between inputs, and the error gets bigger as the output moves towards more red. ##### "exposure_diff" merger The exposure diff merger shows how well the exposure of inputs matches in overlapping zones. The output will be black outside of overlapping zones, signifying no error. In overlapping zones, every pixel contains the absolute difference of the value (per RGB channel) between the twofirst and the second input. ##### "inputidv2" merger The inputid merger shows the overlap zones of each of the inputs. Each pixel of the ouput is set if the corresponding input contributes to this pixel. ### Flow Flows specify which optical flow algorithm used for the flow-based blending.

Member	Type	Default value
type	string	no	Type of optical flow. One of the types below.
leftOffset	int	no	The left-offset of the current frame use to stabilize flow temporally.
rightOffset	int	no	The right-offset of the current frame use to stabilize flow temporally.

#### "no" flow: Disable flow-based blending. #### "simple" flow: Use SimpleFlow. ### Warper Warper specify which warp method used for the flow-based blending.

Member	Type	Default value
type	string	no	Type of image warper. One of the types below.

#### "no" warper: Disable flow-based blending. No warper is used. #### "linearflow" flow: Use linear warper. Along the image boundary, lookup pixels from the computed optical flow. As moving futher away from the boundary, use offset from the original mapping. Smooth transition is used to compute lookup coordinate from using pure optical flow to the pure original mapping.

Member	Type	Default value
maxTransitionDistance	int	100	Transition distance from the border (using optical flow) to using the original mapping.

### Audio Pipeline This specifies the audio pipeline configuration. In a nutshell, it specifies how audio inputs are created from the readers. On these audio inputs simple audio processors can be applied. And with these audio inputs you can create audio mixes.

Member	Type	Default value
sampling_rate	int	44100	Sampling rate of the audio pipeline in Hz. Available values are: 44100 and 48000.
block_size	int	512	Block size in samples. The audio is processed by block of size block size. Value of power of 2 should be preferred.
audio_selected	string	-	Name of the audio mix selected.
audio_inputs	array[object]	-	The list of audio input objects (see below).
audio_mixes	array[object]	-	The list of audio mix objects (see below).
audio_processors	array[object]	-	The list of audio processor objects (see below).

#### Audio Input An audio input is created from several audio sources.

Member	Type	Default value
name	string	-	Name of the audio input. Arbitrary name.
sources	array[object]	-	The list of audio source objects (see below).

##### Audio Source Defines which reader and which channel has to be used to create the corresponding audio input.

Member	Type	Default value
reader_id	int	-	Index of the reader to select. Warning see below.
channel	int	-	Index of the channel in the reader. -1 means select all the channels of the corresponding reader.

Warning: the reader_id selected has to correspond to a reader with an audio stream. It is your responsibility to check it. If you set a wrong reader_id, the audio input won't be created. #### Audio Mix An audio mix offers the ability to combine several audio inputs. By default one audio mix per audio input is created with the same name.

Member	Type	Default value
name	string	-	Name of the audio mix. Arbitrary name.
inputs	array[string]	-	The list of audio input names. This list should use the name of the defined audio inputs.

#### Audio Processor Defines the processing chain for each audio input.

Member	Type	Default value
name	string	-	Name of the audio processor to be applied.
params	array[objects]	-	The list of parameter objects. The parameters differs according to the type of audio processor (see below).

##### Delay processor This processor delays one audio input.

Member	Type	Default value
input	string	-	Defines on which input the delay processor will be applied.
delay	float	-	Value of the delay in seconds to apply on this input.

##### Gain processor Defines a gain on a specific audio input.

Member	Type	Default value
input	string	-	Defines on which input the delay processor will be applied.
gain	float	0	Value in dB of the gain to apply. Range[-100, 20].
mute	boolean	false	Boolean to mute this input.
reverse_polarity	boolean	false	Boolean to reverse the polarity of the signal.

### Output The output object specifies the format and destination for the stitching output.

Member	Type	Default value
type	string	-	The output type. See below for supported types.

The following outputs are supported by default. Each of these have specific options. #### "null" Passing null will discard output writing. You have to specify "filename" with a dummy value. #### "profiling" An output that will measure the time between stitched frames arriving. Can be used on the command line to determine the peak stitching performance of a hardware setup in a realistic environment, independent of I/O operations. When the stitching is done, it logs the mean and median frame rate, as well as the variance to the error log. You have to specify "filename" with a dummy value. "mp4", "mov", "rtmp" as primary and secondary types and "flv" as a secondary type only. #### Video output

Member	Type	Default value
filename	string	-	Output filename, without extension.
video_codec	string	"h264"	The video codec. Available values are: "mpeg2", "mpeg4" (MPEG4 part2, not h264/AVC), "h264", "mjpeg" (Motion JPEG)
fps	double	25	The framerate. Available values range from 1 to 1000.
bitrate	int	15000000	The bitrate in bits per second. Available values range from 500000 to 110000000.
gop	int	25	Group Of Pictures: set the interval between random-access pictures. Available values range between 1 frame to 10 times the fps value (i.e. 10 seconds).
b_frames	int	2	Specifies the number of B frames in an IP(B)* GOP pattern. Available values ranges from 0 to 5. Ignored with the "mjpeg" video codec.
pass	int	1	The number of pass for video encoding. The higher the better quality. Available values range from 1 to 2.
bitrate_mode	string	"VBR"	The rate-control mode. Available values are "VBR" (Variable Bit Rate) and "CBR" (Constant Bit Rate).
max_video_file_chunk	int	-	When this value in bytes is set, the video writer will split the video file into multiple ones if the file size reaches this limit. Due to buffered frames, headers and trailers, a safety margin is taken, the video files chunks will be approximately 5% below this limit.
max_moov_size	int	-	Reserves at most this space for the moov atom at the beginning of the file instead of placing the moov atom at the end. The file writer will try to reserve dedicated space at the beginning of the files to fill the moov atom. If the estimated space needed is larger than max_moov_size the moov atom will be added at the end of the file and the file will be post processed. Disabled if max_video_file_chunk is not set.
min_moov_size	int	-	Reserves at least this space for the moov atom at the beginning of the file. Disabled if max_video_file_chunk is not set.
extra_params	string	""	Some custom coma-separated parameters to be directly pushed to the libav encoder. Example: "preset=superfast,profile=baseline"

#### Audio output

Member	Type	Default Value
audio_codec	string	"aac"	The audio codec. Available values are: "aac" and "mp3". Note: "mp3" supports only a "stereo" layout.
sample_format	string	"fltp"	The sample format. Available values are: "s8", "s16", "s32", "flt", "dbl", "s8p", "s16p", "s32p", "fltp" and "dblp"
sampling_rate	int	48000	The sampling rate in Hz. Available values are: 44100, 48000
channel_layout	string	"stereo"	The channel layout of the output (see below).
audio_bitrate	int	128	The audio bitrate in kilo bits per second (kbps). Available values are: 64, 128 and 192 kbps

##### "channel_layout" The only supported and tested value is "stereo" for any audio codec and any output type. Please refer to each plugin documentation to see the supported configurations. #### "jpg" Jpeg output. One image per frame ("numbered", see section below).

Member	Type	Default value
filename	string	-	Output filename prefix.
quality	int	90	JPEG quality. Between 1 and 100.

#### "png" PNG output. One image per frame ("numbered", see section below).

Member	Type	Default value
filename	string	-	Output filename prefix.
alpha	bool	false	Writes RGBA channels when set, otherwise RGB.

#### "depth-png" PNG depth output. Depth encoded in millimeters at 16 bits per pixel. One image per frame ("numbered", see section below).

Member	Type	Default value
filename	string	-	Output filename prefix.

#### "tiff" TIFF output. One image per frame ("numbered", see section below).

Member	Type	Default value
filename	string	-	Output filename prefix.
compression	string	"none"	TIFF compression. One of "none", "lzw", "packbits", "jpeg", "deflate".

#### "ppm" PPM output. One image per frame ("numbered", see section below).

Member	Type	Default value
filename	string	-	Output filename prefix.

#### "pam" PAM (PNM + alpha) output. One image per frame ("numbered", see section below).

Member	Type	Default value
filename	string	-	Output filename prefix.

#### "raw" Raw output (raw unencapsulated RGBA data). One image per frame ("numbered", see section below).

Member	Type	Default value
filename	string	-	Output filename prefix.

#### "yuv420p" Planar output. Three images per frame, one per color plane.

Member	Type	Default value
filename	string	-	Output filename prefix.

##### "numbered" writers Numbered writers write the number of the stitched frame between the basename and the extension. It is advised for file based output, such as JPG, PNG, etc. Here are the optional parameters.

Member	Type	Default value
numbered_digits	int	1	Number of digits: set 1 for no leading zeros, set 0 to ignore the numbering, any other positive value to get a zero-prefixed number

### Readers How to read each input is specified using the reader_config member of the input. If you are reading from videos or image sequences you will most certainly only specify a filename. If a relative name is given, it starts at the directory where the ptv file is. However, some configurations need to specify complex configurations. In that case, the configuration is an object with a *type* field. The following types are recognized: #### "procedural" readers Procedural readers are used to automatically generate synthetic content, usually for testing. Most procedural readers generate their input directly in device memory using the GPU, and are therefore extremely efficient. They can be used to assess performance. The exact procedural reader to use is specified using the "name" field of the config. Then, each reader has specific options. ##### "color" Fills the input with a single color specified in the 'color' field. The following config fills the input with solid red: { "type": "procedural", "name": "color", "color": "ff0000" } ##### "checker" Fills the input with a color checkerboard of a given size. The background is filled with color1. The checker squares will be painted with a mix between color2 and color3, depending on the coordinates. The following config fills the input with a red-and-white solid checker of size 32 pixels: { "type": "procedural", "name": "checker", "color1": "ff0000", "color2": "ffffff", "color3": "ffffff" "size": 32 } The following config fills the input with a color gradient checker { "type": "procedural", "name": "checker", "color1": "000000", "color2": "eeeeee", "color3": "333333" "size": 32 } ##### "grid" Fills the input with a wireframe grid of a given size. The following config fills the input with a red-on-transparent grid of size 32 pixels and line width 3 pixels: { "type": "procedural", "name": "grid", "color": "ff0000", "bg_color": "00000000", "size": 32, "line_width": 3 } ##### "expr" Writes the result of evaluating an expression. The expression can be any integer expression using numerical constants and the following variables: *cFrame* (the current stitcher frame), *rFrame* (The current reader reader frame; this can be different from the stitcher frame if there are temporal offsets), *inputId* (the id of the input, 0 to num_input - 1) The following config writes the current stitcher frame in red: { "type": "procedural", "name": "expr", "color": "ff0000", "bg_color": "00000000", "value": "cFrame" } ##### "movingChecker" A host-side input that creates a 32x32 black/white checkerboard pattern that moves 2 pixels horizontally and 4 pixels vertically with every frame. As the pattern is changing, it can be used to identify problems such as synchronization issues between readers or incorrectly written buffers / tearing. The reader does not accept any arguments. { "type": "procedural", "name": "movingChecker" } ##### "profiling" A host-side input that repeats the same frame over and over. The frame is filled with random data in the YV12 color space. Crude simulation of a perfect (0-CPU) host-side video decoder. The main difference to the other procedural readers is that the frame is created in host space in YV12, so it has to be copied to the GPU and be unpacked. { "type": "procedural", "name": "profiling" } #### "shared" readers Some cameras multiplex their output in a single image. In that case, all inputs must share a common *delegate* input to read the multiplexed stream, and each input reads a portion of the resulting image. The portion to read is specified by its *offset* with the top-left corner of the delegate. For example, the configuration below declares two *shared* inputs that read from the same delegate "test". The delegate reads a 1280 x 1024 video ("sphericam.mp4") that multiplexes four 640 x 512 streams. The first input reads the top-left portion (offset is (0,0)), while the second input reads the bottom-left one (offset is (0,512)). "inputs" : [ { "width" : 640, "height" : 512, "reader_config" : { "type" : "shared", "shared_id" : "test", "delegate" : { "expected_width" : 1280, "expected_height" : 1024, "reader_config" : "sphericam.mp4" }, "offset_x" : 0, "offset_y" : 0 } (...) }, { "width" : 640, "height" : 512, "reader_config" : { "type" : "shared", "shared_id" : "test", "delegate" : { "expected_width" : 1280, "expected_height" : 1024, "reader_config" : "sphericam.mp4" }, "offset_x" : 0, "offset_y" : 512 } (...) }, ] #### Merger Mask and Blending Order It is sometimes better to have an optimized mask per input and a blending order for the inputs. These values can be set manually by users or optimized automatically using our algorithm. For example, the configuration below show the output pano width ("width") and height ("height") as 2792x1396. These values should match with the current rig setting for the "merger_mask" to be considered as "valid". The "enable" field shows whether the mask is being used. The "interpolationEnabled" shows whether interpolation between different key masks is enabled. The key masks are stored in "masks" described next. The "masks" stores all the computed seam at different frames. For a certain mask, "frameId" stores the frame index, the total number of inputs ("input_index_count") at the computation time. The "input_index_data" field contains the list of masks, one per input, stored as encoded polyline in the input space. "input_indices" stores the index of the input in "input_index_data". Finally, the "masks_order" shows the blending order. "merger_mask" : { "width" : 2792, "height" : 1396, "enable" : true, "interpolationEnabled" : true, "masks" : [ { "input_index_count" : 5, "frameId" : 0, "input_index_data" : [ "", "", "", "", "" ], "input_indices" : [ 0, 1, 2, 3, 4 ] } (...) ], "masks_order" : [ 1, 3, 2, 0 ] } ### Calibration Control Points The calibration algorithm returns the list of control points in the "Pano" tree. "calibration_control_points" : { "matched_control_points" : [ { "frame_number" : 1301, "input_index0" : 0, "x0" : 1275.88, "y0" : 254.024, "input_index1" : 2, "x1" : 1030.24, "y1" : 1394.07, "score" : 0.362205 }, { "frame_number" : 1301, "input_index0" : 0, "x0" : 1289.4, "y0" : 136.963, "input_index1" : 2, "x1" : 1032.32, "y1" : 1311.55, "score" : 0.555556 }, { "frame_number" : 1301, "input_index0" : 0, "x0" : 1473.11, "y0" : 368.84, "input_index1" : 2, "x1" : 1264.67, "y1" : 1406.95, "score" : 0.587302 }, (...) { "frame_number" : 9208, "input_index0" : 1, "x0" : 1408.33, "y0" : 52.2616, "input_index1" : 3, "x1" : 1097.8, "y1" : 1227.7, "score" : 0.552239 } ] } ### Preprocessors It is sometimes desirable to preprocess the inputs to overlay information or modify the image before mapping. To do that, you can use one or several optional preprocessors on each input. The processors are executed in the order they are specified. Each preprocessor is identified by a *type* (and can take option). Here is a list of available types: ##### "tint" Transforms the input by mapping its luminosity onto a single hue given by *color*. Alpha is ignored. "preprocessors" : {"type" : "tint", "color" : "ff3300"} ##### "expr" Overlays the result of evaluating an expression on the input. Options are similar to the *expr* reader. ##### "grid" Overlays a grid on the input. Options are similar to the *grid* reader.