Re: [hugin-ptx] Latest Hugin model, transform specification?

Roger Broadie Wed, 01 Oct 2014 07:49:08 -0700

On Tue, 30 Sep 2014 at 12:36:22 +0100 Paul Womack wrote


----------------

I am using Hugin 2014.0.0.51ff237f209e.

Is there a document describing what the optimisation parameters

Yaw/Pitch/Roll/TrX/TrY/tRz/Plane yaw/Plane pitch

are, and what the units are?

I am (once again) trying to get essentially arbitrary photographs
(who's only shared properties are overlapping subjects) to line up.

I'm trying to understanding why I'm failing, and wether
success is even possible.

 BugBear

----------------

Paul, is this essentially a follow-up to your earlier question aboutphotographing old documents in record offices in sections and thenstitching them? If so, it certainly can be done, with the main problemscoming from the nature of the subject and the environment, includinglow-contrast faint originals, poor lighting and the need to unfold theoriginals or unroll them in sections.

I've been doing it for some years for the same purpose as you. Up tonow I have used the old technique of photographing the original as faras possible at a constant height and straight downwards (assuming theoriginal is on a horizontal table) and then treating the camera as beingat a great distance by defining a very small fov for the individualimages. I then optimise the lens parameters individually for thedifferent images (except one fov must be frozen as an anchor), whichcompensates to some extent for the inevitable departures in the camerapositioning from the ideal. Of course, this method is a kludge andcertainly does not meet your requirement that the photos should be‘arbitrary’, which I take to mean the camera is not constrained inposition, direction or focal length. I think that may well be possiblein general for flat originals by using Hugin’s translation capability,but personally I think it is prudent to give the optimiser a little morehelp when taking the photographs.

I entirely agree that what is needed is a model that allows you tounderstand what you are doing and what changes might help if it does notwork. I use the 2013 version, which I think is the latest available forWindows, and that has the parameters


Yaw, Pitch, Roll, X, Y and Z

Your 2014 version appears to add two further parameters, ‘Plane Yaw’ and‘Plane Pitch’. As I know nothing about those, I’d like to start withthe 2013 set.

It is explained in the Help files in the section ‘Stitching aphoto-mosaic’, which is also to be found here:http://wiki.panotools.org/index.php?title=Stitching_a_photo-mosaic&oldid=13518.I see Thomas Modes as already mentioned it, but it is rather brief (itcalls itself a stub) and does raise some questions. I have had topuzzle over it a bit, so it may be worth expanding on the way Iinterpret it; if others think what I have to say is wrong, perhaps theywill tell us. If this piece is longer than is normal on this group, itis because there seems so little information about planar photomosaics,as opposed to linear photomosaics, i.e. single rows of images, that alonger account may benefit others, or at any rate engender comment and abetter understanding for me as well as others.

As explained in the help file, X, Y and Z are the coordinates of theindividual camera positions in a coordinate system with its origin atthe centre of the panosphere, the imaginary sphere on the surface ofwhich a normal central-viewpoint panorama is assembled duringoptimisation. On the face of it, the panosphere is not needed for aplanar photomosaic, but I guess it is there first to latch onto theexisting code for assembling and outputting the completed panorama andsecondly because it is actually needed when the translation is used itsoriginal purpose of allowing a nadir to be stitched in (not something Ihave any experience of).

So how about the scale? I think the diagram in the help files mustcatch the situation in the middle of optimisation - after all, in whatis said to be a linear photomosaic the individual images do notcompletely overlap on the image plane. Outboard of the image plane andparallel to it must be a plane (call it the subject plane) thatrepresents the original document in the scale of the drawing. Then, inthe optimisation the two must be brought into coincidence. One can lookon that as inflating the panosphere, thereby pushing the image plane outuntil it merges with the subject plane, or, alternatively, since theradius of the panosphere is fixed at 1, as shrinking the scale of thecombination of the subject plane and the individual camera positionsuntil the subject plane is brought into coincidence with the imageplane. Formally, I think one can say that the unit distance of thescale in which the coordinate distances for X, Y and Z are expressed isthe radius of the panosphere.

Of course a coordinate system also needs axis directions. As far as Ican see, in the 2013 version, the Z axis is perpendicular to the imageplane, intersecting it where the panosphere touches it, and that is thedirection with respect to which yaw and pitch are measured (in degrees,as Thomas has confirmed). One can expect yaw to be measured bydeflection along the X axis and pitch by deflection along the Y axis.

However, there are problems with the directions of these axes. The textand the diagram seem not to agree about the direction of Z, the textstating that the panosphere touches the image plane at (0,0,1) and thediagram implying it does so at (0,0,-1). In fact I think it must be thelatter. All my examples seem to me to show that Z points, in the senseof becoming more positive, downwards as seen in the diagram, i.e. in theopposite direction to the arrow-head in the diagram. That correspondsto the direction from the subject towards the camera. If that is right,it would be helpful to those struggling with the explanation to make itclear. Further, although X increases from left to right, as one mightexpect, Y increases downwards, which is not what I, at least, wouldexpect. I did wonder if the directions were chosen with the right-handscrew rule in mind, but in that case, with that configuration of X andY, surely Z should go into the paper. The direction of Z is arbitrary,provided it is applied consistently, but it is important, because ithelps interpret the values thrown up by the optimiser when things go wrong.

One of the unexpected features of the model when applied to planarmosaics is that since there is no idealised camera taking asingle-viewpoint panorama to be placed at the centre of the panospherethere is nothing to anchor the centre of the panosphere: in the diagramits position seems completely arbitrary, provided it does not actuallyfall in the subject plane. But it has to be specified in some way, toact as the origin for the camera positions. The simplest course is toput one of the camera positions at (0,0,0) and anchor it (i.e. make itnon-optimisable). That is equivalent to making the centre of thepanosphere coincide with one of the actual camera positions. Thecomplete panorama is then the one seen from that position and the fieldof view, for a planar mosaic of reasonable size, will stretch almost to180°. I did wonder if it would improve matters to move the panospherecentre further away from the subject, to get a smaller overall field ofview. Since the distance from this position to the image is 1bydefinition, that must be achieved by shrinking the system consisting ofthe subject plane and the individual camera positions, which is done byputting the anchor camera position at say (0,0,-0.5). Correspondingly,putting the anchor at (0,0,1) will bring the panosphere centre closer tothe subject and increase the total field of view. In fact, I can'tdetect any significant difference in the final quality between thesedifferent positions. Taking Z=0 for the anchor seems as good a course asany.

The procedure I have found to work generally is to take a set of photosthat cover the subject in a roughly regular array, with the camerapointing generally downwards and held roughly at the same height and inthe same position. That can be supplemented with photos taken at anangle, e.g. to avoid the camera casting a shadow on the subject, orzoomed to capture areas of interest better. If necessary the basic setcan be used to establish the panorama and the supplementary images addedlater.

In optimising I first set the output format at rectilinear (becausethat’s easy to forget). Then I return all the parameters in theoptimiser to 0 and render everything non-optimisable except the X and Yvalues for all positions except one, which thereby anchors thepanosphere centre. Keep the lens (assuming there has been no zooming)non-optimisable for the moment at either the EXIF value or your owncalibration. Generate the control points and try a first optimisation.What you want here is for the images to coalesce into a rough block:any that are missing or are badly out of place probably show that thereare missing or erroneous control points. Investigate, add any controlpoints needed and remove any that are obviously bogus. Re-optimise andwith any luck you will get a sensible-looking block of images. Then youcan successively and cumulatively add the other non-anchor Z values,then the roll and finally the yaw and pitch to the optimisation. By nowthe optimisation should be pretty good and you can try an optimisationof the lens parameters.

There is a particular problem if you have more than one lens. Theoptimiser does seem a bit unstable if you try to optimise two differentlenses simultaneously, because if you repeatedly press the Optimisebutton one of the fovs is prone to change each time. In fact what ishappening is that as the fov goes in one direction the corresponding Zvalues go in the other so that the final size of the constituent imagestays the same. Perhaps with more than one lens you need to optimiseeach on its own.

Z values can be particularly troublesome, especially if included in theinitial optimisation. If all the images are taken from roughly the sameheight and the centre of the panosphere is made to coincide with onecamera position (i.e. it has Z=0) all the other Zs should be close to 0.If any are close to, or even worse more negative than, -1, theoptimiser has failed. Sometimes simply resetting them to 0 andreoptimising is all that is needed, though there may well becontrol-point problems that need sorting out.

And what about the direction of the axis with respect to which yaw andpitch are measured? If yaw and pitch are anchored at 0 for one cameraposition, it will be better if that position is one for which the camerawas pointed directly down, although it is always possible to move thepanorama in the Preview window until it looks right. But it is best todefine horizontal and vertical lines if any are available. Often a maphas a surrounding border which is ideal for the purpose and Hugin seemsto respond particularly well to these controls. Of course all the yaw,pitch and roll parameters need to be optimisable for this approach towork properly.

As to the added parameters Plane Yaw and Plane Pitch, I cannot at allsay what they are meant to do. They look very like the VP Pan and VPTilt of PTGui Pro, and there seems remarkably little information abouthow they work. But if Hugin has introduced them they must presumablyhave a substantial purpose. Possibly they are intended to define the Zaxis as pointing perpendicularly to the plane of the nadir while theaxis of panorama as a whole, assuming it to be a central panorama,points differently. For a planar photomosaic that consideration does notapply and it may be that the plane pitch and tilt parameters can be leftat 0. On the other hand, PTGui Pro does a good job on planarphotomosaics (at least as good as Hugin in the interior of the stitch,though possibly with less control on the shape of the whole) and it doesnot seem possible to suppress its practice of allowing both Yaw and VPPan to vary, as well as Pitch and VP Tilt. So it would be worth seeingif allowing Yaw and Plane Yaw and Pitch and Plane Pitch all to varyunder optimisation improves the result in Hugin.


Good luck

Roger Broadie










--
A list of frequently asked questions is available at: 
http://wiki.panotools.org/Hugin_FAQ

---You received this message because you are subscribed to the Google Groups "hugin and other free panoramic software" group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to hugin-ptx+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/hugin-ptx/542BF55E.4040604%40ogea.freeserve.co.uk.
For more options, visit https://groups.google.com/d/optout.

Re: [hugin-ptx] Latest Hugin model, transform specification?

Reply via email to