User:Prof McCarthy/Rotation matrix

From Wikipedia, the free encyclopedia

Alibi and Alias Transformations

Birkhoff and Mac Lane[1] provide a useful discussion of transformations which uses the terms alibi and alias to describe the results of the transformations. They say:

Equation (13) was interpreted above as a transformation of points (vectors), which carried each point X = (xb · · · , x") into a new point Y having coordinates (y., · · · , y") in the same coordinate system. We could equally well have interpreted equation (13) as a change of coordinates. We call the first interpretation an alibi (the point is moved elsewhere) and the second an alias (the point is renamed).

George Francis[2], University of Illinois, elaborates on the idea of alias and alibi transformations to explain the choices taken by Silicon Graphics in the development of their Iris geometric engine. He says,

The geometers at Silicon Graphics, the company that builds the Iris, do not consider an affine transformation as something that transforms the objects to be displayed. They consider the affine transformation to act on the coordinate system. There is no harm in this attitude, it comes from the realm of movie maker and landscape artist. It is not adapted to the attitude of the industrial designer nor to the mathematician. The two attitudes are referred to as the “alias” and “alibi” approach to coordinate changes. To change coordinates means to give a point a false name. To change its position is to give it an alibi as to its whereabouts.

1997, George K. Francis, Affine Geometry Lesson, Mathematics Department and NCSA, University of Illinois, Urbana, IL, 61801

Equation (13) was interpreted above as a transformation of points (vectors), which carried each point X = (xb · · · , x") into a new point Y having coordinates (y., · · · , y") in the same coordinate system. We could equally well have interpreted equation (13) as a change of coordinates. We call the first interpretation an alibi (the point is moved elsewhere) and the second an alias (the point is renamed).

1977, G. Birkhoff and S. Mac Lane, A Survey of Modern Algebra: 4th Ed., MacMillan Publishing Co, New York, NY, 500pp.

  1. ^ G. Birkhoff and S. Mac Lane, 1977, A Survey of Modern Algebra: 4th Ed., MacMillan Publishing Co, New York, NY, 500pp.
  2. ^ George K. Francis, 1997, Affine Geometry Lesson, Mathematics Department and NCSA, University of Illinois, Urbana, IL, 61801

As far as i can tell this article on the rotation matrix was created in 2004 and by 2005, the signs on the off-diagonal terms had already been changed to try to find a consistent definition. The current talk page goes back to 2006 with many editors citing references for the location of the minus sign on both sides of the diagonal. In the past, I have been reluctant to wade into this discussion, but the recent flare up has moved me to offer the following.

Maybe it will help to begin by saying both views are correct, which sounds strange, but hopefully allows us to focus on what are actually two different ways of viewing a rotation: (i) a transformation from coordinates in one frame to coordinates in another frame, and (ii) a transformation between two sets of basis vectors defining coordinates in the same frame. The two formulations are closely related an perhaps it is no surprise that they result in matrices that are inverses of each other, which for rotation matrices means the transpose of each other and differ only by the location of the minus sign above and below the diagonal.

(i) Consider the first case, where a reference frame M is rotated counter-clockwise by the angle θ relative to a reference frame F. A vector in M has the coordinate x=xi + yj, where i=(1,0) and j=(0,1) are the natural basis vectors along the coordinate axes of M, so x,=(x, y) in this reference frame. Now consider the coordinates X=(X, Y) of the same point but now measured in F. This is easily done by considering the vectors er=(cosθ, sinθ) and et=(-sinθ, cosθ) that are the images of i and j of M, but now measured in the frame F, so X=xer+yer, or

This version of the rotation matrix has the vectors er=(cosθ, sinθ) and et=(-sinθ, cosθ) as column vectors, which means the minus sign is located on the upper right sine term.

(ii) Now consider the second case, where the vectors are measured in the same frame F, so i and j are the natural basis vectors along the x and y axis of F, and we have the unit vectors er=(cosθ, sinθ) and et=(-sinθ, cosθ) measured in F that define a pair of orthogonal unit vectors rotated by the angle θ in the counter clockwise direction. Now, the transformation from coordinates relative to the basis vectors i and j of F to the new basis vectors er=(cosθ, sinθ) and et=(-sinθ, cosθ) in F, is easily defined by the computing i=cosθer-sinθet j=sinθer+cosθet. This means a vector X=Xi+Yj in F is transformed to a vector in the rotated basis, by the transformation,

Now the columns of the rotation matrix are the coordinates of the natural basis vectors i and j as measured in the new basis er and et. This places the minus sign on the sine term below the diagonal.

Hopefully, it is clear that the two ways to define a rotation matrix are inverses of each other, and they differ by the location of the minus sign. This same situation applies to rotations about each of the coordinate axes in three dimensions. The first case results in rotation matrices that have the minus sign in the upper diagonal for Rx and Rz and the lower diagonal for Rz, while the second case yields the transpose of these matrices, with the minus sign in the lower diagonal for Rx and Rz and in the upper diagonal for Ry. I hope this is helpful. Prof McCarthy (talk) 04:52, 19 April 2013 (UTC)