FOR BEST VIEWING PLEASE USE MICROSOFT INTERNET EXPLORER (6.0) OR MOZILLA NETSCAPE (7.2).
FIREFOX DOES NOT DISPLAY THE MATRIX PARENTHESES CORRECTLY.

 


A MATRIX-BASED PROOF OF THE QUATERNION REPRESENTATION THEOREM
FOR FOUR-DIMENSIONAL ROTATIONS

Johan Ernest Mebius

Delft University of Technology, Faculty of Electrical Engineering, Mathematics and Computer Science
P.O.Box 5031, NL - 2600 GA Delft, The Netherlands
Phone +31.15.2783072, Fax +31.15.2786632

September 2004
 

Notations

Bold-face symbols: Geometrical entities
Normal-face symbols: Their algebraic representations
A: Van Elfrinkhof matrix (defined in Paragraph 4.2; Eq.2): Arbitrary 4D rotation matrix
M: Associate of matrix A (defined in Paragraph 4.2)
ML, MR: Matrices representing left- and right-multiplication by a unit quaternion, respectively
(introduced in Paragraph 4.1)
L = a + bi + cj + dk: Unit quaternion appearing as a leftmost factor in quaternion multiplication
P = u + xi + yj + zk: Arbitrary 4D point represented as a quaternion
R = p + qi + rj + sk: Unit quaternion appearing as a rightmost factor in quaternion multiplication
S, SL, SR: Similarity transformation matrix and its left- and right-isoclinic components (introduced in Section 5)
(Similarity transformations restricted to rotations of coordinate system)

1  Introduction

The classical quaternion representation theorem for rotations in 4D Euclidean space states that an arbitrary 4D rotation matrix is the product of a matrix representing left-multiplication by a unit quaternion and a matrix representing right-multiplication by a unit quaternion. This decomposition is unique up to sign of the pair of component matrices.
In this paper this theorem is proved as a result in the theory of matrices by pure matrix means. As a matrix-theoretic theorem it has no a priori geometrical meaning; for this one has to study the behaviour of its matrix formulation under a predetermined class of similarity transformations.
The proof in this paper is not the first one in the existing literature; presumably BOUMAN ([BOUM 1932]) was the first person to publish a satisfactory proof of the representation theorem.

The application of quaternions to geometry is a well-known classical subject, but in the opinion of the author it is also still a fertile research area. Think only of the use of low-dimensional geometry and topology in certain areas of theoretical physics. Or think of the hidden 4D rotational symmetry of the COULOMB field.
The author's desire to make computer software for 3D and 4D geometry led to the purely matrix- based proof of the quaternion representation theorem for 4D rotations presented in this paper.

2  Theorem and outline of proof

Theorem: Each 4D rotation matrix can be decomposed in two ways into a matrix representing left-multiplication by a unit quaternion and a matrix representing right-multiplication by a unit quaternion. These decompositions differ only in the signs of the component matrices.
 

Outline of proof: Let ML, MR be matrices representing left- and right-multiplication by a unit quaternion, respectively. Then their product A = MLMR is a 4D rotation matrix.
Matrix ML is determined by four reals a, b, c, d satisfying the relation a2 + b2 + c2 + d2 = 1.
Likewise, matrix MR is determined by four reals p, q, r, s satisfying the relation p2 + q2 + r2 + s2 = 1. The 16 products ap, aq, ar, as, ..., dp, dq, dr, ds are arranged into a matrix M, which has rank 1 and is easily expressed in the elements of A. Let us in this paper denote it as the associate matrix of A. Conversely, given an arbitrary 4D rotation matrix A, one calculates its associate matrix M in the hope that it is a matrix of products ap, aq, ar, as, ..., dp, dq, dr, ds which are not all zero. This hope is vindicated by proving that M has rank 1 whenever A is a 4D rotation matrix.
The proof is completed by observing that the sum of the squares of the elements of M is unity, and concluding that two pairs of quadruples of reals a, b, c, d; p, q, r, s exist satisfying
a2 + b2 + c2 + d2 = 1,  p2 + q2 + r2 + s2 = 1 and differing only in sign.

3  Quaternions in 4D Euclidean geometry

In this paper the 4D Euclidean space is provided with a Cartesian coordinate system OUXYZ. Points are represented as column vectors (u, x, y, z)T (in this paper denoted as the R4 representation) or as quaternions u + xi + yj + zk. In this connection the coordinate system is also denoted as O1IJK.
Both of these representations are based on an arbitrary choice of coordinate system.
To obtain a possible geometrical meaning of a matrix-algebraic result, one has to prove that it is invariant under coordinate system transformations (similarity transformations), which are in this paper restricted to rotations because we are interested in Euclidean properties, not in more general affine or projective properties.
In the sequel we have to do with geometrical objects as well as their representations with respect to specific coordinate systems. For easy reading object and representation are denoted by the same symbol; the object itself in bold face, its algebraic representations in normal face, with decorations added as needed.

4  Proof of the representation theorem for 4D rotations

Refer to Section 2 above for the statement of the theorem and the outline of its proof.

4.1  Isoclinic 4D rotations

Let P be an arbitrary 4D point, represented as a quaternion P = u + xi + yj + zk.
Let L = a + bi + cj + dk and R = p + qi + rj + sk be unit quaternions
(a2 + b2 + c2 + d2 = 1, p2 + q2 + r2 + s2 = 1).
Consider the left- and right-multiplication mappings ML: P => LP and MR: P => PR.
In the R4 representation ML and MR are linear mappings with matrices

ML

æ
ç
ç
ç
ç
è

a

-b

-c

-d

b

a

-d

c

c

d

a

-b

d

-c

b

a

ö
÷
÷
÷
÷
ø

,    MR

æ
ç
ç
ç
ç
è

p

-q

-r

-s

q

p

s

-r

r

-s

p

q

s

r

-q

p

ö
÷
÷
÷
÷
ø

.

...(1)

One easily proves that both ML and MR are orientation-preserving isometries of R4 with the origin of coordinates O as a fixed point, i. e. rotations of R4 about O.
Both ML and MR have the property of rotating all half-lines originating from O through the same angle (arccos a for ML and arccos p for MR); such rotations are denoted as isoclinic.
There exists however a subtle difference between ML and MR, which is best illustrated by a specific example:

Let ML and MR be left- and right-multiplication by the quaternion Qa= cosa + isina, respectively.
Then ML acts in both coordinate planes 1I and JK as a rotation through the angle a, while MR acts in the 1I plane as a rotation through a and in the JK plane as a rotation through -a.

The two kinds of isoclinic rotations are conveniently distinguished as  left- and right-isoclinic.

Conversely, an isoclinic 4D rotation about O different from the non-rotation I and from the central reversion -I is represented by either a left-multiplication or a right-multiplication by a unique unit quaternion and so is either a left- or a right-isoclinic rotation. This theorem is not used in the sequel, but it allows us to speak of "left- or right-isoclinic" instead of the cumbersome "represented by a unit quaternion left- or right-multiplication".
This theorem is presumably due to ROBERT S. BALL; in [BALL 1889] the author does not mention it explicitly as a theorem, but nevertheless gives a proof. However, BALL's proof covers only the case in which the off-diagonal elements of the given matrix are all nonzero. A complete proof is given in [MEBI 1994].

In Section 5 we prove that the left- and right-isocliny properties and the rotation angle are independent of the choice of coordinate system, and therefore correspond to intrinsic geometrical properties.

4.2  General 4D rotations

Let us study the composition of a left-isoclinic and a right-isoclinic rotation.

First apply ML, then MR to a 4D point P: one obtains (LP)R in quaternion representation. First apply MR, then ML: one obtains L(PR). Quaternion multiplication is associative. Therefore left- and right-isoclinic rotations are commutative, and we have MLMRP = MRMLP  (R4 representation) = LPR (quaternion representation).
In the R4 representation the mapping P => LPR has the matrix
 

MLMR

æ
ç
ç
ç
ç
è

ap-bq-cr-ds

-aq-bp+cs-dr

-ar-bs-cp+dq

-as+br-cq-dp

bp+aq-dr+cs

-bq+ap+ds+cr

-br+as-dp-cq

-bs-ar-dq+cp

cp+dq+ar-bs

-cq+dp-as-br

-cr+ds+ap+bq

-cs-dr+aq-bp

dp-cq+br+as

-dq-cp-bs+ar

-dr-cs+bp-aq

-ds+cr+bq+ap

ö
÷
÷
÷
÷
ø

.

...(2)

VAN ELFRINKHOF ([ELFR 1897]) was apparently the first person to treat the relation between quaternion multiplications and 4D rotations in this algebraic way. For this reason a matrix of the form of Eq. 2 with a2 + b2 + c2 + d2 = 1 and p2 + q2 + r2 + s2 = 1 is in this paper denoted as a Van Elfrinkhof matrix.

A Van Elfrinkhof matrix
 

A = 

æ
ç
ç
ç
ç
ç
è

a00

a01

a02

a03

a10

a11

a12

a13

a20

a21

a22

a23

a30

a31

a32

a33

ö
÷
÷
÷
÷
÷
ø


...(3)

is readily interpreted as a set of 16 linear equations in the 16 unknowns ap, aq, ar,  as, ..., dp, dq, dr, ds.
With slightly more work than just an inspection of the plus and minus signs in Eq. 2  one obtains these unknowns, arranged as a matrix, which is in turn written as a dyadic product:
 

M = 

æ
ç
ç
ç
ç
è

ap 

aq 

ar 

as 

bp 

bq 

br 

bs 

cp 

cq 

cr 

cs 

dp 

dq 

dr 

ds 

ö
÷
÷
÷
÷
ø


æ
ç
ç
ç
ç
è

a

b

c

d

ö
÷
÷
÷
÷
ø


æ
ç
è


p  q  r  s

ö
÷
ø


...(4)


 

 = 1/4 * 

æ
ç
ç
ç
ç
ç
è

a00+a11+a22+a33

+a10-a01-a32+a23

+a20+a31-a02-a13

+a30-a21+a12-a03

a10-a01+a32-a23

-a00-a11+a22+a33

+a30-a21-a12+a03

-a20-a31-a02-a13

a20-a31-a02+a13

-a30-a21-a12-a03

-a00+a11-a22+a33

+a10+a01-a32-a23

a30+a21-a12-a03

+a20-a31+a02-a13

-a10-a01-a32-a23

-a00+a11+a22-a33

ö
÷
÷
÷
÷
÷
ø

.

...(5)

Matrix M contains at least one nonzero element and consists of mutually proportional columns; therefore it has rank 1.
The sum of the squares of its elements is (a2 + b2 + c2 + d2)(p2 + q2 + r2 + s2) = 1.
In this paper M is denoted as the associate matrix of the matrix A.

4.3  The general rotation as a product of isoclinic rotations

In this paragraph we prove that any 4D rotation matrix can be decomposed into a left- and a right-isoclinic rotation matrix in two ways, differing only in sign. If we can find unit quaternions L = a + bi + cj + dk and R = p + qi + rj + sk such that the given 4D rotation matrix can be written as a Van Elfrinkhof matrix, then we are done.

Let A (Eq. 3) be an arbitrary 4D rotation matrix, and let M, calculated according to Eq. 5, be its associate matrix.
Easy and somewhat tedious calculations show that the sum of the squares of the elements of M is unity, and that all 2-by-2 minors of M are zero. In these calculations one uses the orthogonality of A and the equality of complementary 2-by-2 minors of  4D rotation matrices.
The equality of complementary 2-by-2 minors follows from the general theorem stating that complementary minors of an orthogonal matrix B of arbitrary order are equal or opposite according to Det B being equal to +1 or -1. ([CESA 1904], Art. 71,  p. 54)

It follows that M has rank 1, and there exist reals a,  b,  c,  d;  p,  q,  r,  s satisfying a2 + b2 + c2 + d2 = 1,  p2 + q2 + r2 + s2 = 1 such that M actually obeys Eq. 4.

This yields two pairs of unit quaternions L = a + bi + cj + dk; R = p + qi + rj + sk, differing only in sign. Finally, Eq. 1 gives the left- and right-isoclinic components, and Eq. 2 the original 4D rotation matrix as a Van Elfrinkhof matrix.

Remark: It is sufficient to prove that the second, third and fourth columns of M are proportional to its first column. So only nine 2-by-2 minors need to be calculated, not all 36 of them. The Appendix shows one of these calculations.

5  Independence of choice of coordinate system

In this section we prove that the properties of left- and right-isocliny and the rotation angle of an isoclinic matrix are invariant under arbitrary rotations of coordinate system. This section is in part a refresher on displacement transformations, similarity transformations, and their interplay.
Let A be a 4D rotation with centre O. Let A be its matrix referred to a Cartesian coordinate system OUXYZ. Let OU'X'Y'Z' a be second Cartesian coordinate system, originating from the first one by a rotation S about O. Rotation A is a displacement transformation; rotation S is a similarity transformation. Let S be the matrix of S referred to OUXYZ and let SL, SR be its left- and right-isoclinic components. It does not matter which one of the two possible decompositions is taken. Furthermore, let p be an arbitrary 4D point, p = (u, x, y, z)T, p' = (u', x', y', z')T its coordinates referred to OUXYZ, OU'X'Y'Z', respectively. Then p' = S-1p expresses the new coordinates of p in the old ones, and p = Sp' the old coordinates in the new ones.

Let q = Ap be point p rotated by A. How does similarity transformation S affect rotation matrix A? To find this out, begin at new coordinates p', transform to old coordinates p = Sp', apply rotation matrix A to obtain q = Ap = ASp', and finally transform back to new coordinates, ending at q' = A'p' = S-1ASp'. We conclude that the matrix A' of A referred to OU'X'Y'Z' equals A' = S-1AS.

Now we come to prove the intrinsic geometrical nature of the isocliny properties of 4D rotations. Consider the general rotational similarity transformation of the general 4D rotation:
 

 

A' = S-1 A S = SR-1 SL-1 . AL AR . SL SR = SL-1 AL SL . SR-1 AR SR ,  

 ...(6)

where the commutativity of left- and right-isoclinic matrices was applied to sweep all left-isoclinic components to the one side while retaining their order, and the right-isoclinic components to the other side while retaining their order too.
Applying Eq. 6 to a left-isoclinic matrix AL (set AR =I) one obtains the transformation formula
 

 

AL' = S-1 AL S = SL-1 AL SL ,

 

which shows that AL' , being the product of left-isoclinic factors, is left-isoclinic. One may attribute the property of left-isocliny to the rotation AL represented by matrix AL and all its transforms. This establishes the geometrical nature of this property. In other words, one may speak of left-isoclinic rotations in their own right, irrespective of particular matrix representations.
Likewise, for a right-isoclinic matrix one obtains
 

 

AR' = S-1 AR S = SR-1 AR SR ,

 

which shows that AR' is right-isoclinic. Here too the property of right- isocliny may be attributed to the rotation AR represented by AR and all its transforms.
In any rotation matrix of any order the main diagonal elements are the cosines of the angles through which the coordinate axes are rotated. Therefore, in an isoclinic matrix of either kind the main diagonal elements are equal. Their sum, the trace of the matrix, is invariant under similarity transformations. It follows that the rotation angle of an isoclinic matrix is also an invariant. So we may speak of the rotation angle of an isoclinic rotation, irrespective of particular matrix representations.

References

[BALL 1889]  ROBERT S. BALL: Theoretische Mechanik starrer Systeme. Herausgeber: HARRY GRAVELIUS. Berlin: Georg Reimer, 1889
 
[BOUM 1932]  J. BOUMAN: Over quaternionen en hunne toepassing in de meetkunde der vierdimensionale ruimte. Nieuw Archief voor Wiskunde, reeks 2, deel XVII, stukken 3 en 4, 1932, p. 240-266
 
[CESA 1904]  E. CESÀRO: Elementares Lehrbuch der algebraischen Analysis und der Infinitesimalrechnung. Leipzig: B. G. Teubner, 1904
 
[ELFR 1897]  L. VAN ELFRINKHOF: Eene eigenschap van de orthogonale substitutie van de vierde orde.
Handelingen van het zesde Nederlandsch Natuur- en Geneeskundig Congres, Delft, 1897, p. 237-240
 
[MEBI 1994]  JOHAN E. MEBIUS: Applications of quaternions to dynamical simulation, computer graphics and biomechanics. Ph.D. Thesis Delft University of Technology, Delft, 1994
 

Acknowledgement

The author wishes to thank TH. H. M. SMITS for his useful comments and suggestions.

Appendix

The sum of the squares of the elements of M in Eq. 5 is found to be
 
4(a002+a012+a022+a032+a102+a112+a122+a132+a202+a212+a222+a232+a302+a312+a322+a332)/16,  

which equals unity because of the orthogonality of A.

All 2-by-2 minors of M are readily shown to be zero. For instance,
 

16(m00m11 - m10m01)

= (a00+a11+a22+a33)(-a00-a11+a22+a33)-(a10-a01+a32-a23)(a10-a01-a32+a23)
= (a22 + a33)2 - (a00 + a11)2 - (a10 - a01)2 + (a32 - a23)2
= a222 + a332 + a322 + a232 - a002 - a112 - a102 - a012
+ 2(a22a33 - a32a23 - a00a11 + a10a01);

...(7)

the bilinear terms cancel because in the rotation matrix A the complementary minors a22a33 - a32a23 and
a00a11 - a10a01 are equal.

As regards the quadratic terms, we have

(a222 + a332 + a322 + a232) + (a022 + a122 + a032 + a132) = 2,
(a002 + a112 + a102 + a012) + (a022 + a122 + a032 + a132) = 2,


therefore the quadratic terms in Eq. 7 cancel too.



Valid HTML 4.01!