|
A MATRIX-BASED PROOF OF THE
QUATERNION REPRESENTATION
THEOREM
FOR FOUR-DIMENSIONAL ROTATIONS
Johan Ernest Mebius
Delft University of Technology,
Faculty
of Electrical Engineering, Mathematics and Computer Science
P.O.Box 5031, NL - 2600 GA Delft,
The
Netherlands
Phone +31.15.2783072, Fax
+31.15.2786632
September 2004
|
Notations
| Bold-face symbols: |
Geometrical entities |
| Normal-face symbols: |
Their algebraic representations |
| A: |
Van Elfrinkhof
matrix (defined in
Paragraph 4.2; Eq.2): Arbitrary
4D rotation
matrix |
| M: |
Associate of matrix A (defined in Paragraph 4.2)
|
| ML, MR: |
Matrices representing left- and
right-multiplication
by a unit quaternion, respectively
(introduced in Paragraph 4.1)
|
| L = a + bi + cj + dk: |
Unit quaternion appearing as a
leftmost
factor in quaternion multiplication |
| P = u + xi + yj + zk: |
Arbitrary 4D point represented
as a quaternion |
| R = p + qi + rj + sk: |
Unit quaternion appearing as a
rightmost
factor in quaternion multiplication |
| S, SL,
SR: |
Similarity transformation matrix
and its
left- and right-isoclinic components (introduced in
Section 5)
(Similarity transformations
restricted
to rotations of coordinate system) |
1 Introduction
The classical quaternion representation
theorem
for rotations in 4D Euclidean space states that an arbitrary 4D
rotation
matrix is the product of a matrix representing left-multiplication by a
unit quaternion and a matrix representing right-multiplication by a
unit
quaternion. This decomposition is unique up to sign of the pair of
component
matrices.
In this paper this theorem is proved as
a result in the theory of matrices by pure matrix means. As a
matrix-theoretic
theorem it has no a priori geometrical meaning; for this one has to
study
the behaviour of its matrix formulation under a predetermined class of
similarity transformations.
The proof in this paper is not the first
one in the existing literature; presumably BOUMAN
([BOUM 1932]) was the first person to publish a satisfactory proof
of the representation theorem.
The application of quaternions to
geometry
is a well-known classical subject, but in the opinion of the author it
is also still a fertile research area. Think only of the use of
low-dimensional
geometry and topology in certain areas of theoretical physics. Or think
of the hidden 4D rotational symmetry of the COULOMB
field.
The author's desire to make computer
software
for 3D and 4D geometry led to the purely matrix- based proof of the
quaternion
representation theorem for 4D rotations presented in this paper.
2 Theorem
and
outline of proof
Theorem: Each 4D rotation matrix
can
be decomposed in two ways into a matrix representing
left-multiplication
by a unit quaternion and a matrix representing right-multiplication by
a unit quaternion. These decompositions differ only in the signs of the
component matrices.
Outline of proof: Let ML,
MR be matrices representing left- and right-multiplication
by
a unit quaternion, respectively. Then their product A = MLMR
is a 4D rotation matrix.
Matrix ML is determined by
four reals a, b, c, d satisfying the relation a2 + b2 + c2 + d2 = 1.
Likewise, matrix MR is
determined
by four reals p, q, r, s satisfying the relation p2 + q2 + r2 + s2 = 1. The 16 products ap, aq, ar, as, ...,
dp, dq, dr, ds are arranged into a matrix M, which has rank 1 and is
easily
expressed in the elements of A. Let us in this paper denote it as the
associate
matrix of A. Conversely, given an arbitrary 4D rotation matrix A, one
calculates
its associate matrix M in the hope that it is a matrix of products ap,
aq, ar, as, ..., dp, dq, dr, ds which are not all zero. This hope is
vindicated
by proving that M has rank 1 whenever A is a 4D rotation matrix.
The proof is completed by observing that
the sum of the squares of the elements of M is unity, and concluding
that
two pairs of quadruples of reals a, b, c, d; p, q, r, s exist satisfying
a2 + b2 + c2 +
d2 = 1, p2 + q2 + r2 + s2 = 1 and differing only in sign.
3 Quaternions in 4D Euclidean
geometry
In this paper the 4D Euclidean space is
provided
with a Cartesian coordinate system OUXYZ. Points are represented as
column
vectors (u, x, y, z)T (in this paper denoted as the R4
representation) or as quaternions u + xi + yj + zk. In this connection
the coordinate system is also denoted as O1IJK.
Both of these representations are based
on an arbitrary choice of coordinate system.
To obtain a possible geometrical meaning
of a matrix-algebraic result, one has to prove that it is invariant
under
coordinate system transformations (similarity transformations), which
are
in this paper restricted to rotations because we are interested in
Euclidean
properties, not in more general affine or projective properties.
In the sequel we have to do with
geometrical
objects as well as their representations with respect to specific
coordinate
systems. For easy reading object and representation are denoted by the
same symbol; the object itself in bold face, its algebraic
representations
in normal face, with decorations added as needed.
4 Proof of the representation
theorem
for 4D rotations
Refer to Section 2 above
for the statement of the theorem and the outline of its proof.
4.1
Isoclinic 4D rotations
Let P be an arbitrary 4D point,
represented
as a quaternion P = u + xi + yj + zk.
Let L = a + bi + cj + dk and R = p + qi
+ rj + sk be unit quaternions
(a2 + b2 + c2 + d2 = 1, p2 + q2 + r2 + s2 = 1).
Consider the left- and
right-multiplication
mappings ML: P => LP and MR: P => PR.
In the R4
representation ML and MR are linear mappings with
matrices
|
ML = |
æ
ç
ç
ç
ç
è |
|
ö
÷
÷
÷
÷
ø |
, MR
= |
æ
ç
ç
ç
ç
è |
|
ö
÷
÷
÷
÷
ø |
. |
|
...(1) |
One easily proves that both ML
and MR are orientation-preserving isometries of R4
with the origin of coordinates O as a fixed point, i. e. rotations of R4
about O.
Both ML and MR have
the property of rotating all half-lines originating from O through the
same angle (arccos a for ML and arccos p for MR);
such rotations are denoted as isoclinic.
There exists however a subtle difference
between ML and MR, which is best illustrated by a
specific example:
Let ML and MR be
left- and right-multiplication by the quaternion Qa=
cosa +
isina,
respectively.
Then ML acts in both coordinate
planes 1I and JK as a rotation through the angle
a,
while MR acts in the 1I plane as a rotation through
a
and in the JK plane as a rotation through -a.
The two kinds of isoclinic rotations
are
conveniently distinguished as left- and right-isoclinic.
Conversely, an isoclinic 4D rotation
about
O different from the non-rotation I and from the central reversion -I
is
represented by either a left-multiplication or a right-multiplication
by
a unique unit quaternion and so is either a left- or a right-isoclinic
rotation. This theorem is not used in the sequel, but it allows us to
speak
of "left- or right-isoclinic" instead of the cumbersome "represented by
a unit quaternion left- or right-multiplication".
This theorem is presumably due to ROBERT
S. BALL; in [BALL 1889] the
author does not mention it explicitly as a theorem, but nevertheless
gives
a proof. However, BALL's proof covers only the
case
in which the off-diagonal elements of the given matrix are all nonzero.
A complete proof is given in [MEBI 1994].
In Section 5 we
prove that the left- and right-isocliny properties and the rotation
angle
are independent of the choice of coordinate system, and therefore
correspond
to intrinsic geometrical properties.
4.2 General 4D rotations
Let us study the composition of a
left-isoclinic
and a right-isoclinic rotation.
First apply ML, then MR
to a 4D point P: one obtains (LP)R in quaternion representation. First
apply MR, then ML: one obtains L(PR). Quaternion
multiplication is associative. Therefore left- and right-isoclinic
rotations
are commutative, and we have MLMRP = MRMLP
(R4 representation) = LPR (quaternion representation).
In the R4
representation the mapping P => LPR has the matrix
|
MLMR
= |
æ
ç
ç
ç
ç
è |
|
ö
÷
÷
÷
÷
ø |
. |
|
...(2) |
VAN ELFRINKHOF
([ELFR 1897]) was apparently the first person to treat the
relation
between quaternion multiplications and 4D rotations in this algebraic
way.
For this reason a matrix of the form of Eq. 2 with a2 + b2 + c2 + d2 = 1 and p2 + q2 + r2 + s2 = 1 is in this paper denoted
as a Van
Elfrinkhof
matrix.
A Van Elfrinkhof
matrix
|
A = |
æ
ç
ç
ç
ç
ç
è |
|
ö
÷
÷
÷
÷
÷
ø |
|
|
...(3) |
is readily interpreted as a set of 16
linear
equations in the 16 unknowns ap, aq, ar, as, ..., dp, dq, dr, ds.
With slightly more
work than just an inspection of the plus and minus signs in
Eq. 2 one obtains these unknowns, arranged as a matrix,
which is in turn written as a dyadic product:
|
M = |
æ
ç
ç
ç
ç
è |
|
ö
÷
÷
÷
÷
ø |
|
= |
æ
ç
ç
ç
ç
è |
|
ö
÷
÷
÷
÷
ø |
|
æ
ç
è |
p q r s
|
ö
÷
ø |
|
|
...(4) |
|
= 1/4 * |
æ
ç
ç
ç
ç
ç
è |
|
ö
÷
÷
÷
÷
÷
ø |
. |
|
...(5) |
Matrix M contains at least one nonzero
element and consists of mutually proportional columns; therefore it has
rank 1.
The sum of the squares of its elements
is (a2 + b2 + c2 + d2)(p2 + q2 + r2 + s2) = 1.
In this paper M is
denoted as the
associate
matrix of the matrix A.
4.3 The general rotation as a
product
of isoclinic rotations
In this paragraph we prove that any 4D
rotation
matrix can be decomposed into a left- and a right-isoclinic rotation
matrix
in two ways, differing only in sign. If we can find unit quaternions L
= a + bi + cj + dk and R = p + qi + rj + sk such that the given 4D
rotation
matrix can be written as a Van Elfrinkhof matrix, then we are done.
Let A (Eq. 3) be an
arbitrary
4D rotation matrix, and let M, calculated according to
Eq. 5, be its associate matrix.
Easy and somewhat tedious calculations
show that the sum of the squares of the elements of M is unity, and
that
all 2-by-2 minors of M are zero. In these calculations one uses the
orthogonality
of A and the equality of complementary 2-by-2 minors of 4D
rotation
matrices.
The equality of complementary 2-by-2
minors
follows from the general theorem stating that complementary minors of
an
orthogonal matrix B of arbitrary order are equal or opposite according
to Det B being equal to +1 or -1. ([CESA 1904],
Art.
71, p. 54)
It follows that M has rank 1, and there
exist reals a, b, c, d; p, q,
r,
s satisfying a2 + b2 + c2 + d2 = 1, p2 + q2 + r2 + s2 = 1 such that M actually obeys Eq. 4.
This yields two pairs of unit
quaternions
L = a + bi + cj + dk; R = p + qi + rj + sk, differing only in sign.
Finally,
Eq. 1 gives the left- and right-isoclinic components, and
Eq. 2 the original 4D rotation matrix as a Van Elfrinkhof matrix.
Remark: It is sufficient to
prove
that the second, third and fourth columns of M are proportional to its
first column. So only nine 2-by-2 minors need to be calculated, not all
36 of them. The Appendix shows one of these calculations.
5
Independence
of choice of coordinate system
In this section we prove that the
properties
of left- and right-isocliny and the rotation angle of an isoclinic
matrix
are invariant under arbitrary rotations of coordinate system. This
section
is in part a refresher on displacement transformations, similarity
transformations,
and their interplay.
Let A be a 4D rotation with centre
O. Let A be its matrix referred to a Cartesian coordinate system OUXYZ.
Let OU'X'Y'Z' a be second Cartesian coordinate system, originating from
the first one by a rotation
S about O. Rotation
A is a displacement
transformation; rotation
S is a similarity transformation. Let S
be the matrix of S referred to OUXYZ and let SL, SR
be its left- and right-isoclinic components. It does not matter which
one
of the two possible decompositions is taken. Furthermore, let p
be an arbitrary 4D point, p = (u, x, y, z)T, p' = (u', x',
y',
z')T its coordinates referred to OUXYZ, OU'X'Y'Z',
respectively.
Then p' = S-1p expresses the new coordinates of p in
the old ones, and p = Sp' the old coordinates in the new ones.
Let q = Ap be point p
rotated by A. How does similarity transformation
S affect
rotation matrix A? To find this out, begin at new coordinates p',
transform
to old coordinates p = Sp', apply rotation matrix A to obtain q = Ap =
ASp', and finally transform back to new coordinates, ending at q' =
A'p'
= S-1ASp'. We conclude that the matrix A' of A
referred
to OU'X'Y'Z' equals A' = S-1AS.
Now we come to prove
the intrinsic geometrical nature of the isocliny properties of 4D
rotations.
Consider the general rotational similarity transformation of the
general
4D rotation:
|
|
A' = S-1 A S
= SR-1
SL-1 . AL AR . SL
SR = SL-1 AL SL .
SR-1 AR SR , |
...(6) |
where the commutativity of left- and
right-isoclinic
matrices was applied to sweep all left-isoclinic components to the one
side while retaining their order, and the right-isoclinic components to
the other side while retaining their order too.
Applying Eq. 6 to a
left-isoclinic matrix AL (set AR =I) one obtains
the transformation formula
|
|
AL'
= S-1 AL S
= SL-1 AL SL , |
|
which shows that AL' , being
the product of left-isoclinic factors, is left-isoclinic. One may
attribute
the property of left-isocliny to the rotation AL represented
by matrix AL and all its transforms. This establishes the
geometrical
nature of this property. In other words, one may speak of
left-isoclinic
rotations in their own right, irrespective of particular matrix
representations.
Likewise, for a right-isoclinic matrix
one obtains
|
|
AR'
= S-1 AR S
= SR-1 AR SR , |
|
which shows that AR' is
right-isoclinic.
Here too the property of right- isocliny may be attributed to the
rotation
AR
represented by AR and all its transforms.
In any rotation matrix of any order the
main diagonal elements are the cosines of the angles through which the
coordinate axes are rotated. Therefore, in an isoclinic matrix of
either
kind the main diagonal elements are equal. Their sum, the trace of the
matrix, is invariant under similarity transformations. It follows that
the rotation angle of an isoclinic matrix is also an invariant. So we
may
speak of the rotation angle of an isoclinic rotation, irrespective of
particular
matrix representations.
References
| [BALL
1889] |
ROBERT S.
BALL: Theoretische
Mechanik starrer Systeme. Herausgeber: HARRY
GRAVELIUS.
Berlin: Georg Reimer, 1889 |
| [BOUM 1932] |
J. BOUMAN:
Over quaternionen
en hunne toepassing in de meetkunde der vierdimensionale ruimte. Nieuw
Archief voor Wiskunde, reeks 2, deel XVII, stukken 3 en 4, 1932,
p.
240-266 |
| [CESA 1904] |
E. CESÀRO:
Elementares
Lehrbuch der algebraischen Analysis und der Infinitesimalrechnung.
Leipzig: B. G. Teubner, 1904 |
| [ELFR 1897] |
L. VAN ELFRINKHOF:
Eene eigenschap van de orthogonale substitutie van de vierde orde.
Handelingen van het zesde Nederlandsch Natuur- en Geneeskundig Congres, Delft, 1897, p. 237-240 |
| [MEBI 1994] |
JOHAN E. MEBIUS: Applications
of quaternions to dynamical simulation, computer graphics and
biomechanics.
Ph.D. Thesis Delft University of Technology, Delft, 1994 |
Acknowledgement
The author wishes to thank TH.
H. M. SMITS for his useful comments and
suggestions.
Appendix
The sum of the squares of the elements of
M in Eq. 5 is found to be
| 4(a002+a012+a022+a032+a102+a112+a122+a132+a202+a212+a222+a232+a302+a312+a322+a332)/16, |
|
which equals unity because of the
orthogonality
of A.
All 2-by-2 minors of
M are readily shown to be zero. For instance,
| 16(m00m11 - m10m01)
= (a00+a11+a22+a33)(-a00-a11+a22+a33)-(a10-a01+a32-a23)(a10-a01-a32+a23)
= (a22 + a33)2
-
(a00 + a11)2 - (a10 - a01)2
+ (a32 - a23)2
= a222 + a332
+
a322 + a232 - a002
-
a112 - a102 - a012
+ 2(a22a33 - a32a23 -
a00a11 + a10a01);
|
...(7) |
the bilinear terms cancel because in
the
rotation matrix A the complementary minors a22a33 - a32a23 and
a00a11 - a10a01
are equal.
As regards the quadratic terms, we have
|
(a222 + a332
+ a322 + a232) + (a022
+ a122 + a032 + a132)
= 2,
(a002 + a112
+ a102 + a012) + (a022
+ a122 + a032 + a132)
= 2,
|
|
therefore the quadratic terms in
Eq. 7 cancel too.
|