Download Report

COMP11120 (2013–14)
Mathematical Techniques for Computer Science (Vectors and Matrices)
These notes accompany the part of the course given by Mark Kambites.
Why Do I Need Vectors and Matrices?
Computers process information primarily in the form of numbers, but many applications involve spatial
information. For example, a computer controlling a robotic arm needs to keep track of where it is and work
out how to move it around. A 3d computer game needs to work out how objects moving in space should be
displayed on a (2d) screen.
To make these tasks possible, the spatial information must be given a numerical representation so it can
be processed. Vectors and matrices are the mathematical concepts which do this: they give a way to encode
spatial information as numbers. An understanding of them is essential for computation involving spatial
information.
• Vectors give a numerical way to describe, record and process spatial information: for example, position, velocity or acceleration.
• Matrices give a numerical way to describe, record and process information about transformations of
space: for example, rotation, reflection, projection (e.g. projecting a 3-dimensional model onto a
2-dimensional screen).
Within mathematics, vectors and matrices have many other applications and are important for many other
reasons. But for computer science the ones described above are probably the most important, and they will
be the focus of this course.
Practicalities
Lecturer. Dr. Mark Kambites (email [email protected]).
Notes and Lectures. Notes will be provided on Mark’s webpage, which is accessible via Moodle. The
notes form the definitive content of this part of the course. The lectures will explain the material more
informally.
Exercises. Examples sheets will be provided on Mark’s webpage; solutions to these will be made available
when you have had a chance to attempt the questions yourself. There are also a few additional exercises in
the notes, to which formal solutions will not be made available. Those marked “Exercise*” aim to give
a greater degree of insight into the underlying mathematics; they are not directly examinable and can in
principle be ignored, but if you can master them it will help with the rest of the material.
Office Hour. My office is room 2.144 in the Alan Turing Building (across the square from Kilburn). My
office hour is generally 09:30-10:30 on Thursday mornings during termtime; any exceptions will be
announced in lectures. If you can’t make it to the office hour but need a personal meeting then please email
for an appointment.
Coursework Tests. This part of the course will have two take-home coursework tests. If all goes as
planned, the first will be made available on 27th March and due in on 2nd April. The second will be made
available on 3rd April and due in on 30th April.
Examination. Two of the four questions on the examination will examine this part of the course.
Feedback. Please let me know how you are finding the course, and especially if you think there are any
problems with it. Feel free to use the Moodle forum, speak to me after a lecture, email me, come along in
my office hour, or even slip an anonymous note under my office door!
Books. This part of the course is self-contained and full notes will be supplied, so you should not need to
refer to any books. But if you would like an alternative viewpoint, any of the countless introductory texts
on vectors and matrices should be suitable.
1
1
Revision of Trigonometry
Trigonometry tells us about the relationship between the side lengths and angles of triangles. This is more
generally useful than you might think, because many other shapes can be broken up into triangles. Here we
shall be mostly interested in right-angled triangles.
Recall that in a right-angled triangle,
opposite
y
=
r
hypotenuse
x
adjacent
cos θ = =
r
hypotenuse
y
opposite
tan θ = =
x
adjacent
2
2
r = x + y2
sin θ =
r
y
θ
x
Example. Find x in the following triangle
Solution. Here, cos θ =
3.5
x
so
r
x = r cos θ = 3.5 cos 35◦ = 3.5 × 0.819 = 2.87.
35◦
x
Example. Find θ in the following triangle
Solution. Here, tan θ =
1.1
1.1
y
=
= 0.688. So
x
1.6
θ = tan−1 0.688 = 34.5◦ .
θ
1.6
Note. If tan θ = p, then θ = tan−1 p. Similarly, if sin θ = q, then θ = sin−1 q, or if cos θ = r, then
θ = cos−1 r.
Relationships. There are various properties of and relationships between the trigonometric functions.
Some of the most useful are:
cos(−θ) = cos θ,
sin(−θ) = − sin θ,
tan θ =
sin θ
,
cos θ
− cos θ = cos(180◦ − θ),
sin θ = cos(θ − 90◦ ).
Calculations. In general, you’ll need a calculator (or a look-up table) to compute trigonometric functions
and their inverses. But there are a few standard values of sin θ and cos θ that you should know by heart:
√
1
1
3
◦
◦
◦
◦
, sin 90◦ = 1.
sin 0 = 0, sin 30 = , sin 45 = √ , sin 60 =
2
2
2
◦
◦
cos 0 = 1, cos 30 =
√
1
1
3
, cos 45◦ = √ , cos 60◦ = , cos 90◦ = 0.
2
2
2
Note that these are exact values, and so are better than the rounded approximations you would get from
a calculator (or a computer doing floating point arithmetic).
Exercise. Use these values and the relationships above to work out exact values for: (i) tan 60◦ , (ii)
cos(−30◦ ), (iii) sin 315◦ and (iv) cos−1 (−1/2).
2
2
Coordinate Systems
Y
Recall that coordinates are used to describe the position of a point. In two dimensions (e.g. on a computer
screen), the position of a point can be determined by
its x-coordinate (the distance “along”) and its ycoordinate (the distance “up”). These are measured
from a fixed point, called the origin O.
y
O
x
X
Y
x negative
x and y
y positive
both positive
x and y
x positive
both negative
y negative
Points in the each of the four “quadrants” of the plane
can be distinguished by the signs of their x- and ycoordinates (whether they are positive or negative):
X
Y
Here are 5 points in two-dimensional space:
• Point A has coordinates (3, 6) as it is
3 units to the right of the origin and
6 units up from the origin.
C
A
5
• Point B has coordinates (7, 2) as it is
7 units to the right of the origin and
2 units up from the origin.
B
• Point C has coordinates (−4, 6) as it
is 4 units to the left of the origin and
6 units up from the origin.
-5
X
5
D
• Point D has coordinates (−4, −3) as
it is 4 units to the left of the origin
and 3 units down from the origin.
-5
E
• Point E has coordinates (4, −7) as it
is 4 units to the right of the origin and
7 units down from the origin.
Z
In three dimensions, three coordinates are required
to give the position of a point. Conventionally, the
x and z axes are drawn in the plane of the paper/screen/board while the y axis is pointing into the
board, perpendicular to the other two, as shown on
the right.
Y
X
3
Z
Of course the axes would look a bit
different if viewed from different angles. For example, they could look as
shown on the left.
Y
X
This is not an issue, but there is an important distinction between right-handed and left-handed sets of
axes. Both of the above two are right-handed: you should find that you can arrange your right-hand so
the thumb points in the direction of the x-axis, the forefinger in the direction of the y-axis and the middle
finger in the direction of the z-axis. But (unless perhaps you are double-jointed!) you won’t be able to do
this with your left hand. Swapping the y- and z-axes would give a left-handed system.
To avoid confusion, it is conventional to use right-handed axes unless there is a good reason to do
otherwise (which won’t happen in this course).
3
Introduction to Vectors
A vector is an entity which gives the location of one point relative to another. It is also used to describe a
physical quantity with an associated direction, such as “2 km north-east”, or “5ms−1 downwards”.
a
a
a
a
a
a
a
Intuitively, a vector can be visualised as an “arrow”.
The position of the arrow is unimportant — what matters is the length (or magnitude) of the arrow and its
direction. Two vectors are the same (termed equal)
if they have both the same magnitude and the same
direction. Here, all the vectors marked a are the same
(even though they are drawn in different places). The
vector marked b is different.
b
Notation. To distinguish vectors from ordinary numbers (or scalars) we usually underline their names,
and in print we also use bold type: for example a or v. The magnitude of a vector v is written |v|.
Parallel Vectors. Two vectors in the same direction are called
parallel. The three vectors on the right are all parallel to each
other.
3.1
The Zero Vector
There is one special vector which is difficult to draw as an arrow. It is called the zero vector and denoted
0. It has magnitude 0 and does not have a direction at all. (Intuitively, this is because if you haven’t
travelled any distance, it doesn’t make sense to ask in which direction you went!).
Note that the zero vector 0 is not the same thing as the zero scalar 0.
4
3.2
Adding Arrows
Vectors can be added by the triangle rule. To work out the vector sum a + b, place the tail of vector b at
the head of the vector a. The vector a + b is the directed arrow from the tail of a to the head of b:
Vector addition is commutative,
that is, a + b = b + a. This is
illustrated in the picture on the
left.
b
a
c
In addition, vector addition is
associative:
a + (b + c) = (a + b) + c
b
a+b+c
a
The zero vector 0 acts as an identity for vector addition:
0 + a = a = a + 0.
3.3
Scaling Vectors
Any vector can be multiplied (or scaled) by a scalar (number). Scaling by a positive number λ gives a
vector in the same direction as the original one, but with magnitude multiplied by λ. Scaling by a negative
number −λ gives a vector pointing in the opposite direction, with magnitude multiplied by λ.
−2a
3a
a
The Zero Vector. Scaling any vector by the scalar 0, or scaling the zero vector 0 by any scalar, gives 0:
0a = 0 = λ0.
Negation. We write −a as a shorthand for (−1)a, that is, the arrow with the same magnitude as a and
the opposite direction.
5
3.4
Subtracting Arrows
Now that we have addition and scaling we can also
define subtraction:
a − b = a + (−b) = a + (−1)b.
a−b
Intuitively, if a and b are drawn with their tails at the
same point, then a − b goes from the head of b to the
head of a.
3.5
b
a
Vectors in Component Form
Z
Arrows give a useful way to visualise vectors, but
they do not give us the numerical representation we
wanted for spatial data. For this, we consider the three
vectors i, j and k of magnitude (length) 1 in the directions of the three coordinate axes:
k
Y
j
i
X
Every vector can be expressed as a sum of scalings of these vectors. Specifically, the vector which goes a
units in the x-direction, b units in the y-direction and c units in the z-direction can be written as ai + bj+ ck.
Z
2i + 1j + 4k
Example. The vector going 2 units to the right, 1
unit back (into the board/paper) and 4 units up, can
be written as 2i + 1j + 4k or 2i + j + 4k.
4
Y
2
1
X
Y
In two dimensions, only the vectors i and j are needed.
Any vector can be expressed as a sum of multiples of
i and j.
2i + 3j
3
X
2
6
3.6
Displacement and Position Vectors
If A and B are points, the displacement vector from
A to B is the arrow which fits with its tail at A and
−
−
→
its head at B. It is usually written as AB.
In component form, if A = (ax , ay , az ) and B =
(bx , by , bz ) then
B
−
−
→
AB
−
−
→
AB = (bx − ax )i + (by − ay )j + (bz − az )k.
A
Example. The displacement vector from (2, 4) to (5, 2) is given by (5 − 2)i + (2 − 4)j, that is, 3i − 2j. The
vector from (1, 1, 2) to (5, 7, 3) is given by 4i + 6j + k.
We can represent a point P as a vector, by
considering the displacement vector from
the origin to P . The position vector of P
is the arrow which fits with its tail at the
origin and head at P .
In component form, the position vector of
the point (x, y, z) is the just vector xi +
yj+ zk. For example, the position vector of
(2, 4, 3) is the vector 2i + 4j + 3k. Similarly
in two dimensions: the position vector of
the point (2, 6) is the vector 2i + 6j.
3.7
Vector Operations in Component Form
The vector operations we saw earlier all have simple descriptions in component form:
Addition. To add together two vectors in component form, add together the relevant components. For
example:
(2i − 4j + 5k) + (2i + 6j − 1k) = 4i + 2j + 4k
Subtraction. To subtract one vector from another, subtract the relevant components. For example:
(2i − 4j + 5k) − (2i + 6j − 1k) = 0i − 10j + 6k = −10j + 6k
Scaling. To scale a vector (multiply it by a scalar), we multiply each component of the vector by that
scalar. For example
5 × (2i + 5j − 1k) = 10i + 25j − 5k
Notice that we are multiplying a vector by a scalar to get a vector.
Magnitude. How do we recover the magnitude (length) of a vector from its component form? The
magnitude of a = ax i + ay j + az k is given by
q
|a| = a2x + a2y + a2z
For example, the magnitude of −2i + 6j + 3k is given by
p
√
(−2)2 + 62 + 32 = 49 = 7.
7
3.8
Unit Vectors
A vector with magnitude 1 is called a unit vector. There is exactly one unit vector in every possible
direction.
q
q
4
4 2
3 2
16
3
9
+ 25
= 1.
− 5 + 5 = 25
Example. The vector − 5 i+ 5 j is a unit vector as its magnitude is given by
The vector 12 i + 13 j + 41 k is not a unit vector as its magnitude is
s 2 2 r
61
1 2
1
1
6= 1
+
+
=
2
3
4
144
Finding Unit Vectors. Given a non-zero vector, we can find a unit vector parallel to it by scaling it to
have the right length. Specifically, we multiply by the scalar 1/m where m is the magnitude of the vector.
√
Example. To find a unit vector parallel to a = i + 8j + 4k, note that |a| = 12 + 82 + 42 = 9 so a unit
vector parallel to a is
1
1
8
4
× (i + 8j + 4k) = i + j + k.
9
9
9
9
Exercise. Check that this is a unit vector!
3.9
Further Problems
In the figure above, composed of a series of parallelograms, all points refer to the corners of parallelograms
except for G which is at the centre of a parallelogram.
−−→
−−→ −→ −−→ −−→ −−→ −−
→
−−
→
• If vector AB = p and AD = q, find, in terms of p and q, vectors BC, AC, DC, HF , EC, F G and
−−→
DG.
• Give the position vector of the point (−2, 4, 1).
• Give the displacement vector from (2, 5, 4) to (7, 1, 2)
• Find the magnitude of the vector a = 3i − 4j + 12k
• Which of the following is a unit vector ?
1.
2.
1
3i
1
3i
+ 23 j + 32 k
+ 13 j + 32 k
For the one which is not, find a unit vector parallel to it.
• For a = 2i − 3j + 7k, b = −i + 2j + 1k and λ = 4, find
1. a + b
2. a − b
3. a + λb
• For a = 3i − 4j + 4k, b = 2i + 7j − 3k and c = 2i + 6j + 5k, show that (a + b) + c = a + (b + c).
8
4
4.1
The Scalar and Vector Products
Scalar Product of Arrows
The scalar product (also called the dot product)
is an operation which combines two vectors to give
a scalar (a number). In terms of arrows, the scalar
product is defined by
a · b = |a| |b| cos θ
where θ is the angle between the directions of vectors
a and b.
Thus, the scalar product is the magnitude of the first vector, times the magnitude of the second vector,
times the cosine of the angle between the two vectors. Notice that the angle is always measured by putting
the tails of the two vectors together, and measuring the angle in between.
Example. Find the scalar product of the two vectors a = 4i and b = i + j.
Solution.
b
45◦
a
√
√
√
The magnitude of vector a is 42 + 02 = 4. The magnitude of vector b is 12 + 12 = 2. The angle
between a and b is 45◦ . So, the scalar product a · b is given by
a · b = |a| |b| cos θ
√
= 4 × 2 × cos 45◦
√
1
= 4× 2× √
2
= 4.
Thus, a · b = 4.
The Zero Vector. The definition of a · b given above does not make sense if a = 0 or b = 0 (or both),
since 0 has no direction. By convention, the scalar product of the zero vector with anything is defined to
be 0 (the zero scalar):
a · 0 = 0 = 0 · a.
4.2
Sign of the Scalar Product
Since magnitudes are always positive, the value of the angle θ between two vectors determines whether their
scalar product is positive, negative or zero:
9
Acute Angles. If θ is an acute angle (between 0 and 90◦ ) the scalar product is positive. For example,
Here the scalar product is
3 × 2 × cos 25◦ = 6 × 0.91 = 5.4
which is positive.
Obtuse Angles. If θ is an obtuse angle (between 90◦ and 180◦ ) the scalar product is negative. For example,
Here the scalar product is 3 × 2 × cos 130◦ = 6 × (−0.64) = −3.9 which is negative.
Right Angles. If θ is exactly 90◦ then the scalar product is 0. Conversely, if the scalar product of two
non-zero vectors is zero then we can immediately deduce that they are perpendicular:
Here the scalar product is
3 × 2 × cos 90◦ = 6 × 0 = 0.
4.3
The Scalar Product in Component Form
How do we calculate the scalar product of two vectors from their component forms? This turns out to be
easy!
Two dimensions. If a = ax i + ay j and b = bx i + by j then
a · b = ax bx + ay by .
10
Three dimensions. If a = ax i + ay j + az k and b = bx i + by j + bz k then
a · b = ax bx + ay by + az bz .
Remark/Exercise. You can check that these formulae work even if a = 0 or b = 0 (see Section 4.1).
Example. Find the scalar product of the two vectors a = 4i and b = i + j.
Solution. Here ax = 4, ay = 0, bx = 1 and by = 1. The scalar product is 4 × 1 + 0 × 1 = 4. (Notice that
this is the same example we worked out at the start of the section, and of course we get the same answer.
But this time we got there without having to work out the angle between the vectors.)
Example. Find the scalar product of the two vectors a = 4i − 3j + 2k and b = i + j + 6k.
Solution. Here ax = 4, ay = −3, az = 2, bx = 1, by = 1 and bz = 6. The scalar product is 4 × 1 + (−3) ×
1 + 2 × 6 = 13.
4.4
Calculating Angles Between Vectors
The two different ways to calculate the scalar product can be combined to give a way of calculating the
angle between two vectors. We know that
a · b = |a| |b| cos θ.
Assuming neither vector is 0 we can divide through by |a| |b| to get:
cos θ =
a·b
|a| |b|
If we know the component forms of a and b, then we know how to compute a · b, |a| and |b|, so we can
compute cos θ and hence θ itself.
Example. Find the angle between the two vectors a = 2i − j + 3k and b = i + 6j − 2k.
Solution. Here
a · b = 2 × 1 + (−1) × 6 + 3 × (−2) = −10,
p
√
|a| = 22 + (−1)2 + 32 = 14, and
p
√
|b| = 12 + 62 + (−2)2 = 41.
Thus
−10
cos θ = √ √ = −0.417
14 41
and
θ = 115◦ .
4.5
Algebraic Properties of Scalar Products
The following laws apply to scalar products.
a·b=b·a
(“commutativity”)
a · (λb) = λ(a · b) = (λa) · b
(“compatibility with scaling”)
a · (b + c) = a · b + a · c
(“distributivity over addition”)
a · (b − c) = a · b − a · c
(“distributivity over subtraction”)
Exercise*. Prove these.
11
4.6
Calculating Vector Components in a Given Direction
If a and b are non-zero vectors, then it is possible to express a as the sum of two vectors, one of which is
parallel to b, and the other of which is perpendicular to b. Specifically, we can write
a = aparallel + aperp
where
aparallel =
is a vector parallel to b, and
a·b
b
|b|2
aperp = a − aparallel = a −
is a vector perpendicular to b.
a·b
b
|b|2
Exercise*. Prove that aparallel really is parallel to b, that aperp really is perpendicular to b, and that their
sum really does equal a. (Hint: you will need the definition scalar product in terms of angle, and some
elementary trigonometry. It might also help to consider a unit vector parallel to b).
Example. Express the vector a = 7i + 4j as the sum of a vector parallel to b = 2i + 3j and a vector
perpendicular to b.
√
√
Solution. Here a · b = 2 × 7 + 4 × 3 = 26 and |b| = 22 + 32 = 13.
26
aparallel = √
(b) = 2(2i + 3j) = 4i + 6j and
( 13)2
aperp = a − aparallel = 7i + 4j − (4i + 6j) = 3i − 2j
so that
7i + 4j = (4i + 6j) + (3i − 2j).
4.7
Further Exercises
1. Find the scalar products between the following pairs of vectors:
2. Find the scalar product (i + 6j − 2k) · (−2i − j + 4k).
3. Find the angle between the two vectors a = 2i + 3j and b = −i + 6j.
4.8
The Vector Product
We have seen that the scalar product gives a way to multiply two vectors and get a scalar. In three
dimensions, there is another way to multiply two vectors which gives a vector. The vector product (also
called the cross product) of a and b is written a × b. Provided a 6= 0 and b 6= 0 it has the following
properties:
1. It is a vector.
12
2. It has magnitude |a| |b| sin θ where θ is the angle between a and b
3. Its direction is perpendicular (or normal) to both a and b (and hence also to any plane containing
a and b — more on this later).
4. a, b and a × b form a right-handed system. (As before with axes, this means that you can arrange
your right-hand so that the thumb points in the direction of a, the forefinger in the direction of b and
the middle finger in the direction of a × b.
If a = 0 or b = 0 (or both) then a × b = 0.
4.9
The Vector Product in Component Form
As with the scalar product, it is straightforward to calculate the vector product of two vectors in component
form. If a = ax i + ay j + az k and b = bx i + by j + bz k then
a × b = (ay bz − az by )i + (az bx − ax bz )j + (ax by − ay bx )k.
Example. If a = −2i + 3j + 4k and b = 5i − j − 6k, find the vector product a × b.
Solution. Here ax = −2, ay = 3, az = 4, bx = 5, by = −1, bz = −6. So:
a × b = (ay bz − az by )i + (az bx − ax bz )j + (ax by − ay bx )k
= [3(−6) − 4(−1)]i + [4 × 5 − (−2)(−6)]j + [(−2)(−1) − 3 × 5]k
= −14i + 8j − 13k.
4.10
Further Exercises
• Using both the a · b = |a| |b| cos θ formula and the component formula, evaluate a · b where a = 5i
and b = −2i − 2j.
• For c = 2i − 5j − 2k and d = −i + 4j − 3k, evaluate c · d.
• For p = 5i + 2j, q = 7i + 1j and r = −2i + 5j which of the pairs of vectors make (i) acute angles, (ii)
right angles or (iii) obtuse angles?
• Find the angle between 2i − 3j and 4i + j − k.
• For a = 2i − j, b = 3i + 2j and c = 5i − 4j, verify the identity a · (b + c) = a · b + a · c.
• Express a = 13i+11j+7k as the sum of a vector parallel to b = 1i+2j+4k and a vector perpendicular
to b.
5
Introduction to Matrices
A matrix is a rectangular array of numbers, for example:
A=
2 5
3 8
or B =
1 6 3
2 1 9


1
 2 

or C = 
 3 .
4
Rows and Columns. A matrix has clearly defined numbers of rows and and columns. A matrix with p
rows and q columns is called a p × q matrix. Of the above examples:
• A is a 2 × 2 matrix (2 rows and 2 columns – we say that it is square);
• B is a 2 × 3 matrix (2 rows and 3 columns);
13
• C is a 4 × 1 matrix (4 rows and 1 column).
Matrix Entries. If a matrix is called A, we write Aij to denote the entry in the ith row and jth column
of A. For example, in the matrix A above, the entries are A11 = 2, A12 = 5, A21 = 3 and A22 = 8.
Equality of Matrices. Two matrices are equal if
• they have the same number of rows and
• they have the same number of columns and
• corresponding entries are equal.
Example. Find the circumstances under which
2 b
3 9
and
a 5
3 d
are equal.
Solution. Both have 2 rows and 2 columns so the potential for equality exists. For the matrices to be equal
it is necessary (and sufficient) for corresponding entries to be equal, that is: 2 = a, b = 5, 3 = 3 and 9 = d.
So the matrices are equal if a = 2, b = 5, d = 9.
 
2
2 b
Example. Find the circumstances under which
and  c  are equal.
3 9
d
Solution. The first matrix has 2 rows and 2 columns; the second matrix has 3 rows and 1 column. The
two will never be equal regardless of the entries of b, c and d. There are thus no circumstances under which
the matrices are equal.
5.1
Addition of Matrices
Two matrices can be added if they have the same number of rows (as each other) and the same number of
columns (as each other). If so, to get each entry of A + B just add the corresponding entries of A and B.
Formally,
(A + B)ij = Aij + Bij .




2 1
7 2
Example. Can A =  6 1  and B =  1 4  be added? If so, what is the sum?
7 1
3 9
Solution. A and B are both 3 × 2 matrices so they can be added. The sum is


 
9 3
2+7 1+2
A + B =  6 + 1 1 + 4  =  7 5 .
10 10
7+3 1+9
5.2
Scaling Matrices
Any matrix can be multiplied by any scalar, to give another matrix of the same size. To do this, just
multiply each entry of the matrix by the scalar. Formally,
(λA)ij = λ(Aij ).
Example. Find 7
−1 4
.
3 2
Solution. Each entry is multiplied by the scalar 7, so
−1 4
7 × (−1) 7 × 4
−7 28
7
=
=
.
3 2
7×3
7×2
21 14
Notation. Just as for vectors, we write −A as a shorthand for (−1)A.
14
5.3
Subtraction of Matrices
Two matrices of the same size (the same number of rows and same number of columns) can be subtracted.
Again, the operation is performed by subtracting corresponding components. Formally, (A−B)ij = Aij −Bij .




2
7
−1 3
Example. Find A − B if A =  7 −1  and B =  0 4 
−1 2
2 0
Solution. A has 3 rows and 2 columns. B has 3 rows and 2 columns. As the numbers of rows are equal
and the numbers of columns are equal, the subtraction A − B is meaningful and we have

 

2 − (−1) 7 − 3
3
4
A−B =  7−0
−1 − 4  =  7 −5 
−1 − 2
2−0
−3 2
Remark. Just as for vectors, we always have
A − B = A + (−1)B.
Exercise*. Prove this.
5.4
Multiplication of Matrices
Two matrices A and B can, under certain circumstances, be multiplied together to give a product AB.
The condition for this to be possible is that the number of columns in A should equal the number of rows
in B.
If A has m rows and n columns, and B has n rows and p columns, then AB can be formed and will have
m rows and p columns.
A
×
B
=
AB
m× n
n× p
m× p
The entry in the ith row and the jth column of AB depends on the entries in the ith row of A and the
entries in the jth column of B. Formally,
(AB)ij = Ai1 B1j + Ai2 B2j + Ai3 B3j + . . . + Ain Bnj =
n
X
Aik Bkj .
k=1
The way entries of the factors combine to form entries of the product can be visualised as

 


..
..
B1j
.
.

 


B
2j
 Ai1 Ai2 Ai3 . . . Ain  


(AB)
ij

 


...  =  ... ...
...

  . . . . . . B3j
..
..





..
.
.

 


.
..
..
.
.
Bnj
i − th row
j − th column
Element in i − th row
and j − th column
follows:







Example. Consider the matrices

Verify that AB exists, and find it.

2 1
5
A =  −3 7  and B =
.
−2
1 5
Solution. A has 3 rows and 2 columns; B has 2 rows and 1 column. As the number of columns in A is the
same as the number of rows in B, AB exists. AB will have 3 rows and 1 column. The three entries of AB
are computed as follows:
15
• The top entry of AB will be computed from top row of A and the single column of B: it is 2 × 5 +
1 × (−2) = 8.
• The middle entry of AB will be computed from the middle row of A and the single column of B: it is
−3 × 5 + 7 × (−2) = −29.
• The bottom entry of AB will be computed from the bottom row of A and the single column of B: it
is 1 × 5 + 5 × (−2) = −5.


8
So AB =  −29 .
−5
Example. Consider the matrices
C=
2 1
−5 3
and D =
4 −2
3 7
.
Verify that CD exists and find it.
Solution. Both C and D have 2 rows and 2 columns. As the number of columns in C equals the number
of rows in D, CD exists. CD will also have 2 rows and 2 columns.
CD =
2 1
−5 3
4 −2
3 7
2×4+1×3
2 × (−2) + 1 × 7
=
−5 × 4 + 3 × 3 (−5) × (−2) + 3 × 7
11
3
=
−11 31
Example. Consider the matrices

2
6
 1
2


E =  −3
7
 1
2
−2
0
−4
8
2
2
7
3
−5
9
4
4
Then EF will be a 5 × 3 matrix of the form

Find the element y.


EF = 




7


 −1
 and F = 

 2

2
.
.
.
y
.
.
.
.
.
.
.
.
.
.
.
8
4
−3
2

−1
1 
.
2 
−2



.


Solution. y is (EF )42 , that is, the element in the 4th row and the 2nd column. The calculation will
8
4
involve the 4th row of E, which is 1 2 2 4 , and the 2nd column of F , which is
. y will equal
−3
2
1 × 8 + 2 × 4 + 2 × (−3) + 4 × 2 = 18.
Tip. The only way to master matrix operations, and especially multiplication, is to practice them! Try
making up lots of examples of matrices and multiplying them. Get a friend to do the same ones and check
you get the same answer.
16
5.5
Further Exercises
1. Find the number of rows and columns in A =
2 2
2. Find
+
−1 3
1 5
2
3. Find
0 3
−3
5.6
6 −1
2 2
6
,
−
2 4
−1 3
2
1 8
2
and 1 5
2 7
−3
2 3 6
. What is A12 ?
−1 4 7
−1
2 2
and 5
.
4
−1 3
1 8
.
2 7
Vectors as Matrices
A matrix with only one column is called a column matrix. A matrix with only one row is called a row
matrix. It is sometimes helpful to write vectors as column matrices1 . For example, the vector ai + bj + ck
 
a

b . Because of this, column matrices are sometimes also
can be considered as the 3 × 1 column matrix
c
called column vectors.
Regarding a vector as a matrix allows us to multiply vectors together with matrices. Specifically, if v
is an n-dimensional vector and A is a matrix with n columns, then we can compute the product Av. This
will be a column matrix. Most interesting is the case where A is an n × n square matrix, in which case Av
will be another n × 1 matrix, that is, another n-dimensional vector.
Example. Form the product Ax where
A=
2 7
1 5
and x = −1i + 4j.
Solution. Expressed as a column matrix,
x = −1i + 4j =
−1
4
so the product is just the matrix product
2 7
−1
2 × (−1) + 7 × 4
26
Ax =
=
=
= 26i + 19j.
1 5
4
1 × (−1) + 5 × 4
19
Exercise. Form the product By where

2
 −1
B=
 0
−1
5.7
6
2
4
3



3
1
1 
 and y =  2  .
4 
−1
0
Identities and Inverses
Identity Matrices. A square matrix with 1’s on the main diagonal (the diagonal from top left to bottom
right) and 0’s everywhere else is called an identity matrix. The n × n identity matrix is denoted In . For
example:


1 0 0
1 0
I2 =
while I3 =  0 1 0  .
0 1
0 0 1
It has the property that AIn = A and/or In A = A whenever A is the right size for these products to be
defined.
1
In some contexts it is also useful to consider a vectors as a row matrices, but we shall not need to do this in this course.
17
Inverses. Let A be an n × n square matrix. An inverse of A is an n × n matrix A−1 such that
A A−1 = A−1 A = In .
Not every matrix has an inverse. Matrices without an inverse are called singular.
Exercise. What is the inverse of the identity matrix In ?
Exercise*. Prove that AIn = A whenever the product is defined.
5.8
Transpose and Symmetry
The transpose of a p × q matrix is the q × p matrix obtained by swapping over the rows and columns. The
transpose of a matrix A is written AT . For example:
T


1
2
6
 7 
  = 1 7 2 4 while if A =  −1 3  then AT = 2 −1 1 .
 2 
6 3 1
1 1
4

A (necessarily square!) matrix which is equal to its transpose is called symmetric. For example:


−1 4 3
2 −1
or  4 8 0  .
−1 4
3 0 7
Notice the “symmetry” of the matrices about the main diagonal. Another way of saying the matrix A is
symmetric, is that Aij = Aji for all i and j.
A (necessarily square!) matrix which is equal to minus its transpose is called skew-symmetric. For
example,


0 4 2
 −4 0 −1 
−2 1 0
is skew-symmetric as its transpose is


0 −4 −2
 4 0
1 .
2 −1 0
Another way of saying the matrix A is skew-symmetric is that Aij = −Aji for all i and j. (In particular
setting i = j this means that Aii = −Aii for all i, which means the entries on the main diagonal must all be
0!)
The Zero Matrix. A matrix is called a zero matrix if all of its entries are 0. For example:
 
0
 0 
0
0
  or
.
 0 
0 0
0
Exercise. Find an example of a 3 × 3 matrix which is both symmetric and skew-symmetric? How many
such matrices are there?
5.9
Further Exercises
2 3
−1 4
• Find the product CD where C =
and D =
.
7 −1
2 −2
1 2 7
2 3
exist? If so, find it.
• Does the product
6 2 3
5 6
18

4
• Does the product 1 4 5 2  2  exist? If so, find it.
1

• Consider the following matrices:


0 5 −2
4 2
4 6 7
1 3


,
and
,
−5 0 6
.
2 7
6 7 2
6 7
2 0 −6
For each one, decided whether it is (i) square, (ii) symmetric and/or (iii) skew-symmetric.
5.10
Commuting and Non-Commuting Matrices
It is easy exercise to find two matrices A and B such that AB and BA are both defined, but AB 6= BA.
We say that matrix multiplication is not commutative.
Example. Consider the matrices


2 −1
−1 4 0
A =  2 −1  and B =
.
2 3 6
1 4
In this case the products AB and BA can be both be formed, but AB is a 3 × 3 matrix while BA is a 2 × 2
matrix. So AB cannot possibly be equal to BA.
Exercise. Find two 2 × 2 matrices C and D such that CD 6= DC.
Sometimes, however, pairs of matrices do commute.
Example. Find values of b and d such that the matrices
E=
2 6
1 7
and F =
3 b
1 d
commute.
Solution. We have
EF =
FE =
2 6
1 7
3 b
1 d
3 b
1 d
2 6
1 7
=
=
12 2b + 6d
10 b + 7d
and
6 + b 18 + 7b
2 + d 6 + 7d
.
For these to be equal we need each of the entries to be equal, that is:
12 = 6 + b, 10 = 2 + d, 2b + 6d = 18 + 7b, and b + 7d = 6 + 7d.
It is easy to see that these equations are all satisfied by taking b = 6 and d = 8, so these are the required
values. In other words, if
3 6
2 6
and F =
E=
1 8
1 7
then we will have EF = F E.
Terminology Warning. Note that because multiplication is not commutative, phrases like “multiply A
by B” can be ambiguous, and should not be used!
19
5.11
Laws of Matrix Multiplication
While matrix multiplication is not commutative, it does satisfy various other important laws. Here are some
you should know:
(AB)C = A(BC)
(“associativity”)
A(B + C) = AB + AC
(“left distributivity”)
(B + C)A = BA + CA
(“right distributivity”)
T
T
T
(AB) = B A
Exercise. Consider the matrices
1 2
2 6
−2 1
A=
, B=
and C =
.
3 4
1 7
2 3
Check that (AB)C = A(BC).
5.12
Further Exercises
−2 1
1. If A =
and B =
, show that AB 6= BA.
2 4
45752
462467
2757527
2. If C =
and D =
, show that the statement CD = DC cannot be
464647 14641477
5277528
true.
−1 −1
−1 1
2 1
show that A(B + C) = AB + AC
and C =
,B=
3. If A =
4
1
3 7
1 2
5.13
7 1
1 4
Homogeneous Coordinates


x
The column matrix  y  in three dimensions can be thought of as representing either a vector between
z
two points, or the position vector of a particular point. Sometimes it is important to distinguish carefully
between these cases, which we can do by adding a fourth component. This will be equal to ‘1’ for a point
and equal to ‘0’ for a vector. Thus
 
x
 y 

• 
 z  represents the point (x, y, z), while
1
 
x
 y 

• 
 z  represents the vector xi + yj + zk.
0
Coordinates of this form are called homogeneous coordinates. We shall see later that they are extremely
useful when considering geometric transformations.
Example. Represent the point (2, 5) and the vector i − 3j using homogeneous coordinates.
Solution. The point and the vector are 2-dimensional and so the homogeneous coordinates will have three
components. Now the point (2, 5) and the vector i − 3j will be represented as



1
2
 5  and  −3 
1
0

20
respectively.
Note. Subtracting the homogeneous coordinates of two points yields the vector from one point to the other,
while adding a vector to an existing point yields a new point displaced from the original point by the given
vector.
−−→
Example. Given the points A = (2, 5, 4), B = (2, −1, 3) and the vector BC = 2i + 4j + k, use homogeneous
−−
→
coordinates to find vector AB and point C.
Solution. In homogeneous coordinates we have
 



2
2
 5 
 −1 
−−→ 




A=
 4  , B =  3  and BC = 
1
1
−−
→
The vector AB is given by

 
2
2


−−
→ 
−1   5
AB = 
 3 − 4
1
1
which is the vector −6j − k. The point C is given by

 
2
2
 −1   4

 
 3 + 1
1
0
which is the point (4, 3, 4).
5.14



2
4 
.
1 
0


0
  −6 
=

  −1 
0


4
  3 
= 
  4 
1
Further Exercises
• For A =
1 5
2 −1
and B =
2 10
, find x and y such that AB = BA.
x y
• When A has 2 rows and 3 columns, B has 3 rows and 5 columns, C has 5 rows and 4 columns and
D has 4 rows and one column, find the dimensions of AB, BC, CD, ABC, BCD and ABCD. Show
that all steps in finding ABCD are meaningful, that is, the final and intermediate products exist.

 
 
2
2 3 4
2
2 3 4  
• Find the products  −1 5 6   4  and
4 .
−1 5 6
−2 2 3
5
5
• Using homogeneous coordinates (and traditionally), find the vector from (2, 6) to (4, −1).
• In four dimensions, express the following in homogeneous coordinates.
Points : (1, 2, 3, 4), (0, 0, 5, 0), (−2, −6, 2, 5)
Vectors : 2x − 3y + 7z − 3w, 7x + y + z − 4w.
Note, for this example w may be taken as the unit vector in the 4th dimension.
21