Algebra of Matrices Systems of Linear Equations

ALFONSO SORRENTINO

[Read also § 1.1-7, 2.2,4, 4.1-4]

1. Algebra of Matrices

Definition. Let m, n be two positive integers. A m by n real matrix is an ordered
set of mn elements, placed on m rows and n columns:

We will denote by the set of m by n real matrices . Obviously, the set
can be identified with R (set of real numbers).

NOTATION:

Let . To simplify the notation , sometimes we will abbreviate A =
. The element is placed on the i-th row and j-th column, it will be also
denoted by . Morerover, we will denote by   the i-th row of
A and by the j-th column of A. Therefore:

Remark. For any integer n ≥1, we can consider the cartesian product Rn. Obviously,
there exists a bijection between Rn and or .

Definition. Let . The transpose of A is a matrix
such that:

  for any i = 1, . . . , n and j = 1, . . . , m.

The matrix B will be denoted by AT . In few words, the i-th row of A is simply the
i-th column of AT :

and .

Moreover,

We will see that the space of matrices can be endowed with a structure of vector
space. Let us start, by defining two operations in the set of matrices : the
addition and the scalar multiplication.

• Addition:

where A + B is such that: .

• Scalar multiplication:

where cA is such that: .

Proposition 1. (, ·) is a vector space. Moreover its dimension is mn.

Proof. First of all, let us observe that the zero vector is the zero matrix 0 [i.e., the
matrix with all entries equal to zero: 0ij = 0]; the opposite element (with respect
to the addition) of a matrix A, will be a matrix −A, such that .
We leave to the reader to complete the exercise of verifying that all the axioms that
define a vector space hold in this setting.

Let us compute its dimension. It suffices to determine a basis. Let us consider the
following mn matrices (for h = 1, . . . ,m and k = 1, . . . , n), such that:

[in other words, the only non- zero element of is the one on the h-th row and
k-th column]. It is easy to verify that for any , we have:

therefore this set of mn matrices is a spanning set of . Let us verify that
they are also linearly independent . In fact, if

This shows that is a basis.

Definition. Let .

• A is a square matrix (of order n), if m = n; the n-ple is called
diagonal of A. The set will be simply denoted by .
• A square matrix is said upper triangular [resp. lower
triangular] if aij = 0, for all i > j [resp. if aij = 0, for all i < j].
• A square matrix is called diagonal if it is both upper and lower
triangular [i.e., the only non-zero elements are on the diagonal: aij = 0,
for all i ≠ j].
• A diagonal matrix is called scalar, if .
• The scalar matrix with is called unit matrix,
and will be denoted In.
• A square matrix is symmetric if A = AT [therefore, aij = aji]
and it is skew-symmetric if A = −AT [therefore, aij = −aji].

Now we come to an important question: how do we multiply two matrices ?
The first step is defining the product between a row and a column vector.

Definition. Let and . We
define the multiplication between A and B by:

More generally, if and , we define the matrix product :

where , for all i = 1, . . . ,m and j = 1, . . . , p.

Remark. Observe that this product makes sense only if the number of columns of
A is the same as the number of rows of B. Obviously, the product is always defined
when the two matrices A and B are square and have the same order.

Let us see some properties of these operations (the proof of which, is left as an
exercise).

Proposition 2. [Exercise]
i) The matrix multiplication is associative; namely:

(AB)C = A(BC)

for any , and .

ii) The following properties hold:

(A + B)C = AC + BC, for any and ;
A(B + C) = AB + AC, for any and ;
, for any ;
(cA)B = c(AB), for any c ∈ R and ,;
(A + B)T = AT + BT , for any ;
(AB)T = BTAT , for any and .

Remark.
Given a square matrix , it is not true in general that there
exists a matrix such that AB = BA = In. In case it does, we say that
A is invertible and we denote its inverse by A-1.

Let us consider in the subset of invertible matrices:

: there exists such that AB = BA = In} .

This set is called general linear group of order n.
We leave as an exercise to the reader, to verify that the following properties hold.

Proposition 3. [Exercise]
i) For any , we have: .
ii) For any , we have: and .
iii) For any and c ∈ R, with c ≠ 0, we have: . In
particular, .

An important subset of , that we will use later on, is the set of orthogonal
matrices.

Definition. A matrix is called orthogonal, if:

The set of the orthogonal matrices of order n is denoted by On(R). Moreover,
from what observed above (i.e., A-1 = AT ), it follows that (i.e.,
orthogonal matrices are invertible).

To conclude this section, let us work out a simple exercise, that will provide us with
a characterization of O2(R).

Example. The set O2(R) consists of all matrices of the form:

for all α ∈ R.

Proof.
One can easily verify (with a direct computation), that:

therefore, these matrices are orthogonal. We need to show that all orthogonal
matrices are of this form.

Consider a matrix . Let us show that there exists α ∈ R,
such that or . By Definition:

From the first two equations , if follows that b2 = c2. There are two cases:

i) b = c or ii) b = −c .

i) In this case, plugging into the other equations, one gets that (a + d)c = 0
and therefore:

i') c = 0 or i'') a + d = 0 .

In the case i'):

namely, A is one of the following matrices:

In the case i ):

therefore, with α∈ [0, 2π ) such that (a, b) = (cos2α, sin2α) (this
is possible since a2 + b2 = 1).

ii) Proceeding as above, we get that (a − d)c = 0 and therefore there are two
possibilities:

ii') c = 0 or ii'') a − d = 0 .

In the case ii'), we obtain again:

In the case ii ):

therefore, with α∈ [0, 2π ) such that (a, c) = (cos2α, sin2α) (this
is possible since a2 + c2 = 1).

 

Prev Next