Previous section

Abstract vector spaces

The vectors in Rn compose a vector space, because they can be added to each other and multiplied by scalars according to certain rules. (In fact, Rn is a real vector space, because the scalars are real numbers. Later we shall treat complex vector spaces.)

In general, a vector space is any structure that behaves like Rn.

Formal definition

A scalar is a real number. A (real) vector space consists of five things: Moreover, the assignments of sums and products must satisfy the following rules (where bold-face letters are vectors, and italic letters are scalars, that is, real numbers):

Addition rules

Scalar-multiplication rules

`Rule of unity'

This completes the definition of a real vector space. I have distinguished the `rule of unity' (my term) to emphasize that it does not follow from the other rules.

Consequences of the definition

You can prove the following from the vector-space rules:

Examples

Every vector space has at least one vector, namely the zero-vector. Possibly this is the only vector in the space, in which case the space is trivial.

The set R of real numbers, with its operations of addition and multiplication, is a vector space.

With the usual operations of addition and scalar multiplication, the m×n matrices compose a vector space, which we could call Rm×n.

If n is a nonnegative integer, then a polynomial of degree n is a sum

a0 + a1x + a2x2 ... + anxn ,

where x is a variable, the ai are scalars, and an is not zero. In particular, a scalar a is a polynomial of degree 0, if a is not zero; if a = 0, then a is the zero-polynomial, whose degree is less than zero. We can observe the following:

The set of polynomials of degree at most n can be denoted Pn.

The set of positive real numbers is a vector space, provided that addition of vectors is understood to be multiplication, and scalar-multiplication by a real number r is understood to be exponentiation by r.

Suppose A is an n×n matrix such that A2 = A. We can define the product of a vector x in Rn by a scalar r to be rAx. With this new product, and the usual addition, Rn satisfies all rules for vector spaces, except the `rule of unity' (unless A is the identity).

Subspaces

Suppose V is a vector space, and W is a nonempty subset of V such that: Then W is by definition a subspace of V. In particular, W is a vector space. (Note that 0 is in W.)

Any vector space has the following two subspaces: itself, and the trivial subspace {0}. (Possibly these are identical.)

If A is an m×n matrix, then the set of solutions x to the homogeneous linear system

Ax = 0

is a subspace of Rn, called the nullspace of A.

Linear combinations and spanning sets

Suppose v1, v2,..., vn are vectors in a vector space V. A linear combination of these vectors is a vector of the form

x1v1 + x2v2 + ... + xnvn ,

where the coefficients xi are scalars. Note that any such vector is in V. If all of the coefficients xi are zero, then the linear combination can be called trivial. The set of all linear combinations of the vectors vi is a subspace W of V called the span of the vi and denoted

span{v1, v2,..., vn} .

We also say that W is spanned by the vectors vi, and that the set {v1, v2,..., vn} is a spanning set for W.

Spanning sets, in general, are not unique, and may be redundant. For example, Rn is spanned by the set

{e1, e2, ..., en},

where ei has 1 in row i and 0 everywhere else. But if v is any other vector in Rn, then Rn is also spanned by the set {e1, e2, ..., en, v}.

To check whether a vector b in Rm is a linear combination of vectors a1, a2,..., an, write the vectors ai as the columns of an m×n matrix A, and set up the equation

Ax = b .

The product Ax is a linear combination of the columns of A, so the equation is consistent if and only if b is a linear combination of the columns of A.

Linearly independent sets

A set {v1, v2,..., vr} of vectors from a vector space is called linearly independent if one of the following equivalent conditions holds: A linearly independent set has no redundancy, in this sense: if a linearly independent set spans a certain vector space, and if you take a vector away from the set, then the remaining vectors no longer span the space. In particular, a linearly independent set cannot contain the zero-vector.

To check for linear independence in a set of n vectors in Rm, write the vectors as the columns of an m×n matrix. The nullspace of the matrix is the trivial vector space if and only if the columns are linearly independent.

Note a consequence of the last fact: No set of n vectors in Rm is linearly independent, if m<n. Such sets may or may not be independent, if m is not less than n.

Suppose A is an m×n matrix, and B is an n×r matrix. Then the columns of the product AB are linear combinations of the columns of A. Knowing whether the columns of one of these are independent tells you nothing about whether the columns of the other are independent.

Bases

[The word bases is the plural of basis, which is originally Greek.] Linear independence is an intrinsic property of a subset of a vector space. If a linearly independent set also spans the vector space it is in, then it is called a basis of that space.

Any linearly independent set is the basis of something, namely its span.

The standard basis of Rn is the set {e1, e2, ..., en} (where ei is as defined above; it is also column i of the n×n identity matrix). An arbitrary subset of Rn is a basis (of Rn) if and only if it has n elements, which are the columns of an invertible matrix.

Suppose B is a finite basis {v1, v2, ..., vn} of a vector space V. Then for any vector v in V, there is a unique vector x in Rn such that

v = x1v1 + x2v2 + ... xnvn ;

the vector x is called the B-coordinate vector of v, and denoted by

(v)B .

The function from V to Rn that takes a vector to its B-coordinate vector is a linear transformation. One can use it to show that every basis of V has size n; this justifies calling n the dimension of V.

next section

Valid CSS! Valid HTML 4.01!