This article is from the Relativity and FTL Travel FAQ, by Jason W. Hinson jason@physicsguy.com with numerous contributions by others.
In this section I will introduce just a few basic ideas which will give
the reader a feeling for what tensors are. This is simply meant to provide a
minimum amount of information to those who do not know about tensors.
Basically, a tensor is a geometrical entity which is identified by its
various components. To give a solid example, I note that a vector is a type
of tensor. In an x-y coordinate system, a vector has one component which
points in the x direction (its x component) and another component which
points in the y direction (its y component). If you consider a vector
defined in three dimensional space, then it will also have a z component as
well. Similarly a tensor in general is defined in a particular space which
has some number of dimensions. The number of dimensions of the space is also
called the number of dimensions of the tensor. Note that vectors have a
component for each individual (one) dimension, and they are called tensors
of rank 1. For other tensors, you have to use two of the dimensions in order
to specify one component of the tensor. In x-y space, such a tensor would
have an xx component, an xy component, a yx component, and a yy component.
In three-space, it would also have components for xz, zx, yz, zy, and zz.
Since you have to specify two of the dimensions for each component of such a
tensor, it is called a tensor of rank 2. Similarly, you can have third rank
tensors (which have components for xxx, xxy, ...), fourth rank tensors, and
so on.
So that you aren't confused, I want to explicitly note that the
dimensionality of a tensor (the number of dimensions of the space in which
the tensor is defined) is independent of the rank of the tensor (the amount
of those dimensions that have to be used to specify each component of the
tensor). In any dimensional space, we can have a tensor of rank 0 (just a
number by itself, because it is not associated in any way with any of the
dimensions), a tensor of rank 1 (like a vector--it has a component for every
one dimension you can specify), a tensor of rank 2 (it has a component for
every pair of dimensions you can specify), etc.
Now we look at a very important property of tensors. In fact, it is the
property which really defines whether a set of components make up a tensor.
This property involves the question of how the tensor's components change
when you change the coordinate system you are using for the space in which
the tensor is defined. So, let's consider an example in two dimensional
space where you go from some coordinate system (call the coordinates x and
y) to some other coordinate system (call these coordinates x' and y'). There
will be some sort of relationship between the two systems. For example, say
we start at some point in this space such that our coordinates are (x,y) and
(x',y') (depending on which coordinate system you are using). Now, say we
move an "infinitesimal distance" in x (using the first coordinate system).
Call that distance dx. When we do so, we may have changed our x' position
(using the second coordinate system) by some infinitesimal amount, dx'.
Also, we may have changed our y' position by some amount dy'. We can use
these concepts of infinitesimal changes to define some relationships between
the two systems. We can answer the question "how does x' change when x
changes at this point" by noting the ratio, dx'/dx. Similarly we can write
dx/dx' to denote how much x changes with changes in x' at some point, and
dy'/dx denotes how y' changes with changes in x.
Please understand that these are not simply ratios of definite numbers.
For example, dx'/dx is not necessarily the inverse of dx/dx' because dx in
one expression is NOT the same as dx in the other. The first expression uses
dx in the following context: "If I hold y constant and change x by an amount
dx, x' and y' might change by amounts dx' and dy'. Take the amount that x'
changes (dx') and divide it by the amount I changed x (dx)." The second
expression uses dx in the following context: "If I hold y' constant and
change x' by an amount dx', x and y might change by amounts dx and dy. Take
the amount that x changes (dx) and divide it by the amount I changed x'
(dx')." You can see that the dx in the former context does not have to be
the same amount as dx in the latter. So, when I write dx'/dx or dx/dx' or
dy/dx' etc, you must understand that the form of these ratios (what's on top
and what's on bottom) defines how they are produced, and they are not just
ratios of definite numbers. (Those who know something of calculus will
obviously recognize these terms as simple partial derivatives, but
anyway....)
Now, all together there are four of these ratios which denote how the
x' and y' coordinates change with changes in x and y:
dx'/dx, dx'/dy, dy'/dx, and dy'/dy.
Similarly, there are four more to denote how x and y change with changes in
x' and y':
dx/dx', dx/dy', dy/dx', and dy/dy'.
In general the values of these ratios can depend on where you are on a
manifold, so each ratio is generally a function of x and y (or x' and y', if
you like).
Now, we have these ratios which help us relate one coordinate system to
another. If we have a tensor defined in this space, then we must be able to
use those ratios to find out how the tensor's components themselves change
when we go from considering them in one coordinate system to considering
them in the other. Let's consider a tensor of rank 1 (a vector) in a two
dimensional space. Let the vector, call it V, have an x component (V_x) and
a y component (V_y). Then, the rules for finding the x' and y' components of
the vector at some point are the following:
(Eq 5:4)
V_x' = dx'/dx V_x + dx'/dy V_y
and
V_y' = dy'/dx V_x + dy'/dy V_y.
That is the way in which this type of first rank tensor must transform
from one coordinate system to another. Note that we can write both equations
in Equation 5:4 by using the following:
(Eq 5:5)
V_a = SUM(b = x,y) [da'/db V_b]
In that expression, "a" can be either x or y (so we actually have two
equations--those in Equation 5:4). Also, the right side of the equation is a
summation where the first term in the summation is found by letting b = x,
and the second term is found by letting b = y. Further, we could make this
expression more general by noting that it will be true for a space with
higher dimensions when we let "a" be any one of those dimensions and let the
sum with b extend over all the dimensions.
The fact that the physical components of a vector do actually transform
this way is what makes the vector a tensor. However, we should note that not
all types of vectors transform this way.
To show this is so, first we will consider a function which has a value
at every point in x-y space. Call the function f(x,y). Such a function is a
0 rank tensor, because at any point in the space, it has some single,
numerical value (it does not have components for x and y like a vector
does--you can't ask "what's its value in the x direction", or "what's its
value in the y direction", because it has only a single number at any
point). Note that if we change to another coordinate system, the value of f
at some physical point in the space will not change. Because it has no x or
y component, it is invariant when you change coordinate systems, as are all
0 rank tensors. This is the way all 0 rank tensors must transform when you
change coordinate systems--they must be invariant.
Now, back to the point that there are other types of vectors which do
not transform as discussed earlier. Let's take the function we were just
discussing, f(x,y), at some point and ask "how does it change with small
changes in x?" If the function changes by an amount df when we move to
another x location a distance dx away, then we can write the expression
df/dx to tell how f changes with x. We can do the same in y and have the
expression df/dy. Then we could define a vector (call it G) which has an x
component (G_x) equal to df/dx at every point in x and y, while it has a y
component (G_y) equal to df/dy at every point. Now, what if we do this same
procedure in the x'-y' coordinate system. First, we need to convert f into a
function f'. We do this such that if a point in our space has coordinates
(x,y) in one coordinate system while the same physical point has coordiantes
(x',y') in the other coordinate system, then we want f(x,y) = f'(x',y').
That way f' is the proper representation of f in the primed coordinate
system. Now we again find a vector, G, and we will end up with the x' and y'
components of the G vector such that G_x' = df'/dx' and G_y' = df'/dy'.
We now want to figure out how to transform G from one frame to another.
First, we will look at G_x' = df'/dx' which says that G_x' comes from
knowing how f' changes with respect to x' (i.e. df'/dx'). To transform this
component of G, we must know how to find df'/dx' using G_x and G_y. This
means we will be using information about how f changes with respect to x and
y (i.e., using df/dx and df/dy). We will also need to use information about
how x and y change with respect to x'. Without taking the time to fully
explain the calculus involved, perhaps the following equation will not be
too surprising:
(Eq 5:6) df' df' dx df' dy -- = -- * -- + -- * -- dx' dx dx' dy dx'
(Eq 5:7)
df' df dx df dy
-- = -- * -- + -- * --
dx' dx dx' dy dx'
dx dy
= G_x* -- + G_y* --
dx' dx'
(Eq 5:8) G_x' = dx/dx' G_x + dy/dx' G_y and G_y' = dx/dy' G_x + dy/dy' G_y
(Eq 5:9) G_a' = SUM(b = x, y) [db/da' G_b]
x y
V and V (or V^{x} and V^{y}),
G and G (or G_x and G_y) x y
(Eq 5:10)
a' da' b
V = SUM(b = x,y) -- V
db
(Eq 5:11)
db
G = SUM(b = x,y) -- G
a' da' b
(Eq 5:12)
a'b' da' db' xx da' db' xy da' db' yx da' db' yy
U = -- * -- U + -- * -- U + -- * -- U + -- * -- U
dx dx dx dy dy dx dy dy
[ da' db' ce ]
= SUM(c & e vary over all dimensions) [ -- * -- U ]
[ dc de ]
(Eq 5:13)
[ dc de ]
U = SUM(c,e) [ -- * -- U ]
a'b' [ da' db' ce ]
(Eq 5:14) a' [ da' de c ] U = SUM(c,e) [ -- * -- U ] b' [ dc db' e ]
 
Continue to: