12 The Gradient
This set of notes dives deeper into the geometry of the first derivative. As a review, we remind ourselves of the notation for the gradient here.
Definition 12.1 (The Gradient) The gradient of a function
The symbol
This notation is convenient here as
12.1 Directional Derivatives
We have seen that
Definition 12.2 (Directional Derivative) The derivative of
Computing this seems difficult. But we can use the Fundamental Strategy of Calculus to save the day! We linearized the surface at a point using the differential, which gave the approximation
That is, the directional derivative is just a linear combination of the two basic slopes we already know!
Theorem 12.1 (Directional Derivative) If
All of this carries over to three or higher dimensions: if
Any time we have a collection of sums of products of terms we should think, ‘Is this a dot product?’ And in this case it is! If we factor the above equations into a dot product we see the directional derivative is related directly to the gradient.
Theorem 12.2 (Directional Derivatives and the Gradient)
12.2 Geometry of the Gradient
Since we know the interpretation of dot products in terms of angles, we can use the directional derivative formula above to help us understand the direction the gradient points in.
If a vector
This actually tells us alot!
Theorem 12.3
- The gradient points in the direction of maximal directional derivative.
- Its magnitude is the directional derivative in that direction
- In the orthogonal direction to the gradient, the directional derivative is zero: the function is not changing!
The last of these facts is so useful on its own, that it gets it’s own theorem box:
Theorem 12.4 The gradient vector is orthogonal to the level sets of a function, and points in the direction of increase.
This is very helpful for understanding a function from its gradient, as it lets us convert between level set understanding and gradient understandings!
12.2.0.1 The Gradient and Level Sets
When level sets are close to each other, that means the function is steeply increasing or decreasing, so the gradient is long. When level sets are far apart, that means the function is only slowly changing, so the gradient is short. Thus, there’s an inverse relationship between the length of the gradient and the density of level sets.
12.2.1 Tangent Planes to Level Sets
Because the gradient is a normal vector to level sets we can use the gradient to derive the equation for a tangent plane to a surface! We previously wrote it down of for functions,
But this was just in analogy with the tangent line case. Now, we wish to derive it from our original description of planes, in terms of their normal vectors: if
Describes the plane because it says
is the normal vector to our plane, we can directly write down the equation for the normal at
and, after computing the dot product we can see it’s the same equation we already know!
But knowing the normal vector also allows us to compute other geometric quantities of interest: such as the normal line: the parametric line which intersects a level set orthogonally.
This is also immediate: as if we know a point
Example 12.1 Compute the tangent plane and the normal line to
First, we re-arrange so that the surface equation is written as a level set:
This vector and the original point
12.3 Videos
12.3.1 Calculus Blue
12.3.2 Khan Academy:
Directional derivatives and slope:
Why the gradient is the direction of steepest ascent:
The gradient and Contour Maps: