We can integrate the chain rule with other differentiation rules to handle functions composed of multiple functions. When differentiating compositions that involve three or more functions, it’s often necessary to apply the chain rule multiple times. This allows us to systematically derive the derivative without memorizing complex formulas, as the application of the chain rule can be repeated as needed.
In a general case, consider a function k(x)=h(f(g(x))).
First, apply the chain rule to find:
k′(x)=ddx(h(f(g(x))))=h′(f(g(x)))⋅ddx(f(g(x)))
Applying the chain rule once again gives us:
k′(x)=h′(f(g(x)))f′(g(x))g′(x)
chain tule for a composition of three functions
For all values of x for which the function is differentiable, if
k(x)=h(f(g(x))),
then
k′(x)=h′(f(g(x)))f′(g(x))g′(x)
In other words, we are applying the chain rule twice.
Notice that the derivative of the composition of three functions has three parts. (Similarly, the derivative of the composition of four functions has four parts, and so on.) Also, remember, we always work from the outside in, taking one derivative at a time.
Find the derivative of k(x)=cos4(7x2+1)
First, rewrite k(x) as
k(x)=(cos(7x2+1))4
Then apply the chain rule several times.
k′(x)=4(cos(7x2+1))3(ddxcos(7x2+1))Apply the chain rule.=4(cos(7x2+1))3(−sin(7x2+1))(ddx(7x2+1))Apply the chain rule.=4(cos(7x2+1))3(−sin(7x2+1))(14x)Apply the chain rule.=−56xsin(7x2+1)cos3(7x2+1)Simplify.
Don’t forget that cosn(t) is a commonly used shorthand notation for (cos(t))n. When we write it without the shorthand notation, we can clearly see why the chain rule is necessary in these situations.
A particle moves along a coordinate axis. Its position at time t is given by s(t)=sin(2t)+cos(3t). What is the velocity of the particle at time t=π6?
To find v(t), the velocity of the particle at time t, we must differentiate s(t). Thus,
v(t)=s′(t)=2cos(2t)−3sin(3t).
Substituting t=π6 into v(t), we obtain v(π6)=−2.
Watch the following video to see the worked solution to this example.
For closed captioning, open the video on its original page by clicking the Youtube logo in the lower right-hand corner of the video display. In YouTube, the video will begin at the same starting point as this clip, but will continue playing until the very end.
At this point, we present a very informal proof of the chain rule.
Proof
For simplicity’s sake we ignore certain issues: For example, we assume that g(x)≠g(a) for x≠a in some open interval containing a. We begin by applying the limit definition of the derivative to the function h(x) to obtain h′(a):
h′(a)=limx→af(g(x))−f(g(a))x−a
Rewriting, we obtain
h′(a)=limx→af(g(x))−f(g(a))g(x)−g(a)⋅g(x)−g(a)x−a
Although it is clear that
limx→ag(x)−g(a)x−a=g′(a),
it is not obvious that
limx→af(g(x))−f(g(a))g(x)−g(a)=f′(g(a))
To see that this is true, first recall that since g is differentiable at a,g is also continuous at a. Thus,
limx→ag(x)=g(a).
Next, make the substitution y=g(x) and b=g(a) and use change of variables in the limit to obtain
Let h(x)=f(g(x)). If g(1)=4,g′(1)=3, and f′(4)=7, find h′(1).
Use the chain rule, then substitute.
h′(1)=f′(g(1))g′(1)Apply the chain rule.=f′(4)⋅3Substituteg(1)=4andg′(1)=3.=7⋅3Substitutef′(4)=7.=21Simplify.
Watch the following video to see the worked solution to this example.
For closed captioning, open the video on its original page by clicking the Youtube logo in the lower right-hand corner of the video display. In YouTube, the video will begin at the same starting point as this clip, but will continue playing until the very end.