Ad
related to: chain rule example problemsstudy.com has been visited by 100K+ users in the past month
Search results
Results From The WOW.Com Content Network
In this situation, the chain rule represents the fact that the derivative of f ∘ g is the composite of the derivative of f and the derivative of g. This theorem is an immediate consequence of the higher dimensional chain rule given above, and it has exactly the same formula. The chain rule is also valid for Fréchet derivatives in Banach spaces.
In probability theory, the chain rule [1] (also called the general product rule [2] [3]) describes how to calculate the probability of the intersection of, not necessarily independent, events or the joint distribution of random variables respectively, using conditional probabilities.
The chain rule can be used to find whether they are getting closer or further apart. For example, one can consider the kinematics problem where one vehicle is heading West toward an intersection at 80 miles per hour while another is heading North away from the intersection at 60 miles per hour.
The chain rule has a particularly elegant statement in terms of total derivatives. It says that, for two functions f {\displaystyle f} and g {\displaystyle g} , the total derivative of the composite function f ∘ g {\displaystyle f\circ g} at a {\displaystyle a} satisfies
A very simple example of a useful variable change can be seen in the problem of finding the roots of the sixth-degree polynomial: x 6 − 9 x 3 + 8 = 0. {\displaystyle x^{6}-9x^{3}+8=0.} Sixth-degree polynomial equations are generally impossible to solve in terms of radicals (see Abel–Ruffini theorem ).
Suppose a function f(x, y, z) = 0, where x, y, and z are functions of each other. Write the total differentials of the variables = + = + Substitute dy into dx = [() + ()] + By using the chain rule one can show the coefficient of dx on the right hand side is equal to one, thus the coefficient of dz must be zero () + = Subtracting the second term and multiplying by its inverse gives the triple ...
This can be derived using the chain rule for derivatives: = and dividing both sides by to give the equation above. In general all of these derivatives — dy / dt , dx / dt , and dy / dx — are themselves functions of t and so can be written more explicitly as, for example, d y d x ( t ) {\displaystyle {\frac {dy}{dx}}(t)} .
Reverse accumulation traverses the chain rule from outside to inside, or in the case of the computational graph in Figure 3, from top to bottom. The example function is scalar-valued, and thus there is only one seed for the derivative computation, and only one sweep of the computational graph is needed to calculate the (two-component) gradient.