Inverse function rule

In calculus, the inverse function rule is a formula that expresses the derivative of the inverse of a bijective and differentiable function $f$ in terms of the derivative of $f$ . More precisely, if the inverse of $f$ is denoted as $f^{-1}$ , where $f^{-1}(y)=x$ if and only if $f(x)=y$ , then the inverse function rule is, in Lagrange's notation,

\left[f^{-1}\right]'(a)={\frac {1}{f'\left(f^{-1}(a)\right)}}

.

Rule:

{\color {CornflowerBlue}{f'}}(x)={\frac {1}{{\color {Salmon}{(f^{-1})'}}({\color {Blue}{f}}(x))}}

Example for arbitrary

x_{0}\approx 5.8

:

{\color {CornflowerBlue}{f'}}(x_{0})={\frac {1}{4}}

{\color {Salmon}{(f^{-1})'}}({\color {Blue}{f}}(x_{0}))=4~

This formula holds in general whenever $f$ is continuous and injective on an interval $I$ , with $f$ being differentiable at $f^{-1}(a)$ ( $\in I$ ) and where $f'(f^{-1}(a))\neq 0$ . The same formula is also equivalent to the expression

{\mathcal {D}}\left[f^{-1}\right]={\frac {1}{({\mathcal {D}}f)\circ \left(f^{-1}\right)}},

where ${\mathcal {D}}$ denotes the unary derivative operator (on the space of functions) and $\circ$ denotes function composition.

Geometrically, a function and inverse function have graphs that are reflections, in the line $y=x$ . This reflection operation turns the gradient of any line into its reciprocal.[1]

Assuming that $f$ has an inverse in a neighbourhood of $x$ and that its derivative at that point is non-zero, its inverse is guaranteed to be differentiable at $x$ and have a derivative given by the above formula.

The inverse function rule may also be expressed in Leibniz's notation. As that notation suggests,

{\frac {dx}{dy}}\,\cdot \,{\frac {dy}{dx}}=1.

This relation is obtained by differentiating the equation $f^{-1}(y)=x$ in terms of $x$ and applying the chain rule, yielding that:

{\frac {dx}{dy}}\,\cdot \,{\frac {dy}{dx}}={\frac {dx}{dx}}

considering that the derivative of $x$ with respect to $x$ is 1.

Derivation

Let $f$ be an invertible (bijective) function, let $x$ be in the domain of $f$ , and let $y$ be in the codomain of $f$ .

This also means that $y$ is in the domain of $f^{-1}$ , and that $x$ is in the codomain of $f^{-1}$ .

(Sidenote: since f is a bijective function, $y$ being in the codomain of the function, $f$ , it means that $y$ is in the range of the function, $f$ .)

Since $f$ is an invertible function, we know that: $f(f^{-1}(y))=y$ and $f^{-1}(f(x))=x$

Which equation should be used for the derivation? Technically speaking, since $f$ and $f^{-1}$ are inverses, either one will work, and you can do some careful manipulation of swapping the instances of $f$ and $f^{-1}$ , however, that may cause confusion. So, which equation should be used?

Taking a look at both equations, it looks like, in order to do anything useful, we would have to differentiate both sides of the chosen equation. Due to the fact that there are nested functions, the chain rule will also be in use. We want our final formula to give the derivative of the inverse function, with respect to the input to the inverse function ( $y$ ), rather than with respect to another function. So, thinking about how the chain rule works, we know that we will have to multiply by the derivative of the "inside". If we use the first equation ( $f(f^{-1}(y))=y$ ), the "inside" function will be $f^{-1}(y)$ , and according to the chain rule, we would get $(f^{-1})^{\prime }(y)$ not nested within any other function. That being said, let's start.

${\begin{aligned}&&f(f^{-1}(y))&=y\\{\text{Differentiating both sides with respect to }}y&&\\&\implies &{\dfrac {\mathrm {d} }{\mathrm {d} y}}f(f^{-1}(y))&={\dfrac {\mathrm {d} }{\mathrm {d} y}}y\\{\text{Using the chain rule on the left side}}&&&\\&\iff &{\dfrac {\mathrm {d} \left(f(f^{-1}(y))\right)}{\mathrm {d} \left(f^{-1}(y)\right)}}{\dfrac {\mathrm {d} \left(f^{-1}(y)\right)}{\mathrm {d} y}}&={\dfrac {\mathrm {d} y}{\mathrm {d} y}}\\&\iff &{\dfrac {\mathrm {d} f(f^{-1}(y))}{\mathrm {d} f^{-1}(y)}}{\dfrac {\mathrm {d} f^{-1}(y)}{\mathrm {d} y}}&={\dfrac {\mathrm {d} y}{\mathrm {d} y}}\\&\iff &f^{\prime }\left(f^{-1}(y))\right)(f^{-1})^{\prime }(y)&=1\\{\text{Dividing both sides by }}f^{\prime }\left(f^{-1}(y))\right)&&&\\&\implies &(f^{-1})^{\prime }(y)&={\frac {1}{f^{\prime }\left(f^{-1}(y))\right)}}\end{aligned}}$

Great! Rather than using $a$ as the variable, we can rewrite this equation using $a$ as the input for $f^{-1}$ , and we get the following:[2]

$(f^{-1})^{\prime }(a)={\frac {1}{f^{\prime }\left(f^{-1}(a))\right)}}$

Examples

$y=x^{2}$ (for positive $x$ ) has inverse $x={\sqrt {y}}$ .

{\frac {dy}{dx}}=2x{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }};{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }}{\frac {dx}{dy}}={\frac {1}{2{\sqrt {y}}}}={\frac {1}{2x}}

{\frac {dy}{dx}}\,\cdot \,{\frac {dx}{dy}}=2x\cdot {\frac {1}{2x}}=1.

At $x=0$ , however, there is a problem: the graph of the square root function becomes vertical, corresponding to a horizontal tangent for the square function.

$y=e^{x}$ (for real $x$ ) has inverse $x=\ln {y}$ (for positive $y$ )

{\frac {dy}{dx}}=e^{x}{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }};{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }}{\frac {dx}{dy}}={\frac {1}{y}}

{\frac {dy}{dx}}\,\cdot \,{\frac {dx}{dy}}=e^{x}\cdot {\frac {1}{y}}={\frac {e^{x}}{e^{x}}}=1

Additional properties

Integrating this relationship gives

{f^{-1}}(x)=\int {\frac {1}{f'({f^{-1}}(x))}}\,{dx}+C.

This is only useful if the integral exists. In particular we need

f'(x)

to be non-zero across the range of integration.

It follows that a function that has a continuous derivative has an inverse in a neighbourhood of every point where the derivative is non-zero. This need not be true if the derivative is not continuous.

Another very interesting and useful property is the following:

\int f^{-1}(x)\,{dx}=xf^{-1}(x)-F(f^{-1}(x))+C

Where

F

denotes the antiderivative of

f

.

The inverse of the derivative of f(x) is also of interest, as it is used in showing the convexity of the Legendre transform.

Let $z=f'(x)$ then we have, assuming $f''(x)\neq 0$ :

{\frac {d(f')^{-1}(z)}{dz}}={\frac {1}{f''(x)}}

This can be shown using the previous notation $y=f(x)$ . Then we have:

f'(x)={\frac {dy}{dx}}={\frac {dy}{dz}}{\frac {dz}{dx}}={\frac {dy}{dz}}f''(x)\Rightarrow {\frac {dy}{dz}}={\frac {f'(x)}{f''(x)}}

Therefore:

${\frac {d(f')^{-1}(z)}{dz}}={\frac {dx}{dz}}={\frac {dy}{dz}}{\frac {dx}{dy}}={\frac {f'(x)}{f''(x)}}{\frac {1}{f'(x)}}={\frac {1}{f''(x)}}$

By induction, we can generalize this result for any integer $n\geq 1$ , with $z=f^{(n)}(x)$ , the nth derivative of f(x), and $y=f^{(n-1)}(x)$ , assuming $f^{(i)}(x)\neq 0{\text{ for }}0<i\leq n+1$ :

${\frac {d(f^{(n)})^{-1}(z)}{dz}}={\frac {1}{f^{(n+1)}(x)}}$

Higher derivatives

The chain rule given above is obtained by differentiating the identity $f^{-1}(f(x))=x$ with respect to $x$ . One can continue the same process for higher derivatives. Differentiating the identity twice with respect to $x$ , one obtains

{\frac {d^{2}y}{dx^{2}}}\,\cdot \,{\frac {dx}{dy}}+{\frac {d}{dx}}\left({\frac {dx}{dy}}\right)\,\cdot \,\left({\frac {dy}{dx}}\right)=0,

that is simplified further by the chain rule as

{\frac {d^{2}y}{dx^{2}}}\,\cdot \,{\frac {dx}{dy}}+{\frac {d^{2}x}{dy^{2}}}\,\cdot \,\left({\frac {dy}{dx}}\right)^{2}=0.

Replacing the first derivative, using the identity obtained earlier, we get

{\frac {d^{2}y}{dx^{2}}}=-{\frac {d^{2}x}{dy^{2}}}\,\cdot \,\left({\frac {dy}{dx}}\right)^{3}.

Similarly for the third derivative:

{\frac {d^{3}y}{dx^{3}}}=-{\frac {d^{3}x}{dy^{3}}}\,\cdot \,\left({\frac {dy}{dx}}\right)^{4}-3{\frac {d^{2}x}{dy^{2}}}\,\cdot \,{\frac {d^{2}y}{dx^{2}}}\,\cdot \,\left({\frac {dy}{dx}}\right)^{2}

or using the formula for the second derivative,

{\frac {d^{3}y}{dx^{3}}}=-{\frac {d^{3}x}{dy^{3}}}\,\cdot \,\left({\frac {dy}{dx}}\right)^{4}+3\left({\frac {d^{2}x}{dy^{2}}}\right)^{2}\,\cdot \,\left({\frac {dy}{dx}}\right)^{5}

These formulas are generalized by the Faà di Bruno's formula.

These formulas can also be written using Lagrange's notation. If $f$ and $g$ are inverses, then

g''(x)={\frac {-f''(g(x))}{[f'(g(x))]^{3}}}

Example

$y=e^{x}$ has the inverse $x=\ln y$ . Using the formula for the second derivative of the inverse function,

{\frac {dy}{dx}}={\frac {d^{2}y}{dx^{2}}}=e^{x}=y{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }};{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }}\left({\frac {dy}{dx}}\right)^{3}=y^{3};

so that

{\frac {d^{2}x}{dy^{2}}}\,\cdot \,y^{3}+y=0{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }};{\mbox{ }}{\mbox{ }}{\mbox{ }}{\mbox{ }}{\frac {d^{2}x}{dy^{2}}}=-{\frac {1}{y^{2}}}

,

which agrees with the direct calculation.

References

"Derivatives of Inverse Functions". oregonstate.edu. Archived from the original on 2021-04-10. Retrieved 2019-07-26.
"Derivatives of inverse functions". Khan Academy. Retrieved 23 April 2022.

Marsden, Jerrold E.; Weinstein, Alan (1981). "Chapter 8: Inverse Functions and the Chain Rule". Calculus unlimited (PDF). Menlo Park, Calif.: Benjamin/Cummings Pub. Co. ISBN 0-8053-6932-5.

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] "Derivatives of Inverse Functions". oregonstate.edu. Archived from the original on 2021-04-10. Retrieved 2019-07-26.

[2] "Derivatives of inverse functions". Khan Academy. Retrieved 23 April 2022.

Calculus
Precalculus	Binomial theorem Concave function Continuous function Factorial Finite difference Free variables and bound variables Graph of a function Linear function Radian Rolle's theorem Secant Slope Tangent
Limits	Indeterminate form Limit of a function One-sided limit Limit of a sequence Order of approximation (ε, δ)-definition of limit
Differential calculus	Derivative Differential Differential equation Differential operator Mean value theorem Notation Leibniz's notation Newton's notation Rules of differentiation linearity Power Sum Chain L'Hôpital's Product General Leibniz's rule Quotient Other techniques Implicit differentiation Inverse functions and differentiation Logarithmic derivative Related rates Stationary points First derivative test Second derivative test Extreme value theorem Maxima and minima Further applications Newton's method Taylor's theorem
Integral calculus	Antiderivative Arc length Basic properties Constant of integration Fundamental theorem of calculus Differentiating under the integral sign Integration by parts Integration by substitution trigonometric Euler Weierstrass Partial fractions in integration Quadratic integral Trapezoidal rule Volumes Washer method Shell method
Vector calculus	Derivatives Curl Directional derivative Divergence Gradient Laplacian Basic theorems Line integrals Green's Stokes' Gauss'
Multivariable calculus	Divergence theorem Geometric Hessian matrix Jacobian matrix and determinant Lagrange multiplier Line integral Matrix Multiple integral Partial derivative Surface integral Volume integral Advanced topics Differential forms Exterior derivative Generalized Stokes' theorem Tensor calculus
Sequences and series	Arithmetico–geometric sequence Types of series Alternating Binomial Fourier Geometric Harmonic Infinite Power Maclaurin Taylor Telescoping Tests of convergence Abel's Alternating series Cauchy condensation Direct comparison Dirichlet's Integral Limit comparison Ratio Root Term
Special functions and numbers	Bernoulli numbers e (mathematical constant) Exponential function Natural logarithm Stirling's approximation
History of calculus	Adequality Brook Taylor Colin Maclaurin Generality of algebra Gottfried Wilhelm Leibniz Infinitesimal Infinitesimal calculus Isaac Newton Fluxion Law of Continuity Leonhard Euler Method of Fluxions The Method of Mechanical Theorems
Lists	Differentiation rules List of integrals of exponential functions List of integrals of hyperbolic functions List of integrals of inverse hyperbolic functions List of integrals of inverse trigonometric functions List of integrals of irrational functions List of integrals of logarithmic functions List of integrals of rational functions List of integrals of trigonometric functions Secant Secant cubed List of limits Lists of integrals
Miscellaneous topics	Differential geometry curvature of curves of surfaces Euler–Maclaurin formula Gabriel's horn Integration Bee Proof that 22/7 exceeds π Regiomontanus' angle maximization problem Steinmetz solid