Integrals: The Fundamental Theorem of Calculus
The area under a curve, however the curve may go as long as it doesn't double-back (which would create a multi-map, breaking the single-valued definition of function) then it's good for figuring the area between it and the abscissa. To be demonstrated is that such an area is continuous with the continuous abscissa even if the function is finitely discontinuous on the interval, which is less of a constraint on the integrand than the restriction on the derivative's argument, being continuity.
Definitions
Riemann Integrability
Integration has an additional requirement upon being a function type of map and required of it in order for it to be integrated, which is that its partitioned fill be bounded by two step functions (one above and one below the function) with the deviation from an infinitisemial partitioning being reducible with the partition size (where partition $\text{size}\equiv\frac{1}{\text{number})}$).
The Riemann Integral
The area between a function and the $y=0$ line ($x$-axis) is called the integral of that function, denoted here with the integrand on the LHS and integral with evaluation bar and limits on the RHS:
We will see that the operation of integration is the antiderivative of the integrand. The antiderivative is the inverse operation of the derivative over an interval, but without the interval start point the antiderivative adds an unknown constant, and is called the indefinite form of the integral. The constant of integration in the indefinite integral represents any fixed global value added to the antiderivative of $f$ and as cancelled by the difference of them in a definite integral on its interval, with their combined difference shown in the indefinite form in \eqref{derivative-inverse}:
If we define a new function, labelled capital-$F$, as the area between the curve, $y=f(t)$, and the $t$-axis, then we can say immediately about the integral of a constant function, as seen in Figure 1 (below), that the integral of a constant, $c$, with respect to anything, here it is $t$, depends linearly on the variable endpoint, labelled $x$:
The variable $t$ as parameter, uses the infinitesimal symbol $dt$, and also known as delta-$t$, $\delta t$.
The Integral Function
The area, $F$, is a function of the interval, for some interval, $[a, x]$, that is: $F=F(a, x, f)$.
Verify that for \eqref{integral-constant} the integral value attains zero for equal endppoints, $x=a$, as it always must for any integrand.
Next, using the inductive method here, and building up an interval from zero ($x\gt a$, in \eqref{integral-constant}),
The change in $F$ for a given change $b$ is $\Delta F = F( a, x + b, f) - F(a, x, f)$. So, to use the Greek letter Delta (for the root of difference diaphora) to symbolize the difference of the quantity:
Continuity of Integral
At eq. \eqref{delta-F} is the formula for an area which is a rectangle of width $b$ and height $f(x+b)$, and as long as the function is continuous then the difference of $f(x+b)$ and $f(x)$, is less than $\epsilon$ for the given step size (partition size), $b$, then this construction entails integrals. Integrals apply to step functions too, which include those discontinuous functions which step for a single point (zero width) or curves which have a finite set of such exorbitant points along the interval.
Thus a simply integrable function yields a continuous integral in $x$, or alternatively in $a$ (with some sign (negativity) business). Since for any given partition size, the integral over any partition segment of an integrable function is that which is bounded between step functions of relative extrema of the function within the partition. And a constant (boundary) function integrates as a linear function of the endpoint, which is continuous.
A derivative is only defined if the function is smooth, so the fact that an integral is always continuous satisfies that aspect of the FTC, eq. \eqref{derivative-inverse}.
So, the change in the area per change in interval is proportional to that change, ($\Delta x = b$), and the value of $f$ at $x=x+b$. Below, the area between the low-point and zero is constant and left out.
The area between function $f$ and $y=0$ (the line with zero slope and zero as $y$-intercept) over the bounded yet imprecisely defined interval, $[a, x]$, is the filled region.
The sum of rectangle areas, with more rectangles over the given interval, is more accurate, but it's returns are diminishing as the size of the neglected part contributing to the change in $F$ per change in $x$ is $ b \ll 1$. The error reduced by doubling the partition granularity is
$$ \frac{\Delta F(a, x, f)}{\Delta x} = f(x) $$
The difference in area acquired by extending the interval boundary by $b$ is simply the height of the function at $(x+b)$ times the width, the differential parameter $b$. We can say the quantity of such area is, in general, a function of the following parameters.
- $f(x)$, the curve making the top boundary,
- $a$, the line going from the point $(a, 0)$ to point $(a, f(a))$, as a section of the entire vertical line $(a, y)$, making the left boundary,
- $x$, defining the right boundary.
Where the fourth bounding line, the $x$-axis, is constant. So the area which is bounded between $f(x)$ and the $x$-axis, over the interval, $[a, x]$, can be solved by the differential equation, $D_x F = fb$, which is identical to the derivative formula for $f$, except now the derivative is $f(x)$ (compare to $D_x f(x)$ derivative formula). We can write, with a change of variable $x\to x$ while keeping $a$ constant, we write:
$$ \lim\limits_{b\ll 1}\frac{F(a, x+b)-F(a, x)}{b} $$
$$ =D_{x} F(a, x) = f(x) $$
So for arbitrarily small $b=\Delta x$, the ratio becomes the derivative of $F$ at $x$, and is equal to $f(x)$. This immediately tells us that the area function, $F$, is the antiderivative of the function $f$.
The area which could be thought of as a (finite) series of the function in subject as kernel to a summation operating from the interval lower bound in subject to the intervals upper bound all terms multiplied by some differential parameter $k$, as the uniform width of the rectangle heights.
The $k$-width rectangles add up to some area equivalent to the area between the function $f(x)$ and the $x$-axis (the abscissa), for the given interval $[a, b]$. So, the change in that total area as a function of the upper bound of the interval $b$ is the height of the the curve at $b$ so $F(x+b) - F(x) = b f(x) $
For historical placement of this notion, the antiderivative of $x^{-1}$ was solved by Gottfried Leibniz in 1676, first publisher of the formalism of calculus [1].
The example polynomial used above, $f$, is a truncated sine series, of four times the argument, to sixth degree in the Taylor series. And the relationship between the integral and the derivative of a function is known as the Fundamental Theorem of Calculus.
Polynomial Integrand
For the example curve, depicted in the above figures, the integral of the polynomial is evaluated term by term using the antiderivative of the monomial.
$$ \int_{a}^{x} \left(4x - \frac{32}{3}x^3 + \frac{128}{15}x^5\right)dx $$
The integral of a sum of functions (in this case, a trinomial) is the sum of the integrals of those functions. This follows from the linearity of the derivative: since the derivative of a sum is the sum of the derivatives, the antiderivative shares this property.
$$ D_x\sum_i f_i(x)=\sum_i D_x f_i(x) $$
Since $D_x (f_i + f_{i+1}) = D_x f_i + D_x f_{i+1}$:
$$ \frac{d}{dx}(f_i + f_{i+1}) = \lim_{dx\to 0}\frac{f_i(x+dx) + f_{i+1}(x+dx)-f_i(x) - f_{i+1}(x)}{dx} $$
$$ = \lim_{dx\to 0}\frac{d f_i }{dx}+\frac{ d f_{i+1}}{dx} $$
So, the integral of the function figured is:
Since the derivative of eq. \eqref{integral_f} is f, we verify the antiderivative is correct.
Geometric Series Approximation
Integral of $r^x, \forall\, 0 \lt r\lt 1$
The integral of $r^x$ is the continuous summation, or Riemann Integral, of the exponential with a proper fraction, $r$ not $e$, as base.
With $r^x=e^{\ln (r) x}$, so we have:
This of course should be compared to the series for a range of proper fractions, which has some similarity to the approximation of the Harmonic Numbers to the integral of the Hyperbola in that the result is asymptotically accurate.
| $r$ | $\frac{1}{1-r}$ | $\frac{-1}{\ln (r)}$ | $\Delta$ |
|---|---|---|---|
| $0.25$ | $\frac{4}{3}$ | $0.72$ | $0.61$ |
| $\frac{1}{3}$ | $1.5$ | $0.91$ | $0.59$ |
| $0.5$ | $2$ | $1.44$ | $0.56$ |
| $0.8$ | $5$ | $4.48$ | $0.52$ |
| $0.9$ | $10$ | $9.49$ | $0.51$ |
| $0.9999$ | $10,000$ | $9,999.50$ | $0.50$ |
The approximation is more accurate the closer $r$ is to one, since the deviation is almost a fixed amount of undershot for a given series' value. Compare this to the Harmonic Numbers approximation, Euler Constant, which undershoots the Harmonic Numbers series for the same reason $-1/\ln(r)$ undershoots $1/(1-r)$, which is the continuous approximation is continuously decreasing while the corresponding series is constant for each unit of abscissa.
Let's look at the set of proper fractions as domain interval for the two functions, to compare the Geometric Series with our continuous function approximation graphically.
Integration By Parts
Integrals are everywhere in physics, and over and over again the method employed is Integration By Parts. The solution to an integral can be solved by breaking the integrand into two factor terms, when possible, using the following method.
When the derivative of a product function, $F'=(UV)'$, f-prime equals u-v-all-primed, is integrated you get the function plus a constant.
The Fundamental Theorem of Calculus, eq. \eqref{FTC}, is all we need for the the I.B.P. theorem.
We can look at the two terms of a product function's derivative and use the identity created by applying the Product Rule to the product function. The article section on it can be reviewed here: Product Rule .
The way the Integration By Parts formula, eq. \eqref{IBP}, is used is by decomposing an integral in question into one of the two additive integrals, which then makes the job of integration simpler because the decomposition separated the integration into a formula of two smaller integrals, with one factor always working best as the derivative factor (primed factor).
Integrating Our Simple Function
The antiderivative of a function is generally not obvious, and our example function, that of the normalized one-half power binomial, $f(x)=\sqrt{1+x^2}$, is no exception.
$$ \begin{equation}\label{simple_integral} F = \int \sqrt{1+x^2} dx \end{equation} $$
Since the function has an inner-function, $x^2$, you can't just use the Power Rule on the outer function, $\sqrt{x}$, because the Chain Rule requires us to account for the derivative of the inner function, $1+x^2$. Next, I.B.P., eq. \eqref{IBP}, can't simply be applied in the current form because $f$ is not a function of a product, it's a single composite function. To solve for $F$, the integral of $f$, eq. \eqref{simple_integral}, we must change its form with a change of variable.
Change of Variable
Since we have a composite function in the integrand, we must use $u$-substitution.
The letter u chosen is reasonably thought of as terse for Unkown intermediate function.
$u$-substitution is the process of transforming the domain, and consequently the form of the integrand in the new domain's variable is different—and amenable to the by parts integration method described above.
This change of variable is called a
Point Transformation (as opposed to function transformations like the Fourier).
The tangent function is a valid substitution for $x$ because:
- their ranges are identical ($-\infty < x < \infty$),
- tangent is continuous and differentiable on the principal interval $(-\pi/2, \pi/2)$,
- it is one-to-one (invertible) on this interval.
This invertibility ensures a unique mapping, or bijection, between the variable $x$ and the angle $\theta$. If two functions map separate domains to the same range via continuous bijections, they are compatible substitutes.
Changing the variable has consequences for the integration beyond the composite function. Recall that the width of the infinitesimal slice of the area under the curve in Figure 3 (above) is some width in space $x$, but $dx$ transforms as its own function into the new slice transformation, $g(\theta)d\theta$, determined by the derivative of the point transformation, $dx/d\theta$.
- [1] Scriba, Christoph J. (1963). The inverse method of tangents: A dialogue between Leibniz and Newton. Archive for History of Exact Sciences 2 (2):113-137.