Riemann integral

If you are having difficulty understanding this article, you might wish to learn more about algebra, functions, and mathematical limits.

In a branch of mathematics known as real analysis, the Riemann integral is a simple way of viewing the integral of a function on an interval as the area under the curve.

Table of contents

1 Overview
2 Step functions
3 Lower and upper sums
4 Results about the Riemann integral

Overview

Figure 1: Illustration of Riemann integration as a converging sequence

The numbers that appear in the upper righthand corner of the animation above give the sums of the areas of the grey rectangles. As the number of rectangles increases this sum converges to the Riemann integral for the curve shown.

Figure 2

Let f(x) be a real-valued function of the interval [a,b], so that for all x, f(x)≥0 (f is non-negative.) Further let S=S_f:={(x,y)|0≤y≤f(x)} (see Figure 2) be the region of the plane under the function f(x) and above the interval [a,b]. We are interested in measuring the area of S if that is possible, and we denote this area by ∫_a^bf(x)dx. In case several variables appear in f, the dx will serve to specify the variable of integration. If the variable of integration and interval of integration are understood, the notation can be simplified to ∫f.

Once we have succeeded in evaluating the integral of f for certain f which are non-negative, we can extend the integral to functions which may take negative values by linearity. Some functions have no clear Riemann integral, but especially, the interactions of limits and the Riemann integral are difficult to study. An improvement is to use the Lebesgue integral which both succeeds at integrating a broader variety of functions, as well as better describing the interactions of limits and integrals.

Historically, Riemann designed this theory first and gave some evidence for the fundamental theorem of calculus. The theory of Lebesgue integration arrived much later, when the weaknesses of the Riemann integral were better understood.

The basic idea of the Riemann integral is to use very simple and unambiguous approximations for the area of S. We find an approximate area which we are certain is less than the area of S, and we find an approximate area which we are certain is more than the area of S. If these approximations can be made arbitrarily close to one another, then we can assign an area to S.

Because of the geometric nature of the Riemann integral, it allows us to formulate many problems of nature as a problem of integration. It also provides some hints for methods of numerical integration, for evaluating definite integrals on computers to an acceptable degree of precision. However, for exact calculations for given formulae, the Riemann integral does not suggest a suitable approach.

For certain functions, the theory of antiderivatives provides exact results for definite integrals. While the Riemann integration theory justifies taking limits and provides a geometric point of view, the antiderivative theory of integration gives tools for integrating certain formulae precisely.

The fact that the seemingly disparate theories of Riemann integration and antiderivatives are essentially talking about the same subject is contained in the fundamental theorem of calculus.

Step functions

Let E be any subset of [a,b]. Let X_E(x) be the function which is 1 if x is in E and 0 if x is not in E. X_E is called the indicating function of E, or the characteristic function of E.

These functions are our starting point, and we should agree that

∫z X_[c,d](x)dx=z(d-c)

for any interval [c,d] in [a,b] and any constant z≥0. Indeed, in this case, the area under the curve is a rectangle with base [c,d] and height z.

Likewise, some geometric experimentation with such functions suggests that if f₁, f₂, ..., f_n are n indicating functions of intervals, are scalars, then the area under

should be

A function of this form is called a linear combination of indicating functions, or more simply a step function. We note now that we've decided what the integral of step functions ought to be.

We will take a shortcut now by stating that the preceding formula will be used even if some (or all) of the coefficients a_j are negative.

A crucial difference between the Riemann integral and the Lebesgue integral is that the step functions of the Lebesgue integral are linear combinations of indicating functions of sets which are not necessarely intervals. Of course, work is then required to calculate the integral of this larger class of step functions. Also note that the Lebesgue integral does not use upper sums, and that non-negative functions are dealt with first, before extending to functions which may take negative values.

Lower and upper sums

From the geometry of the problem, we impose that if f(x)≤g(x) for all x in [a,b] then we really ought to have

∫f≤∫g

simply by seeing that the area S_f is a subset of S_g (at least in the case of non-negative functions, this is clear.) We call this requirement monotonicity.

Given the integral of step functions and the monotonicity requirement, we can get a first stab at integrating arbitrary non-negative functions. Let f(x) be a real-valued function of [a,b] and let l(x) be a step function such that l(x)≤f(x) for all x. Furthermore, let u(x) be a step function such that u(x)≥f(x) for all x. If we are to assign a value to ∫f consistent with the monotonicity requirement, then we need that

∫l≤∫f≤∫u

The integral ∫l is then called a lower sum for f and the integral ∫u is then called an upper sum for f. The preceding inequality must hold for all lower and upper sums of f, so we can deduce another inequality:

sup_l∫l≤∫f≤inf_u∫u

where sup_l∫l is the smallest upper bound for all lower sums, and inf_u∫u is the largest lower bound for all upper sums (see supremum and infimum.) The number sup_l∫l is sometimes called the lower sum; likewise, the number inf_u∫u is the upper sum.

If the supremum and infimum are equal, then there is only one choice left for ∫f. It may not happen that the supremum is larger than the infimum (this is by our construction, as the reader may check.) However, it may happen that the supremum is less, and not equal to, the infimum. For instance, the reader may check that, for the indicating function

X_Q

where Q is the set of rational numbers in [a,b], a<b, the lower sum is 0 and the upper sum is b-a>0.

The collection of functions whose lower sum and upper sum are equal and finite is the set of Riemann integrable functions. By contrast, functions that have differing upper and lower sums are said to be non-Riemann integrable. In the context of this article, we will say integrable or non-integrable with the understanding that we are speaking of Riemann integrability.

One also checks that a step function's integral is equal to is lower and upper sums.

Results about the Riemann integral

Lemma 1: Let [a,b] be an interval. The map I:f→∫f which maps f to its integral from a to b is a linear map. That is, for any integrable functions f and g, and any real number a, I(af+g)=aI(f)+I(g).

This can be shown from first principles, from the construction of the Riemann integral.

Theorem 2: Any real-valued continuous function of the interval [a,b] is integrable.

The proof relies on the fact that any continuous function of an interval is necessarely uniformly continuous.

Corollary 3: If f is continuous everywhere in [a,b] except perhaps for finitely many points of discontinuity, and f is bounded, then f is integrable.

The boundedness requirement can not be dropped.

Theorem 4: If f_k is a sequence of integrable functions over [a,b], and if f_k converge uniformly to a function f, then f is integrable, and the integrals ∫f_k converge to ∫f.

Corollary 5: Let C(a,b) be the Banach space of continuous functions over [a,b] with the uniform norm. Then I:f→∫f is continuous. Together with Lemma 1, this says that the integral is a continuous functional of C(a,b).

The hypotheses of theorem 4 (uniform convergence on a fixed, bounded interval) are very strong. A primary failing of the Riemann integral is the difficulty we face when attempting to relax these hypotheses. In fact, the numerical sequence ∫f_k will converge to the number ∫f a lot more often than is suggested by the theorem, but it is very difficult to prove so in this setting. The correct way of getting a stronger theorem is to use the Lebesgue integral.

Another problem with the Riemann integral is that it does not extend to unbounded intervals very succesfully. If we wish to integrate a function f from -∞ to +∞, we can naively calculate

lim_n→+∞ ∫_-nⁿf(x)dx

However, certain properties (such as translation invariance, the fact that the Riemann integral does not change if we translate the integrand f) are lost. In fact, Theorem 4 becomes false for such an integral, and it becomes very difficult to use limits in conjunctions with integrals. Such an integral is called an improper integral, for it is not deemed to be a Riemann integral, strictly speaking. Again, the Lebesgue integral alleviates these difficulties.