Harel G. (1998) Greek Versus Modern Mathematical Thought and the Role of Aristotelian Causality in the Mathematics of the Renaissance: Sources for Understanding Epistemological Obstacles in College Students' Conception of Proof

Plenary talk at the International Linear Algebra Society Conference. University of Wisconsin, Madison.
Now published as Harel, G. (1999). Students' understanding of proofs: a historical analysis and implications for the teaching of geometry and linear algebra, Linear Algebra and Its Applications, 302-303, 601-613.

© Guershon Harel

   

The process of observing and analyzing students' behaviors is interesting and complex but also unstable. It is unstable because it involves countless variables, many of which are uncontrollable. Despite this, what we learn from this process is useful, even essential, in designing and implementing mathematics curricula for both students and teachers. This paper is about students' behaviors in relation to justification and proof. Some of these behaviors are assumed to be due to narrow instruction in school; others seem to be unavoidable, in the sense that they are of a human developmental nature. Analyzed from a cognitive and historical perspectives of mathematical development, these students' understandings of proof can be classified into three categories:

Category 1 : In this category, students' understandings of proof (viewed in relation to those of their instructors) seem to parallel the Greek conception of mathematics (viewed in relation to that of modern days).

Category 2 : In this category, students' understandings of proof are reminiscent of the 16-17th Century conception of mathematics.

Category 3 : In this category, students' limited understandings of proof seem to be largely a result of narrow instruction in the elementary and secondary schools.

I will focus on the first two categories; the third category is addressed in Harel and Sowder (1998). The findings about students' understandings of proof, part of which is reported here, were obtained from a sequence of six teaching experiments with a total of 169 students (mathematics majors and engineering majors). The data were collected from classroom observations in the form of field notes and retrospective notes, clinical interviews, homeworks, and written tests and quizzes. Some of the data came from videotaped classroom sessions and sixty-to-ninety-minute clinical interviews with students (for more detailed accounts, see Harel and Sowder, 1998).
   While the data from these experiments confirm that students have difficulty with proof, the historical analysis I describe here only points to a possible parallelism between these difficulties and historical phenomena in the development of mathematics. My research did not intend to establish and has not established such a parallelism. The historical analysis has, however, provided me with insights as to the possible conceptual basis for students' mathematical behavior.

Category 1: Students' Understanding of Mathematics in Relation to Greek Versus Modern Mathematical Thought

The axiomatic conception of proof is that the student understands, at least in principle, that a mathematical justification must have started originally from undefined terms and axioms. There really are three different levels of the axiomatic conception, which historically were developed in three consecutive periods: Intuitive axiomatic conception, structural conception, and axiomatizing conception.

Intuitive Axiomatic.

The view of what constitutes an acceptable mathematical proof has had many turning points. Babylonian mathematics is considered proof-free, because it does not deal with general statements, deduction, or explanations; rather, it prescribes specific solutions to specific problems. The axiomatic method&emdash;that is, the notion of deductive proof from some accepted principles&emdash;was conceived by the Greeks. However, it is important to note that the Greeks had one single type of mental objects in mind, namely, objects that are idealizations of physical reality, such as a line, plane, triangle, etc. Accordingly, with the intuitive axiomatic conception, the student is able to handle only axioms that correspond to her/his intuition. For example, the statement "One and only one line goes through two points" is understood only in the context of personal geometric intuition. Here the objects, which are derived from an idealization of the physical reality, determine the set of axioms. Similarly, the statement, "For any a and b in F, a+b=b+a" is understood only in the context of experience with real numbers.
   In modern mathematics, on the other hand, the objects are determined by a set of axioms. This new view was a revolutionary way of thinking in the development of mathematics and may shed light on some of the difficulties I observed with students.

Structural.

To explain the second level, let me point to a critical distinction between Euclid's Elements and Hilbert's Grundlagen. While the Elements is restricted to a single interpretation -- namely that its content is a presumed description of human spatial realization -- the Grundlagen is open to different possible realizations, such as the Euclidean space, the surface of a half-sphere, ordered pairs and triples of real numbers, etc. -- including the interpretation that the axioms are meaningless formulas. In other words, the Grundlagen characterizes a STRUCTURE that fits different models. This obviously is not unique to geometry. In algebra, a group or a vector space is defined to be any system of objects satisfying certain axioms that specify the structure under consideration. Accordingly, the structural conception is the understanding that definitions and theorems represent situations from different realizations that share a common structure determined by a permanent set of axioms. In this conception, the axioms that define the structure are permanent, and one studies the structure itself, not just the axiom system. So, for example, one studies real analysis on the basis of the axioms of a complete ordered field, or one studies the theory of vector spaces on the basis of the vector space axioms, etc.

Axiomatizing.

Our data suggest that the structural conception is a cognitive prerequisite to the axiomatizing proof conception -- a conception by which a person is able to investigate the implications of varying a set of axioms, or to understand the idea of axiomatizing a certain field.
   One of the questions I addressed in these studies was this one: Do undergraduate mathematics majors possess the axiomatic conception at any level? For example, do students understand that axioms in geometry require no specific interpretation? In particular, can students consider their own intuitive space (i.e., the Euclidean space) as a specific system that may or may not satisfy the structure at hand?

A Sample of Results

Textbooks in axiomatic geometry often begin with finite geometries as a preparation for non-Euclidean geometries. As you see in Figure 1 and Table 1, a compilation of 31 students' reactions to the task in Figure 1, the idea that geometric properties are not supposed to evoke spatial imagery is virtually absent from students' conception. We should remember that this idea is a relatively new concept in mathematics; it was born at the turn of this century with the publication of Hilbert's Grundlagen. Poincare, in his review of the Grundlagen, saw a need to point to this seemingly self-evident feature of the Grundlagen. To our students this was a very difficult idea.

   
Problem
: Consider a model where the points are A, B, C, D, E, F, G and the lines are {A,B,F}, {A,C,E}, {A,D,G},{B,C,D}, {B,E,G}, {C,F,G}, {D,E,F}. Do the properties below hold?

1. Only one line goes through two points.
2. Given a line and a point not on it, there is a line containing the given point and is parallel to the given line.

Students' responses include statements like:

• "These are consecutive points."
• "The line continues in each direction."
• "It is a segment with three points."
• "A line with only 3 points? Impossible."
• "The line has to be all the points contained between those points."
• "There has to be an infinite number of points between the points."
   

Figure 1

N
%

Contextual (Imposition of extraneous notions, e.g.,
"betweeness", "distance", "collinearity"

17

55

Axiomatic

5

16

Undecided

9

29

Total  

31

100

Table 1

When students are unable to be detached from a specific context, whether it is the context of intuitive Euclidean space in geometry or the context of Rn in linear algebra, we call that conception, contextual. And so, with the contextual conception, general statements are interpreted (and proved) in terms of a specific context.
   Judging from the historical development of geometry, I conjecture that the contextual conception is developmentally inevitable, and it should have been developed in the secondary school. What is surprising is its robust influence on students in an advanced stage in their mathematical education. We must appreciate, however, the non-triviality of the structural conception.
   Here is another example from history that helps us appreciate this state of intellectual development. The modern notion of "number" was born when symbols representing no specific reality were treated as objects that can be operated upon by certain rules. These objects are defined not by what they represent but by an a priori set of rules. Not all mathematicians of the 17th century shared this new way of thinking; some raised serious doubts about its intelligibility and viability. How is it possible to reason about symbols without a concrete referent and especially without a geometrical referent, as in the case of imaginary numbers and negative numbers? How is it possible, asked Arnauld, a 17th century mathematician, to subtract a greater quantity from a smaller one, where the mental image of "quantity" is nothing else but a physical amount or a spatial capacity? Moreover, how is it possible to understand such a statement as (-1)/1=1/(-1), where the quantity 1 is larger than the quantity &endash;1, and therefore, the division of 1 by &endash;1 must be smaller than the division of &endash;1 by 1?
   Let me conclude this category with two items: Examples of students' responses that demonstrate the absence of the structural conception from our students' reasoning (Figure 2), and a quantitative comparison between the two conceptions in one of the teaching experiments (Table 2).

Q:  How do you describe the abstract algebra course you have taken recently?

S1: Well, … lot of [it] is just weird things.
    You end up learning things like 1+1 is not 2 anymore….

Q:  Explain the argument made by the statement: (-1)x = -x

S2:  There is no argument. It is obvious that negative one times x is negative x .

    

Figure 2

N
%

Contextual (General vector space statements are interpreted and proved in the context of Rn.)

33

66

Structural

10

20

Undecided

7

14

Total  

50

100

Table 2

 

Students' inability to deal with any geometric structure but the one corresponding to their spatial imageries is reminiscent of the Greek's view of mathematics. In Greek science, concepts are formed in continual dependence on, and interpreted from the point of view of, their "natural" foundations, and their scientific meaning is abstracted from "natural," pre-scientific experience. In modern science, on the other hand, what is intended by the concept is not an object of immediate insight, but an object whose scientific meaning can be determined only by its connection to other concepts, by the total edifice to which it belongs, and by its function within this edifice. To illustrate the phenomenon of how students are constrained by their physical imageries, consider the following example. When linear algebra instructors present to the students a problem such as "Given W is a subspace of Rn, find the projection of c onto W" along with the sketch in Figure 3, they do not mean the sketch to be literal but symbolic. It turns out that such a sketch is not conceived as a REPRESENTATION of the abstract setting, but as the ACTUAL OBJECT of inquiry.

Figure 3

 

Another illustration of students' appeal to their immediate physical reality rather than to the actual definition comes from their difficulties with the concepts of span, dependence, and independence. For example, the sketches in Figure 4, when presented to illustrate the definition,

span {x1a1+…+ xkak | x1, …xk are real numbers},

are not conceived as a special cases of the concept of span, but the ACTUAL OBJECTS of inquiry.

Figure 4

Category 2: Students' Understanding of Mathematics in Relation to the Role of Aristotelian Causality in the Mathematics of the Renaissance

We tend to associate misconception and missing conceptions only with mathematically weak students. But in fact, all students, the weak and the able, in their desire to understand and make sense of the mathematical concepts we intend to teach them, encounter difficulty, and demonstrate as a result behaviors that in many cases are difficult to explain. Figures 5 and 6 present examples of proofs to which certain students&emdash;always among those who seek to understand&emdash;respond in a manner that has perplexed me. The students seem to understand each step in the proof, but not the proof as a whole: They remain unconvinced that the proof indeed proves the theorem's assertion. What is the conceptual base for their responses? What is the question these students are really asking? While further research is needed to answer these questions, in what follows I will offer a conjecture.

Problem: Prove that any three vectors in R2 are linearly dependent.

Proof (given by the instructor)

Let u, v, w be three column vectors in R2, and consider the system AX=0, where A=[u v w]. The system AX=0 has at least one free variable; therefore, it has a non-zero solution, x=[x1 x2 x3]. Since x1u+x2v+x3w=0, and x1, x2, x3 are not all zero, one of the columns of A must be a linear combination of the others. Hence the columns of A are linearly dependent.

Some students respond by saying something to the effect, what if the system weren't homogeneous? Your answer is dependent on the fact that the system is homogeneous. If the system weren't homogeneous, you wouldn't be able to prove that the vectors are dependent.

Figure 5

Theorem:

Let T be a square matrix, and let v1, v2 , … vk be eigenvalues of T that correspond to eigenvalues r1, r2 , …, rk. If r1, r2, …, rk are all distinct, then v1, v2, … vk are linearly independent.

Proof:

Assume a1v1+ a2v2+ … + akvk=0. We should show that aI=0 for each i.
Let pi(x)=(x- r1) (x- r2)… (x- ri-1)(x- ri+1) … (x- rk)/(ri- r1)( ri- r2)… (ri-ri-1) (ri-ri+1) … (ri- rk) for .

By the Spectral Mapping Theorem,

0=pi(T)0
 = pi(T)(a1v1+ a2v2+ … + akvk)
 = a1pi(T)v1+ a2pi(T)v2 + … + akpi(T)vk
 = a1pi(r1)v1+ a2pi(r2)v2 + … + akpi(rk)vk
 = ai

Hence, ai=0 for each i.

Some students respond in a similar manner to that in Figure 5 by saying something to the effect what if you chose different polynomials? v1, v2 , … vk are independent because you chose these polynomials. Maybe if you have chosen polynomial other than these, you wouldn't be able to prove the theorem.

Figure 6

 

Before we suggest what might be hidden in these to-us-strange responses, let me turn again to history:

"We do not think we understand something until we have grasped the why of it. … To grasp the why of a thing is to grasp its primary cause." asserts Aristotle in Posterior Analytics.

Mathematics is not a perfect science, argued 16-17th century philosophers, because an "implication" is not just a logical consequence; it must also demonstrate the cause of the conclusion. Some mathematicians (e.g., Barozzi, 16-17th century) argued that some parts of mathematics are more scientific (causal) than others; but that proof by contradiction is not a causal proof, and therefore it should be eliminated from mathematics. Others (e.g., Barrow, 16-17th century) argued that all mathematics proofs are causal including proof by contradiction.

To illustrate the nature of this debate, consider the Euclid's Proposition I.32 and its proof:

The sum of the three interior angles of a triangle is equal to 1800.

Proof:

Construct CE parallel to AB (Figure 7). Then the alternate angles BAC and ACE are congruent and the corresponding angles ABC and ECD are congruent. Hence, m(ABC)+ m(BAC)+ m(ACB)= m(ECD)+ m(ACE)+ m(ACB)=1800.

 

Figure 7

 

What is the cause of the property that is proved here, asked these philosophers? The proof appeals to two facts about the auxiliary segment CE and the external angle ACD. But these facts, they argued, cannot be the true cause of the property. For the property holds whether or not the segment CE is produced and the angle ACD considered.
   The scientificness of mathematics was denied primarily based on the pervasive use of proofs by contradiction, which, in the eyes of many philosophers, did not qualify as causal proofs. When a statement "A implies B" is proved by showing how NOT B leads logically to an absurdity, we do not learn anything about the causality relationship between A and B. Nor, these philosophers continued to argue, do we gain any insight into how the result was obtained.
   Another interesting argument against the scientificness of mathematics was this one: If mathematical proof were scientific (i.e., causal), then proofs for "A if and only if B" statements entail that A is the cause of B and B is the cause of A. Hence, A is the cause of itself, which is absurdity, because nothing can be its own cause.
   Since the basis of proof by exhaustion is proof by contradiction, it too was unsatisfactory to many mathematicians of the sixteenth and seventeenth centuries. They felt that the ancients, who broadly used proof by exhaustion to avoid explicit use of infinity, failed to convey their methods of discovery.
   Were these issues of marginal concern to the mathematics of the sixteen and seventeen centuries, or had they significantly affected it? To what extent did the practice of mathematics in the sixteen and seventeen centuries reflects global epistemological positions that can be traced back to Aristotle's specifications for perfect science? These are important questions, if we are to draw a parallel between the individual's epistemology of mathematics and that of the community.
   Mancosu (1996) argues that the practice of Cavalieri, Guldin, Descartes, and Wallis, and other important mathematicians reflects a deep concern with these issues. He shows, for example, how two of the major works of the 1600s--the work by Cavalieri on indivisibles and that by Guldin, his rival, on centers of gravity--aimed at developing mathematics by means of direct proofs. These two mathematicians, argued Mancosu, explicitly avoided proofs by contradiction in order to conform to the Aristotelian position on what constitutes perfect science--a position Aristotle articulated in his Posterior Analytics.
   Descartes, whose work represents the most important event in seventeenth-century mathematics, was heavily influenced by these developments. Descartes appealed to a priori proofs against proofs by contradiction because they show how the result is obtained and why it holds, and they are causal and ostensive.
   Before we go back to the students' responses in Figures 5 and 6, let me present one more observation. A group of eight inservice teachers were presented with two proofs of Proposition I.32: the proof just presented and the following proof which was originally offered by a preservice secondary teacher (Amy) taking a course in college geometry (see Harel and Sowder, 1998):

Amy demonstrated to the whole class how she imagines the theorem, "The sum of the measures of the interior angles in a triangle is 1800." Amy said something to the effect that she imagines the two sides AB and AC of a triangle ABC being rotated in opposite directions through the vertices B and C, respectively, until their angles with the segment BC are 900 (Figure 8a, b). This action transforms the triangle ABC into the figure A'BC'', where A'B and A''C are perpendicular to the segment BC. To recreate the original triangle, the segments A'B and A''C are tilted toward each other until the points A' and A'' merge back into the point A (Figure 8c). Amy indicated that in doing so she "lost two pieces" from the 900 angles B and C (i.e., angles A'BA and A''CA) but at the same time "gained these pieces back" in creating the angle A. This can be better seen if we draw AO perpendicular to BC: angles A'BA and A''CA are congruent to angles BAO and OAC, respectively (Figure 8d).
 

Figure 8

 

All eight teachers preferred Amy's proof to the standard Euclid's proof, saying that it shows why the sum of the angles in a triangle is 1800. They indicated that through Amy's proof they could see how the construction of the triangle "made" the sum of the angles 1800. For these teachers, we suggest, Amy's proof was a causal proof--an enlightening proof that gives not just mere evidence for the truth of the theorem but the cause of the theorem's assertion.

The history of the development of the concept of proof, as we have briefly reviewed here, shows that our current understanding of proof was born out of an intellectual struggle during the Renaissance about the nature of proof--a struggle in which Aristotelian causality played a significant role. It is possible, we conjecture, that the development of students' conception of proof include some of the obstacles encountered by the mathematics community through history. I suggest that Aristotelian causality is one of these obstacles. Causality is more likely to be observed with able students, who seek to understand phenomena in depth, than with weak students, who usually are satisfied with whatever the teacher presents. It is possible, for example, that the students' responses in Figures 5 and 6 are a manifestation of the causality phenomenon. The students who responded to the proof in Figure 5 by saying "What if the system weren't homogeneous?" had interpreted the homogeneous system AX=0 to be the cause for the independency of the vectors u, v, and w, and so they desired to understand the exact causality relationship. Similarly, the students who responded to the proof in Figure 6 by saying "What if you took different polynomials" sought to understand the cause-effect relationship between the Lagrange polynomials, pi(x) (defined in Figure 6), and the theorem's assertion about the independency of the eigenvectors.

Implications

The observation regarding causality is still in its infancy. Systematic studies on the existence of this phenomenon with students must be conducted before any conclusions can be drawn. On the other hand, our data strongly support the observation regarding students' difficulties extending their proof schemes--from the contextual proof scheme to the structural proof scheme and axiomatizing proof scheme--thus, my conclusions will be based on this observation alone.

1. I have doubts about the wisdom of starting off college geometry courses with finite geometries. Our data suggest that a more promising approach would be to begin with axiomatic Euclidean geometry, which corresponds to and extends students' physical reality.

2. For high school geometry, we believe it is imperative to reinstate synthetic Euclidean geometry in high school mathematics curricula, whose place has recently been significantly eroded. Synthetic Euclidean geometry in 2 and 3 dimensions can serve as an intermediate stage to an advanced level of mathematical thinking, particularly, to the structural conception of proof. An obvious justification for this assertion is that Synthetic Euclidean geometry is a concrete context in which students can learn the concept of a deductive system--a concept college students largely lack. I will not discuss this justification here but focus instead on a less obvious, but critically important justification.
   We have seen earlier how students are unable to be detached from their spatial imageries in dealing with abstract algebraic structure--a phenomenon called contextual conception. One manifestation of this conception is that students do not understand the role of symbolic figures (geometric illustrations) in algebraic structures such as Rn. Synthetic Euclidean geometry is an excellent context to teach students to view symbolic figures as representations of "reality," not the "reality" itself. For example, in proving the statement in Figure 9a by proof-by-contradiction, one must first assume that lines x and y intersect (Figure 9b). This usually causes a discomfort to many students because the figure conflicts with their image of a straight line, and they begin to realize that the figure is a representation of a hypothetical spatial situation. Similarly, Figure 9c is a source of difficulties to many students because the properties the student is asked to prove are not directly seen in the figure but needs to be visualized. Again, here too the students begin to learn that the figure is just a representation of her or his reality, not the reality itself. 

 

(9a)  
(9b)
  

Given x=y, prove l||m

(9c)

NJ, NI, and NQ are not in one plain. Let NJ=NI, JQ=IQ, and NQ is perpendicular to JQ. Prove that at least one of the segments is perpendicular to one of the planes.

Figure 9

It is interesting to note that according to our data, the ability to deal with 3 dimensional Euclidean geometry correlates with the ability to think axiomatically (as in finite geometries).

3. My conclusions for the teaching of linear algebra might, I am afraid, be controversial. Instructors and textbook writers have good intention when they start with geometric motivations in introducing new concepts. We all intend to use geometry to provide students with geometric insight. But, it is my conclusion, these geometric illustrations do not achieve their purpose. On the contrary, they hinder students learning the true concepts of linear algebra.
   The sequence in which we present material to students and the way we introduce new concepts are critical learning factors. When geometry is introduced before the concept has been formed, the students view the geometry as the raw material to be studied, they remain, as a result, in the restricted world of geometric vectors, and do not move up to the general case.
   As we should be careful not to move students up hastily from Rn to more general vector spaces, we should be as careful how to introduce special cases of Rn, namely the geometry of directed segments. The student must stand on solid ground as to the world he or she is studying. In elementary linear algebra, there should be one world--Rn--at least during the early period of the course.
   Having said this, I want to emphasize and reemphasize that I am not advocating the elimination of geometry from linear algebra. On the contrary, geometry can be a very powerful tool in solidifying concepts the students have formed or begun to form. I am only raising questions about its usefulness in motivating new linear algebra concepts.
   There are at least three reasons that make geometric motivations very popular among textbooks and instructors of linear algebra.
   The first reason is that the instructor sees how the geometric situation is isomorphic to the algebraic one and so he or she believes that the geometric concept can be a corridor to the more abstract algebraic concept. The problem with this preconception is that the student does not share this important insight.
   The second reason is that the geometric concepts are relatively easy to understand, and so it is only natural to use them as startup ideas. For example, the concept of linear independence is commonly introduced via collinear directed segments. The problem with examples such as this is that they are very simple and easily understood by every student, whereby the students form an extremely powerful concept image that it is hard for many to relinquish.
   The third reason is that the instructor well understands that the geometric illustrations are only introductory ideas to the abstract concepts that are yet to come. He or she knows that the true labor is to abstract the concepts from the geometric ideas and has developed the patience to do so. Unfortunately, a regular student does not have this basic metamathematical understanding.

Bibliography

Harel G., Sowder L. (1998) Students' proof schemes. Research on Collegiate Mathematics Education, Vol. III, E. Dubinsky, A. Schoenfeld, & J. Kaput (Eds.), American Mathematical Society.
Mancosu P. (1996) Philosophy of mathematical practice in the 17th century. New York: Oxford University Press.