Part I explored the structure and strategy of neoclassical economics, that is – or so I argued – of equilibrium theory and its applications, while Part II addressed the problems of theory assessment in economics and showed how these can be clarified by the conclusions of Part I. The structure and strategy of economic theorizing to which economists are devoted rather than the views of theory assessment they accept are what is philosophically distinctive about economics. The problems that have been addressed are central to the practice of economists, to epistemological problems of interest to philosophers of social science, and to the attitudes of philosophers, economists, politicians, and others toward the discipline of economics and the conclusions it defends.
Although I have emphasized the vision of economics as a separate science as the key to its methodological peculiarities, this book has defended many detailed theses, and I shall summarize the most important of these in Chapter 16. Much of this book can be regarded as a defense of puzzling features of mainstream economics. For it maintains that the “obvious falsehoods” upon which economic theory depends can be regarded as qualified truths and can be justifiably accepted and employed in some contexts. Along the way, this book criticizes both general philosophical views that would condemn such theorizing, and specific accounts of economic methodology such as those defended by Hutchison, Samuelson, Machlup, and Friedman.
My remarks concerning the structure and strategy of economic theorizing have been more critical, though I found good reasons even for what I have taken to be mistakes. Given the distinction between models and theories in Chapter 6, one can appreciate the conceptual explorations that are so prominent in theoretical economics, free of misplaced concerns with empirical testing. The account of the global theory structure of microeconomics in Chapter 7, as a “separate science” in Mill’s sense, helps explain why equilibrium theory dominates the tool boxes of economists and partially justifies that dominance. But I argued that the commitment of many economists to equilibrium theory is exaggerated and seems, especially in the case of normative economics, to prevent economists from coming to terms with important problems. The insistence that a single unified theory span a separate economic realm can be justified only if it leads to theoretical success.
Before pulling together the threads of the argument in Chapter 16, I need to address remaining defenses of equilibrium theory as well as a challenge to my whole project. Chapter 15 is according devoted to these tasks and to an examination of specific methodological implications of my position.
I have argued that the insistence that economic phenomena be treated by a single unified theory – equilibrium theory in particular – has no general justification. Whether equilibrium theory is the best way to proceed is an empirical question, and there is little reason to reject other approaches because they cannot be integrated into a unified theory of an economic realm. But in reaching this, my principal critical conclusion, I have not yet addressed two powerful arguments that may appear to establish the permanent hegemony of equilibrium theory. Even if these arguments can be answered, this book has said little about the upshot of this philosophizing for the practice of economics. Finally, even if equilibrium theory has no methodologically privileged position, one might reasonably question whether an outsider can challenge the methodological practice of an established discipline such as economics. Is not such a challenge futile and arrogant? What possible authority do I have to preach to economists?
15.1 The Hegemony of Equilibrium Theory
Suppose a theorist were to offer a maverick explanation of some economic phenomenon
, which employed ad hoc behavioral generalizations that violate the strictures of Section 7.5. In assessing this explanation compared to an explanation that applies equilibrium theory, there are three possible cases: (1) there is a competing explanation of the phenomenon
in terms of equilibrium theory and permissible additions; (2) equilibrium theory does not appear to have any relevance to
at all; and (3)
is anomalous from the perspective of equilibrium theory, which apparently implies that the phenomenon ought not to occur.
If (1) the phenomenon were also explicable in terms of equilibrium theory and permissible additions, then one might argue that the standard microeconomic explanation is necessarily preferable to the deviant explanation unless the deviant explanation is much better supported by the data or is grounded in a competing research program that promises to match mainstream economics in scope and heuristic power. For the standard explanation employs a systematic theory and links
to an extensive range of other phenomena. Since unification is one goal of scientific explanation (Michael Friedman Reference Friedman1974; Kitcher Reference Kitcher1981), the microeconomic explanation is, ceteris paribus, the better explanation. Furthermore, as Max Weber (Reference Weber, Shils and Finch1949), Fritz Machlup (Reference Machlup1969), and many of the Austrian economists emphasize, one constraint, or, at the very least, desideratum, on the explanations of social phenomena is that they show such phenomena to be the consequences of intelligible or understandable human action (§A.9). Microeconomic explanations do this in a particularly powerful way, since they are extensions of the standard folk-psychological account of human action in terms of beliefs and desires (see §A.9). It is hard to see how a genuine alternative to equilibrium theory could possess these explanatory virtues to nearly the same degree that equilibrium theory does. The alternative might be better confirmed – in better accordance with the facts – and so economists might be driven from their allegiance to equilibrium theory. But, given the difficulties involved in testing and confirmation in economics, this is an unlikely prospect.
In case 2, where equilibrium theory is not relevant at all, one might question whether the phenomenon
truly belongs to the domain of economics. Since economics is defined by its causal factors, it could not be irrelevant to any genuinely “economic” phenomenon, and it is hard to see how equilibrium theory could fail to be relevant to any phenomenon in its standard domain. Again, this thought is strengthened by the tradition of Weber and the Austrians, who emphasize that the social phenomena of interest to us are those upon which human action and deliberation bear.
The most interesting case is the last. Suppose one has an explanation of some economic phenomenon,
, which employs a theory such as Tversky, Slovic, and Kahneman’s “compatibility hypothesis” (§14.5), where
would apparently be ruled out by equilibrium theory and standardly accepted auxiliary assumptions. Such phenomena constitute an argument for revising equilibrium theory or limiting its scope. One can always defend equilibrium theory by ad hoc attributions of particular tastes, information failures, and the like, but significant success by some alternative theory in explaining behavior that is largely inexplicable in terms of equilibrium theory presents a major challenge (like that posed – in the opinion of some economists – by Keynes’ theory). Yet most economists neither seek alternative theories nor believe that they can be found.
Russell and Thaler list three reasons why economists have been critical of anomalous experimental findings and are consequently unwilling to consider alternative theories that explain those findings:Footnote 1 (1) “[i]n the real world, people will learn,” (2) “[e]conomists are interested in aggregate behavior and these individual errors will wash out when we aggregate,” and (3) “[m]arkets will eliminate the errors” (1985, p. 1074; see also Gul and Pesandorfer 2008). The second response is a nonstarter, when, as in the case of preference reversal, the errors are systematic, and I have no more to say about it. The first and third responses, on the other hand, call for careful discussion.
15.1.1 People Will Learn
As discussed in Chapter 14, there is evidence, at least with respect to preference reversal, that people do not learn easily. But it is hard to believe that people will not learn at all, and economists have in fact found a way to get experimental subjects to avoid preference reversals. Chu and Chu (Reference Chu and Chu1990) ask experimental subjects to state their preferences with respect to a single pair of a
-bet and a $-bet (such as [($4, −$1), 35/36] and [($16, −$1.50), 11/36]) and to state prices for the two bets. Those who showed preference reversals were then “educated” by the following sequence of exchanges. They had to purchase the $-bet from the experimenter for the price stated, make the exchange of the $-bet for the
-bet, which they claimed to prefer, and then to sell the
-bet back to the experimenter for the price they had stated. After the round was complete, they were, of course, poorer by the difference between the amounts they said they would pay for the $-bet and the amount they said they would accept for the
-bet. Experimental subjects could then revise their preferences or their pricing and the game was repeated. Few continued to be reversers after two rounds, and after having been educated to stop reversing their preferences with respect to one pair, individuals also avoided reversing their preferences with respect to other pairs.Footnote 2 So people do indeed learn. In some environments, irrational behavior is inherently unstable.
Chu and Chu are careful not to leap to conclusions concerning how effectively actual markets educate traders, and the weakness in the general claim that learning makes irrational behavior unimportant is precisely at this point. For markets do not always underline our mistakes so clearly. (If they did, then subjects should have learned not to show preference reversals before the experiments began.) Chu and Chu’s results and the general argument that people will learn provide some hope that phenomena such as preference reversals will not be important factors in economic life. But such hope does not justify dismissing experimental findings of irrationality as obviously insignificant.Footnote 3
15.1.2 Arbitrage Arguments
The third ground Russell and Thaler (Reference Russell and Thaler1985) mention, “[m]arkets will eliminate the errors,” might sound like a restatement of the first, since markets can eliminate errors by facilitating learning. Russell and Thaler are, however, making a different argument, which has been influential in economics. Arguments of this formFootnote 4 purport to show that disequilibria are impossible, except temporarily or as unimportant curiosities. They also apparently provide good reason to dismiss most evidence gathered from surveys, experiments, or field observation as irrelevant.
Disequilibria are anomalous from the perspective of equilibrium theory. Suppose that some significant disequilibrium phenomenon
apparently obtains. For example, suppose that women workers with the same abilities as male workers are paid a lower wage.Footnote 5 This disequilibrium apparently implies a failure of maximization. Demand for the cheaper but not less able female labor should increase, and demand for more expensive male labor decrease until equal wages are paid to equally productive workers. Field studies suggest a variety of explanations for wage disparities between men and women in terms of entrenched expectations, prejudices, and customs.
The “arbitrage argument” rules out the phenomenon altogether. Regardless of what the data might appear to show, disequilibrium cannot persist.Footnote 6 Either there are constraints on knowledge, action, or preference that perpetuate the apparent disequilibrium and make it an equilibrium after all, or the data are misleading. Except in the case of a nearly universal “taste for discrimination,” or legally mandated wage differences, etc., the minority of firms that hire without regard to sex will seize profit opportunities presented by less expensive but equally able women workers. Those firms will earn higher profits. The other firms that continue to hire more expensive male workers will find themselves financially pressed, and ultimately the discriminating firms will either change their hiring practices or they will be driven out of business. The market ensures that equal wages are paid to equally qualified workers. All it takes is competition and a few firms who are only concerned with the bottom line.
So equilibrium theorists can cheerfully concede that people are driven by all sorts of motives and that people may be irrational in countless ways. But the market mechanism, coupled with the rationality of a few individuals and firms, implies that phenomena drastically inconsistent with equilibrium theory are either transitory or require nearly universal divergence between actual motivation and that postulated by equilibrium theory. Surveys, experiments, and field reports may provide material for interesting anecdotes, but competition and the pursuit of profits guarantee that equally productive women will not be paid less than men. Few economists would dismiss such arguments.Footnote 7 Most would say that, while competitive markets may not guarantee that discrimination will be eliminated, they forcefully move society in the right direction.
This form of argument, which I have elsewhere dubbed an “arbitrage argument” (Hausman Reference Hausman1989a), is used in other fields too. In an essay mainly focusing on biology, Elliott Sober calls such arguments “equilibrium explanations” (1983). They seem to be enormously powerful, because they require so little knowledge. To make an arbitrage argument, one need not pay attention to the actual causal mechanisms that supposedly eliminate any disequilibrium outcomes. All that is required is that a minority act on the perceived advantage and that the environment be competitive enough to permit them to thrive at the expense of others.
But, if one does not attend to the motivation, the institutional facts, and the actual mechanisms that supposedly ensure outcomes explicable by equilibrium theory, then one can easily go astray. It may not be the case in real circumstances that hiring equally able but less expensive women workers will lower costs, and even if hiring women does lower costs, it may not be the case that firms that hire women will thrive.Footnote 8 Only attention to the messy facts from which arbitrage arguments abstract will enable one to know whether these assumptions are true and whether the relevant equilibrium must obtain. One cannot concede that the world is a messy place with many factors influencing choices and their outcomes without recognizing that actions that appear to lower costs from the armchair may not actually do so. Similarly, one cannot concede that there are many relevant causal factors and still cling to the a priori conviction that the existence of competitive markets guarantees that the factors encompassed in equilibrium theory dominate all others. There is good evidence supporting the claim that women with apparently equivalent credentials are paid less than men (Bergmann Reference Bergmann1989), but there are complications. Goldin (Reference Goldin2014) argues that the differences between the incomes of men and women is due to the preferences of firms for employees who are willing to work longer and more flexible hours. Since women are less willing to work those hours, they are, from the perspective of these firms, less productive. One can then conclude that the disparities in wages reflect neither a disequilibrium nor discrimination against women. But even if employers are not sacrificing profits by refusing to hire less expensive and equally productive women, the structure of jobs themselves may discriminate against women in virtue of requiring such long and variable work schedules. Similarly, if male employees do not work well with women, employers who prefer male employees purely because it is profitable to do so are arguably discriminating by virtue of allowing the discriminatory preferences of male employees to govern the company’s hiring.Footnote 9
The arbitrage argument does not defend equilibrium theory from all empirical doubts, but it does pose interesting questions. Since there is a great deal to be said for equilibrium theory, and since the weaker presumptions that drive arbitrage arguments are plausible, phenomena that appear to be incompatible with equilibrium theory are puzzling and demand explanation. Sometimes apparent conflicts can be explained away, but sometimes one uncovers factors that significantly influence economic phenomena, from which equilibrium theory abstracts. These can always be treated as mere disturbances, but they may be susceptible to systematic theorizing. As argued in Chapters 13 and 14, there should be no methodological rule against economists studying these other factors.
Although there might possibly be an a priori case for the irreplaceability of a folk-psychological, belief-desire account of human action in terms of the aims and interests social scientists have in inquiring about social phenomena (§A.9), neither arbitrage nor learning constitute a blanket a priori defense of utility theory, and there is no a priori case to be made for the other constituents of equilibrium theory.Footnote 10 To what extent economics ought to be a unified separate science as opposed to a collection of specific theories of narrower scope cannot be decided by methodological reflection. It is an empirical question. Given the limited predictive power of equilibrium theory, there should be no presumption that alternative theories can be dismissed on general methodological grounds.
15.2 How to Do Economics
Suppose that the conclusions of the previous chapters concerning confirmation and theory appraisal in economics are correct: economists employ an uncontroversial method of theory assessment. Unfortunately, owing to poor data (relative to the state of economic knowledge), little can be learned about which theories are better confirmed. Given the initial credibility of the basic behavioral postulates of economics, it is rational to remain committed to them in the face of apparent disconfirmations. The consequence of such a defense is to leave economists unable to learn very much from typical economic data.
If this account is correct, economists desperately need better data in order to advance their theoretical knowledge, and in the first edition, I called for a serious redirection of effort among economists, which is in fact taking place (although not because I said it should). Such data gathering must not be divorced from specific objectives and problems, both practical and, especially, theoretical. Economists must be prepared to consider alternative kinds of theorizing, or else there is little point to such data gathering. Ways of getting better data include:
1. A major commitment of resources to take advantage of experimental opportunities. Experimental economics has blossomed over the last half-century. This is a positive development and should be encouraged.Footnote 11
2. A general willingness to make use of observational data of all kinds. Much can be done to refine the techniques of field reports and social experimentation.Footnote 12 Many economists have overcome the animus economists have traditionally felt toward the laborious process of gathering data and the precarious process of interpreting them.
3. More active engagement by economists in data gathering in order to appreciate better the limitations of particular data sets. Although econometricians have recognized and emphasized this point, the structure of rewards within the economics profession for too long made involvement with data gathering costly to individual researchers. That is now changing, with Nobel prizes going to experimenters, behavioral economists, and development economists whose main activities consist of field experiments.
4. Further work on improved statistical techniques for the analysis of data. Lacking expertise in econometrics, I have no clear conception of the limitations of current methods and am uncertain about how much can be hoped of further developments.
Of course, even improved data will not help theoretical knowledge in economics grow if economists are unwilling to entertain alternatives because of methodological commitments to a single style of theorizing. If unfamiliar explanations can command empirical support, they should be pursued.
I defend three changes in theoretical economics, which are already well under way:
1. More work should be devoted to exploring alternatives to elements of mainstream economics that have already been proposed, such as Machina’s proposal to surrender transitivity and the independence principle, Loomis’ and Sugden’s regret theory, McClennen’s notion of resolute choice, Simon’s theory of procedural, bounded rationality, and, most obviously, the many proposals of behavioral economists such as Kahneman and Tversky’s prospect theory.
2. More serious attention should be paid to the work of other social scientists. Whether economists have anything to learn from the results of any particular psychological or sociological research is an empirical question. The only legitimate reason to dismiss all work of other social scientists as of little interest would be if the separate science of economics were a smashing success. In the case of preference reversals, we can see that economists are paying serious attention to both the findings and theories of psychologists.
3. Quite different styles of theorizing, such as that exemplified by the institutionalists should be encouraged, and empirical work studying problems faced by particular firms or groups of employees should be taken seriously by economic theorists.
What do these generalities mean in detail? (It is here that I am most painfully aware of the limitations of my knowledge.) The best I can do is to consider specific examples, some of which exemplify these methodological recommendations and some of which do not, and to call attention to their methodological virtues and vices.
But, before doing so, two central qualifications are required:
Economics is a diverse enterprise, and there is no reason why it should become less diverse. Economists face many different tasks and questions. My focus is only on theoretical economics.
There is absolutely no reason why all economists should employ the same styles and strategies of theorizing. If my methodological recommendations were to find general favor among economists, work devoted to the further elaboration of a separate science of economics – that is, to the articulation and application of equilibrium theory – should still be done. There is no alternative that is so obviously superior that it would justify everyone abandoning mainstream economics. What is wrong with economic theorizing is not what economists are doing, but what they are not doing and what they refuse to do.
15.3 Cautionary and Encouraging Examples
15.3.1 Samuelson’s Overlapping-Generations Model Revisited
Recall Samuelson’s essay, “An Exact Consumption-Loan Model.” Samuelson’s stated purpose is to determine what effects the desire to save for retirement has on the rate of interest. He is also interested in the mathematical and conceptual problems raised by the infinity of time periods, but these interests were supposedly subordinated to answering his empirical question.
However, as we saw, Samuelson abandons his main question and focuses on conceptual and implicitly normative issues. There is nothing wrong with conceptual and normative inquiry, and Samuelson’s essay is a fertile and intriguing contribution, which raises fascinating questions about the influence today of expectations concerning tomorrow, about the notion of optimality in a dynamic context, and about the possible interplay between market and “social contract” in solving dynamic intergenerational problems. What bothers me is the view of what such models can accomplish that is implicit in Samuelson’s essay and, more dramatically, in Wallace’s account of fiat money. These models do not answer empirical questions and it is hard to see how they could do so. Samuelson’s and Wallace’s overoptimism and Wallace’s theoretical purism can be explained by attributing to them a vision of equilibrium theory as the core of a separate science of economics (see Hahn 1982, pp. 6–7).
In the discipline as a whole, it is probably a good thing that there are some theorists with dogmatic commitments. Without them, individuals would not be willing to explore such complicated models. There is a methodological failing here only when such theorizing gains a hegemony over the profession and attracts all the prestige and all the most talented students.
15.3.2 Preference Reversals
The phenomenon of preference reversals suggests that economists have come to recognize that there is something to be learned from experimentation and psychological theory. There are significant mainstream economists who do not subscribe to a dogmatic deductive method that prevents them from investigating anomalous data. Such dogmatism as there is results from economists’ commitment to economics as a separate science.
When economists first recognized the reality of preference reversals, which were successfully anticipated by a plausible psychological hypothesis, their reaction was to attempt to generalize or weaken parts of equilibrium theory rather than to utilize the theoretical contributions of psychologists. While economists were pondering implausible explanations in terms of violations of independence or the reduction postulate, psychologists were doing the theoretical and empirical work that seems to have revealed the source of the phenomenon in “procedure variance” and the “scale compatibility” hypothesis. I see no methodological mistake in attempts to reformulate equilibrium theory, as Machina (Reference Machina1987) proposes. There is a problem here only if most economists regard this as the only possible response.
There are several methodological lessons here. First, experiments can have a dramatic impact. What else plausibly could have led leading theorists to concede that individual choice could be so drastically inconsistent with utility theory?Footnote 13 Second, experimental results are (obviously) subject to interpretation. By themselves, even a long series of experiments only poses a problem. Third, economists, unlike astrologers, do not dismiss anomalous data.Footnote 14 They do not like such data, and they try to discredit them by analysis and experimentation, but they do not hide them either. Not only do some economists take the results seriously, but the major journals in the field now readily publish reports of experimental results and discussions of these results. One even finds reports of survey results, which Friedman sneered at.Footnote 15 The accusation that economics is not an empirical discipline (Rosenberg Reference Rosenberg1983) cannot be sustained. But commitment to equilibrium theory, and especially to the theoretical strategy and structure that it exemplifies, runs deep. This waning commitment has in the past closed economics to the consideration of relevant theories proposed by other disciplines and generated an unjustifiable, but fading, dogmatism.
15.3.3 Stretching Mainstream Economics
Although economists of some schools, such as the institutionalists, have been willing to make use of theories and empirical findings from other disciplines, orthodox mainstream economists have generally clung to their separate science. This claim might be regarded as merely definitional: to have behaved otherwise would automatically classify an economist as not mainstream. But it is possible to develop abstract mathematical models that incorporate both postulates of equilibrium theory and theoretical generalizations proposed by social theorists. For example, although prospect theory surrenders the hope of uniting normative claims about how individuals should choose with empirical claims about how they do choose, it permits mathematical modeling that is similar to mainstream modeling. Modifications of equilibrium theory such as these lie at the limits of economics as a separate science. They include many of the standard behavioral postulates of equilibrium theory, and, in particular, they portray individuals as maximizing a well-defined utility function, albeit in the case of prospect theory, this utility function is sensitive to context in a normatively indefensible way. In mathematical style and in willingness to abstract and to simplify, such models are orthodox. But, in attempting to model the findings of psychologists and other social scientists, they open a chink in the wall that keeps economics separate.
In a series of striking papers, George Akerlof, a Nobel laureate in economics, has demonstrated that it is possible to combine orthodox tools and modeling style with theories from other social disciplines. I focus on only one example among those that Akerlof’s work provides. In “Labor Contracts as Partial Gift Exchange” (1982), Akerlof begins by citing the results of an empirical study by the sociologist George Homans (Reference Homans1953; 1954), which showed that “cash posters” (clerks who recorded payments) at a utility company processed, on average, 17 percent more bills per hour than the company required. Yet, according to Homans, “[i]t [cash posting] was an exceedingly routine and repetitive clerical job, which could be done with little concentration by girls whose main interests were not in the job itself and who were not deeply concerned with promotion in the company.” Moreover, “[i]n view of the fact that it required no previous outside training, such as stenography, it paid well” (1954, p. 727).
If the marginal utility of effort for the cash posters is negative, then minimum satisfaction of the work rules would be utility maximizing. If, on the other hand, the marginal utility of effort is positive (albeit decreasing), then the company and the workers could both be better off if higher pay were offered for more output. The data suggest a failure of maximizing on the part of the workers or the company.
The explanation Akerlof defends draws on the work of anthropologists, sociologists, and philosophers (especially Mauss Reference Mauss1954 and Titmuss Reference Titmuss1971), and it is intuitively plausible (though by no means obviously correct). In working for a firm, employees develop attachments to one another and to the firm itself. These attachments lead employees to make a gift to the firm of extra work. Such gifts are, however, only provided if the employees feel that the firm is fairly reciprocating with gifts of its own in the form of lenient work rules that reduce the pressure on less able workers or better pay or benefits than some salient reference. Such gift exchanges can lead to wages above the market-clearing rate and hence to involuntary unemployment. The “gifts” the firm provides in exchange for more or better work from employees need not be in the form of higher wages. Indeed, Homans notes, “[o]nly one feature of the job was mentioned favorably by more than half (6) of the girls, and that was the general friendliness of the group and the “niceness” of the people in the division” (1954, p. 727).
Thus far, the explanation smacks of sociology and anthropology, which in the view of most economists in 1982 would be a damning indictment. Moreover, the data Akerlof cites in support of his claims are drawn from sociological studies including surveys (Stouffer, Suchman et al. 1949 and Stouffer, Lumsdaine et al. 1949). This work can nevertheless count as mainstream economics, because Akerlof goes on to sketch a mathematical model in which gift exchange is generated from the profit-maximizing choices of firms and the utility-maximizing choices of workers. The abstract model has three main components:
1. Quantitative effort norms. These are functions of wages, work rules requiring minimum effort, worker’s utilities, wages paid by other firms, the unemployment rate, and unemployment benefits. Firms know that effort norms depend on these factors.
2. The utility of employees. This depends on the effort norms, the employee’s individual effort, the wage, and individual tastes. Workers choose whether to be employed, which job to take, and what level of effort to exert on the job in order to maximize their utility, subject to the constraint that they satisfy a firm’s work rules. Their utility does not depend exclusively on the wages they receive.
3. The firm’s output. This depends on the number of employees and their level of effort. Given limited knowledge of workers’ individual tastes, firms decide on wages, work rules, and size of labor force in order to maximize net revenue.
The behavior of the cash posters is modeled in a more particularized model with a fixed uniform wage, specified parameter values, and the further assumption that greater differences in work rules for different kinds of workers has a negative effect on output norms. Given the existence of norms that depend on the factors mentioned and the dependence of individual effort on such norms, gift exchange can result from the choices of employees and firms.
In addition to constructing models of gift exchange that conform to the general outlines of Homans’ case study, Akerlof draws on theoretical work in social psychology, sociology, and anthropology (Etzioni Reference Etzioni1971; Festinger Reference Festinger1954; Maus 1954; Mayo Reference Mayo1949; Merton Reference Merton1957; Stouffer et al. Reference Stouffer, Suchman, de Vinney, Star and Williams1949; Titmuss Reference Titmuss1971), but the social theory in his essay is unspecific, and Akerlof’s claims about whether the cash posters engage in an exchange of gifts are not well confirmed. Compare his account of gift exchange to Avner Offer’s, which highlights the following features of a gift:
[A] voluntary transfer; an expectation of reciprocity; reciprocity is notionally open to discretion as to value and time; and is motivated by a desire for regard, over and above any gains from trade; regard is communicated by gift; personalized gift authenticates regard; gift is unpriced, often unpriceable; and gift establishes repetitive, self-enforcing bond, which facilitates trade.
Regard is an attitude of approbation. It needs to be communicated. The gift embodies that communication and carries the signal.
Although what Akerlof writes is compatible with Offer’s account, it is far less specific. Indeed, it may be that all that Akerlof means is that the cash posters provide more work than the company demands and that the company provides better wages and working conditions than the workers can find elsewhere. To be sure, he maintains that workers acquire “sentiments” for the firm and for their co-workers, which motivate their excess performance (1982, pp. 543–4), but he says little about what those sentiments might be and to whom (other than “the firm” and other workers) they are directed.
Akerlof seems mainly concerned to explain involuntary unemployment by the above-market-clearing wages paid by firms. In “primary markets” “the gift component of labor input and wages is sizable, and therefore wages are not market clearing” (1982, p. 544). But it seems that a variety of mechanisms could explain the excess compensation and performance. Indeed, Akerlof himself questions whether wages above the market rate significantly influence productivity.Footnote 16 What explains Homans’ findings is far from clear. The relatively high pay of the cash posters appears to be something of a fluke. “In the not-too-distant past, cash posting had been done at night by men and, therefore, had commanded relatively high wages, which were not changed when it was transferred to the day time and to women” (Homans Reference Homans1953, p. 6). The cash posters were paid the same wages as the more skilled ledger clerks, who resented the failure of the company to assign them a higher status. Conversely, one can speculate that the cash posters felt that they possessed a higher status than their unskilled work would otherwise have received.Footnote 17
What other mechanisms besides gift exchange could explain Homans’ observations? One possibility is that the outcome is a game-theoretic equilibrium among rational self-interested agents. In a repeated game, such as an iterated prisoner’s dilemma,Footnote 18 cooperation can be in the player’s rational self-interest. No (conditionally) altruistic motivation is required. By paying above-market wages (and retaining the capacity to fire workers who are not productive), an equilibrium is possible where employees are more productive in return for higher wages. In such an equilibrium, there need be no sentiment of kindness, gratitude, or obligation. It is just a matter of rational calculation on the part of employees and employers.
What actually motivates the cash posters and their employer matters. If the excess wages and productivity are entirely materially self-interested, then (other things being equal) firms would do well to carefully monitor the performance of their workers, while if the excess productivity is a gift, it might be withdrawn or diminished by the intrusive enforcement of work rules. Making known to the employees how significantly their excess performance enhances the firm’s output would further motivate gift exchange, while if anything encouraging self-interested employees to strike a harder bargain with management.
Instead of an exchange of gifts or an equilibrium among rationally self-interested individuals, perhaps the phenomenon is one in which the firm trusts the employees to behave well, and trustworthiness rather than kindness or gift-giving motivates their high productivity. Moreover, rational agents do not care only about money. An agent may produce more than the work rules require, not as a gift to the firm, but as a self-interested investment in greater job security and a pleasant work environment.
In truth, there are many possible explanations for apparently costly cooperative behavior – gift exchange, fairness, inequality aversion, trustworthiness, repeated-game equilibrium – and economists have explored these possibilities both with mathematical modeling (such as Rabin Reference Rabin1993) and by controlled experiments both in the laboratory and in the field. For example, there is now a large literature exploring variations on the “ultimatum game” (Güth et al. Reference Güth, Schmittberger and Schwarze1982). In ultimate games, one experimental subject
(the proposer) offers the division of a prize (usually some amount of money) between
and some other anonymous individual
(the responder), who accepts or rejects
offer. If
accepts, then the prize is divided as proposed. If
rejects the division, then both players get nothing. On the assumption that any amount of money is preferred to none, responders will accept any division. Proposers should thus offer very little, which responders accept.
Unsurprisingly, this is not what is observed. Proposers often offer an even division, and rarely offer less than a third to the responder. Very uneven offers are typically rejected. One cannot understand this behavior in terms of the exchange of monetary gifts, because an uneven offer is still a gift, and it costs the responder nothing to accept it. One might instead suggest that what is at issue is an aversion to inequality (Fehr and Schmidt Reference Fehr and Schmidt1999), which leads proposers to offer even divisions and leads responders to reject uneven divisions. But in dictator games, where the recipient of a proposed division has no choice but to accept the division, proposers offer much less (Bolton et al. Reference Bolton, Katok and Zwick1998; Camerer and Thaler Reference Camerer and Thaler1995; Guala and Mittoni 2010). The purported aversion to inequality apparently evaporates. Moreover, when the division is known to be set by a computer program and is no longer
choice, responders are more willing to accept unequal divisions (Blount Reference Blount1995). In a three-person variant in which
offers a three-way split between herself,
, and a third recipient,
, who has no choices to make,
tend to offer a fairly even division between themselves and
with little or nothing for
, which
accept. Fairness and inequality aversion seem to be significant considerations only when one is getting the short end of the stick.Footnote 19
A plausible diagnosis of behavior in the ultimatum game invokes social norms, which determine what individuals regard as fair and respectful behavior toward themselves. Hence, one sees differences in the offers and responses on the part of individuals in different cultures.Footnote 20 The contrast between offers in the ultimatum and dictator games suggests that proposers make generous offers not so much because they feel directly compelled to do so by the norm (which plausibly calls for the same offers in the ultimatum and dictator games), but because they expect (correctly) that very unequal offers will be perceived by the responder as an insulting violation of a norm, which will be punished.
Several studies have attempted to probe Akerlof’s hypothesis that gift exchange rather than some other mechanism explains excess productivity coupled with above-market wages. Fehr et al. (Reference Fehr, Kirchsteiger and Riedl1998) report a laboratory experiment in which buyers (who are analogous to employers) offer a price to sellers for some good, and sellers (like workers) then have a choice over the quality of the good they provide (i.e., their effort level). Buyers wind up offering prices that are far above the reservation prices of sellers, who in turn supply goods of higher quality, which are more costly for sellers to provide:
The fact that
does not converge towards
[the competitive equilibrium price] in this design can be interpreted in several ways: (i) It may be due to buyers’ altruism or buyers’ attempts to obey some equity norm. (ii) It may be caused by sellers’ willingness to reject prices that are close to
. If buyers anticipate sellers’ willingness to reject low offers it is in their interest to offer prices that are suficiently above
. (iii) Prices above
may also be caused by the apparent willingness of many sellers to choose a high
[quality] in response to a high
. If there is a sufficiently steep positive relation between
and
it is in the pecuniary interest of buyers to offer high prices.
Fehr et al. find evidence for the third interpretation, which to some extent supports Akerlof’s hypothesis. But to make the experiments relatively simple, they abstract from important features of the cash posters’ case. The experiment is set up to avoid reputational effects. Sellers establish no continuing bonds to one another or to the buyers. Unlike employers, whose choices have all sorts of effects on employees, all that the buyers have to offer to the sellers are the prices buyers bid for the good.
Gneezy and List report on field experiments designed to test Akerlof’s gift exchange hypothesis. Subjects were recruited for a one-time job (2006, p. 1367) either computerizing the holdings of a library or doing door-to-door fundraising. After having been trained for the task, some of the subjects were informed that they would be paid at a higher wage rate than had previously been offered to them. This higher pay is the gift they receive. What Gneezy and List found was that the gift led to appreciably higher output for the first few hours of employment, but the boost to productivity then wore off and the employer would have done better not to have offered the gift (2006, p. 1365). They conclude that whether findings such as those reported in Fehr et al. (Reference Fehr, Kirchsteiger and Riedl1998) “have implications for real labor markets [is] an open empirical issue” (2006, p. 1381). “[G]reat care should be taken before making inference [sic] from laboratory experiments, which might be deemed as hot decision making, to field environments, which typically revolve around cold decision” (Gneezy and List Reference Gneezy and List2006, p. 1379).
On the other hand, Falk (Reference Falk2007) reports on a field experiment whereby the inclusion of gifts in letters seeking contributions for the benefit of street children in Bangladesh had a very large effect on both the number and total value of contributions. However, both the papers by Falk and by Gneezy and List address cooperative circumstances that are very different from those that Akerlof is concerned with. Neither is concerned with continuing relationships, which may facilitate the reciprocation that Akerlof postulates, and the gifts in Falk’s study, which were reproductions of children’s drawings, clearly enhanced the salience of the fundraising appeal, whether or not they invoked norms of gift-giving.
Netzer and Schmultzer (2014) show that gift exchange is not possible between a self-interested employer and workers whose actions are influenced by a certain conception of fairness. If, following Rabin (Reference Rabin1993), one models a concern for fairness as a disposition to reward kind behavior with kindness and unkind behavior with an unkind response, then, if it is in the employer’s interest to pay above-market wages in response to the worker’s kind provision of excess output, then in pursuing his or her own interest, the employer is ipso facto failing to respond with kindness to kindness. Fairness calls for punishment rather than reward from the workers. “Profitable gift-exchange should cease to exist as soon as workers are fully aware that the firm’s ultimate goal is profit maximization” (Netzer and Schmultzer 2014, p. 1605). Netzer and Schmultzer point to evidence of this effect in other studies:
For instance, in his interpretation of experimental results, Charness (Reference Charness2004, p. 679) conjectures that employees might no longer perceive high wages as kind once they realize that paying these wages is in the employer’s own interest. Fehr, Goette, and Zehnder (Reference Fehr, Goette and Zehnder2009) emphasize the importance of explaining the fairness aspect of wage variations to the workers, and Bellemare and Shearer (Reference Bellemare and Shearer2011) argue that gifts should not be “clearly in the short-term interests of the firm”.
If the sentiments that Akerlof has in mind should be modeled as a return of kindness to kindness, then gift exchange is impossible for firms known by employees to be motivated by pursuit of their expected returns. However, this result does not refute Akerlof, because he is not committed to this way of modeling the sentiments of workers, and, moreover, he may suppose that firms have other objectives than maximizing profits. This argument illustrates the Pandora’s box Akerlof opened when he sought to expand the motivational repertoire of labor economics.
In the most recent examination of gift exchange of which I am aware, DellaVigna et al. (Reference DellaVigna, List, Malmendier and Rao2022) report on field experiments that attempt to determine whether workers have social preferences that make an appreciable difference in their output and whether those preferences are directed toward benefiting their employers:
As in prior gift-exchange field experiments, workers are hired for a one-time task, to shut down repeated-game incentives and thus isolate social preferences. They are then exposed to different employer actions, such as surprise pay raises (Gneezy and List Reference Gneezy and List2006), pay cuts (Kube, Maréchal, and Puppe Reference Kube, André Maréchal and Puppe2013), or in-kind gifts (Kube, Maréchal, and Puppe Reference Kube, André Maréchal and Puppe2012). We also create variation in the return to the employer from workers’ effort as in Englmaier and Leider (Reference Englmaier and Leider2020). The differences in worker effort across the treatments provide evidence on workers’ baseline levels of social preferences as well as reciprocity to the employer’s generosity.
They claim to find evidence of a “warm glow” – that workers derive “utility from doing their part by exerting effort for their employer, regardless of how the effort translates into payoffs for the employer” (2022, p. 1044) – as opposed to an altruistic preference to benefit the employer. There is some evidence of gift exchange, but its effect on worker performance, especially productivity, is small. Since the set-up of the experiment precludes the formation of any of those sentiments that arise from sustained interaction, the comparative impotence of gift exchange does not refute Akerlof’s conclusions.
This lengthy case study illustrates the incursion into economics of concepts, distinctions, and hypotheses from the other social sciences. Since understanding the factors that influence the conditions and productivity of employment is of obvious importance within economics, this literature’s challenges to a simple supply and demand model of wage and output determination are of obvious importance. At the same time, this empirical turn in labor economics (only one aspect of which I have surveyed) raises at least as many questions as it answers. Plenty of opportunities remain for formal modeling, but with a loosening of the constraints on the factors that may appear in those models, confusing and even contradictory results are unsurprising.
It might appear from this discussion that all is well with the vision of economics as a separate science. What further demonstration could one ask of the flexibility of equilibrium theory than the fact that these empirical generalizations from psychology and anthropology can be incorporated into it? But to someone committed to a separate science of economics, the generalizations that appear in the models inspired by Akerlof’s work are suspect, because they are not generated by what Lakatos calls “the positive heuristic” of the mainstream research program. One finds a fusion between rational choice modeling and social-psychological generalizations concerning gifts, kindness, anger, and norms that do not satisfy the constraints listed in Section 7.5. The additional generalizations concerning the preferences and beliefs surveyed earlier have a comparatively narrow scope. They undermine the dominance of acquisitiveness as the sole motive of consumers and workers and have no explanation in terms of equilibrium theory itself. Just as economists have objected to adaptive expectations on the grounds that they were not derived from equilibrium theory and make exploitable fools of individuals, so might the defender of the separate science of economics object to the generalizations Akerlof and others writing on these topics borrow from other social sciences. Akerlof makes this point with an elegant analogy:
[E]conomic theorists, like French chefs in regard to food, have developed stylized models whose ingredients are limited by some unwritten rules. Just as traditional French cooking does not use seaweed or raw fish, so neoclassical models do not make assumptions derived from psychology, anthropology, or sociology.
I would add that, if French chefs resembled mainstream economists, French cuisine would be more monotonous, for the chefs would use very few ingredients. They would also strenuously insist that food containing any other ingredients was not French. In stretching and opening microeconomics, Akerlof is transcending the vision of it as a separate science.
In Akerlof’s combinations of microeconomics and other theories, microeconomics plays two different roles. In its mathematical treatment of utility theory, economics provides a means to render determinant the implications of the empirical “forces” captured in the generalizations borrowed from other social sciences. But, second, it also supplies one “force” of its own: acquisitiveness. Fusing the theory of gifts or an account of resentment of unfairness with a mathematical treatment of utility theory, acquisitiveness, and profit maximization consequently has implications that the psychological or anthropological theories by themselves do not have. Akerlof shows how hypotheses formulated by other social theorists can sometimes be neatly presented within a formalism familiar to economists, but a single set of behavioral generalizations is no longer doing the work for all economic problems, and the vision of economics as a separate science has been abandoned.
It is possible to regard the behavioral generalizations offered by psychologists as simply further facts to be incorporated into microeconomics in the same way that technological knowledge or institutional constraints are incorporated. But to do so would be to countenance a radical change in strategy, because the models economists employ would then depend on substantive social and psychological theories, and economics would be a separate science in style only. If the explaining is being done by ad hoc psychological laws, and the economic framework is imposing a merely stylistic unity, then one has largely given up the aspiration of being able to separate off an economic realm, subject only to its own laws. Akerlof’s marvelous ability to incorporate the insights of psychologists and sociologists into mathematical models is subversive of the style of theorizing to which it apparently conforms.
Although models such as those discussed in this section are concerned with empirical questions of positive economic theory, they are normatively loaded; and resistance to modifying microeconomics may have political and ideological as well as methodological sources. If workers in risky industries systematically underestimate the risks involved in their employment, the bargains they make with their employers cannot be counted on to protect their interests (Akerlof and Dickens Reference Akerlof and Dickens1982). If individuals irrationally refuse to think about their retirement, then there is a stronger case for mandatory social security or nudging.Footnote 21 If output depends on whether workers believe that they have been treated fairly, management must think about fairness in addition to marginal product. If economic behavior is significantly influenced by distinctive psychological and sociological traits in addition to rational acquisitiveness, or if there is any systematic irrationality, then the identification of well-being as the satisfaction of preferences is cast into doubt and the argument for competitive markets from minimal benevolence (§4.4) no longer goes through. The case for government intervention in economic life is consequently strengthened.Footnote 22 However separable positive and normative questions – questions of is and ought – may be in principle, they are here, as in Samuelson’s overlapping-generations model and in economics generally, constantly intertwined in fact.
15.4 What Is to Be Done?
If this book’s general story is correct, what can be done to improve economics? One thing is to get economists to recognize how much the discipline has changed from the portrait textbooks provide. Given the structure of academic disciplines, including economics, fundamental change is slow and rarely the result of philosophical argument. Indeed, sensible economists will place little credence in the arguments of outsiders such as philosophers. Moreover those, if any, who are persuaded drastically to change their research profiles may find themselves exiled.Footnote 23
One should not suppose that the incentives within the discipline will automatically favor the optimal mix of methodological commitments. The “meta-level” arbitrage argument that economists who employ a better methodology will convert or bankrupt those who employ a worse methodology is even weaker than the arbitrage argument concerning the behavior of economic agents. The competitive structure of scientific disciplines (see Hull Reference Hull1988) is quite different from the structure of markets, and in economics, there is the additional complication that financial interests may bias the work of economists. Although these distortions may be weak among academic economists, economists who are employed by firms, unions, think tanks, and political parties often need to adjust their conclusions to the interests of their employers. The obstacles in the path of empirical success in economics are so profound and the costs of empirical research are so high that measurable standards of excellence in economics, such as mathematical prowess, can persist regardless of whether they contribute to or hinder progress in the discipline.Footnote 24
I am not competent to map out further reforms that economists ought to undertake. The most important step is to comprehend where the discipline has been and how it is changing. Many detailed changes still need to be made in the education of economists,Footnote 25 in the incentives in the profession, and in the tenuous relationships between economists and other social theorists. The discipline has been in need of the major overhaul that it is currently undergoing.
15.5 Epistemology, Methodology, and the Practice of Economics
Having climbed briefly and precariously on to my soap box, I am pleased to step down and address the philosophical question of whether I should have been up there. One might question the possibility or point of “external” criticisms: can any outsider grasp the constraints governing the different tasks economists undertake well enough to be able to offer sensible advice? Regardless of how well supported such advice was, could it possibly have any effect?Footnote 26 Even religious moralists with the authority of divine writ have had a hard time changing people’s behavior. How then could a philosopher, armed with little more than modus pones, change practices that are devised by smart, dedicated, and well-educated economists? One’s reaction at this point might shift from cynicism to annoyance. What impertinence! How dare Hausman pretend to legislate for economists?
These rhetorical questions, which can be asked about all normative enterprises, suggest healthy skepticism but nothing more. Consider the analogy between philosophers of science studying science and economists studying business. Can economists know enough about business to offer advice? Can their advice possibly have any effect? Is it not arrogant to suppose that economists could know more about how to run a business than individuals who have devoted their lives to some firm and whose livelihood depends on its success? Such questions counsel caution, but they do not justify repudiating economic assessment and advice.Footnote 27 The analogous doubts about normative methodology do not automatically discredit the recommendations offered by a philosopher.
With regard to the complaint that methodology is futile, remember that the influence of this book depends on you as well as me. There is no doubt that normative theorizing, like all theorizing, generally has little immediate influence on human practice. But this realization provides no better reason to dismiss methodological advice than to dismiss moral precept. No doubt, economists are and should be reluctant to change their practices, and the force of argument is limited. But arguments can persuade, and good arguments can persuade rationally. If a case is clear, cogent, and accessible, let us suppose that it can have some effect too.
More interesting than these general complaints against all methodological criticisms are arguments directed particularly against accepting the advice of philosophers (hiss!). Consider the following grounds for hesitation:
1. The philosophical theories that supported previous methodological advice have, one after another, collapsed with their own internal problems.
2. Philosophical claims to special authority in methodological matters have been progressively undermined by the development of philosophical thought itself.
3. The most influential philosophical recommendations of the twentieth century – that supplied by the logical positivists (at least in popularized versions such as Ayer’s Language Truth and Logic (1936; see §A.1) and that offered by Popper and Lakatos – are, in my view, in large part bad advice. Why should this book be any better?
These objections to philosophy of economics call for some comments on the nature of my enterprise. In doing so I also react to provocative work by Deirdre McCloskey, who has pressed such objections with particular force. In her book, The Rhetoric of Economics (1985a), McCloskey repudiates the whole enterprise of methodology.Footnote 28
McCloskey and I agree that the distinguishing feature of economic methodology is its concern with the relationship between the practices and products of economics and general cognitive ends such as truth and predictive reliability. Methodology is concerned with whether the claims of economics are predictively reliable or true and how one can judge whether they are reliable or true; and it is concerned with whether the practices of economists lead to conclusions that one ought to rely on or to believe. This is not to say that methodology is exclusively concerned with questions of theory assessment, which has obviously not been the only issue in this book. But much of methodology is normative, and the standards employed are in part “external” general standards, such as those stating when one is justified in relying on claims for practical purposes and when one is justified in regarding claims as true or close to the truth.
When McCloskey argues for an end to methodology, she is arguing against any connection between specific evaluative standards and the concerns of epistemology. Why? Obviously, there are evaluative standards; norms are unavoidable features of every human enterprise. But one might question whether any norms are better than any others in leading scientists to acquire predictively valuable or true conclusions. Paul Feyerabend comes close to rejecting all norms (1975), but this extreme view is not McCloskey’s.
A slightly more moderate view is that no norms can be shown to be better than others in achieving predictively valuable or true conclusions. One might argue for this claim by maintaining that “good” arguments in economics are simply those that accord with whatever standards happen (for whatever reason) to be prevalent among economists. Many have read McCloskey as defending this radically skeptical conclusion. I do not think this interpretation is correct, because rhetoric – her replacement for methodology – is supposed to be concerned with good arguments, that is, arguments that not only happen to persuade but which ought to persuade (1985a, p. 29). Furthermore, McCloskey criticizes the conflation between economic and merely statistical significance, which is common in leading economics journals and thus presumably persuasive to many economists (1985a, chapter 9).
If McCloskey agrees that there are grounds to believe that some norms are better than others, why does she deny that there are specifically epistemological grounds for distinguishing which are better and which are worse? Although some of McCloskey’s reasons apply to all methodological inquiry, many are directed only against a particular kind of methodology, which one might call “a priori” or “conceptualist.” Positivists or, more generally (in McCloskey’s terminology), “modernists” are often taken to believe that methodological standards are determined by the analysis of concepts.
Regardless of whether McCloskey correctly interprets traditional conceptualist methodology or justly criticizes it, her critique of specifically “modernist” methodology is not an argument against all methodology. In my view, philosophical theses about how knowledge ought to be acquired and structured, such as those defended in this book, are justified just as other theses are. One asks, “how well do they enable my body of knowledge, including my perceptual beliefs, to hold together?” Epistemology has been, in W. V. O. Quine’s terminology, “naturalized” (1969). Like physics or anthropology, it aims to improve our beliefs. Only the narrowness of its questions, which is dictated by its normative role, distinguishes epistemology from other empirical studies of the acquisition and revision of human beliefs.
The methodological inquiry in this book embodies this vision of philosophy. People acquire knowledge. To find out how, one must study what they do, without presuming that there is only one good way to learn. To find out how people have learned, and to find out which methods have been successful in which circumstances, one must study what has been done and how well it has worked. Insofar as McCloskey is only insisting that those interested in economic methodology must study how economists argue, I fully agree.
But McCloskey wants to draw more radical conclusions.Footnote 29 She wants to repudiate all methodology, not merely a priori methodology. Apart from her critique of a priori methodology, she seems to have three main reasons for denying that economists can be held to “external” cognitive standards justified by epistemological considerations.
First, McCloskey asserts, “[n]othing is gained from clinging to the Scientific Method, or to any methodology except honesty, clarity, and tolerance” (1983, p. 482). Her point seems to be that external standards are vacuous. Truly informative and substantive standards are context dependent. They will be determined by features internal to economics, not by general epistemological considerations. The failed efforts of philosophers to provide contentful context-free accounts of notions such as confirmation or scientific explanation suggest that substantive norms will be context-specific (Miller Reference Miller1987). But the conclusion that no significant transdisciplinary claims can be made and that there can be no role for epistemology depends on the false assumption that epistemological claims can make no reference to context. Just how methodological rules should depend on features of the context of inquiry is itself an important epistemological question.
Second, McCloskey argues that “[i]t would be arrogant to suppose that one knew better than thousands of intelligent and honest economic scholars what the proper form of argument was” (1985a, p. 139). But epistemology seems arrogant only if one falsely assumes on the one hand that these “intelligent and honest economic scholars” agree on an internally consistent methodology that coheres reasonably with the rest of their beliefs and on the other hand that the methodologist drops from a philosophical cloud. As we have seen, there is plenty of methodological controversy among economists, and philosophers can learn some economics before issuing edicts. Since there need be no arrogance when economic scholars themselves invoke epistemological concerns, it is not automatically arrogant of outsiders to do so.
Finally, McCloskey argues that the methodological standards defended by the epistemologist depend on a chimerical notion of Truth (with a capital “T”) (1985a, pp. 46–7; 1985b, pp. 136–7; 1988a, pp. 255–6). This claim is confused. Truth is an objective, not an evidential criterion. McCloskey argues that the standard of good argument is whatever persuades the majority of competent economists. There is no other success to be obtained and no other objective to aim for.
This argument rests on a failure to distinguish among different issues. First, it is not true that all invocations of epistemology are bound up with the notion of truth, for not all epistemologies are realist (§A.2).Footnote 30 The instrumentalist wants only predictively useful hypotheses. Second, although both instrumentalist and realist economists aim to persuade other economists, they seek to persuade not as an ultimate goal, but because they take success in persuasion to be a fallible result and indication of having made a good argument. Working among a set of corrupt and depraved colleagues, one might be disconcerted by persuasive success.
Third, the fact that truth and future reliability are not grounds for accepting conclusions is entirely consistent with the aspiration of making true or reliable claims. The truth of
is not an argument in favor of believing
. One might persuade someone that tariffs are harmful by shouting with an air of great certainty: “It’s true! It’s true!” But shouts are not arguments. One might as well simply recite: “Tariffs decrease economic welfare. Tariffs decrease economic welfare. Tariffs decrease economic welfare.” Truth is what one seeks, not one’s evidence. Similarly, the greater future predictive success of
as compared to
is no evidence or argument now in favor of
. Future predictive success, like truth, is a goal, not evidence. This platitude gives one no reason to be suspicious of the notions of truth or predictive reliability.Footnote 31
In addition, although this is not one of McCloskey’s arguments, one might argue that the proposal to find out how to do science by studying scientifically how people do science is multiply paradoxical. If one does not already know what science is, one will not know which practices to study, and if one does not already know how to do science, one will not know how to study those practices. Either one cannot start at all, or one must begin by assuming that one already knows the answers one is looking for. But, if one must beg all the significant questions, what point can the exercise have? (See Hausman 1980 or 1981a, postscript.)
The trick is to beg the questions in the right way. There is nothing wrong with beginning with the presumption that one knows how to find out what norms govern institutions such as theoretical economics, provided that one’s initial presuppositions are subject to correction in the course of inquiry (Friedman Reference Friedman1979). I began this methodological study of economics with many unavoidable presuppositions, which I have tried to scrutinize, piecemeal. Philosophy of economics, as I have attempted to practice it, thus resembles history, sociology, literary criticism, and economics itself as much as it resembles conceptual analysis.
I see no grounds to conclude that this sort of empirical methodological inquiry is misconceived. There is no good general philosophical case against the possibility of investigating empirically how people learn, what sorts of methods work best for which sorts of problems, or how one can best insure against the sorts of mistakes people are prone to make. Such questions are not purely psychological, although psychological evidence may bear on them. Investigations into the history and current state of science are also relevant to their answers. In any event, they are real questions. They appear to be answerable, and their answers may be of normative importance.
Moreover, appraisals of economics that draw on epistemological theses are inevitable. For the very terms in which one describes the practices of economists – theorizing, testing, deducing, modeling, sampling, and so forth – carry philosophical baggage. McCloskey unavoidably begins with such notions, and her object – the discourse of economists – is already penetrated through and through with philosophical influences. Could these philosophical influences on the methodological inquirer or on the object of inquiry ever be transcended? I doubt it, and, even if they could, they will be with us for a long while yet.Footnote 32
Furthermore, at least one kind of normative epistemological theory, the Bayesian view of confirmation, is itself an application of central claims of economic theory. For economists to refuse to pay attention to the norms proposed by Bayesian epistemologists would be to refuse to heed the implications of more or less their own models. Epistemology is unavoidable.
Finally, as Alexander Rosenberg has argued (1988a), if economics ever succeeded in repudiating all epistemology and answered only to its own standards, it would lose its influence on noneconomists and its rational hold on economists themselves. If the standards of acceptance among economists had no connection to epistemologically significant goals, such as reliability or truth, then the fact that a particular conclusion was accepted by most economists would be of no more interest to policy-makers than is the fact that a particular conclusion is accepted by most astrologers.
These comments do not constitute a defense of the conclusions and recommendations of this book. They stand or fall on their merits. I have only argued that there is nothing misconceived about the project of appraising mainstream economics as an intended contribution to human knowledge.
This book has focused on the epistemological peculiarities of a special human cognitive enterprise: mainstream economics. It has not broached the central problems of epistemology or of philosophy of science in their full generality. Its conclusions concern economics, and although some are of general significance, many are not relevant to other disciplines, even other social disciplines. For example, although Richard Miller in his Fact and Method eloquently insists upon the importance of the particular problems and standards that characterize different disciplines, he endorses the philosophical platitude that scientific theories postulate the existence of unobservable things to explain generalizations at the level of observations (1987, p. 135). To make sense of economic theory, one should reject this view. Similarly, the extent to which the view of models and theories defended in Section 7.3 helps to illuminate economics provides an argument for the cogency of that general view. The discussion of Milton Friedman’s methodology helps one to disentangle different positions that might be called instrumentalist (§A.2) and shows that instrumentalists need to be concerned about truth, even if their ultimate goals are purely practical. The discussion in Chapters 13 and 14 of how evidence bears on utility theory illuminates the tenuous general relations between theory and data.
It would be tedious to compile a list of examples such as these, and, in any event, my main concern is economics, not general philosophy of science. Instead, in closing, I shall attempt to bring together the main theses of this book and to show how they clarify and explain the most prominent methodological peculiarities of economics. The fact that economics is a social science – a science of human beings (§A.9) – is crucial to its distinctive methodological problems.
16.1 The Structure and Strategy of Mainstream Theoretical Economics
Part I offered a general account of the structure and strategy of theoretical economics, which is summarized in Figure 3.6. I argued that equilibrium theory – that is, the generalizations of consumer choice theory and the theory of the firm – lies at the heart of mainstream theoretical economics. Mainstream economics is the articulation, elaboration, and application of equilibrium theory.
It is unhelpful to maintain that equilibrium theory is the hard core of a Lakatosian scientific research program, because mainstream economists are free in particular inquiries to drop some of the constituents of equilibrium theory and even to replace them with contraries. A commitment to equilibrium theory is only a commitment to some subset of its components and to a modeling style. Some of the generalizations that constitute equilibrium theory are more central than others, but if one attempts to say what theoretical economics is by identifying some common core of propositions that are shared by every model or theory, one will not be able to give an informative characterization. What Lakatosians might be inclined to call the “negative heuristic” does not forbid tampering with equilibrium theory. It effectively forbids removing rational acquisitiveness and the possibility of equilibrium from their central places, but the characterization of this “pseudo-hardcore” is left open: nonsatiation can be replaced with satiation, but claims about cognitive dissonance are suspect. Incompleteness or intransitivities can be explored, but psychological generalizations about procedure variance are unwelcome. At the same time, although not specific to economics, Lakatos’ emphasis on the unity of research programs and the importance of their “positive heuristic” provides a helpful framework for characterizing equilibrium theory.
Equilibrium theory consists of the theory of consumer choice, the theory of the firm, and the thesis that equilibrium obtains. This last constituent fits awkwardly, for it appears as a theorem rather than as an axiom, but it is a central constituent nevertheless. All the parts of equilibrium theory are problematic. In the theory of the firm, only the law of diminishing returns is relatively solid. Constant returns to scale shows up to ensure mathematical coherence rather than for its empirical success. Maximization of net returns not only appears to be false, but it is in conflict with utility maximization by members of firms in most institutional settings.
The theory of consumer choice consists of utility theory, acquisitiveness, and diminishing marginal rates of substitution. It is less problematic than the theory of the firm, for acquisitiveness and diminishing marginal rates of substitution seem to be reasonable first approximations, at least with respect to market behavior, and (ordinal) utility theory is perhaps a plausible approximation. In the context of choices among a finite number of options, continuity is harmless, and, although transitivity may break down, there is obviously a good deal to be said for its correctness with respect to the limited set of options among which individuals are choosing. Completeness is the most problematic of the axioms of ordinal utility theory, though perhaps not in the context of complete certainty (but that context is itself problematic).
As I understand scientific theories, they consist of sets of lawlike statements that are systematically interconnected; and I thus identify equilibrium theory with the laws of the theory of the firm and of consumer choice, plus the assertion that equilibrium obtains. But these laws do no work by themselves. In both theoretical and empirical inquiries, laws are always combined with simplifications and specifications of relevant circumstances. Some of these simplifications, such as infinite commodity divisibility, perfect information, or perfect competition are pervasive and help determine the character of theoretical economics. But these simplifications are not assertions or discoveries of economics.
Theories in mainstream economics can be classified as partial or general equilibrium accounts. Partial equilibrium analyses treat a small number of markets in relative isolation, while general equilibrium theories attempt to deal with general interdependencies among markets, although this is less true of highly aggregative general equilibrium models with few commodities. Mathematical investigations of abstract general equilibrium models have been methodologically puzzling, since the models are not even approximately true of existing economies. I argued that they are best interpreted as conceptual investigations, investigations of possibilities, or attempts to develop heuristically useful tools.
Equilibrium theory, not general equilibrium theory, is the fundamental theory of mainstream economics. In addressing general interdependencies, as opposed to single markets or small groups of markets, general equilibrium theories augment equilibrium theory with simplifications or specifications concerning the circumstances to be studied. In addition to practical applications, as in forecasting models or input–output models, general equilibrium theories serve as a proving ground for new tools and are used to explore whether equilibrium theory will be able to serve as the core of a separate economic science.
To make sense of this theoretical enterprise requires distinguishing between models and theories. Models are predicates or definitions of predicates. The assumptions of models are clauses in definitions and not true or false assertions about the world. However, the investigation of models typically involves scrutinizing a fictitious world, conjured up so as to allow economists to pretend that there is something of which the predicate the model constitutes or defines is true. Provided that this heuristically attractive way of proceeding is understood to involve fictions, it is harmless and may be fruitful. As I argued in Chapter 6, one crucial component of science is the articulation of new concepts in terms of which to theorize. But once philosophers and economists take literally the claim that models create worlds and make possible experiments, which have the same or even superior epistemic credentials as laboratory experiments have, I jump ship, for this vessel is headed for a metaphysical shipwreck.
When one offers a general theoretical hypothesis asserting that a model is true of some realm of reality, then one is offering a theory; and in offering the theoretical hypothesis, one is committed to treating what were the assumptions of the model as assertions about the world. Despite what they sometimes say, economists do not treat equilibrium theory as merely a fundamental model, to which no empirical commitments pertain. For economists believe that they can predict and explain economic phenomena by means of equilibrium theory.
Central to mainstream economics has been the thought that economists are concerned with a set of causal factors or “laws,” which predominate in a particular domain of social life. These laws are generally well known and make up accepted economic theory, which provides a unified and complete, but inexact, account of the entire economic realm. This vision expresses a methodological commitment to what Mill called “a separate science” of economics. Other generalizations about preferences, beliefs, and constraints may be added to economic models and theories, provided that they do not conflict with the central place within the economic realm of rational acquisitiveness or make equilibrium impossible. To employ any other generalizations is ad hoc. No changes in fundamental theory itself are welcome that do not preserve its universal scope. The further features of human behavior that psychologists and sociologists discover are, from this perspective, typically ad hoc and only have a narrow scope. They are usually not suitable for inclusion in particular economic models and are typically disqualified from inclusion within fundamental theory. The methodological commitment to a separate science, which is fortunately waning, leads to the view that equilibrium theory is (at a suitable level of abstraction and approximation) the whole theoretical truth about economics.
This methodological commitment to the structure and strategy of a separate economic science explains why economists theorize as they do. I argued that it is unjustifiably dogmatic. The methodological commitment to a separate science of economics preserves the scientific appearance of economics and spares economists the maddeningly difficult and disorderly task of floundering among disparate data, attempting to identify significant causal factors. It preserves the aesthetic attractions of economics and keeps it a tractable subject for mathematical exploration. The commitment to a separate science maintains the close connection between the empirical theory of how people choose and the theory of how they rationally ought to choose, which in turn provides a strong pragmatic argument for treating apparent disconfirmation as error. And this commitment is essential to the normative argument for perfectly competitive equilibrium, which underlies both conservative defenses of laissez-faire and liberal analyses of market failures.
This vision of economics is appealing, and it is easy to understand why economists have been so deeply committed to it, but I have argued that it stands in the way of empirical progress.
16.2 Appraising Microeconomics and General Equilibrium Theory
Part II was concerning with theory appraisal, and it was as much critical as constructive. Many hundreds of pages have been written concerning how to assess economic theories, much of it unhelpful in large part because questions of theory appraisal in economics have rarely been joined to a detailed treatment of its structure and strategy. When economists fail to take apparent disconfirmations as refutations, methodological critics accuse them of adhering to an unreasonably dogmatic theory of confirmation or as failing to live up to their scientific standards. But, as I have argued, their actions may instead be consistent with an uncontroversial view of theory assessment, given the limitations in economists’ knowledge and the constraints on experimentation, which together drastically limit the evidential relevance of predictive successes or failures. If economists are sometimes unreasonably dogmatic, it is usually because of their commitment to regarding economics as a separate science, not because of their views of theory appraisal.
Further complicating the story are Milton Friedman’s influential views, which direct economists to be guided entirely by the success of the relevant predictions of their theories. This advice is impractical, quite apart from the internal problems with Friedman’s methodology. Friedman’s advice has rarely been followed, and to implement it would require that economics be radically transformed. Yet, since Friedman presents his views in defense of theoretical “business-as-usual” against critics of standard economics, one finds economists espousing Friedman’s methodology who would never dream of seriously acting on it.
Add in the influences of Popper and Lakatos and the confusions deepen. Popper was a natural authority to look to, for he was a leading philosopher of science. Moreover, his political views and his views concerning the methodology of the social sciences appeal to many economists. His message may appear at first glance consonant with Friedman’s (Blaug 1976). But, if Popper were right about how to do science, then economists would show a massive failure of methodological nerve. If one takes seriously what Popper says about falsifiability and the critical attitude, then the methodological practice of economics is not only mistaken, it is intellectually scandalous. Although this book has been critical of features of the methodology of economics, it has shown how researchers of intelligence and scholarly integrity could be committed to it.
One can better defend the honor of economists by adapting Lakatos’ methodology of scientific research programs, since it is flexible enough that few practices can be unequivocally condemned. Yet some developments in economics that are widely regarded as central theoretical advances (such as the switch from cardinal to ordinal utility theory) must be seen from Lakatos’ perspective as evidence that the “neoclassical research program” is degenerating. Furthermore, Lakatos’ categories do not fit the practices and products of economics easily and his rejection of “justificationism” is an invitation to epistemological disaster.
Friedman, Popper, and Lakatos, each in his own way, would prevent us from relying on supporting evidence. But, in interpreting experimental failures and in gambling on theories in new circumstances, economists need to make careful use of the evidence bearing on the propositions involved. It is only because of the importance of such evidence that there is a special difficulty about testing in economics. There is no special logical problem deriving predictions from economic theory and other statements. The difficulty is that the other statements referred to here are so far from the truth that the test results tell us nothing about economic theory. It is difficult even to envision an activity that paid no attention to supporting evidence. It would be nothing like the sciences we know.
To appraise economic theory sensibly, one must also come to terms with its inexactness. While inexactness is not a virtue, it is not a mortal sin either. Inexactness must be distinguished from other sorts of empirical shortcomings, and one must explain how inexact sciences can be understood, criticized, and defended. The inexactness of equilibrium theory should be understood mainly in terms of tendencies or vague ceteris paribus qualifications. Until one understands how counterfactual or qualified claims can be true or false, valuable or valueless, and confirmed or disconfirmed, one is in no position to recognize the extent to which equilibrium theory is true, valuable, or confirmed.
The theory of confirmation is a difficult and puzzling area in philosophy of science, and I do not know exactly how theories should be appraised. Neither do scientists, including economists. If progress in science depended on having scientific method exactly right, there would be no progress in science. There is a good deal of truth to the simple story that says that one derives predictions from a hypothesis and other premises and draws conclusions concerning how likely it is that the hypothesis is correct “on the basis of”Footnote 1 the success or failure of those predictions. Economists do exactly this, but they are blessed with the knowledge that there is a great deal of truth to equilibrium theory, and they are cursed with such difficulties in testing that they are rarely in any position to change their initial assessment. The weakness of the empirical control exerted by economic data provides for a legitimately large role for pragmatic factors. Choices about whether to deal with disconfirmations by means of fundamental theory modification or by the introduction of disturbances or interferences (which in many contexts do not need to be mentioned) will turn not only or mainly on the data, but on the sort of exactness one thinks obtainable, on the jobs the theory is supposed to do, and on the aesthetic, systematic, heuristic, and normative virtues the alternatives may have.
This looks like Mill’s inexact method a priori, his deductive method, but it is not a special theory of confirmation at all, and its apparent dogmatism arises from the precarious relevance of the data coupled with the good reasons economists have to find some truth in equilibrium theory.
16.3 Reasons and Causes: Rationality and Economic Behavior
If one says that Elizabeth’s preferences are complete, transitive, and continuous, one is offering empirical generalizations about her. These generalizations assert matters of fact. They are testable. And, unless Elizabeth is unlike the rest of the species, they are not all true, although there is a good deal of truth to them. Similarly, to connect Elizabeth’s actions to her preferences by asserting that Elizabeth never prefers an option she believes to be feasible to the one she chooses, is, on the ordinary understanding of the notion of preference, an empirical claim (although the theory of revealed preference mistakenly takes choice as defining preference). As many have pointed out, this generalization is not easy to test (Boland Reference Boland1981; Caldwell Reference Caldwell1983). To employ these generalizations – that is, utility theory – to explain why Elizabeth chose some option is to say only that she chose it because she most preferred it. Hamlet killed Claudius because he preferred to. Such explanations are vacuous and shallow. Utility theory can be used to explain Elizabeth’s choices only if it has something to say about what determines her preferences among the immediate objects of choice.
When economists employ consumer choice theory to explain why individuals chose some bundle of commodities, they give the agent’s reasons more clearly, and they also seem to provide a more definite causal explanation. Acquisitiveness and diminishing marginal rates of substitution are substantive “laws” of preferences. They have no obvious connection to rationality per se, and they are easier to test. The preference for more commodities and services over fewer is the motive force (“greed”), and diminishing marginal rates of substitution is a psychological constraint governing economic preferences. Utility theory guarantees the consistency of the chooser and derives a choice from acquisitiveness, diminishing marginal rates of substitution, and further facts. Both the causal story and the reason for the choice are clearer than in an explanation employing only ordinal utility theory.
Since an explanation employing consumer choice theory can be construed as a causal explanation, one might be tempted to regard the reason-giving aspect and the fact that utility theory is a theory of rationality as merely extra detail, some “local color” that is irrelevant to the explanatory logic of theoretical economics. But it seems to me that many of the distinctive methodological characteristics of economics can be understood better if the reason-giving feature of explanations in economics is taken seriously. In particular, the fact that utility theory is a theory of rationality helps to explain the following:
1. Why the notions of Pareto optimality and Pareto superiority are so pervasive and appealing in welfare economics and why ethical concerns are so often intermingled in positive economic theorizing (§4.4 and Chapter 8).
2. Why economists have such a strong empirical commitment to utility theory as the best account of how agents in fact choose (Chapters 13 and 15).
3. Why economists follow their distinctive theoretical strategy (Chapter 7).
Much of the methodological distinctiveness of economics stems from the remarkable fact that a normative theory of rationality lies at its theoretical core.
1. Pareto optimality. In Section 4.4 I sketched an argument for the moral approval (ceteris paribus) of perfectly competitive equilibrium on the basis of equilibrium theory and “minimal benevolence” (that, ceteris paribus, it is a morally good thing to make people better off). Equilibrium theory identifies people’s well-being with the satisfaction of their preferences, and it provides the premises for the welfare theorems which show that perfectly competitive equilibria are Pareto efficient and that all Pareto optima can be achieved as competitive equilibria. Rationality, in the form of utility theory, is specifically presupposed by the notion of Pareto efficiency and is central to welfare theorems and to the identification of well-being with the satisfaction of preferences. It is because preferences are reasons that the satisfaction of preferences has a claim to be a matter of prudence and thus a claim on the beneficence of others.
Consequently, any challenges to the empirical adequacy of equilibrium theory, or to the argument for the Pareto optimality of the consequences of individually rational behavior in competitive markets, bear immediately and forcefully on the argument for perfect competition that relies on minimal benevolence. Hence a positive conceptual investigation such as Samuelson’s has immediate moral reverberations that are evident in Meckling’s and Lerner’s responses. Challenges or qualifications to individual rationality or to its consequences undermine the moral claims of competitive markets.
2. The commitment to utility theory. As I argued in Chapter 15, the fact that utility theory is a theory of rationality can give one additional reason to favor it as an empirical theory of human preference and choice. The instability of irrational behavior and the educative effects of theories of rationality provide reason to believe that a theory that portrays individuals as behaving rationally is more likely to be true than a theory that depicts them as behaving irrationally. Furthermore, a theory which depicts actual behavior as rational permits explanations to be reason-giving as well as causal. These explanations can be accepted by economic agents as well as by economic theorists, and they permit actions to be appraised.Footnote 2 Such a theory may also have better effects on how people will behave than a theory that describes people as irrational. Although these last pragmatic factors must take second place to empirical adequacy, they can have a large influence on theory choice in a discipline such as economics in which empirical adequacy is so hard to judge.
3. Why economists follow their strategy. There are many reasons why it is hard to get knowledge in economics. In particular, I have stressed the inadequacies of the data with which economists have to work. Some of the limitations in these data have been economists’ own fault, since they have in the past avoided gathering or using certain kinds of evidence; but both the multiplicity of causal factors and the practical and moral problems of experimentation are serious difficulties, which can be mitigated but not eliminated.
One impediment to progress in economics, about which economists are in the course of addressing, is the hegemony of the vision of economics as a separate science. This hegemony is due in large part to the fact that equilibrium theory contains a theory of rationality. Utility theory links equilibrium theory to the plausible explanatory strategy embedded in everyday “folk” psychology. Generalizations concerning social phenomena that are not in terms of the beliefs and preferences of agents or the constraints on their actions are ad hoc and in need of explanation in these terms. Generalizations that cannot be explained in these terms are inherently unstable, because “people will learn,” or at least enough of them will that the competitive pressure of the market will bring them into line.
Theoretical constraints are necessary to focus research and to motivate the investigation of esoteric questions. But the justification for a particular paradigm or research program, like the justification for the commitment to economics as a separate science, is success and progress, including especially empirical success and empirical progress. How successful and empirically progressive theoretical economics has been is controversial. There are many nitty-gritty examinations of specific questions, but these often employ scarcely more theory than the assumption that people respond rationally to incentives. When theorists and experimenters in other disciplines have generated potentially relevant generalizations and data, economists should be eager to reach out and incorporate this material into their theorizing.
When one recognizes the centrality of rationality, one can understand better why so many economists are deeply committed to equilibrium theory and to the image of economics as a separate science. Linked to the notion of a separate science is not only the heuristic power of microeconomics but also the rational prescriptive force of utility theory and the moral argument for perfectly competitive equilibrium. Any step away from equilibrium theory weakens these links between purported facts, rational oughts, and moral oughts and surrenders the vision of a unified theory of economic phenomena. It is hard to give up so much.
If, as I believe, there are systematic failings of human rationality, and economic behavior is significantly influenced by many motive forces, apart from acquisitiveness and diminishing marginal rates of substitution, then equilibrium theory is an enfeebled theory, regardless of whether there is anything better. If factors from which the separate science of economics abstract are important influences on economic outcomes, mathematical expertise and elegance in modeling will not save the day.
The edifice of contemporary mainstream economics is gorgeous, and it can often help solve specific predictive problems. But its empirical difficulties are serious and, if the speculative thoughts of these pages are correct, it will never fully conquer them. Furthermore, concerns about rationality are unavoidable, and normative policy implications will always be close at hand to generate bias. The problems are serious, but there are ways forward. Although some theorists should keep pushing the current strategy as hard as they can, I applaud the extent to which economists have become more eclectic, more opportunistic, more willing to gather data, more willing to work with generalizations with narrow scope, and more willing to collaborate with other social scientists.