3.1 Introduction
This chapter has several goals. Among other things, we introduce terminology and facts about the Upper Paleolithic (UP); explain why conventional theories of economic growth do not adequately capture the dynamics of population and technology during this period; develop our own formal model of these matters; suggest that climate change was the driving force behind population growth and technological innovation in the Upper Paleolithic; and argue that our model can account for some empirical patterns identified by archaeologists. The ideas in this chapter provide foundations for our analysis of the transition to sedentism (Chapter 4) and the transition to agriculture (Chapter 5).
Following an overview in this section, we offer empirical information on climate, population, and technology in Sections 3.2–3.4. Section 3.4 argues that standard models of economic growth conflict with crucial facts about the Upper Paleolithic. In particular, any model predicting continuous expansion at an exponential or faster rate contradicts the observation that regional populations were commonly flat, cyclic, or subject to collapse.
This motivates our effort in the second part of the chapter to construct a formal model better suited to the archaeological evidence about the Upper Paleolithic. In our framework the equilibrium growth rate for population and technology is zero almost all of the time, with transitory periods of growth triggered by climate change. While some readers will not want to delve deeply into the mathematics in this part of the chapter, we urge all readers to look at the verbal portions, which often discuss important conceptual points. Proofs of all formal propositions are available at cambridge.org/economicprehistory.
Section 3.5 presents a model of technological innovation based on learning by doing. In our approach, technology is more likely to improve for food resources that are more intensively exploited, and it remains constant for any resources not currently in use. Section 3.6 studies the short-run allocation of labor time across individual food resources. Section 3.7 adopts a long-run framework in which population is endogenous and evolves according to Malthusian dynamics. Section 3.8 uses a very-long-run framework where technology is also endogenous and evolves according to the model of learning by doing in Section 3.5. There can be many equilibria in the very long run, and foraging societies can get stuck in what we call a “stagnation trap.” In such situations, some resources are not used because they are unattractive with existing technical knowledge, but knowledge does not improve because the resources are not used. Section 3.9 shows that changes in climate can sometimes enable foraging societies to escape from stagnation traps, leading to episodic technological innovation and population growth.
In the rest of the chapter we resume a verbal presentation. Section 3.10 suggests that our model can explain two empirical patterns in the archaeological record. One of these is the broadening of the diet in the Upper Paleolithic (the so-called broad spectrum revolution). The other is the increasing ability of human groups to colonize harsh natural environments such as Siberia. Section 3.11 is a summary and Section 3.12 is a postscript.
We begin by explaining some terminology for the benefit of non-archaeologists. Newcomers to the literature will frequently encounter references to the Lower, Middle, and Upper Paleolithic, where the term Paleolithic means “old stone age.” In all three of these periods, stone tools were used for hunting and gathering. The Neolithic period, or “new stone age,” refers to the use of stone tools for agriculture (see Chapter 5).
The boundary between the Lower and Middle Paleolithic is traditionally dated to about 250 KYA. Until recently, the accepted date for the advent of anatomically modern humans (AMHs, or Homo sapiens) was around 200 KYA, so the Lower Paleolithic was associated only with archaic humans; that is, members of the genus Homo, but not Homo sapiens. The most famous of these is Homo neanderthalensis. However, new discoveries from Morocco have pushed the appearance of Homo sapiens back to 300 KYA or earlier (see Section 1.1), creating overlap between AMHs and the Lower Paleolithic. Because very little is known about the technology used by AMHs at such early dates, we ignore the Lower Paleolithic here.
The boundary between the Middle and Upper Paleolithic is more relevant for our purposes. We will address the question of dates for this boundary below. The distinction between the two periods is based on differences in the methods used to create stone tools. The Middle Paleolithic is generally associated with a relatively primitive technique called “flake technology” while the Upper Paleolithic exhibits a more advanced technique called “blade technology” (for a description of the latter, see Fagan and Durrani, Reference Fagan and Durrani2019, ch. 5).
In blade technology, the tool producer (or knapper) starts with a prepared core of stone material, often flint. A punch stone and hammer stone are used to split fragments from the core, which are called blades. The blades are then shaped into a variety of tools including knives, scrapers, burins, and so on. The Upper Paleolithic toolkit represents an advance over the Middle Paleolithic in several ways: (a) raw materials such as flint are used more efficiently; (b) specialized tool types are made for specialized uses; (c) each tool type follows a standardized pattern; (d) stone tools are used to make further tools out of bone, antler, or ivory; and (e) composite tools can be created using multiple materials. The ability to carve bone or ivory made possible the creation of new tools like fishhooks and sewing needles. Composite techniques also widened the range of tools; for example, spears could involve a wood handle, a bone tip, and resin to attach the tip to the handle.
Later improvements in hunting methods included spear throwers as well as bows and arrows. Other markers of the Upper Paleolithic are grinding and pounding tools for processing wild plant foods; long-distance trade in raw materials; food storage facilities; structured hearths; specialized habitation areas for butchering, cooking, waste disposal, and sleeping; and artistic items such as ornaments, paintings, and musical instruments.
The traditional date for the boundary between the Middle and Upper Paleolithic is around 40–45 KYA, the period when AMHs were arriving in Europe. This amounts to a distinction between the technology used by Neanderthals already living in Europe and the technology brought to Europe by the first AMH immigrants. However, the technology of Upper Paleolithic Europe closely resembles technology used by AMHs in South Africa as early as 60–80 KYA (Mellars, Reference Mellars, Mellars, Boyle, Bar-Yosef and Stringer2007; Mourre et al., Reference Mourre, Villa and Henshilwood2010; Nowell, Reference Nowell2010). Mellars argues that the pioneering blade technology of South Africa fueled the spread of AMHs to Asia and Europe, a view that remains debatable. In any event, 40–45 KYA may represent not so much a qualitative boundary in the technical progress of AMHs as a milestone in their geographic dispersal (see our discussion in Section 2.3 on what we mean and don’t mean by the word “progress” in the context of technical change).
Archaeologists often refer to the Upper Paleolithic as a revolution in comparison with the Middle Paleolithic (Bar-Yosef, Reference Bar-Yosef2002a). The dramatic contrast in technological capabilities between the two periods warrants this label. From an economic perspective, however, the innovation process was remarkably slow. More than 200,000 years elapsed between the emergence of anatomically modern humans and the earliest widely accepted evidence of blade technology around 80 KYA. Tens of millennia more elapsed before the Neolithic transition to agriculture began around 12 KYA in southwest Asia. While spearthrowers and the bow and arrow were breakthroughs for the people who invented them, the average pace of technical progress in the Upper Paleolithic was negligible by comparison with early agricultural societies, let alone industrial societies.
Blade technology is a classic example of what economists would call a general-purpose technology (Lipsey et al., Reference Lipsey, Carlaw and Bekar2005). In this chapter we treat the existence of blade technology as given and do not attempt to explain the transition from the Middle to the Upper Paleolithic (for an intriguing demographic hypothesis about the rise and spread of UP culture see Powell et al., Reference Powell, Shennan and Thomas2009, Reference Powell, Shennan, Thomas, O’Brien and Shennan2010). Our focus is on the development of techniques specific to individual food resources. For example, nets can be used to catch birds, snares can be used to catch hares, weirs can be used to catch fish, and sickles can be used to harvest wild grasses. Our model of learning by doing in Section 3.5 addresses resource-specific technologies of this kind.
There is no uniform end date for the Upper Paleolithic. In the European context, archaeologists sometimes treat the Upper Paleolithic as terminating with the onset of the Holocene climate regime around 11.6 KYA (see Figure 1.1 in Chapter 1). For Europe, the period between the start of the Holocene and the arrival of agriculture is often called the Mesolithic. Because agriculture arrived at different times in different subregions of Europe, the timing of the shift from the Mesolithic to the Neolithic varies with location.
In the context of the Levant or other parts of southwest Asia, archaeologists often refer to the period between the Last Glacial Maximum and the Holocene (about 21–11.6 KYA) as the “Epi-Paleolithic,” meaning “Final Old Stone Age.” This terminology works well for southwest Asia because the Neolithic transition to agriculture roughly coincided with the arrival of the Holocene around 11.6 KYA (see Chapter 5). The distinguishing technological features of the Epi-Paleolithic in southwest Asia include microblades and, by about 14.5–13.0 KYA, increasing sedentism (see Chapter 4).
These terminological schemes are not necessarily a good fit for other regions of the world such as Africa and China. In the New World, the period known as “Archaic” is sometimes regarded as paralleling the Upper Paleolithic of the Old World. However, in many parts of the world the Upper Paleolithic never led to indigenous agriculture or to indigenous metal tools. In much of Africa, Asia, and the Americas, and all of Australia and the Arctic, an Upper Paleolithic toolkit remained in use until contact with the global economy in the last few centuries.
Direct evidence on Upper Paleolithic social institutions is scarce, so ethnographic analogies are helpful in painting a picture. There is broad agreement that prior to the rise of sedentary communities, the basic social unit was the mobile foraging band. Bands observed by anthropologists generally have about 25 members, with 7–8 healthy adults who can search for food on a full-time basis (Kelly, Reference Kelly2013a, ch. 7; French, Reference French2016, 176–179). These figures vary across societies, but they likely reflect a rough compromise between the insurance benefits from sharing food within the group and the fact that larger groups would deplete local resources too quickly, requiring more frequent residential moves (Kelly, Reference Kelly2013a, ch. 7). Such bands tend to make regular seasonal rounds within a familiar territory (see the description of traditional Shoshone society in Johnson and Earle, Reference Johnson and Earle2000, ch. 3). Several bands may come together annually at times and places of resource abundance or when stored food is available, offering opportunities to find mates, share information, trade exotic items, and engage in ritual activities.
Beyond the day-to-day benefits from food sharing within a band, the key strategy for coping with negative environmental shocks that are local and temporary is reciprocity among neighboring bands. This may involve sharing of food or access to territory (Kelly, Reference Kelly2013a, ch. 6). Storage technology is usually limited and tends to play a minor role in risk management. For negative shocks that are widespread and prolonged, the only practical response is a search for better resources in more distant territories.
Of course, small egalitarian bands of mobile foragers were not the only form of social organization in the Upper Paleolithic. Graeber and Wengrow (Reference Graeber and Wengrow2021, 86–92) cite a few examples from the period of settlements that include stone temples, elaborate burials, and massive monuments. But these findings are rare and are not the focus of this chapter.
3.2 Climate
Climate reconstruction (paleoclimatology) involves observations on polar ice, alpine glaciers, ocean and lake sediments, tree rings, calcite precipitated on the walls of caves, growth bands in corals, and the chemistry of marine shells. Polar ice cores have more than 40 distinct climate-related properties and provide data on temperature, rainfall, ice volume, and atmospheric CO2 concentrations (EPICA, Reference Bentley, Alexander and Shennan2004; McManus, Reference McManus2004). For an introductory review of Ice Age climate history, see Woodward (Reference Woodward2014, chs. 8–9).
Figure 3.1 summarizes climate history for the last 420,000 years using data from the Vostok ice core in Antarctica. Notice that from a climatic standpoint this covers the Late Pleistocene and the Holocene. These terms should not be confused with the terms Paleolithic and Neolithic, which are defined by technology rather than climate. The data in Figure 3.1 were published by Petit et al. (Reference Petit1999) and the graph was taken from Creative Commons (2019). The top panel shows temperature variation in degrees Celsius relative to the Holocene, the middle panel shows atmospheric carbon dioxide in parts per million by volume, and the bottom panel shows atmospheric dust concentration. The latter is a proxy for aridity, so the bottom panel can be interpreted as the inverse of precipitation.
From other Antarctic ice cores it is known that the pattern of Ice Ages depicted in Figure 3.1 extends back at least 800,000 years (EPICA, Reference Bentley, Alexander and Shennan2004). The underlying cause is a complex set of cycles involving the Earth’s orbit. Each cycle lasts about 100,000 years, with glacial periods lasting about 90,000 years separated by interglacial periods of about 10,000 years. During glacial periods, large volumes of water are locked up in ice sheets on the continents and sea levels are consequently low. In the past this has created bridges between land masses that are currently separated by ocean. The best known is Beringia, a former land bridge between Siberia and Alaska.
The three time series are not perfectly correlated, but glacial periods (or stadials) tend to have low temperatures, low CO2, and low precipitation, while interglacial periods (interstadials) tend to have the opposite conditions. It is also evident from Figure 3.1 that there are major temperature fluctuations within each glacial period. By comparison with the Holocene, the last Ice Age involved a decrease in global mean annual temperature of 5–10 degrees Celsius, a 30% reduction in atmospheric CO2 concentration, and a drop in sea level of 125 meters (Cronin, Reference Cronin1999).
Several climate events that will be significant for later discussions are visible in Figure 3.1. The Eemian period of 126–116 KYA was the most recent interglacial before the Holocene and was slightly warmer at its peak. The temperature fell rapidly starting around 116 KYA, recovered during 100–80 KYA, and fell rapidly again until 65 KYA. This was followed by a moderate phase during 65–30 KYA. The Last Glacial Maximum (LGM) brought a low point around 21 KYA. Temperature recovered quickly beginning around 19 KYA. A temperature drop called the Younger Dryas (YD) can be seen around 13.0–11.6 KYA. Although this reversal appears brief on the scale of Figure 3.1, it lasted over 1000 years and had major demographic repercussions (see Section 3.3). The Holocene brought warmer, wetter, and more stable conditions beginning at 11.6 KYA. During the Holocene there have been various global and regional climate fluctuations, which will be discussed as they become relevant in later chapters.
3.3 Population
There is a large archaeological literature dedicated to Pleistocene and Holocene demography. French (Reference French2016) reviews the many proxies archaeologists have used to infer population change. The approach in this section involves a method called “dates as data.” First, we provide a quick introduction to radiocarbon dating techniques.
Archaeologists frequently estimate the age of organic materials by measuring the concentration of a naturally occurring radioactive isotope of carbon called 14C relative to the more common stable isotopes 12C and 13C. Plants and animals absorb 14C from their environments while alive, and this isotope decays at a known rate after an organism dies. Older samples therefore have lower concentrations of 14C. In practice the true date for a sample is estimated with error. The extent of this uncertainty is sometimes expressed by a confidence interval, and sometimes by providing an entire probability distribution.
Radiocarbon dating has three limitations. First, it does not work for non-organic material such as stone. Second, dates are unreliable further back than about 40–50 KYA because earlier samples have too little 14C. Third, the atmospheric concentration of 14C varies over time, creating the need for calibration techniques that translate “radiocarbon years” into calendar years. All radiocarbon dates in this book are calibrated and can be treated as calendar years unless the contrary is explicitly stated. Additional information about radiocarbon dating methods is available in introductory archaeology textbooks.
The idea behind “dates as data” runs as follows. If the population in a geographic region in a given time period is large, it is likely to generate extensive material remains that can be discovered by archaeologists. Thus if a relatively large number of observed radiocarbon dates fall in a given time interval, one can infer the presence of a relatively large population during that interval, and conversely if there is a relatively small number of radiocarbon dates in a different time interval. This yields information about how the population changed over time in a region. Archaeologists sometimes use supplemental data to estimate absolute population levels.
Early research using “dates as data” placed individual estimated dates in temporal bins, and used the number of dates in each of the bins to generate a histogram indicating how population changed over time. The size of the bins was usually adjusted to reflect the degree of uncertainty in the dating process. For example, if a typical date had a 95% confidence interval of 1,000 years, the researcher might choose 1,000 years as the size of a bin. Since about 2010, many archaeologists have begun to exploit the entire probability distribution for each date, where the sum of the radiocarbon probabilities is treated as a proxy for population. Various sources of bias in this approach are discussed by French (Reference French2016), who also provides extensive references.
In this chapter we limit attention to regions for which data extend far back into the Pleistocene (Europe and Siberia), for which Upper Paleolithic technology continued into the Holocene (North and South America), or both (Australia). Other regions (including southwest Asia, Japan, and the Sahara) will be discussed when appropriate in Chapters 4 and 5. For brevity we focus on recent studies and often omit citations to earlier research on the same geographic region.
We begin with Europe during 30–13 KYA, a period that covers the growth and retreat of ice sheets associated with the Last Glacial Maximum. Tallavaara et al. (Reference Tallavaara, Luoto, Korhonen, Järvinen and Seppä2015) derive population estimates in two ways. First, they use a “dates as data” approach of the kind described above, where 3,718 calibrated median dates from 895 archaeological sites are placed in bins with 1,000-year durations. They compare these results with population estimates obtained through climate modeling in conjunction with population densities for recent hunter-gatherer societies. Both techniques indicate a large contraction in human population size and range due to the LGM beginning around 27 KYA, with a minimum around 23 KYA and a rapid recovery during 19–13 KYA. The simulations indicate an initial European population of 330,000 at 30 KYA, a minimum of 130,000 at 23 KYA, and 410,000 at 13 KYA. The simulations also show that even in the depths of the LGM, human populations extended north as far as central France, southern Germany, and the southern parts of Ukraine and European Russia, with around 36% of Europe remaining habitable. Tallavaara et al. (Reference Tallavaara, Luoto, Korhonen, Järvinen and Seppä2015, 8235) remark that “climatic conditions were crucial drivers of last glacial human population dynamics.”
Fernández-López et al. (Reference Fernández-López de Pablo, Gutiérrez-Roig, Gómez-Puche, McLaughlin, Silva and Lozano2019) have studied population dynamics in the Iberian peninsula for 16.6–8 KYA using summed probability distributions with 907 radiocarbon dates. The later part of this interval extends into the Holocene, but the Iberian population were still foragers at this time, so the results are relevant here. The authors found several statistically significant upward and downward deviations from a null exponential model. Three qualitative phases were apparent: (a) exponential growth in a period of improving climate called the Last Glacial Interstadial; (b) collapse followed by prolonged stagnation during the Younger Dryas and the first half of the Early Holocene; and (c) logistic growth in the second half of the Early Holocene. The population during phase (a) was positively correlated with temperature and precipitation. The population in phase (c) became stable as temperature and precipitation stabilized during the second half of the Early Holocene.
Population trends in Siberia have been studied by Kuzmin and Keates (Reference Kuzmin and Keates2005) for 46–12 KYA using 437 dates (these authors do not discuss calibration issues, and their dates appear to be uncalibrated). They group radiocarbon dates at individual sites into “occupation episodes” of 1,000 years each. On this basis, they conclude that population density was low until 36 KYA, increased gradually during 36–16 KYA, and increased more rapidly from 16–12 KYA. They find that Siberia was not depopulated at the Last Glacial Maximum. Their conclusions for the LGM are controversial (see Section 3.10).
Australian population trends from 50 KYA to European contact are examined by Williams (Reference Ashraf and Galor2013), who uses 4,575 calibrated radiocarbon dates from 1,750 archaeological sites. The median dates are assigned to 200-year bins. Williams finds low populations through the late Pleistocene, with a drop of about 60% associated with the Last Glacial Maximum during 21–18 KYA. A slow stepwise increase begins around the time of the Holocene, near 12 KYA, with population recovering to pre-LGM levels by about 9.6 KYA. Williams suggests that high-amplitude environmental shocks may have kept the population low during the Late Pleistocene, with climate stabilization in the Holocene allowing population growth. The Holocene shows several population fluctuations that are probably driven by climatic variation, with a peak population in the Late Holocene. Williams suggests that a founding group of 1,000–3,000 people around 50 KYA and a maximum population of 1.2 million around 500 years ago would be consistent with the data. The average annual growth rate for the entire 50,000–year interval is about 0.01% with a range between 0.07 and −0.03.
Several recent articles have focused on North American paleodemography. Two points should be kept in mind in this regard. First, recent evidence indicates that human settlement occurred in North America by 23–21 KYA, but radiocarbon data usable for paleodemography are not available until around 15 KYA. Second, although much of the western portion of the continent was occupied by mobile foraging populations until European contact, sedentary foraging was important in the Pacific Northwest and farming became important east of the Mississippi River as well as in the southwestern US. We focus on the findings for mobile foragers.
Peros et al. (Reference Peros, Munoz, Gajewski and Viau2010) use 25,198 calibrated dates for all of North America, where the median probability value for each date is assigned to a 200-year bin. The dates run from about 15 KYA until European contact. The most rapid growth rate was shortly before 14 KYA. Population was low and relatively constant until roughly 6 KYA, when it began to increase. Because plant domestication is known to have occurred in the eastern half of the continent starting about 4 KYA (see Chapter 5), results after that date are not relevant here. The authors do not report any population decline associated with the Younger Dryas.
Anderson et al. (Reference Anderson, Goodyear, Kennett and West2011) use summed radiocarbon probabilities to investigate the possible effects in North America from the cold and arid period of the Younger Dryas, which occurred during 12.9–11.6 KYA. Their 682 calibrated radiocarbon dates run from 14–11 KYA. They find a large population increase before the YD followed by a rapid drop (perhaps up to 30–50%) in the first few centuries of this period. There is a gradual rebound over the next 900 years in the later stages of the YD, suggesting adaptation to the adverse climatic conditions. This pattern holds for all regions of the continent and is supported by evidence on projectile point frequencies as well as quarry use. Anderson et al. describe similar results using radiocarbon data for Europe, Asia (excluding the Middle East), and Africa. Interestingly, the Middle East does not display a significant population drop with the onset of the Younger Dryas, a point to which we will return in Chapter 5.
Kelly et al. (Reference Kelly, Surovell, Shuman and Smith2013) explored the relationship between population and climate for foragers in the Bighorn Basin of Wyoming over the last 13,000 years, using the summed probabilities derived from several hundred radiocarbon dates. Fluctuations in population were complex, with several peaks and troughs where intervals of population growth were associated with cooler and wetter conditions. An increase in temperature of one degree Celsius led to a 45% decline in population, and a decrease in moisture of 50 mm led to a 26% decline in population after a lag of 300 years. However, the authors were unable to distinguish migratory effects from natural population growth or decline.
Zahid et al. (Reference Zahid, Robinson and Kelly2016) studied forager populations in Wyoming and Colorado during the last 15,000 years using summed probabilities for 7,900 radiocarbon samples. Because this is a larger geographic area than that studied by Kelly et al. (Reference Kelly, Surovell, Shuman and Smith2013), migration effects may be less of a concern. Following an early jump in population, Zahid et al. found an exponential trend for 13–6 KYA with an average annual growth rate of 0.04%, but with several large upward and downward deviations from this trend that lasted for centuries. Population leveled off and declined during 6–3 KYA. The authors attribute the earlier exponential trend to climate change or endogenous biological processes rather than technical innovation. They do not propose a specific explanation for the population decline following 6 KYA.
Our last stop is South America. Goldberg et al. (Reference Goldberg, Mychajliw and Hadly2016) use 5,464 radiocarbon dates from 1,147 sites to compute summed probabilities for 15–2 KYA. Dispersal by the original colonizers is not visible because the earliest dates are scattered throughout the continent. The authors find a rapid increase both in radiocarbon dates and occupied sites during 13–9 KYA, followed by oscillations around a constant mean during 9–5.5 KYA. They suggest that a peak around 11 KYA could reflect an overshoot associated with a megafaunal extinction. Growth resumed in the interval 5.5–2 KYA. The best fit was provided by a two-phase model with logistic growth until 5.5 KYA followed by an exponential growth trend thereafter. The authors note that the timing of exponential growth coincided with a shift toward agriculture across much of South America.
The main lessons we take away from this review of demographic research on five continents are that mobile foragers with Upper Paleolithic technology tended to converge on equilibrium populations for the environments they inhabited, and that there were often very strong climatic effects on population size. Some periods of slow population growth appear to be consistent with slow technological progress in a natural environment where resources were fluctuating around a constant mean. But steady exponential growth was certainly not the norm in the Pleistocene, and probably not in the Holocene either. Even when an exponential trend was sustained for a substantial period, large and statistically significant deviations were usually superimposed on the trend, and regional population contractions were not unusual.
3.4 Technology
If one wanted to argue that the Upper Paleolithic was a technically progressive period, one could. Direct archaeological evidence reveals considerable evolution in the design of stone tools, including the development of microblades. New hunting methods such as the spear thrower and bow and arrow were also developed. Two other empirical patterns are evident. First, people began to rely less on hunting large mammals and more on small prey like hares and birds, along with a wide array of plant foods. This process is known as the broad spectrum revolution. Second, people colonized harsh environments such as the Arctic, which required new hunting technologies as well as innovations with respect to clothing, shelter, and transport.
On the other hand, one could also argue that the Upper Paleolithic was technically stagnant. Tens of thousands of years elapsed between the beginnings of blade technology around 80 KYA and the beginnings of agriculture around 12 KYA. Precursors of modern society like metallurgy and writing are even more recent. Strikingly, much of the world continued to rely on stone-age foraging techniques until European contact in the last 500 years. This suggests that Upper Paleolithic technology was a stable way of life, and that unusual circumstances were needed to push societies toward other economic systems.
In our view, a theoretical framework that can reconcile these two perspectives will need to have the following features:
(a) Technology should be endogenous, with innovation being slow, sporadic, and frequently absent for long periods of time.
(b) Population should be endogenous, with Malthusian dynamics.
(c) Nature should be exogenous, with changes in climate, geography, and ecosystems serving as triggers for changes in technology and population.
(d) The theory should be consistent with the climate history in Section 3.2.
(e) The theory should be consistent with the population history in Section 3.3.
(f) The theory should account for the broad spectrum revolution and the expansion of human populations into harsh environments.
We will construct a model along these lines in Sections 3.5–3.9.
In this section, we review several previous efforts to endogenize technology. We begin with a model of economic growth by Kremer (Reference Kremer1993). Although this is about three decades old, it is worth discussing because it is a pioneering application of growth theory to prehistory and because it crystallizes key issues in a relatively simple way. In fairness we should say that Kremer’s central concern is the transition to modern economic growth, not economic prehistory, but he does claim that his model is relevant for pre-agricultural societies. Other efforts to construct comprehensive theories of economic growth having relevance for prehistory, such as Galor (Reference Galor, Aghion and Durlauf2005, Reference Bar-Yosef2011), will be discussed in Chapter 12.
Kremer’s starting point is a sequence of estimates for world population beginning a million years ago and ending with the twentieth century. The early figures in the series are crude and should not be taken seriously. Much more sophisticated estimates are available today (see Goldewijk et al., Reference Goldewijk, Beusen and Janssen2010), and we need not linger over measurement issues here. The more interesting questions are theoretical in nature.
Kremer’s model rests on two causal relationships. First, he argues that the size of the world population is determined by technological productivity in a Malthusian manner. His model of population is a simplified version of the one in Chapter 2, where short-run adjustment processes are ignored and population is assumed always to equal its long-run equilibrium level given the current productivity level. Equivalently, the average product of labor is always equal to the fixed long-run standard of living y* from Chapter 2.
Second, he argues that the rate of productivity growth is proportional to the level of world population. Note that this claim involves a linkage between a growth rate and a level, in contrast to the Malthusian part of the model that provides a linkage between the level of population and level of productivity. Kremer’s argument for this second causal relationship is that a larger population implies a larger number of independent inventors, and therefore more useful inventions per unit of time. Although Kremer studies several ways of specifying the determinants of productivity growth, we focus on his basic linear model. The idea that a higher population level yields a higher productivity growth rate is common among economists (see Becker et al., Reference Becker, Glaeser and Murphy1999; Galor and Weil, Reference Galor and Weil2000; Jones, Reference Jones2001; Galor, Reference Galor, Aghion and Durlauf2005; Olsson and Hibbs, Reference Olsson and Hibbs2005; Aiyar et al., Reference Aiyar, Dalgaard and Moav2008).
Putting together the causal relationships from the two preceding paragraphs gives a simple prediction: the current population growth rate will be proportional to the current population level. Kremer argues (plausibly) that this provides a good fit to the data over the last several thousand years, except in the twentieth century when the Industrial Revolution and the associated demographic transition became important. For our purposes, the main points are that the model predicts continuous growth in both population and productivity, and that the growth process will be faster than exponential.
This is clearly inconsistent with our criteria for a satisfactory theory of Upper Paleolithic technology. First, it is incompatible with the fact that technical progress was usually sporadic or absent. Second, the prediction about population growth conflicts with findings that regional populations often exhibited stasis, cycles, or collapse. And finally, the model ignores the powerful demographic effects of the natural environment.
As Richerson et al. (Reference Richerson, Boyd and Bettinger2001) have pointed out, when tiny rates of technical change are compounded for tens of millennia, they have enormous consequences for population, the standard of living, or both. Even a small founding population could bring all of Asia to modern hunter-gatherer population densities within a few thousand years. Over the 50 millennia in which humans have occupied Australia, a tiny rate of population growth can take a founding population from the low thousands to the low billions (Williams, Reference Ashraf and Galor2013). This is true even for a small constant rate of exponential growth. If the growth rate is an increasing function of the population level, as in Kremer (Reference Kremer1993), the results are still more explosive. Because there is no sign of sustained population growth (or improving living standards, or greater social complexity) before the first known sedentary communities at about 15 KYA, the equilibrium growth rate must have been zero almost all of the time.
Another problem with the Kremer framework requires comment. In a Malthusian setting it is reasonable to hypothesize that observed population growth could be explained by unobserved technological progress. But the Kremer model ignores the possibility that population growth could have been stimulated by climate improvements with technology held constant (e.g., the recovery from the Last Glacial Maximum, or the shift from the Pleistocene to the Holocene). It also ignores the fact that the colonization of new land masses can yield world population growth with a constant technology. For inferences of unobserved technological progress to be persuasive, one must argue that population grew within a fixed geographic region with fixed natural resources, and that this increase was not driven by migration from external sources.
Archaeology and anthropology also have a literature linking population size to innovation, based on social learning and cultural transmission (Shennan, Reference Shennan2001; Bentley et al., Reference Bentley, Alexander and Shennan2004; Henrich, Reference Henrich2004; Powell et al., Reference Powell, Shennan and Thomas2009, Reference Powell, Shennan, Thomas, O’Brien and Shennan2010; Boyd et al., Reference Boyd, Richerson and Henrich2011; Richerson, Reference Richerson2013; Crema and Lake, Reference Crema and Lake2015). The usual (although not invariable) conclusion is that a higher population level leads to more rapid social learning and technical change. Eventually this yields greater cultural complexity, better adaptation to the environment, and/or enhanced biological fitness. These ideas have been applied in discussions of the Upper Paleolithic (Shennan, Reference Shennan2001; Powell et al., Reference Powell, Shennan and Thomas2009, Reference Powell, Shennan, Thomas, O’Brien and Shennan2010).
The archaeological literature on social learning is consistent with the part of the Kremer model where the innovation rate is an increasing function of the population level. However, it omits the other part of the Kremer model where population is endogenized through Malthusian dynamics. If we were to add a Malthusian population equation, we would be back to the same positive feedback loop where technology and population can both grow explosively. We are not aware of any formal modeling in archaeology that tackles this issue, although the potential for positive feedback has been noted (Shennan, Reference Shennan2001, 14; Powell et al., Reference Powell, Shennan and Thomas2009, 1301; Reference Powell, Shennan, Thomas, O’Brien and Shennan2010, 145). This problem could be mitigated by a concave relationship between the population level and the innovation rate, as is usually found in social learning models, rather than the linear relationship used in the simplest version of the Kremer model.
Anthropological data have been used to test the claim that technical innovation is an increasing function of population size. Some studies support this hypothesized link while others do not (Collard et al., Reference Collard, Buchanan and O’Brien2013). We are not surprised by these mixed results. Virtually all such studies use (small) samples of recently observed foraging or farming societies. The hypothesis being tested treats population as exogenous and technology as endogenous. But on time scales where technology is endogenous (and perhaps at or near an equilibrium level), population will almost certainly also be endogenous for Malthusian reasons (and also at or near an equilibrium level). Regressing technology on population thus involves regressing one endogeneous variable on another, resulting in simultaneity bias. In a model where both technology and population are endogenous, the exogenous variables are those determined by nature (climate, geography, and ecosystems).
Our own story about technical change in the Upper Paleolithic is closely related to other theories about social learning that have been influential among archaeologists. For analytic convenience, our formal model adopts a very simple approach to social learning where learning is initiated through mistakes in copying, which can be regarded as a form of accidental experimentation. Directional learning occurs through comparisons among the accidentally generated techniques for harvesting a particular resource. However, our modeling framework could be extended to accommodate deliberate experimentation.
It is useful to compare our analysis with that in Henrich’s (Reference Henrich2016) highly influential book The Secret of Our Success. Henrich’s treatment of collective learning is much more detailed than ours. It stresses the innate ability of humans to copy from each other and the importance of social norms for the transfer of new knowledge within a society and to future generations. We accept Henrich’s richer view of collective learning and see it as complementary to our simplified account. However, we differ from Henrich in that our analysis is motivated by the observation that technological progress was dramatically uneven during the Upper Paleolithic. Why were there periods of technical improvement separated by long periods of apparent stagnation? Henrich does not address this question but elements of his analysis suggest a pathway to an answer.
Let social norms be the primary mechanism for knowledge transfer, as Henrich suggests. Strict adherence to social norms related to technology could theoretically shut down both experimentation and knowledge transfer. Under what conditions will norms be flexible enough to allow for technological change? Giuliano and Nunn (2021) find that tradition and cultural persistence are weaker in less stable environments. We can link this idea with Henrich’s (Reference Henrich2016, 301) view that climate fluctuations are a source of intensified pressure for social learning. Perhaps climate fluctuations simultaneously loosened the restrictive grip of social norms on new technologies. Our simpler approach yields a relationship between climate and technical innovation that is more easily tested, but it could potentially be expanded to include social norms as a mediating factor.
A brief comment on our relationship to the framework of Boserup (Reference Boserup1965) is also in order. Boserup famously argued that population growth puts downward pressure on living standards, which stimulates technological innovation (often called “intensification” by archaeologists and anthropologists). If population growth is viewed as an exogenous variable, we would reject this approach. However, Wood (Reference Wood1998) provides a synthesis of Boserupian and Malthusian ideas where population is treated as endogenous.
Even so, our theory diverges from Boserup. We do not treat falling standards of living (whether caused by a movement along a declining average product curve, or by a downward shift of the curve) as a causal stimulus for innovation. We do sometimes treat a negative climate shock as the trigger for technical change (see our models of agriculture and manufacturing in Chapters 5 and 10, respectively), but our causal mechanisms differ from those of Boserup. Perhaps more importantly here, we also believe positive climate shocks, which raise living standards in the short run, were a key trigger for innovation in the Upper Paleolithic. Details will be given in Sections 3.9 and 3.10. We make similar arguments regarding positive climate change as a stimulus for sedentism in Chapter 4.
We suggest the following way of thinking about Upper Paleolithic technology. Nature provides many potential food resources that could be exploited if a society had suitable technical knowledge. In a static environment, foragers become very competent at exploiting some subset of these resources, but face long-run stagnation because (a) there is an upper bound on productivity for each resource, (b) latent resources remain unexploited due to the limitations of prevailing knowledge, and (c) knowledge cannot improve for resources that are never used.
To escape from such a trap, a foraging society must be exposed to shocks from nature. For example, an improved climate tends to increase population in the long run. If this Malthusian effect is large enough it may become attractive to exploit latent resources. Once this occurs, learning by doing will generate improvements in the techniques used to harvest, process, or store these new resources. If such knowledge gains are irreversible, a series of positive and negative climate shocks can yield a ratchet effect in technological capabilities, with occasional episodes of population growth and technical innovation.
We develop these ideas formally in Sections 3.5–3.9. Our theory can explain the prolonged periods of economic stagnation found in many pre-agricultural societies, both in the distant past and among recent hunter-gatherers. In Section 3.10 we suggest that our theory can also account for some macro-level patterns in the Upper Paleolithic, including the broad spectrum revolution and the colonization of extreme environments.
3.5 Learning by Doing
The next several sections develop a formal model of Upper Paleolithic economic growth that conforms to the requirements (a)–(f) identified in Section 3.4. We begin with a model of technical innovation along lines similar to those of the social learning models used in archaeology and anthropology, where children learn techniques from adults. Our approach makes a number of simplifications. We assume children can imitate any adult, without special preferences for biological parents or prestigious role models; we assume children can compare techniques among themselves and adopt the best ones available, so there is no variation among the members of any given generation in the techniques used; and we study a limiting case where the number of imitation events goes to infinity while the mutation rate goes to zero. This yields directional learning over multiple generations, where techniques may improve or remain static but never deteriorate.
Consider a list of natural resources
. These could include blueberries, rabbits, and so forth. Each resource can be converted into food through the use of labor. We treat food as a homogeneous output, ignoring distinctions among calories, protein, vitamins, and so on. The production function for converting resource r into food is
where ar is the abundance of resource r (regarded as a flow provided by nature in a given time period); kr is the technique used to harvest it; and nr is the labor used for harvesting. The abundance of each resource is determined by climate, geography, and ecosystems. However, climate is the only determinant of resource abundance that will be varied in this chapter, so for convenience we often refer to
as “climate.”
For fixed ar and kr, each production function in (3.1) has the same general shape as the production function from Chapter 2. We impose the following technical conditions.
Assumption 3.1 The functions fr are twice continuously differentiable and satisfy
;
for all
with
as
; and
for all
.
Each function has a positive and finite marginal product
when labor input is zero. We will say that resource r is active if
and latent if
. The finiteness of the marginal products at zero will be important when we discuss conditions under which a latent resource becomes active. As in Chapter 2, the function
exhibits diminishing marginal productivity (the second derivative is negative).
Dropping the r subscript momentarily, one example of a function satisfying the requirements in A3.1 is
, where the marginal product at zero is
. This functional form is consistent with an interpretation where
is the resource flow in nature,
is the fraction of
that is potentially accessible with the current technique
, and
is the fraction of
that is actually harvested.
Techniques are modeled as binary strings of uniform length Q so
. Let
be the best method for transforming resource r into food:
for all
. The function
is increasing in the number of digits of
that match
. It does not matter which digits are matched. The number of matching digits indexes the level of technical knowledge regarding resource r.
Q is assumed to be large enough that an exhaustive search for the ideal strings is impractical. Instead, each generation inherits a repertoire of techniques from its parental generation. The repertoire available to the adults of generation t is
. This array summarizes the current state of technological knowledge for all natural resources, where this knowledge is freely available to all adults of generation t.
The repertoire
for the next generation is derived as follows. Let there be
adults in period t. Each adult is endowed with one unit of labor time. The society’s labor allocation is
where
. All adults use the same time allocation, so each person devotes
time units to resource r.
A typical child in the period t has X opportunities to watch parents or other adults harvest resources, where X is a large number.
of these observations will involve resource r (there are no observations for latent resources with
). Every time a child i sees the exploitation of resource r, the string
is copied. For each of the Q positions on the string, with probability p an error is made in copying the current digit, and with probability 1-p the digit is copied accurately. Mutations are independent across loci, observations, and agents. A copy is denoted by
where x indexes observations.
Whenever a copy of
is made, child i uses a negligible amount of labor ε
to determine the marginal product
. If the new copy achieves a higher value than the best previous copy, the child retains the new copy and discards any earlier copy. Otherwise, the best previous copy is retained and the new copy is discarded. At the end of this learning process, child i has a best string for resource r, which we denote by
.
The number of children who survive to adulthood is
. When the parents from period t die and their children become adults, these new adults compare their strings for resource r and coordinate on (one of) the best available. The result for resource r is
.
This process gives updated strings for the resources r having
. In most of this chapter, we will assume passive updating for latent resources: that is,
implies
. We discuss alternative productivity assumptions for latent resources at the end of Section 3.9. The entire updated repertoire
is freely available to all adults of generation
.
Let
be the number of correct digits in the string
. To model technical progress for active resources, we need to know the probability distribution over
.
Proposition 3.1
(learning by doing).
Assume
so resource r is active in period t. Define
to be the number of children who survive to adulthood in period
per adult in period t. Let the number of observations per child (X) approach infinity and the mutation probability (p) approach zero with
held constant. In the limit,

All other transition probabilities for qrt+1 go to zero in the limit.
Proposition 3.1 shows that technical evolution for an active resource is directional. Either the number of correct digits in the string is unchanged and so is productivity, or there is a one-step improvement in the string for resource r with an associated productivity gain.
The intuition behind this result is not hard to grasp. In order to obtain technical regress for the string for resource r, it would be necessary for every observation of
to be worse than the status quo. In the limit as the number of observations goes to infinity, the probability of this outcome goes to zero. But technical progress only requires that at least one observation be better than the status quo. The probability of this outcome does not go to zero (though the probability of multiple productivity steps does). Proposition 3.1 does not require a large population. Each child only needs to have many opportunities to observe adult behavior, which could be true even in a small foraging band.
The probability of technical progress is an increasing function of the mutation rate, the labor input for the resource, the population growth rate, and the current distance from the ideal string. Other things equal, progress is more likely for resources involving larger labor inputs because children make more observations for these resources. It is also more likely when the population growth rate is higher because then there are more children who can compare notes on the copies they have made. Finally, progress is more likely when the distance from the ideal string is greater (the current technique is worse), because then there are more opportunities to learn something useful. One corollary of Proposition 3.1 is that if the transition probabilities are continuous at
, productivity for a latent resource is constant with certainty. This provides a theoretical rationale for the idea that strings for such resources are passively updated.
Notice that if we fix the fraction of total time devoted to each resource, a higher population N implies a higher labor input nr for every active resource r, and therefore an increased probability of technical progress for every resource (as long as its current string remains imperfect). This is consistent with the idea that a larger population leads to more innovation. Moreover, the relationship between population and innovation is concave. A less standard result is that a higher rate of population growth (ρ) also stimulates technical progress. However, when a society is close to a Malthusian equilibrium this parameter is close to unity, so the population growth effect may not play a significant role in practice. We often refer to processes of technical change like those modeled in this section as learning by doing. The everyday sense of this phrase is that an individual person can become better at a task by doing it repeatedly, perhaps refining methods through trial and error or improving skills through practice. Our use of the phrase instead refers to social learning in which one generation’s techniques are an improvement on the techniques of the preceding generation. The fundamental idea is that one generation must actively use a resource or technique in order for the next generation to attain higher productivity for that particular resource or technique.
3.6 Short-Run Equilibrium
The next three sections develop increasingly inclusive equilibrium concepts. We start in the present section with the short run, in which time allocation is endogenous and population, technology, and climate are all exogenous. Section 3.7 goes to the long run, where we treat population as endogenous using Malthusian dynamics. Section 3.8 goes to the very long run, where we also treat technology as endogenous through the model of learning by doing developed in Section 3.5. At this point, the only remaining exogenous variable will be climate. Section 3.9 will show how climate change can induce changes in all of the other variables. The relationships among the short, long, and very long runs are summarized in Table 3.1.
At the moment we are only concerned with the allocation of time across various resources, and for this purpose it is useful to adopt more compact notation. Let
be the productivity of resource r when its natural abundance is
and technique
is used. As mentioned earlier,
can be interpreted as the amount of resource r that is potentially accessible for harvesting under current technology. The corresponding vector of productivities will be written A(a, K). When a and K are being held constant, as in this section, we drop these arguments and write this vector as
.
In keeping with our general premise that a small group with frequently interacting members can resolve coordination, distribution, and free rider problems (see Chapter 1), we assume a foraging band allocates the total adult labor supply N across the individual resources to maximize total food, which is shared equally. The resulting time allocation will be called a short-run equilibrium (SRE).
Definition 3.1 A short-run equilibrium for the productivities
and population
is a labor allocation
that achieves
The next proposition characterizes the optimal time allocation.
(short-run equilibrium). The solution in (3.2) is unique and continuous in (A, N). Moreover: (a) Scale effect. Fix (b) Substitution effect. Fix (c) (d) (e) Proposition 3.2
and suppose
. If
then
.
and suppose
with
for all
. Also suppose
and
for some
. Then
and
.
is strictly concave in N and
is decreasing in N.![]()
where
is the derivative with respect to N.
The scale effect in part (a) says that if population increases, with all productivities held constant, more labor time will be devoted to every active resource. The substitution effect in part (b) says that if productivity increases for a particular active resource r, with all other productivities and population held constant, more labor time will be devoted to r and less time will be devoted to all other active resources. The remaining parts say that the food per person y(A, N) falls when population rises, that food per person goes to zero as population goes to infinity, and that food per person has a finite upper bound equal to the largest marginal product among all of the resources, evaluated at zero labor time.
The model of this section shows why diet breadth might expand when population grows. Consider Figure 3.2, where the group population N is on the horizontal axis and the marginal products for two resources
are on the vertical axis. The graph is drawn such that the marginal product for resource 1 always exceeds the marginal product for resource 2 when the same amount of labor time is devoted to each. In the language of diet breadth models (Bettinger, Reference Bettinger2009), resource 1 is unambiguously more highly ranked.
If
in Figure 3.2, it is optimal to allocate all labor to resource 1 because then
and any attempt to transfer labor from resource 1 to resource 2 would result in less total food. This is what an economist would call a corner solution (or a boundary solution). However, when
this is no longer the case. If all labor were to be used for resource 1, we would have
and the marginal product for resource 1 would be lower than the marginal product for resource 2. This implies that if a small amount of labor were transferred from 1 to 2, the gain in food output from resource 2 would outweigh the loss in output from resource 1. In this case optimality requires that positive labor time be devoted to each resource (what an economist would call an interior solution). Furthermore, optimal time allocation requires equal marginal products; that is,
where
,
, and
.
It may seem counterintuitive that even though one resource clearly dominates in Figure 3.2, a group might want to use both. But because resource 1 has a diminishing marginal product, if enough labor is used to harvest this resource, its marginal product will eventually become so low that it is worthwhile to exploit resource 2 as well. This provides an elementary theory about the linkage between population and diet breadth: as population rises, a group eventually shifts from exploiting one resource to exploiting two. It should be clear that the same logic extends to three or more resources. We will return to this point in Section 3.10 in connection with the broad spectrum revolution.
3.7 Long-Run Equilibrium
This section extends the theory of Chapter 2 to determine the population N. We assume that the conditions for short-run equilibrium from Section 3.6 are satisfied in each time period, and we continue to treat technology and climate as exogenous. Accordingly, we continue to use the productivity notation
For simplicity we consider a closed population where migration issues do not arise.
Let time periods be indexed by
, where a time period is the length of a human generation (about 20 years). We assume adults live for one period. During this time, they produce food, have children, and then die. When the adults from period t die, they are replaced by their surviving children, who become the adults for period
.
Suppose a typical adult has
children who survive to adulthood, where y is the adult’s food income. We have already assumed that food is shared equally among adults, so y is the same for everyone. In a simple interpretation, each adult has the same number of surviving children. However, if there is random variation across adults in numbers of offspring, we can also think of
as the average number of children as long as there are enough adults that the random variation across individuals washes out. Integer problems involving the number of children are ignored. As in Chapter 2,
is a continuous and increasing function with a unique food income
such that
.
The population evolves according to
where
is food per adult as defined in Proposition 3.2(c). We impose the following technical restriction to simplify population dynamics.
Assumption 3.2 Monotone population adjustment (MPA). Suppose there is a population
such that
. Keep A constant over time and consider any
. If
then
for all
. If
then
for all
. In either case
.
This says that if the initial population
yields an income above the equilibrium level
, so population tends to rise, it will rise steadily along the path leading to the equilibrium at
. By the same token, if
yields an income below
, so that population tends to fall, it will fall steadily along the path leading to
. Thus MPA rules out oscillations around
along the adjustment path. It holds when the direct positive effect of
on
in (3.3) outweighs the indirect negative effect of
through
. Note that the latter indirect effect is negative because as N rises, y falls, and this dampens the rate of growth
.
We can now give a formal definition of long-run equilibrium. The central idea is that we need a constant population
that solves (3.3) for a given productivity vector A. We also require that the time allocation for this population satisfy the requirements for a short-run equilibrium.
Definition 3.2 A long-run equilibrium (LRE) for the productivity vector A is a population level
such that
, along with the associated SRE labor allocation
obtained from (3.2).
If the population is zero in period t, it must also be zero in period
, and this satisfies D3.2. Hence
is always an LRE. The more interesting question is whether there is a non-null LRE having
. If such an LRE exists, it must have
in order to have
. Proposition 3.3 determines when a non-null LRE can occur.
Part (a) shows that if at least one resource is productive enough, it is possible to support a positive population in the long run. Part (b) shows that it is impossible to have a positive population in the long run if no resource is productive enough to achieve food income
.
Remark:
For brevity we omit a formal definition of stability for LRE. When Proposition 3.3(a) is relevant, the fact that y(A, N) is decreasing in N, plus monotone population adjustment from A3.2, guarantees that the non-null LRE with
is globally asymptotically stable and the null LRE with
is unstable. When Proposition 3.3(b) is relevant, the null LRE with
is globally asymptotically stable.
In every non-null LRE, per capita food consumption is
. As the vector A varies with climate or technology, the long-run population
will vary but the long-run living standard will not. This is the Malthusian feature of the model: an improvement in climate or technology increases food consumption per person in the short run but leads to a larger population with unchanged food per person in the long run.
Figure 3.3 illustrates the results from Proposition 3.3. When some natural resources are sufficiently abundant and/or technology is sufficiently productive, the curve
has a vertical intercept above
and a positive LRE population
exists. The arrows show directions of population adjustment and indicate the stability of
. However, if resources are too scarce and/or technology is too unproductive, the vertical intercept for
is at or below
and the only LRE is the one with zero population.
A concept that will be useful later is the habitation boundary. Suppose there is just one food resource and just one technique for obtaining it. The productivity A is then a scalar rather than a vector. In general, there is a boundary productivity level
with the feature that
implies a zero long run population and
implies a positive long run population. For a given technology,
determines a boundary between a geographic region where natural resources are sufficient to support a positive human population and an adjacent region where they are not. For example, there could be a latitude in northern Asia beyond which foragers cannot survive because natural resources are too scarce. The habitable geographic region will expand when technology improves, a point to which we return in Section 3.10 when discussing the colonization of harsh environments.
3.8 Very-Long-Run Equilibrium
The models in Sections 3.6 and 3.7 showed how time allocation and population could be treated as endogenous variables. Here we continue down this path by making technology endogenous as well, leaving only climate as a source of exogenous shocks.
We begin with an informal interpretation of the model developed thus far. Let
be the number of adults at the start of period t and let
be their technology. The adults observe the climate
. Together,
and
determine the productivities
. The adults then collectively choose a time allocation
that maximizes total food output. This also maximizes the per capita food income
. Early in the period, the adults form couples and the flow of food
determines the number of live births per couple. Later in the period, these children begin to observe the production activities of the adults. At the same time the flow of food
determines how many children survive to adulthood. When children become adults, their parents die and the period ends. The
surviving children become the adults of period
. They compare techniques for harvesting each resource and choose the best possible repertoire
. The process then repeats.
We use the term “very-long-run equilibrium” (VLRE) for a situation in which the LRE requirements are satisfied and in addition, the prevailing technological repertoire K is transmitted to the next generation with probability one. Proposition 3.1 showed that for an active resource
, there is a positive probability of technical progress whenever
. This implies that in a VLRE, the equilibrium repertoire K must include the ideal string
for every active resource, so that productivity for each of these resources is at its upper bound. Moreover, given prevailing technical knowledge, it must be optimal to assign a zero labor input to each latent resource (nr = 0).
This leads to the following formal definition.
Definition 3.3 A very-long-run equilibrium (VLRE) for the fixed climate vector a > 0 is an array (K, N, n) with the following features:
| (a) | (stationary technology) | kr = kr* for all r such that nr > 0; |
| (b) | (stationary population) | N = N[A(a, K)] obeys Proposition 3.3; |
| (c) | (optimal time allocation) | n = n[A(a, K), N] solves (3.2). |
We say that (K, N, n) is a null VLRE if N = 0 and a non-null VLRE if N > 0.
One interpretive remark is important here. The term “very long run” might suggest that technology evolves on a longer time scale than population does. This is not the case. In reality, technology and population evolve on the same time scale (one human generation at a time), according to the dynamic rules described in Sections 3.5 and 3.7. However, for analytic purposes it is often useful to hold technology constant while allowing population to evolve. This is the setting of the long-run model in Section 3.7. In the very long run, by contrast, both variables are allowed to evolve simultaneously (both are endogenous).
To characterize the set of VLREs, we need additional terminology and notation. Let S ⊆ {1 . . R} be a non-empty set of resources. A VLRE is said to be of type S if kr = kr* for r ∈ S and kr ≠ kr* for r ∉ S. All opportunities for technical progress have been exhausted for the resources in the set S but such opportunities remain available for the resources not in this set. If the ideal string kr* is available for resource r, we use the notation Ar* for the associated productivity argr(kr*).
Next we construct a specific repertoire of type S. Let krmin be the string having minimal productivity for resource r. Define the repertoire KS by setting krS = kr* for r ∈ S and krS = krmin for r ∉ S. Thus, we assign the best possible technique to each resource in the set S and the worst possible technique to each resource not in S. The next proposition describes the conditions under which there is a non-trivial VLRE of type S with the specific repertoire KS. It also shows that all of the other VLREs of type S are essentially identical, in the sense that they have the same population and time allocation.
Proposition 3.4
(very-long-run equilibrium).
Fix the climate vector a > 0. Let AS be the productivity vector associated with KS, let NS = N(AS) be the LRE population level for AS, and let nS = n(AS, NS) be the SRE labor allocation for (AS, NS). The array (KS, NS, nS) is a non-null VLRE of type S if
(a) Ar*fr′(0) > y* for some r ∈ S and
(b) HN(AS, NS) ≥ Ar(krmin)fr′(0) for all r ∉ S.
Every other non-null VLRE of type S has the same population NS > 0 and the same labor allocation nS. If either (a) or (b) fails to hold, any VLRE of type S is null.
Part (a) says that there must be at least one resource for which the ideal string kr* is available, and the resource can support a positive population at the food income y*. If the latter condition is not satisfied, the equilibrium population must be zero. Part (b) says that the marginal product of labor for the society as a whole (HN) must be at or above the marginal product for every latent resource. If this were not true, one could increase food output by using some latent resource, so time allocation would not be optimal. The only differences among non-null VLREs of type S involve techniques for the latent resources r ∉ S. These must be sufficiently unproductive but are otherwise indeterminate.
A resource that satisfies Ar*fr′(0) > y* as in condition (a) of Proposition 3.4 will be called a staple. Such a resource can support a positive population even in the absence of any other resource. Every non-null VLRE of type S must have at least one staple in the set S. It can be shown that every staple in S must be active. Any resources with Ar*fr′(0) ≤ y* will be called supplements. The set S could include one or more supplements, and these resources may be either active or latent.
Every resource r ∉ S is latent, regardless of whether it is a staple or a supplement. Condition (b) from Proposition 3.4 requires that for each of these resources, there must be a harvesting method so unproductive that it would not be optimal to exploit the resource. It seems plausible that for each resource there is some highly unproductive technique of this kind. As a result, many different VLREs could exist, with different resources being active and latent (or more precisely, different specifications of the set S).
The following corollary to Proposition 3.4 provides a general test for the existence of a VLRE with positive population.
Corollary 3.1
Let K* be the repertoire where the ideal string is available for every resource, with N* = N(A*) being the associated population from LRE and n* = n(A*, N*) being the associated time allocation from SRE. If max {Ar*fr′(0)} > y* then (K*, N*, n*) is a non-null VLRE. If max {Ar*fr′(0)} ≤ y* then every VLRE is null.
The corollary says that whenever at least one staple exists, there is a non-null VLRE of the form (K*, N*, n*). We call this the maximal VLRE because although other VLREs could exist, no VLRE can support a larger population, and any VLRE with fewer active resources will have a smaller population. A VLRE with population below the maximum associated with (K*, N*, n*) will be called a stagnation trap, because a larger population could be supported if suitable technological knowledge were available. When no staple exists, even ideal technical knowledge is inadequate to maintain a viable population.
3.9 The Effects of Climate Change
We are now ready to address a central theoretical question: What environmental conditions are most conducive to technical progress? In particular, can climate change help a society escape from a stagnation trap?
The first task is to show that the system converges to a VLRE from any initial state. This is done in Proposition 3.5. The second task is to study the impact of climate shocks on technology, population, and labor allocation. Proposition 3.6 provides such an analysis for neutral shocks that affect all resources in the same proportion. We show that in this case, positive shocks can stimulate progress while negative shocks cannot.
We then discuss biased shocks in a setting with two resources. Our analysis shows that negative shocks biased toward latent resources can generate progress, but this depends on the outcome of a race between rising productivity and declining population. Finally, we discuss the possibility of regress when active resources shut down.
Before defining convergence to a VLRE, we need to spell out how techniques and population are updated over time for a fixed array of resource abundances a = (a1 . . aR) > 0 called “climate.” Let (Kt, Nt) be the state in period t. We obtain (Kt+1, Nt+1) as follows.
(a) Kt determines the productivities At = [a1g1(k1t) . . aRgR(kRt)].
(b) At and Nt determine SRE labor allocation nt and food output H(At, Nt) as in (3.2).
(c) Nt and H(At, Nt)/Nt ≡ y(At, Nt) determine Nt+1 as in (3.3).
(d) Kt, nt, and Nt+1 determine the probability distribution over Kt+1 as in Proposition 3.1. This yields a new state (Kt+1, Nt+1).
Proposition 3.1 is expressed using qrt (the number of digits in krt that match the ideal string kr*), but it generates a probability distribution for krt+1 conditional on krt because there is an equal probability of mutation at every locus of krt that does not already match kr*. We assume existing strings for latent resources are retained with probability one. The laws of motion in (a)–(d) above yield the following results.
Proposition 3.5
(convergence to very-long-run equilibrium).
Assume ρ(y) > 0 for all y > 0. Fix the climate a > 0 and consider any initial state (K0, N0) with N0 > 0.
(a) Each sample path {Kt, Nt} has some finite T ≥ 0 and K′ such that Kt = K′ for all t ≥ T. Along a fixed sample path, {Nt} → N′ = N(A′) and {nt} → n′ = n(A′, N′). We will call K′ the terminal repertoire for the sample path and A′ = A(K′) the terminal productivity vector. We also say K′ generates (K′, N′, n′).
(b) With probability one, the terminal array (K′, N′, n′) is a VLRE.
(c) If N0 < N[A(K0)] then {Nt} is increasing. If N0 = N[A(K0)] then {Nt} is non-decreasing. If N0 > N[A(K0)] then {Nt} may decrease for all t ≥ 0, or it may decrease until some T > 0 and become non-decreasing for t ≥ T.
Part (a) shows that the system always reaches a stationary technology in finite time. Once this occurs, population converges to the corresponding Malthusian level and the allocation of labor time converges to the corresponding optimum for this technology and population. Because regress is impossible for active resources and strings remain constant for latent resources, the productivity vector {At} must be non-decreasing.
We know from part (b) that the system converges to some VLRE. However, it is difficult to say much about the probability of converging to any specific VLRE because this depends in a complex way on how mutations affect the productivities, which affect population dynamics, which feed back to the mutation probabilities as in Proposition 3.1.
Part (c) does provide some information about the population dynamics. There are three possibilities. If population is initially below the LRE level, it always rises, because Malthusian forces push population in this direction and could be reinforced by technical progress. If population is initially at the LRE level, it cannot decrease and may remain at a constant level (with no technical progress) or rise (if progress occurs). If population is initially above the LRE level, it may fall forever (for Malthusian reasons), or it may fall for some finite number of periods and then never fall again (because technical progress eventually outweighs Malthusian tendencies).
We next consider responses to climate shocks and associated changes in resource abundances. Recall that technical progress enables a society to support a larger long-run population with a given set of natural resources. This definition requires us to distinguish population growth due to technical change (holding climate constant) from population growth due to climate change (holding technology constant).
Our approach is based on the following thought experiment. Let the system be in a VLRE corresponding to some initial climate state a0. Suppose the climate jumps from the initial state a0 to a new state a′ and stays in the new state for a long time. The system will eventually converge to a VLRE for the new climate state. Finally, suppose climate jumps back to the original state a0 and stays there for a long time. The system will now converge to a VLRE for the original climate. If the population in the final equilibrium is higher than in the initial equilibrium, this can only be due to technical progress along the adjustment path.
The next proposition carries out this thought experiment for a case where climate change influences all productivities in the same proportion. Hence the abundances of all plant and animal species move up or down together, so nature as a whole becomes either more generous (positive shock) or less generous (negative shock). In either situation, the shock is neutral because we are ruling out substitution effects among the resources.
Proposition 3.6
(neutral shocks).
Let
be a non-null VLRE for the climate
. Define
and consider a permanent climate change
where
.
(a) Negative shocks. Suppose
. Assume
so a positive long-run population can be supported under the new climate without any technical change. The system converges to the new
with
,
, and
. The active resources in
are a subset of the active resources in
. If the climate
is permanently restored starting from
, the system converges to the original
.(b) Positive shocks. Suppose
. The system converges to a new
where
iff(*)
for some r such that
.The new population satisfies
, where
iff (*) holds. If
is permanently restored starting from
and (*) does not hold, the system converges to the original
. If
is permanently restored starting from
and (*) does hold, the system converges to some
where
and
. The last inequality is strict iff
. In this case, some resource with
has
.
Proposition 3.6(a) shows that a negative neutral shock cannot stimulate technical progress, because it cannot change the repertoire K. Population falls and some of the previously active resources may become latent (the diet may narrow). Reversing the shock returns the system to the original VLRE and restores the previous population, as long as the society has not gone extinct in the meantime. Restoration of the status quo population reflects the absence of technical progress in response to the climate change.
Proposition 3.6(b) shows that a positive neutral shock can stimulate permanent progress. A necessary and sufficient condition for this outcome is that the shock must lead to exploitation of a latent resource whose technique can be improved. This cannot occur in the short run through substitution effects because relative resource abundances are held constant. Instead, the key causal channel is a scale effect involving population.
Without technical change, the improved climate would lead to a larger population
in the long run. This population growth could make one or more latent resources active, as explained at the end of Section 3.6. If so, the technical repertoire K improves through learning by doing, and population expands beyond the level
induced by the climate change alone. But if the scale effect is too small to activate a latent resource, or productivity is already at its upper bound for each newly active resource, technology remains static and population grows only to the extent that climate permits.
If the climate returns to its original state
and technology has not improved in the meantime
, clearly population must return to its original level
. The same is true if technology improves as a result of the climate shock, but not by enough to alter the set of active resources used in the original climate
. For progress to become visible in the population level
after climate reverts back to
, the string for at least one previously latent resource must improve to a point where the old labor allocation
is no longer optimal at the old population level
. In this case, at least one new resource will be used after the climate returns to
. Simultaneously, some of the resources that were previously active at the climate
may be abandoned due to substitution effects.
Provided that strings for latent resources are conserved, the new technical plateau will be permanent. Proposition 3.6(a) shows that subsequent negative shocks cannot force a technological retreat. The result is a ratchet effect in which technological knowledge gradually improves. But unlike conventional growth models, our framework predicts a punctuated process where intermittent productivity gains stimulated by positive climate shocks can be separated by long periods of stagnation during which technology does not change and the average population growth rate is zero. During these intervals population will fluctuate in response to climate, and individual resources may go in and out of use as a result, but there is no lasting improvement in technological capabilities.
Thus far we have made two key assumptions: that climate shocks are neutral and that strings for latent resources are conserved. In the rest of this section we consider the consequences of relaxing each assumption.
Shocks biased toward or against particular resources create short-run substitution effects that can activate latent resources even before population has time to adjust. It will be convenient to discuss these effects in the context of two resources (R = 2). In all cases we start from an initial VLRE (K0, N0, n0) associated with climate a0 in which resource 1 is active and resource 2 is latent, and we assume the new climate is permanent. Because a negative shock to a latent resource cannot affect the system, we ignore this case.
A Positive Shock to the Latent Resource:
There is an immediate substitution effect away from resource 1 toward resource 2 with N0 constant. If the shock is large enough, n20 > 0 will occur (otherwise the shock is irrelevant). In the long run, population grows and the productivity A2 rises. For both reasons, n2t increases. With probability one, the ideal string k2* is eventually identified and the ideal repertoire K* is achieved.
A Positive Shock to the Active Resource:
An immediate substitution effect keeps resource 1 active and reinforces the latency of resource 2. However, population grows in the long run and this scale effect may eventually outweigh the substitution effect to give n2T > 0 at some T > 0. If resource 2 ever becomes active, it remains so, and the analysis from that point on is identical to the preceding case.
A Negative Shock to the Active Resource:
An immediate substitution effect favors resource 2 at the expense of resource 1. If the shock is large enough, n20 > 0 will occur. There are then two possibilities: (a) n2t > 0 for all t ≥ 0 or (b) there is a T > 0 such that n2t = 0 for all t ≥ T. Case (a) occurs when technical progress for resource 2 is rapid enough to outweigh the population decline resulting from the negative shock. In this scenario, population could eventually begin to grow and the ideal repertoire K* could be achieved. Case (b) occurs when technical progress for resource 2 is too slow, so that this resource eventually shuts down due to the declining population. This aborts further progress and leads to a new VLRE in which resource 2 is again latent.
These conclusions are all tilted in favor of progress by our assumption that strings for latent resources remain intact. This assumption is appealing on grounds of tractability and because it is the limiting case of Proposition 3.1. However, techniques may deteriorate if they are passed down through oral traditions in the absence of any practical experience with the resource in question. Henrich (Reference Henrich2004) cites Tasmania as a prominent example of technical regress. Among non-literate societies, Aiyar et al. (Reference Aiyar, Dalgaard and Moav2008) mention Tasmania and Easter Island. Boyd et al. (Reference Boyd, Richerson and Henrich2011) mention Tasmania, other Pacific islands, and the Polar Inuit of northwest Greenland. These cases all involve small population sizes and isolation from external reservoirs of technical knowledge. Such conditions could have been relatively common in the Upper Paleolithic.
An extreme way to introduce regress into our model would be to assume that if a resource shuts down, the associated string drops out of the technical repertoire. Then the only way to revive the resource would be to borrow a string already in use for an active resource. Using a two-resource framework, it can be shown that in this situation negative climate shocks can lead to regress and the population may not entirely recover when such shocks are reversed. A less extreme approach would allow random mutations leading to the deterioration of previous strings (for other modeling frameworks, see Henrich, Reference Henrich2004; Aiyar et al., Reference Aiyar, Dalgaard and Moav2008).
With these ideas in mind, we can identify a number of prerequisites for sustained technical progress. First, climate shocks must be large enough to trigger experimentation with new resources. Second, they should permit the preservation of existing knowledge. Strong substitution effects that flip a society’s time allocation from one corner solution to another can interrupt the process of “remembering by doing.” Third, a new climate state must persist long enough for new techniques to develop. If a shock is brief, there is little opportunity for productivity to rise for new resources, and the system will return to the previous equilibrium once the status quo is restored. As was shown in Proposition 3.1, productivity gains are accelerated if a society allocates a large amount of labor time to newly active resources. It is also helpful for population to grow quickly in response to positive shocks and not decline too rapidly in response to negative shocks.
We conjecture that in the Upper Paleolithic, technical innovation would have been more common in temperate regions that were more vulnerable to large climate shocks, as compared with tropical regions that were probably buffered from such shocks, at least to some degree. Future archaological research may shed light on this hypothesis.
3.10 Archaeological Applications
The theoretical framework constructed in this chapter can shed light on instances of technological innovation in the archaeological record. We will consider two examples here: the broad spectrum revolution, and colonization of the Arctic.
The Broad Spectrum Revolution:
The BSR refers to expansion in the set of animal and plant species used for food. First we consider animal prey. As shown in Figure 3.1, during about 64–32 KYA the world climate entered a milder phase within the prevailing glacial period. This led to population growth in southwest Asia between 60–44 KYA (Stiner et al., Reference Stiner, Munro and Surovell2000). Stiner et al. (Reference Stiner, Munro, Surovell, Tchernov and Bar-Yosef1999, 193) observe that “in western Asia … human populations increased substantially before the remarkable and rapid technologic innovations (radiations) that mark the Upper and Epi-Paleolithic periods.” The diet began to include birds in Italy around 35 KYA. Such prey were previously available but had not been exploited. Similarly the diet began to include birds in modern Israel around 28–26 KYA (Stiner et al., Reference Stiner, Munro and Surovell2000). Stiner et al. link this dietary expansion to the diffusion and refinement of blade technology, and perhaps also population growth.
These events preceded the Last Glacial Maximum at about 21 KYA, a very large negative climate shock that depressed population in much of the world (see Sections 3.2–3.3). Climate recovery after the LGM led to substantial population growth in southwest Asia and Europe (Gamble et al., Reference Gamble, Davies, Pettitt, Hazelwood and Richards2005). In southwest Asia the period after the LGM was warm and wet, with increased population density (Bar-Yosef, Reference Bar-Yosef, Bellwood and Renfrew2002b, Reference Bar-Yosef, Fitzhugh and Habu2002c).
By 19–17 KYA hares and rabbits had been added to the diet in Italy, and the same occurred by 13–11 KYA in southwest Asia (Stiner et al., Reference Stiner, Munro and Surovell2000). As with birds, these prey species had previously been available but remained latent until the climate moderated and population grew. Stiner (Reference Richards, Pettitt, Stiner and Trinkaus2001, 6996) comments: “early indications of expanding diets in the eastern Mediterranean precede rather than follow the evolution of the kinds of tools (specialized projectile tips, nets, and other traps) needed to capture quick small animals efficiently.” Fresh water aquatic foods (fish, mollusks, waterfowl) were exploited before the LGM, but in western Asia and southern Europe their use intensified after the LGM (Richards et al., Reference Richards, Pettitt, Stiner and Trinkaus2001). By about 13 KYA, people in southwest Asia consumed a very wide range of animal prey (Savard et al., Reference Savard, Nesbitt and Jones2006).
As we explained at the end of Section 3.6 in connection with Figure 3.2, it is not hard to see how population growth can lead to a broader diet. The results of Section 3.9 add two things: the role of climate improvement as a stimulus for population growth, and the refinement of techniques after latent resources become active. To put the argument in a nutshell: A positive neutral climate shock, if sufficiently large and persistent, can start a causal cascade with (a) population growth, (b) broader diet, and (c) technical progress. The latter can feed back to population growth, generating further rounds of (a), (b), and (c) until the system settles into a new equilibrium. This does not require exact climate neutrality. All we need is for the abundance of resources to rise and fall in a correlated way, as one might expect for major climate shifts lasting thousands of years. It appears likely that this mechanism was operative at least twice for animal resources, once during the warm phase before the LGM and a second time during the recovery afterward.
Dietary breadth with respect to plant foods is more controversial. It is clear that at the LGM, inhabitants of southwest Asia relied heavily on small-seeded grasses (Weiss et al., Reference Weiss, Wetterstrom, Nadel, Bar–Yosef and Smith2004). As climate improved afterward, large-seeded grasses became more common (especially wild wheat, barley, and rye). According to a school of thought represented by Weiss et al., people increasingly specialized in large-seeded grasses, which smoothed the path to agriculture. Another school of thought represented by Savard et al. (Reference Savard, Nesbitt and Jones2006) argues that a combination of climate change and population growth led to a gradual broadening of the diet to include more plant species. In this perspective large-seeded grasses were a minor part of the Epi-Paleolithic diet, despite their later centrality for agriculture.
In a re-analysis of data from sites in southwest Asia shortly before agriculture, Savard et al. (Reference Savard, Nesbitt and Jones2006) find that the plant component of the diet was highly diverse at each site, and also that there was substantial diversity in the composition of the diet across sites. The increased abundance of large-seeded grasses after the LGM did not drive out consumption of the small-seeded grasses that were important at the LGM. The results for the region as a whole “fit with elements of … the broad spectrum model” (Savard et al., Reference Savard, Nesbitt and Jones2006, 192). In a similar vein, Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001) report that just before the start of cultivation, more than 100 species of edible seeds and fruits were used in southwest Asia. This broad pattern of plant exploitation was accompanied by considerable technological paraphernalia, such as querns, mortars, pestles, bowls, grinders, pounders, whetstones, choppers, and sickles. Again, this is consistent with our expectations from the model in Section 3.9. Recovery from the LGM led to population growth, use of previously latent plant resources, and improvements in the techniques used to exploit those resources.
The original BSR hypothesis from archaeology involved exogenous population pressure on ecologically marginal areas (Flannery, Reference Flannery, Ucko and Dimbleby1969). In our hypothesis the BSR is triggered by positive climate change and the resulting endogenous population growth is not “pressure.” Rather, it is a response to temporarily higher living standards and leads to greater dietary breadth as people find it more attractive to exploit latent resources. This stimulates technological innovation, which allows more efficient harvesting of the new resources. By contrast with theories based on exogenous population growth, we would not expect any systematic decrease in living standards over long time scales. We could incorporate resource depletion into the model, but this is not an essential feature of our framework. One advantage of our approach in comparison with theories based upon exogenous population growth is that we can explain the timing of BSR episodes by tracing them to shifts in the prevailing climate regime.
Colonization of the Arctic:
Occupation of the Eurasian steppe, Siberia, and the Americas required foragers to overcome severe resource constraints. Novel techniques had to be developed to keep warm, secure shelter, and hunt prey. For a vivid description of the technical problems confronting the Inuit of the Central Arctic and the sophisticated nature of their solutions, see Boyd et al. (Reference Boyd, Richerson and Henrich2011). Here we focus on northeast Asia, which includes northern China, Mongolia, and Siberia.
The relevant period begins around 47 KYA and extends beyond the Last Glacial Maximum. Numerous climate fluctuations occurred over this time (data in this paragraph are from Brantingham et al., Reference Brantingham, Kerry, Krivoshapkin, Kuzmin and Madsen2004). A cold-dry glacial event ended around 52–47 KYA, followed by a cool and wet period during 47–25 KYA. This in turn was followed by a second cold-dry glacial event, the LGM, from 24–20 KYA. The cool-wet intermediate period was associated with lake expansion, the partial replacement of deserts by steppe grasslands, and northward movement in the boundary between forest and tundra. These developments were reversed in the LGM. Also in the cool-wet phase from 47–25 KYA, there were millennial-scale oscillations, with warm events during 37–35 KYA and 28–27 KYA separated by a cold event centered around 32 KYA.
Southern Siberia was permanently occupied by at least 44 KYA. (Information in this paragraph is from Kuzmin and Keates, Reference Kuzmin and Keates2005. The radiocarbon dates given by these authors appear to be uncalibrated. To obtain dates comparable to those used elsewhere in this section, one should add about 2,000 years.) Northern Siberia was colonized around 34–27 KYA. According to Kuzmin and Keates, the number of occupation episodes for Siberia as a whole stayed roughly constant through the LGM, but then rose substantially during 16–12 KYA. Other authors argue that the LGM led to a large population decline in Siberia, as will be discussed below. Kuzmin and Keates reject the assertion that there were successive waves of colonization in Siberia and argue for steady population growth without noticeable minima or maxima, although at varying rates over time.
A transition from Middle to Initial Upper Paleolithic technology has been found at the site of Kara Bom in the Altai region of southern Siberia dating to about 43 KYA (data in this paragraph are from Brantingham et al., Reference Brantingham, Krivoshapkin, Jinzeng and Tserendagva2001, except as indicated). UP features did not completely replace those of the Middle Paleolithic (MP), but an increased emphasis on blade production is evident. Similar Upper Paleolithic technology has been found at two sites in Mongolia dated 33–27 KYA and in northern China around 25 KYA. The Initial UP technology in northeast Asia was associated with wider foraging areas and also the use of non-local raw materials. Upper Paleolithic technology was being used in the Siberian Arctic by around 32 KYA (Goebel et al., Reference Goebel, Waters and O’Rourke2008).
Brantingham et al. (Reference Brantingham, Kerry, Krivoshapkin, Kuzmin and Madsen2004) note an apparent correlation between climate events and the numbers of radiocarbon dates assigned to temporal bins, with a peak in human activity around the warming at 34 KYA, a subsequent decline at the colder spell of 32 KYA, another peak just before the LGM, and a dramatic decline (suggesting a “severe population bottleneck”) during the LGM. With respect to technology, they argue that classic Upper Paleolithic features including microblades, bone/antler/ivory technology, portable art, and indications of structures “emerged in a rapid, punctuated fashion” (270). This was especially true for microblades. The classic UP did not coalesce until 26 KYA, approximately 20,000 years after the initial steps in that direction, and only lasted from 26 to 22 KYA before being overtaken by the severe climate shock of the LGM.
Brantingham et al. (Reference Brantingham, Kerry, Krivoshapkin, Kuzmin and Madsen2004) find that traditional stone tools tend to cluster around 51o N. Microblades and the other advanced technologies of the preceding paragraph tend to cluster further north, around 53–55o N. Based on a sample of all types of technological artifacts, the authors estimate that human occupation expanded north into the sub-Arctic at a rate of 89 km per 1,000 radiocarbon years. Older types of stone tools expanded at a rate of 33–44 km per 1,000 years, while the bone-antler-ivory technology and portable art expanded at a rate of 56 km per 1,000 years. Microblades and structures expanded north at a very rapid rate shortly before the LGM.
To see how our theory can explain the colonization of extreme environments like Siberia, recall the concept of a habitation boundary introduced at the end of Section 3.7. Suppose initially there is only one food resource, and food becomes less abundant as one moves from south to north. With a given technology, there will be some latitude beyond which natural resources cannot support a positive population. But a superior technology can compensate for scarcity of resources, so if food acquisition methods improve while the environment remains unchanged, the habitation boundary will shift to the north.
Our theory is somewhat more complicated because in a very-long-run equilibrium each active resource must already be at its maximum productivity level. But if there are many distinct resources, with some active and the rest latent, the technological repertoire as a whole can improve. As we showed in Section 3.9, this can occur through a sequence of positive and negative climate shocks that generate a ratchet effect, whereby knowledge improves for each latent resource after it becomes active. This enables people to expand into latitudes that were too hostile with the previous repertoire. The northward expansion will open up Arctic food resources that were not available further south, these new latent resources can also become active, and again techniques can improve.
At present we cannot identify the particular food resources that shifted from latent to active status in the course of sub-Arctic and Arctic colonization. However, the climate fluctuations, population trends, and technological developments in northeast Asia appear to be consistent with our theoretical framework. Similar dynamics could have played out for migration into other challenging environments, such as deserts or tropical rainforests.
3.11 Conclusion
We have developed a theory to explain why technological progress was slow and sporadic during the Upper Paleolithic. The theory presented in Sections 3.5–3.9 satisfies the following criteria from Section 3.4.
(a) Technology is endogenous and the rate of productivity growth in very-long-run equilibrium is zero. However, occasional technical progress can occur.
(b) Population is endogenous and described by Malthusian dynamics.
(c) Climate change is the main driver of changes in technology and population.
(d) The model is consistent with the climate history in Section 3.2.
(e) The model is consistent with the population dynamics in Section 3.3.
(f) The model suggests explanations for the broad spectrum revolution and the gradual colonization of extreme environments.
In our framework, foragers exploit a subset of the resources provided by nature. Although they often reach high productivity levels for these resources, further advances require the use of latent resources, which is not attractive in light of existing knowledge. We call this a stagnation trap. Climate shocks can cause foragers to experiment with latent resources. This generates punctuated equilibria where foragers acquire enhanced technical capabilities and higher populations at successive plateaus. If a foraging society can retain enough knowledge of inactive techniques, it may benefit from a ratchet effect through which technology progresses over archaeological time. We will build on these ideas to explain the origins of sedentism and agriculture in Chapters 4 and 5.
3.12 Postscript
This chapter is based on the article “Stagnation and innovation before agriculture” by Dow and Reed (Reference Dow and Reed2011) in the Journal of Economic Behavior and Organization. We received advice on drafts of the original article from Matthew Baker, Cliff Bekar, Mark Collard, Brian Hayden, Stephen Shennan, Lawrence Straus, an anonymous referee, and colleagues at Simon Fraser University, the University of Copenhagen, and the Canadian Network for Economic History. Scott Skjei assisted with the research, and the Social Sciences and Humanities Research Council of Canada provided financial support. As always, the opinions are those of the authors.
Sections 3.1–3.4 are new for this volume, as are the graphs. References to the archaeological literature have been updated and Section 3.10 has been entirely rewritten. The formal model is identical to the one in Dow and Reed (Reference Dow and Reed2011). When we constructed the original model, we were aware of the climate history in Section 3.2, some of the early findings from paleodemography (although none of the specific studies in Section 3.3), and the evidence on the broad spectrum revolution in Section 3.10. We were not aware of the information about Siberian climate, population, or technology.
4.1 Introduction
From the origins of anatomically modern humans until after the Last Glacial Maximum around 21,000 years ago, almost everyone lived in small, mobile foraging bands. Starting around 15,000 years ago, foragers in some regions such as southwest Asia and Japan began to develop permanent settlements. Sedentary foraging predated agriculture by several millennia and accelerated with the onset of the Holocene 11,600 years ago, which brought a warmer, wetter and more stable climate. The best evidence for early sedentism is from temperate zones. Among modern hunter-gatherer societies, those in tropical rainforests and the Arctic have remained the most mobile (Kelly, Reference Kelly2013a).
Sedentism can be defined in various ways and is a matter of degree, so we need to clarify our use of this term. First, it is important to recognize that mobile foraging groups do not just move at random across the landscape. A common pattern involves the use of seasonally shifting base camps over an annual cycle, with hunting and gathering on trips away from each base camp. When anthropologists and archaeologists refer to sedentism, they often mean settlements that are at least partially occupied year-round.
Evidence used by archaeologists to infer sedentism at a site includes the presence of plants and animals from all four seasons; the presence of species that flourish when in frequent contact with humans (e.g., mice, rats, and sparrows); substantial investments in dwellings, earthworks, ceremonial structures, or monuments; greater use of cemeteries; and site-specific investments in food processing and storage facilities. Marshall (Reference Marshall2006) cites the durability of dwellings and the size of settlements as the most crucial markers.
While we accept these indicators of sedentism, we require more than simply the year-round occupation of a site. A time period in our model is defined to be one human generation. We are therefore interested in the conditions under which adult children stay in the same location as their parents, even when facing multi-year environmental shocks that reduce the abundance of local food resources. We call a community sedentary if it is robust to such negative shocks over periods lasting decades or centuries. We sometimes refer to this phenomenon as sustained sedentism to differentiate it from more temporary or fragile forms of sedentism. A time interval lasting generations or centuries makes it possible for major structures to be built, for burial practices to develop, for the regional population to grow, and for new technologies and institutions to evolve.
Why was sustained sedentism an important transition, one that (as in the words of our subtitle) shaped the world? The development of large permanent communities was a massive social upheaval relative to the ancestral lifestyle of small mobile bands. We will unpack some of the implications in what follows.
Ethnographic data indicate that group sizes among sedentary foragers are much larger than for mobile foragers (Kelly, Reference Kelly2013a, 171–172). Sedentism is also correlated with other important variables. Using original data from Keeley (Reference Keeley1988, Reference Keeley, Barton, Roberts and Roe1991), Rowley-Conwy (Reference Rowley-Conwy, Panter-Brick, Layton and Rowley-Conwy2001, 40–44) shows that ethnographically known foragers fall into two distinct clusters. Those with low sedentism have low population relative to natural productivity, low use of food storage, and low stratification. Those with high sedentism have high levels on these other dimensions. A classic example of the latter kind is provided by the societies on the northwest coast of North America (Ames and Maschner, Reference Ames and Maschner1999). Such societies illustrate Kelly’s point that sedentary foraging is correlated with “social hierarchies and hereditary leadership, political dominance, gender inequality, and unequal access to resources” (Reference Ferguson and Fry2013a, 104). Anthropological evidence strongly suggests that even if sedentary foraging had never led to agriculture, it would still have led to economic inequality, and it would probably have led to greater warfare over land. We pursue these ideas in Chapters 6–8.
The other (and larger) reason why sustained sedentary foraging shaped the world is that at least in some cases, it was a precursor to agriculture. We want to be careful in our comments on this matter because there is a long and unhelpful tradition of conflating sedentary foraging with agriculture. Sloppy language such as “the transition to sedentary agriculture” should therefore be avoided. It is essential to recognize that the transition to sedentary foraging was distinct from the transition to agriculture, and requires a distinct theoretical explanation (Marshall, Reference Marshall2006). Indeed, we will propose in this chapter that the transition to sedentism was largely driven by positive climate trends, while we argue in Chapter 5 that transitions to agriculture were often driven by negative climate shocks.
Sedentary foraging is clearly not a sufficient condition for a subsequent transition to agriculture. For example, millennia of sedentary foraging never led to indigenous agriculture in Japan or along the northwest coast of North America. When domesticates finally arrived, they were imported from elsewhere. We discuss the case of Japan further in Section 4.3. Nor is year-round sedentism a necessary precondition for agriculture. In the American Southwest, for example, imported domesticates like maize appear to have been integrated into a foraging system involving seasonal mobility, with truly sedentary villages arising a millennium or more later (Wills, Reference Wills1988).
Nevertheless, we believe that sustained sedentism contributed to several pristine agricultural transitions. One leading example is southwest Asia (see Section 4.3). In this region, sedentism supported a large population and led to technological innovations that were pre-adapted for cultivation. For reasons to be explained in Chapter 5, both factors made a subsequent transition to agriculture more likely. Sedentism may also have led to institutional innovations with respect to property rights that increased the probability of an agricultural trajectory, as some economists have asserted (see Section 4.2). However, robust foraging communities arose 3,000 years before agriculture and were clearly not the proximate cause of this trajectory in southwest Asia. Although sedentism helped set the stage, we will argue that climate and geography took the leading roles. Our hypothesis about the agricultural transition in southwest Asia will be presented in Chapter 5.
This chapter assumes that people in a region can freely migrate from one location to another. Thus, as discussed in Section 1.2, we are studying a transition along the main sequence from cell A to cell B in Figure 1.2, where institutions allow open access for both mobile and sedentary foragers. To be sure, the sedentary foragers described by anthropologists often exercise group property rights over valuable locations, exhibit stratification, and engage in warfare. We take up these subjects in Chapters 6–8, where we explore sedentary foraging with closed access (cell B′ in Figure 1.2) or stratification (cell B″ in Figure 1.2). But when mobile foragers began their journey toward sedentism, their institutions probably did not yet involve rigid property rights or social stratification. These are more likely to be effects of sedentism rather than causes. Therefore, we would like to know whether an initial impetus toward sedentism can develop in a setting of free mobility. We will answer in the affirmative.
What caused mobile foragers to become sedentary foragers? A common answer involves direct effects of nature. The idea is that people are mobile when the location of food resources is constantly shifting, or when important resources are not available all in one place, or when local resources are rapidly depleted. People become sedentary when nature provides a rich, diverse, and reliable assortment of resources at a single location, and these resources are not easily depleted.
We don’t doubt that this is part of the story, but it is far from the whole story. As we will see in Section 4.3, sedentism was not just a matter of people settling down in one place. It also involved new food sources, larger settlements, larger regional populations, and technological innovation. These changes often included more use of plant or aquatic foods as compared with prey animals; fixed investments in mortars, ovens, and storage facilities; and more durable housing (Kelly, Reference Kelly2013a, 122–128). How can we explain the recurrent pattern of greater dietary breadth, population growth, and technical change?
We develop a formal model that addresses these questions. We consider a region with many individual production sites, where the weather at each site can be good or bad. Good weather is associated with abundant food resources and bad weather is associated with scarce food resources. In our model, a climate regime is defined by the probability distribution over weather conditions (temperature, precipitation, etc.) at each site. For a given climate these random draws are independent across sites and time periods, so a site that is good in one period can be bad in the next. A change in climate refers to a change in the probability distribution for weather events. The rate of sedentism is the fraction of the local population that remains in place when a site switches from good to bad weather. This is our measure of how robust communities are to natural shocks.
The shift from the Last Glacial Maximum to the Holocene involved both better mean weather and decreased variance in weather (see Section 3.2; Richerson et al., Reference Richerson, Boyd and Bettinger2001; Woodward, Reference Woodward2014, chs. 8–9). These trends were sometimes associated with sedentism well before the Holocene (see the discussion of southwest Asia in Section 4.3), although more often sedentism developed in the early Holocene. Climate shifts of this kind could have led to a positive rate of sedentism through three causal mechanisms.
First, let total regional population be constant in the short run. The lower variance shrinks the productivity gap between sites with good weather and sites with bad weather. To take the simplest case, imagine that the productivity of good sites does not change but bad sites become better. If the bad sites improve enough, diminishing returns at the good sites could make it attractive for some agents to start using bad sites. It follows that when a site switches from good weather to bad weather it will no longer be entirely abandoned.
Second, in the long run the better mean weather conditions result in population growth through Malthusian dynamics. Even if the decreased variance in weather is not sufficient to cause sedentism by itself, a higher regional population could lead to some use of sites with bad weather due to diminishing returns at sites with good weather. It can thus yield a positive rate of sedentism. Neither of these two mechanisms requires technical change, and each is reversible in the sense that if climate reverts to its earlier mean and variance, the rate of sedentism will eventually return to its original level.
The third causal mechanism involves technological innovation. We assume that agents can use two methods of food collection. We will refer to these metaphorically as “hunting” (a shorthand term for mobile methods) and “gathering” (shorthand for stationary methods). Suppose that a society initially uses only hunting, and suppose climate change leads to Malthusian population growth. This will reduce the marginal product of labor in hunting. At some point gathering may begin, with new food resources being targeted. If so, learning by doing could increase the productivity of gathering over time. The rising productivity of gathering will then reinforce the population growth generated by climate change. This trajectory can lead to sedentism even when climate change and population growth alone would not. Assuming that new knowledge about food acquisition methods is retained over time, this process creates a ratchet effect such that even if climate reverts to its original state, greater dietary breadth, increased population, and new techniques of gathering can persist. The sedentism rate can thus remain above its original level.
The rest of the chapter is organized as follows. Section 4.2 reviews a number of existing theories about the origins of sedentism from both anthropology and economics. Section 4.3 reviews archaeological evidence on sedentary foraging, including the leading cases of southwest Asia and Japan.
Sections 4.4–4.10 develop our formal model. We consider three different time spans: the short run, the long run, and the very long run. In the short run, the allocation of labor across sites and food procurement methods is endogenous, while the aggregate regional population, technology, and climate are exogenous. In the long run, the regional population becomes endogenous while technology and climate remain exogenous. In the very long run, technology becomes endogenous and only climate is still exogenous. This parallels the structure of our modeling efforts in Chapter 3.
Section 4.4 describes food acquisition at an individual site. Section 4.5 considers the regional economy, which has a continuum of production sites. The sites are exposed to idiosyncratic variations in food availability due to weather shocks. Property rights are characterized by open access. As explained above, this is a logical starting point for the development of sedentism, and we see it as a reasonable approximation when population density is low and the agents can move relatively easily from one site to another through kinship ties. A short-run equilibrium (SRE) is an allocation of regional population across production sites such that no individual agent can gain by changing sites.
Section 4.6 adopts the Malthusian assumption that the population growth rate is an increasing function of food per capita, as was discussed in Chapter 2 and Section 3.7. A long-run equilibrium (LRE) is an SRE such that the regional population stays constant over time. Section 4.7 adds technological innovation where the gathering technology can improve through learning by doing, along lines discussed in Section 3.5. A very-long-run equilibrium (VLRE) is an LRE such that technological capabilities remain constant over time, as was discussed in Section 3.8.
Section 4.8 constructs a baseline VLRE that captures central features of the Last Glacial Maximum. These include a climate where weather conditions have a low mean and high variance, a low population, the use of mobile food acquisition techniques, and complete abandonment of sites whenever natural resources become locally scarce. We then consider a permanent shift to a new climate regime with a higher mean and a lower variance. Section 4.9 studies three causal mechanisms by which climate amelioration can lead to sedentism. The first operates in the short run through the reduction of variance in weather conditions across sites. The second operates in the long run through Malthusian population growth, and the third combines population growth with technical innovation.
Section 4.10 examines the consequences of sedentism for demography and living standards. Assuming sedentism made children less costly, as is often said to be true, our theory predicts a shift toward increased fertility, increased child mortality, decreased food per capita, and increased regional population. Section 4.11 follows with some comments on how our model relates to theories based on resource depletion.
Section 4.12 reviews further archaeological evidence. Section 4.13 summarizes our conclusions and discusses a few extensions of the theory. Section 4.14 is a postscript. Proofs of the formal propositions are available at cambridge.org/economicprehistory.
4.2 Theories of Sedentism
There are five main alternatives to our explanation for the origins of sedentism, three from anthropology and two from economics. Price and Brown (Reference Price and Brown1985) describe two traditional theories from anthropology, which they label as “pull” and “push.” According to the pull hypothesis sedentism arose due to resource abundance, which encouraged people to reduce their mobility. According to the push hypothesis foragers were forced to adopt sedentism by resource scarcity, which caused a shortage of food relative to population. This led to greater dietary breadth and more time devoted to harvesting and processing wild food resources, which in turn led to lower mobility. These theories have various empirical inadequacies (Kelly, Reference Kelly1992).
A third anthropological explanation is summarized by Kelly (Reference Kelly2013a), who argues that sedentism was driven by regional population growth, which led to group packing and made it harder for groups to relocate when local resources were depleted. Sedentism by one group raised the cost of moving for other groups, and therefore sedentism tended to spread. Sedentism typically emerged in places where the local resource base was hard to deplete (e.g., on coasts or rivers where aquatic resources were available). Sedentism led to further population growth through increased fertility, so once the process was set in motion it became self-reinforcing. Technology evolved toward the use of lower-value but less-easily-depleted food resources.
We agree with Kelly that population growth and technical change were mutually reinforcing in the transition to sedentism. Our model does not include resource depletion because we want to show that the qualitative evidence on the transition to sedentism can be explained without appealing to this variable. Indeed, we will argue that sedentism was triggered by the greater resource abundance brought about by a better climate. We will, however, discuss resource depletion issues further in Section 4.11. Another contrast with Kelly is that we do not emphasize the group ownership of production sites in this chapter, although we discuss and model it extensively in Chapters 6–8. We also depart from Kelly by endogenizing population through Malthusian dynamics, and by emphasizing technical ratchet effects along the lines of Chapter 3.
Acemoglu and Robinson (Reference Acemoglu and Robinson2012, 137–143) offer an economic and political story about the origins of sedentism. Their account centers on Natufian society in southwest Asia, which we will discuss in Section 4.3. Acemoglu and Robinson note that the Natufians were sedentary foragers well before they began to domesticate plants and animals. They observe that mobility was costly for the young and old, made food storage difficult, and prevented use of productive but heavy implements such as grinding stones. However, “while it might be collectively desirable to become sedentary, this doesn’t mean that it will necessarily happen. A mobile group of hunter-gatherers would have to agree to do this, or someone would have to force them” (139). “In order for sedentary life to emerge, it therefore seems plausible that hunter-gatherers would have had to be forced to settle down, and this would have to have been preceded by an institutional innovation concentrating power in the hands of a group that would become the political elite, enforce property rights, maintain order, and also benefit from their status by extracting resources from the rest of the society” (139). “The emergence of political elites most likely created the transition first to sedentary life and then to farming” (140). “Institutional changes occurred in societies quite a while before they made the transition to farming and were probably the cause both of the move to sedentarism [sic], which reinforced the institutional changes, and subsequently of the Neolithic Revolution” (141).
One can argue that there was mild inequality among the Natufians after sedentism and before agriculture, although some archaeologists call them egalitarian (Grosman and Munro, Reference Grosman, Munro, Enzel and Bar-Yosef2017, 704). There is also evidence for mild inequality in southwest Asia after the spread of agriculture during the Holocene. We will return to both points in Chapter 6. However, there is no evidence at all that the mobile foragers who preceded the sedentary Natufians had either political elites or stratification. In fact there is no evidence that any prehistoric or more recent society of mobile foragers has had a political elite or economic stratification. Acemoglu and Robinson appear to believe that an elite could have emerged prior to sedentism through random institutional drift, but they offer no further explanation as to how or why this could have happened. They also do not explain how an elite in a mobile band could have forced other band members to settle down, or what their motivation for this would have been. Chapter 6 will develop a theory about the origins of inequality that is more consistent with archaeological evidence. In this chapter, we ignore the alleged role of coercion as a causal factor in the transition to sedentary foraging.
Bowles and Choi (Reference Bowles and Choi2013, Reference Feder2019) view sedentism in the context of the transition to cultivation, which eventually led to farming communities dependent upon domesticated crops. They think sedentism initially arose at atypically rich production sites where wild resources were sufficiently concentrated to allow sedentary living, and to make defending the resources worthwhile. They also emphasize the reduction of climate volatility, which lessened the pressure for mobility.
Their causal sequence runs as follows. Initially production technology was based on mobile hunting and gathering. Climate change led to warmer and more stable weather conditions favorable to plant growth, which resulted in a few particularly well-endowed production sites. The existence of these sites motivated a transition to greater sedentism. Finally, sedentism created an environment where the coevolution of property rights and agriculture could occur. We will return to the ideas of Bowles and Choi when we discuss the origins of agriculture in Chapter 5.
4.3 Some Archaeological Evidence
This section provides background information about the rarity of sedentism at the Last Glacial Maximum and its emergence as climate conditions improved. We then take a closer look at two regions where early sedentism is well documented: southwest Asia and Japan. These cases offer guidance for our formal model. We conclude this section with some remarks on the relationship between sedentism and agriculture.
In the first three chapters, we used the notation “KYA” to abbreviate dates going back tens of thousands of years. In this chapter and those to come, it will be convenient to switch to the notation “BP,” meaning years before the present. This naturally raises the question of what is meant by the present. In archaeological publications from the second half of the twentieth century, the present was usually defined by convention to be 1950 AD (or CE, meaning “current era”). However, in the twenty-first century some scholarly work has begun to treat the year 2000 CE as the definition of the present. For the time scales of this book a difference of 50 years is negligible, because 95% confidence intervals for radiocarbon dates are often ±100 years or more. Readers can therefore enjoy the simplicity of round numbers and treat 10,000 BP as 10,000 years before 2000. As always we use calibrated (calendar) years rather than uncalibrated (radiocarbon) years unless otherwise stated.
The Last Glacial Maximum (LGM) occurred around 21,000 BP in calendar years, or around 18,000 BP in uncalibrated radiocarbon years. The contributors to two volumes edited by Gamble and Soffer (Reference Gamble and Soffer1990) and Soffer and Gamble (Reference Gamble and Soffer1990) survey archaeological evidence from this period for 30 regions of the world. Many regions had settlement sizes ranging from small (probably temporary hunting camps or tool-making workshops) to medium (probably seasonal base camps) and large (probably residential camps with occasional population aggregation). Some foraging groups in areas of high ecological diversity may have collected food within relatively compact territories, and such groups probably had higher population densities than elsewhere.
Nevertheless, in the 600+ pages of the two volumes edited by Soffer and Gamble, only one author claims evidence for a sedentary lifestyle (involving permanent structures in Siberia; see Madeyska, Reference Madeyska, Gamble and Soffer1990, 32). Archaeological evidence for sedentary communities at the time of the LGM may not have survived or may have been overlooked. But if there were such communities during this period, they were apparently rare and ephemeral.
Aside from southwest Asia and Japan, which will be discussed in detail below, sedentism arose early in the Holocene in Portugal, the Baltic region, and parts of Africa. For Portugal in the first 5,000 years of the Holocene, Straus (Reference Aikens, Akazawa, Straus, Ericksen, Erlandson and Yesner1996) reports the appearance of cemeteries, dugout structures, and large shell middens, with a highly diversified diet from hunting, fishing, other aquatic resources, and probably abundant nuts, berries, and tubers. Straus describes the settlements as “semi-sedentary.” For Eastern Europe and the Baltic region, Dolukhanov (Reference Dolukhanov, Straus, Ericksen, Erlandson and Yesner1996) doubts that year-round settlements existed in the late Pleistocene, but by the early Holocene climate change had led to “comparatively large settlements of permanent and semi-permanent character in Baltic lagoons and inland lacustrine depressions, with an economy based on the exploitation of a wide spectrum of wildlife resources” (168). Sedentary foraging arose in North Africa and East Africa around the same time (Barham and Mitchell, Reference Barham and Mitchell2008, 351–355).
Such evidence for early sedentism is important, but it is necessary to emphasize that in other places there was no such transition. For example, Hiscock (Reference Hiscock2008, 252–253) doubts that sedentary villages existed in Australia prior to European contact. The same was probably true for most tropical rain forests, deserts, and the Arctic.
Southwest Asia:
We begin with a short summary of climate in the Levant, which is defined to include modern Israel, Palestine, Jordan, Lebanon, and Syria, plus areas of Turkey and the Sinai Peninsula. The information in this paragraph is from Robinson et al. (Reference Acemoglu and Robinson2006). Data from many proxies such as lake levels and sediments, palaeosols, fluvial sediments, palaeobotanical records, speleothems, mollusk chemistry, calcretes, and deep-sea cores tell the following story. From the Last Glacial Maximum (23–19,000 BP) until Heinrich event 1 (16,000 BP), the Levant was cool and dry. The Bølling-Allerød interval (15,000–13,000 BP) was warmer and wetter. The Younger Dryas (12,700–11,500 BP) reverted to cold and extremely arid conditions. The early Holocene (9500–7000 BP) was warm, and also the wettest phase over the last 25,000 years. This regional climate history is consistent with the climate history for the Northern Hemisphere revealed in Greenland ice cores. For more on climate in southwest Asia in this period, see Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018).
The dates for initial sedentism in southwest Asia are still somewhat controversial. Some authors argue that sedentism evolved in a gradual way with foundations that trace back to the Last Glacial Maximum (Maher et al., Reference Maher, Richter and Stock2012). Others are skeptical about any evidence for sedentism before farming economies emerged in the early Holocene (Boyd, Reference Boyd2006). But most experts believe a sedentary lifestyle developed in the Bølling-Allerød period and was associated with an archaeological culture called the Early Natufian.
Kuijt and Prentiss (Reference Kuijt, Prentiss, Prentiss, Kuijt and Chatters2009) date the start of the Early Natufian to 14,900/14,600 BP in the Jordan Valley. Byrd (Reference Byrd2005, 255–262) believes sedentism began in the resource-rich Mediterranean woodland. He comments that sites were located at junctions between two environmental zones, were close to springs, and may have had about 60 inhabitants. He stresses the heightened importance of plant foods, an elaborate array of ground stone tools, the use of baskets for storage, permanent stone architecture, and complex mortuary practices. Hunting of large mammals and birds, as well as freshwater fishing, were also important. Grosman and Munro (Reference Grosman, Munro, Enzel and Bar-Yosef2017) cite evidence for permanent architecture, heavy non-mobile ground stone tools, and domestic mice, rats, and sparrows.
The Early Natufian period was accompanied by a ten-fold increase in settlement sizes (Bellwood, Reference Bellwood2005) and a “fertility explosion” despite the absence of any cultivation (Bocquet-Appel and Bar-Yosef, Reference Bocquet-Appel, Bar-Yosef, Bocquet-Appel and Bar-Yosef2008, 4–5). The central innovation with respect to lithic tools was the sickle blade, used for harvesting wild cereals (Grosman and Munro, Reference Grosman, Munro, Enzel and Bar-Yosef2017). The Natufian culture expanded northward out of its core area in Palestine/Israel through the Mediterranean forest zone at around 14,000 bp and subsequently into western Syria and Lebanon (Ibáñez et al., Reference Ibáñez, González–Urquijo, Terradas, Enzel and Bar-Yosef2017).
Valla et al. (Reference Valla, Khalaily, Samuelian, Bocquentin, Bridault, Rabinovich, Enzel and Bar-Yosef2017) provide detailed information for the site of Ain Mallaha, one of the main Natufian settlements in the Levant, which enjoyed a permanent spring, a rivulet, and marshes along the shore of a lake. Ain Mallaha is dated to 14,326 ± 266 BP. It was a hamlet with semi-buried buildings having circular walls and postholes. Several of the buildings have superimposed floors, suggesting that they were rebuilt in the same location. Graves were either under floors or in fills. The hypoplasia found on some skeletons indicates dietary deficiency or infectious diseases in childhood. Dental caries may be related to a diet rich in vegetal food. Traces of trauma on the bones are rare. The site had a broad-spectrum economy including large and small mammals, birds, reptiles, crustaceans, fish, and mollusks. Hares may have been trapped using novel techniques. Small-seeded grasses are more prominent than barley and wheat, perhaps because the latter had to be gathered 12–16 km away. The authors infer that the people of Ain Mallaha were sedentary based on the numerous graves, the use of heavy basalt pounding tools, and the wealth of resources in the surrounding area.
Using climate data from Greenland ice cores and archaeological data from the southern Levant, Maher et al. (Reference Maher, Banning and Chazan2011, 16) express skepticism about the causal connection between the onset of the Bølling-Allerød and the start of the Early Natufian, suggesting that the latter began first. But they also comment that “populations may have increased and settlements became quite large during the Early Natufian, perhaps taking advantage of the Bølling-Allerød’s higher precipitation and associated increases in available food resources, even if … the Early Natufian actually started somewhat earlier than the Bølling-Allerød” (21). They acknowledge that climate impacts may have varied across the Levant, and that the lithic artifacts used to date cultural periods may be imperfectly correlated with changes in settlement patterns or the economy (22). Grosman and Munro (Reference Grosman, Munro, Enzel and Bar-Yosef2017) agree that Early Natufian culture began before the Bølling-Allerød, but assert that it mainly corresponded to this climate phase. Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018, 62) argue that the rejection of causal linkages between climate phases and cultural periods by Maher et al. (Reference Maher, Banning and Chazan2011) and Henry (Reference Henry, Bar-Yosef and Valla2013) is unpersuasive given the limited temporal precision of the data.
With the onset of the Younger Dryas around 12,700 BP, climate became colder and drier for over 1,000 years. According to Kuijt and Prentiss (Reference Kuijt, Prentiss, Prentiss, Kuijt and Chatters2009), this period was associated with more mobility, less elaborate architecture, smaller buildings, shallower cultural deposits, and fewer burials. Many people abandoned previous forms of food storage, adopted mobile hunting and gathering, and had smaller group sizes. At the same time, at least one favorable site in the northern Levant, Abu Hureyra, saw an increase in population (see Chapter 5). Likewise sites in the southern Levant with locally favorable conditions, such as locations in river valleys and on Mount Carmel, continued to support sedentary communities (Bocquet-Appel and Bar-Yosef, Reference Bocquet-Appel, Bar-Yosef, Bocquet-Appel and Bar-Yosef2008, 6). Thus, it is difficult to generalize about the effect of the Younger Dryas on sedentism for the region as a whole.
Maher et al. (Reference Maher, Banning and Chazan2011) again express skepticism about the causal connection between the Younger Dryas and the Late Natufian developments of the sort described by Kuijt and Prentiss (Reference Kuijt, Prentiss, Prentiss, Kuijt and Chatters2009), but agree that these events occurred relatively closely in time. Grosman and Munro (Reference Grosman, Munro, Enzel and Bar-Yosef2017) paint a more complex picture. In the Mediterranean zone, the shift from Early Natufian to Late Natufian culture occurs before the Younger Dryas, but in more arid regions, the Late Natufian coincides with the Younger Dryas. Late Natufian sites are sometimes mobile and seasonal, but in other cases Early Natufian sites continue in use during the Late Natufian. Sedentary or semi-sedentary sites with large structures were still being used in the Jordan Valley during this period, for example.
At the start of the Pre-Pottery Neolithic A (PPNA), which coincides with the start of the Holocene around 11,600 BP, climate changed drastically, becoming much warmer and wetter. This resulted in dramatically better growing conditions for plants. At about the same time groups showed greatly decreased residential mobility, the construction of buildings requiring more labor investment, and food storage. Economic practices were now centered on expanding communities of forager-collector-cultivators. Subsistence strategies continued to include hunting of wild game and collecting of wild plants, but agriculture spread widely across the region (Kuijt and Prentiss, Reference Kuijt, Prentiss, Prentiss, Kuijt and Chatters2009).
This chronology is supported by evidence from paleodemography. For southwest Asia as a whole, Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018) found that population shows a downward deviation relative to a fitted exponential curve in the later part of the Bølling-Allerød and the early Younger Dryas. They found an upward deviation early in the Holocene. The latter event could have been driven partly by the spread of agriculture, although climate amelioration was almost certainly a contributing factor (see Section 5.9 for a more detailed discussion).
Several features of this case are worth highlighting. First, climate change was strongly correlated with shifts between mobile and sedentary lifestyles, and the climate effects involved changes in mean temperature and precipitation rather than merely lower variance. Second, lifestyle transitions went in both directions: when the regional climate deteriorated, sedentism tended to be abandoned, except at locations that were buffered from climate shocks. Third, climate change was associated with changes in population densities, reflected in settlement sizes. Finally, sedentism involved a distinct repertoire of techniques with respect to diet, food storage, and housing. These techniques began to develop more than 3,000 years before domesticated cereals emerged in the PPNA.
Japan:
Another case of sedentary foraging involves the Jomon of Japan, a society that lasted over 10,000 years. Aikens and Akazama (Reference Aikens, Akazawa, Straus, Ericksen, Erlandson and Yesner1996) use tree pollen data to argue that a warmer climate in the late Pleistocene and early Holocene led to the spread of oak forests from south to north through the Japanese archipelago. The new oak forests were followed by a broad-spectrum diet with greater reliance on plant and aquatic foods, early development of pottery for cooking, storage facilities, and durable residences.
The remaining information on Jomon society in this section is taken from Habu (Reference Habu2004) except where noted. While some non-staple plants were probably domesticated, most of the Jomon diet consisted of wild animals, wild plants, and aquatic resources (see ch. 3 of Habu, Reference Habu2004). The Jomon display no evidence of warfare or political complexity. There is little evidence of inequality and this is only found in the Late/Final periods.
The dating of Jomon periods is complicated by regional diversity and radiocarbon calibration problems (37–42), so the dates given below are approximate. The Incipient Jomon period began around 12,000 BP and was characterized by mobile foraging (246–247). The Initial Jomon period is dated from 11,000 to 6900 BP and corresponded to the transition “from the Final Pleistocene to the warmest part of the mid-postglacial period” (42). Most settlements in Honshu remained small (generally 3–5 pit dwellings) and there is little evidence for food storage, so it is likely that mobile foraging continued as the dominant food acquisition strategy. However, there is evidence for larger sedentary communities in Hokkaido and southern Kyushu.
The Early and Middle Jomon periods (roughly 6900–4000 BP) overlap with the “Climatic Optimum,” the warmest part of the postglacial (Holocene) period. The Early Jomon had more sedentism, larger settlements, and food storage technologies. Aggregate population reached a maximum around the Middle period, and in some regions increased by a factor of 50 or more relative to previous levels (254). Likewise, site densities and settlement sizes reached a high point (133). One site had continuous occupation by 200–500 people for over 1,500 years (114). Such settlements had hundreds of labor-intensive pit dwellings.
The Late/Final Jomon, starting around 4,000 BP, was associated with a cooling trend. Over 600–700 years, population fell to levels prevailing before the Early/Middle growth phase (197, 254). The number of large settlements declined, along with site density, to levels similar to those of the Early Jomon, although this was not accompanied by a full return to mobile foraging. The Final Jomon dates to 3,300–2,100 BP. During the second half of this period, influences from the Korean peninsula led to rice cultivation.
Crema et al. (Reference Crema, Habu, Kobayashi and Madella2016) study population dynamics between 7,000–3,000 BP for the Jomon. Relative to null hypotheses with uniform or exponential distributions, they find a sequence with rapid initial population growth, a high-density plateau, a decline, and then renewed growth. They suggest (but do not test) the hypothesis that these fluctuations were associated with climate change. In general, their results are consistent with what one might expect based on the relevant part of the narrative presented by Habu (Reference Habu2004).
As in southwest Asia, climate appears to have played a key role in determining the subsistence strategies of the Jomon. Favorable climate was closely associated with population growth, and a deteriorating climate was associated with population decline. Sedentism involved both an increased reliance on food storage and greater residential construction, with some communities lasting for many centuries. Another parallel with the case of southwest Asia is that sedentary foraging became widespread in Japan more than 3,000 years before the arrival of agriculture from Korea. Finally, we note that the lack of political complexity, warfare, and inequality in the early stages of sedentism is consistent with our modeling assumption of relatively free individual mobility.
Sedentism and Agriculture:
This chapter is concerned with pristine transitions from mobile to sedentary foraging, where the term “pristine” means that the transition was not caused by a diffusion of sedentary lifestyles from other societies. Specifically, we are not addressing cases where sedentism was triggered by the diffusion of agriculture from a neighboring society.
A society that experiences a pristine transition to sedentary foraging may follow various subsequent trajectories: (a) after a potentially long lag it may undergo a pristine transition to agriculture, as in southwest Asia; (b) after a potentially long lag it may adopt agriculture through diffusion from outside sources, as in Japan; or (c) it may never adopt agriculture, as along the northwest coast of North America prior to European contact. A society of mobile foragers that does not undergo any transition to sedentary foraging may likewise follow various trajectories: (a) it may eventually adopt sedentism and agriculture simultaneously due to the diffusion of agricultural technology; (b) it may be displaced by farming populations, as in parts of Europe (Shennan, Reference Shennan2018) and southern Africa; or (c) it may engage in mobile foraging indefinitely, as in Australia prior to European contact.
The next several sections build a formal model of the conditions that could trigger a pristine transition from mobile to sedentary foraging. The conditions that could trigger a subsequent pristine transition to agriculture will be left for Chapter 5.
4.4 The Production Site
We consider a site with L agents. Each agent is endowed with one unit of labor time, and each agent is of negligible size relative to the population at the site. The food produced at the site is shared equally among the agents, who allocate their labor time to maximize total food output. Our use of the term “site” will be flexible and may involve a relatively large geographic area within which mobile foragers search for food.
Food can be obtained by hunting or gathering. We use “hunting” as a generic label for mobile food procurement activities involving a portable toolkit, minimal processing, and quick consumption. Aside from literal hunting, this could include the collection of berries or other easily harvested wild plant foods. We use “gathering” as a generic label for stationary activities involving fixed assets, extensive processing, or storage. Fixed assets could include grinding stones, ovens, pottery, or storage pits. Further examples outside the literal definition of gathering include animal traps and fishing weirs.
Total food output (in calories) is given by
where f and g are the production functions for hunting and gathering respectively;
is the weather;
is the labor input for hunting;
is the labor input for gathering; and
is the productivity of gathering. The parameter
is neutral between the two activities and reflects the general abundance of food resources at the site.
The production functions have the following properties:
Assumption 4.1
;
for all
;
for all
;
for all
; and
for all
. The marginal products
and
are positive and finite at zero labor input. We also assume
.
One example of such a production function is
where
and A can be interpreted as the natural supply of some wild food resource available for harvesting.
At a benchmark site where
, the maximum food output is
The solution in (4.2) is unique due to the strict concavity of the objective function. The optimal labor allocation and the function H have the following properties.
Proposition 4.1
(optimal labor allocation).
(a) Suppose
. Define
by
.(i) For
the solution has
and
with
.(ii) For
the solution has
and
with
.
(b) Suppose
. Define
by
.(i) For
the solution has
and
with
.(ii) For
the solution has
and
with
.
(c) Suppose
. For all
the solution has
and
with
.(d) H(L, k) is continuous, increasing, and strictly concave in L for any fixed
.(e) Food per capita
is decreasing in L for any fixed
.(f) (i) As
,
for any fixed
.(ii) As
,
for any fixed
.
Part (a) of Proposition 4.1 says that if gathering productivity (k) is small enough, only hunting is used at low levels of L and both activities are used at higher levels of L. Conversely, part (b) says that if k is large enough, only gathering is used at low levels of L and both activities are used at higher levels of L. Part (c) is a boundary case. Part (d) shows that the marginal product of labor is positive and decreasing. Part (e) shows that food per capita h(L, k) decreases as labor input rises. Part (f) shows that food per capita is finite at
and goes to zero as L goes to infinity. For an unoccupied site, food per capita is defined to be
.
As in Chapters 2 and 3, for the production function f(Lf) the marginal product of labor and the average product of labor are equal to the same finite constant at zero input. The production function g(Lg) has the same feature. This extends to the function H(L, k), which has
where HL(0, k) is the marginal product of labor at zero and h(0, k) is the average product of labor at zero, as in Proposition 4.1(f)(i).
4.5 Short-Run Equilibrium
This is the first section of the book in which we need to distinguish between sites and regions. We think of a region as a contiguous geographic area for which there is no in- or out-migration. This may be true because the area is circumscribed by mountains, deserts, or oceans, or because heavily populated zones are far apart and it makes sense to ignore migration between such zones. In empirical discussions we often refer to regions like southwest Asia, sub-Saharan Africa, or Mesoamerica that correspond to areas studied by archaeological specialists. In such contexts we assume that events in separate regions evolved independently.
Within a region there are generally multiple production sites. In Chapters 4–5 we assume that individuals can move freely among these sites. For simplicity we ignore the costs associated with travel over intra-regional distances. We also assume there are no social barriers to movement among sites, either because population densities are too low to make exclusion feasible, or because kinship links enable population to flow relatively freely between sites when this is economically advantageous to the individuals involved. Property rights therefore involve open access. In Chapters 6–8 we consider physical and social barriers to mobility across sites.
The number of sites in a region depends on the model. In Chapters 4–6 it will be convenient to assume a continuum of sites. In Chapters 7–8 we will study models where a region has only two distinct sites or territories. Finally, in Chapter 10 we will consider a region with one large site and many small sites.
Now consider a continuum of production sites uniformly distributed on the unit interval and indexed by
. The weather at an individual site s is binary:
Assumption 4.2
with
. The probability that a site s has good weather
is
and the probability that the site has bad weather
is
. Weather is independent across sites. The fraction of sites with good weather is always
and the fraction of sites with bad weather is always
.
We think of the regional economy as having a fixed total supply of food resources under a given climate regime. The weather in the current period determines how this supply is distributed across sites. There is no aggregate uncertainty.
We define the short run to be a time period over which regional population
does not change (one human generation, or about twenty years). Due to open access, it must be true in equilibrium that no agent can obtain a higher food income by changing sites. This motivates the following definition.
Definition 4.1 Fix the parameters
and the regional population
. Let
be total labor at a production site having weather
, and let y be food per capita. The combination
is a short-run equilibrium (SRE) if
(a)
with 
(b)
with equality if 
(c)

The logic behind this definition runs as follows. Due to
, if we had
we would need to have
. The resulting food per agent at a site of type B would be
, which is less than
, the food an agent could obtain by moving to an unoccupied site of type A. To prevent such incentives for movement, we must have
as in condition (a) of D4.1. The associated food income per agent is denoted by y.
If
then we must have
to avoid the movement of agents from sites of type A to unoccupied sites of type B. If
then sites of type B must provide the same income as sites of type A, because otherwise agents would move from sites of one type to sites of the other. This gives condition (b) in D4.1. The local populations at the individual sites must add up to the regional population N, so we require
as in condition (c).
There is a unique SRE for each
. To see why, let η be the inverse function h‒1, where we temporarily suppress the parameter k to simplify notation. This gives
The regional “demand” for labor is
. The equilibrium food income y must satisfy
The demand D(y) is zero for
. It is continuous and increases as y decreases for
, with
as
. This implies the existence of a unique food income y(N) such that (4.3c) holds for the given
. More formally:
In the rest of this section we show how the qualitative nature of SRE depends on the parameters N and k. It is easy to see from the definition of SRE that either
or alternatively
. We assume throughout this section that
, because we want to focus on situations where an initially low population density implies that only the hunting technique is employed. It follows from Proposition 4.1(a) that either
and
(only hunting is used at sites of type A) or
and
(both hunting and gathering are used at sites of type A). When sites of type B are active,
implies
due to the greater population at sites of type A. Thus if gathering is used anywhere, it must be used at sites of type A.
These considerations yield four possibilities for the structure of SRE:
(I) Only type A sites are active
and only hunting is used
.(II) All sites are active
and only hunting is used
.(III) Only type A sites are active
and both techniques are used
.(IV) All sites are active
and both techniques are used
.
Figure 4.1 illustrates these four regions in (N, k) space for a fixed climate regime
. Regions I and II where only hunting is used are associated with low values of the gathering productivity k, while regions III and IV where both hunting and gathering are used are associated with higher values of k. Regions I and III where sites of type B are inactive correspond to low values for the regional population N, while regions II and IV where all sites are active correspond to higher values of N.
The locations of the regions in Figure 4.1 depend only on the productivity ratio
and not the absolute productivity levels
. The boundary for the use of the gathering technique (labeled
in Figure 4.1) is determined using Proposition 4.1(a). The relevant equation is
When only sites of type A are active (regions I and III), the local population density
is obtained directly from condition (c) in the definition of SRE. Substitution into (4.4) gives the desired relationship between N and k. When all sites are active but gathering is not used, (a) and (b) in the definition of SRE imply
. This and (c) determine local populations
and
. Substituting
into (4.4) gives the relationship between N and k corresponding to the boundary
in Figure 4.1.
The boundary for use of sites of type B (labeled
in Figure 4.1) is derived as follows. We use the notation h(0, k) and η(⋅, k) to indicate that these functions depend on the gathering productivity k. Consider the population threshold
When
only sites of type A are active
. When
all sites are active
. This boundary is vertical at
in Figure 4.1 when gathering is not used, because then the parameter k is irrelevant and
is a constant. When gathering is used at sites of type A, the boundary
slopes up as indicated in Figure 4.1. This occurs because higher gathering productivity drives a wedge between the A and B sites in a way that favors the type-A sites, which are already gathering. As we explain later, our concept of sedentism focuses on the question of whether bad sites are active, so a key task will be to determine how
can be achieved if it does not hold initially.
4.6 Long-Run Equilibrium
We turn next to population dynamics. Time is discrete with each period equal to one human generation. Agents come to adulthood at the beginning of a period, observe the weather conditions at each site, choose a site at which to work, produce food, have children, and then die at the end of the period. The regional (adult) population in period t will be denoted
. All adults have the same food income
in period t regardless of the site at which they work. This income is obtained from
using Proposition 4.2, where we now include k as an explicit argument in all relevant functions.
Assumption 4.3 An adult in period t has
children who survive to become adults in period
. The function
is continuous and increasing with
. There is some
such that
.
The relationship between food and surviving children arises because more parental food leads to higher fertility, lower child mortality, or both. This is the Malthusian aspect of the modeling framework used in this chapter.
As in Chapter 3, population evolves according to
This equation implies that a non-null stationary population
must yield
or equivalently
. We formalize this idea in the following definition.
Definition 4.2 Fix the parameters
. The combination
with
is a non-null long-run equilibrium (LRE) if
and y* form a SRE for N.
In order to support a positive population in the long run, food income must achieve the level y* needed for demographic replacement. This gives an existence result.
The following monotone convergence assumption simplifies later analysis.
Assumption 4.4 Monotone population adjustment (MPA). Suppose there is a population
such that
. Keep the parameters
constant over time and consider any
. If
then
for all
. If
then
for all
. In either case
.
As with A3.2 in Section 3.7, this rules out oscillations around
along the adjustment path. MPA holds if the direct positive effect of
on
in (4.6) outweighs the indirect negative effect operating through
.
Figure 4.1 can be used to determine how the structure of SRE changes along the adjustment path
. For example, imagine that a horizontal line associated with a fixed value of k crosses regions I, II, and IV because gathering productivity is low. Suppose the initial population N0 from A4.4 is in region I and the LRE population N* from A4.4 is in region IV. This implies that population will increase over time and the system will move horizontally from left to right in Figure 4.1. At first only sites of type A are active and only hunting is used; after a while, sites of type B become active but gathering is not yet used; and eventually both hunting and gathering are used, at least at the type-A sites. Other chronological sequences could occur depending on the productivity level k, the initial population N0, and the final population N*.
4.7 Very-Long-Run Equilibrium
This section is based on the model of learning by doing described in Section 3.5 and the concept of very-long-run equilibrium defined in Section 3.8. To avoid repetition, the discussion here will be kept brief. Suppose the resources for the hunting production function f have been exploited for many millennia, and there are no further opportunities to increase hunting productivity through learning by doing. On the other hand, resources associated with the gathering production function g have not previously been exploited. If gathering begins, learning by doing also begins, and in each period there is a positive probability that the productivity coefficient k will jump by a discrete amount. All such innovations are freely available throughout the region. This process can end in one of two ways: either (i) the gathered resources are no longer used, which terminates learning by doing and prevents k from rising any further or (ii) k reaches an upper bound
and there are no further opportunities for technical innovation in gathering.
These ideas lead to the following definition.
Definition 4.3 Fix the parameters
and let
be the maximum achievable value of k. The combination
with
is a non-null very-long-run equilibrium (VLRE) if
(a) (LA, LB, N) form an LRE for the given k; and
(b) Either
, or
, or both.
Condition (b) ensures that the productivity level k remains constant with probability one.
As in Chapter 3, “very long run” does not imply that population adjustments occur first (in the long run) and then technical adjustments occur second (in the very long run). The population and technology (Nt, kt) adjust simultaneously. This terminology is meant simply to highlight that the productivity level k is exogenous in the definition of LRE but endogenous in the definition of VLRE.
Proposition 4.4
(very-long-run equilibrium).
Fix the parameters
.
(a) For a given
, there is a unique non-null VLRE iff
and the value of
such that
gives
. This determines the equilibrium value of
. If
then
satisfies
; otherwise
. Finally,
.(b) For
, there is a unique non-null VLRE iff
. In equilibrium
satisfies
. If
then
satisfies
; otherwise
. Finally,
.
Part (a) deals with VLREs in which gathering productivity is below its maximum possible value. In order to avoid learning by doing that raises k with positive probability, we need
so the gathering technology is not used at any site. For the VLRE to be non-null
, it must be possible to support a positive population LA at the good sites with hunting alone. Bad sites may or may not be active, depending on their natural productivity
. The requirements for a VLRE of this kind parallel those for an LRE in which hunting is the only technology, but with the additional requirement that the latent productivity k be low enough to ensure that gathering is not used. Chapter 3 referred to such a VLRE as a “stagnation trap”; gathering is not used because its productivity is too low, but its productivity cannot increase because gathering is never used.
Part (b) deals with VLREs where gathering productivity is at the maximum k*. This removes the need for any restriction on the use of gathering. In general, VLRE may involve hunting, gathering, or some combination of the two. The condition for VLRE to be non-null
is identical to the condition for an LRE to be non-null with gathering productivity k* (see Proposition 4.3(a)). When k* is small enough, part (b) reduces to an equilibrium in which only hunting is used, as in part (a).
4.8 The Baseline Equilibrium
A site’s natural productivity is θA with probability λ and θB with probability 1−λ. These draws are independent across sites and periods. Thus the mean productivity of a site is
and the variance is
. These equations give
In our analysis of climate change, we keep λ constant and only consider changes in the productivity levels
and
.
Between the Last Glacial Maximum and the Holocene, mean weather conditions improved and the variance of weather declined. We thus contrast two climates: an initial bad regime
and a subsequent good regime
, where
and
. From (4.7) this unambiguously raises
and the ratio
. The productivity
may go in either direction. Here we assume
rises, because it is reasonable to think that the Holocene was beneficial at both good and bad sites. The ratio
must rise, so worse sites must improve in greater proportion.
Assumption 4.5 The climate amelioration from
to
yields
,
, and
. V is positive in both regimes so
and
. We ignore possible changes in the fraction of good sites λ.
Our concept of sedentism is that agents remain at a location even when resources become scarce at that location. This contrasts with a fully mobile lifestyle where agents always move to the locations where resources are most abundant. To formalize this idea, recall that in a long-run equilibrium there is a constant regional population N with local populations
at the individual production sites. Suppose a site changes from A (abundant resources) in period t−1 to B (scarce resources) in period t. In LRE each adult has one child who survives to adulthood so the
adults at the site in period t−1 leave
surviving adult children at the site at the start of period t. Due to the diminished local abundance of resources in period t, there is an outflow of
agents in period t, with LB agents remaining behind. When
, a production site is always occupied by some agents even if weather conditions turn bad at that site.
We define the rate of sedentism to be
because this is the fraction of the local population that remains in place when there is an adverse weather shock. The sedentism rate is zero if sites are entirely abandoned when resources are locally scarce
. In a long-run equilibrium where type-B sites are active, we have
and
. This implies that for a fixed value of
, the sedentism rate
is increasing in
with
as
. In other words, the sedentism rate rises when the variance in weather conditions drops. Under our definition, sedentism can be positive even when only “hunting” (mobile food acquisition) is used, as long as sites are not fully abandoned when local weather worsens. “Gathering” (stationary food acquisition) is not a necessary condition for a positive sedentism rate.
Next consider a baseline VLRE reflecting the circumstances of the Last Glacial Maximum. We want a VLRE in which only sites of type A are active and only hunting is used. Whenever an individual site shifts from good to bad weather conditions, all agents move away from that site.
Condition (4.9a) ensures that a site of type A can support a positive workforce LA0 at the food per capita y*. Condition (4.9b) ensures that agents with access to income y* will not move to a site of type B. Condition (4.9c) ensures that gathering is unattractive when the workforce at a site is LA0. It is easy to see that such parameter values exist: choose a large enough value of θA0 to satisfy (4.9a), a small enough value of θB0 to satisfy (4.9b), and a small enough value of k0 to satisfy (4.9c).
4.9 The Effects of Climate Change
Starting from the baseline equilibrium constructed in Section 4.8, suppose there is a permanent improvement in climate to (θA*, θB*) in period t = 0 as described in A4.5. In reality the shift from the LGM to the Holocene was lengthy, non-monotonic, and uneven across regions of the world, but the qualitative effects of climate change are most easily studied using this simple framework.
We are interested in causal mechanisms that yield LB > 0 and thus a positive rate of sedentism. Three mechanisms are considered. The first, called “climate only,” operates in the short run. The second, called “climate plus population,” operates in the long run. The third, called “climate plus population and technology,” operates in the very long run.
Proposition 4.6
(short run; climate only).
Start from a baseline VLRE as in Proposition 4.5. Let the climate change described in A4.5 occur at the start of period t = 0.
(a) Gathering is not used at any site in period t = 0.
(b) Sites of type B become active in period t = 0 iff θA*y*/θA0 < θB*f′(0).
Part (a) results from the following facts: (i) LB is zero in the baseline equilibrium; (ii) the regional population is fixed in the short run at N0; (iii) the new equilibrium must therefore have LA no larger than its baseline level, and smaller if LB becomes positive; (iv) there is a corner solution with no gathering at type-A sites in the baseline equilibrium and LA does not increase, so the new SRE also has a corner solution for these sites; (v) if gathering is not used at type-A sites, it cannot be used at type-B sites either, because LB < LA must hold in SRE. Part (b) is straightforward. In the short run, regional population is fixed at N0. If the type-B sites improve enough relative to the type-A sites, some of this population will spread out into the B sites.
In the rest of this section we assume that the condition in Proposition 4.6(b) does not hold so sedentism does not arise immediately. We first want to show that even when climate change does not lead to sedentism directly, it can trigger a process of population growth that leads to sedentism indirectly. The key idea involves Malthusian population dynamics. In the short run, higher productivity at the type-A sites makes all agents better off. Because food per capita y0 now exceeds y*, the regional population increases.
Let the population adjustment path be {Nt}. The climate change is permanent so (θA*, θB*) remains in effect along this path. Temporarily we also assume that k0 remains in effect, and ignore the learning by doing issues raised in Section 4.7. Due to A4.4, {Nt} is increasing and converges to a new LRE population N* > N0 as t → ∞. Food per capita {yt} is decreasing and converges to y*.
Now consider Figure 4.2, where all of the curves reflect the new ratio θB*/θA*. The boundaries LB = 0 and LAg = 0 intersect at (N, k), where LA, N, and k must satisfy
(4.10)At the point (N, k) we have a SRE in which only type-A sites are active and only hunting is used. Agents are indifferent about moving to an unoccupied type-B site, and gathering would come into use at the type-A sites if there were any increase in its productivity.
One possibility is that the gathering productivity k0 from the baseline equilibrium may be below k. In this case we start from a point like P in region I. As population rises, the system moves to points like Q in region II, where type-B sites are active but gathering is still not used. With enough population growth, the system eventually moves to points like R in region IV, where type-B sites remain active and both hunting and gathering are used at the type-A sites.
Another possible sequence arises when latent gathering productivity k0 is above k. Now we start from a point like S in region I. Population growth first leads to points like T in region III where hunting and gathering are both used at the type-A sites, while type-B sites remain inactive. Further population growth moves the system to points like U in region IV where type-B sites become active.
Whether these transitions occur depends on the new long-run population level N*, which is determined by the absolute level of the new climate parameters (θA*, θB*). We can hold the ratio θB*/θA* constant while scaling up the degree of climate improvement by multiplying the productivities of both A and B sites by an identical factor μ > 1. This raises the LRE population N* without altering the location of the regions in Figure 4.2.
The following proposition describes the range of potential outcomes.
Proposition 4.7
(long run; climate plus population).
Start from a baseline equilibrium as in Proposition 4.5. Let the climate change described in A4.5 occur at the start of period t = 0. Define LA* > 0 using θA*h(LA*, k0) ≡ y*. Assume that the inequality in Proposition 4.6(b) does not hold so sites of type B do not become active at t = 0. Let k0 remain constant.
(a) Suppose k0 < k. There are three possible cases.
(i) If θB*f′(0) ≤ y* then type-B sites are never active and gathering is never used. The sedentism rate S remains at zero.
(ii) If θB*f′(0) > y* and f′(LA*) ≥ k0g′(0) there is some T > 0 such that type-B sites are not active for t < T but are active for t ≥ T. Gathering is never used. The sedentism rate St approaches a limit S* with 0 < S* < 1.
(iii) If θB*f′(0) > y* and f′(LA*) < k0g′(0) the results in (a)(ii) apply, except that there is some T′ ≥ T such that gathering is used at sites of type A for t ≥ T′.
(b) Suppose k0 > k. There are three possible cases.
(i) If f′(LA*) ≥ k0g′(0) then type-B sites are never active and gathering is never used. The sedentism rate S remains at zero.
(ii) If f′(LA*) < k0g′(0) and
then type-B sites are never active. There is some T > 0 such that gathering is not used for t < T but is used at sites of type A for t ≥ T. The sedentism rate S remains at zero.(iii) If
and
the results in (b)(ii) apply, except that there is some T′ ≥ T such that type-B sites become active for t ≥ T′. The sedentism rate St approaches a limit S* with 0 < S* < 1.
(c) Suppose k0 = k. There are only two possible cases.
(i) If
and
then type-B sites are never active and gathering is never used. The sedentism rate S remains at zero.(ii) If
and
there is some T > 0 such that type-B sites are not active and gathering is not used for t < T, but type-B sites are active and gathering is used at type-A sites for t ≥ T. The sedentism rate St approaches a limit S* with 0 < S* < 1.
Proposition 4.7 makes several points. First, when the climate amelioration is small, population does not grow enough to cause any qualitative change relative to the baseline VLRE. In the new LRE, there is still no sedentism and still no gathering. This is true for cases (a)(i), (b)(i), and (c)(i). Second, population growth can lead to sedentism while at the same time all food continues to be obtained from hunting. This occurs in case (a)(ii). Third, population growth may fail to generate sedentism because the type-B sites remain inactive, but there may be enough population growth to stimulate the use of gathering at the type-A sites. This occurs in case (b)(ii). Finally, when the climate amelioration is sufficiently large, Malthusian population growth eventually leads to both sedentism and gathering. This is true for cases (a)(iii), (b)(iii), and (c)(ii).
We conclude this section by showing that even if climate change plus population growth does not stimulate sedentism, these factors can trigger a process of technological innovation that does. The key idea is that population growth may not directly make sites of type B active, but it may cause agents at sites of type A to use gathering as well as hunting. Once gathering begins, learning by doing raises gathering productivity over time. This can eventually make the type-B sites active, thus yielding sedentism.
It is not hard to show that once gathering comes into use, it must remain in use. This is true for three reasons. First, even if gathering productivity remains constant, the fact that regional population is growing means that the local population at type-A sites is also growing. This scale effect implies that gathering continues to be used at these sites. Second, if learning by doing raises gathering productivity, there is a substitution effect that increases the incentive to use gathering at a given local population level. Third, the productivity growth leads to further regional population growth beyond what is caused by climate change alone. This reinforces the growth of local population at the type-A sites. Thus gathering cannot shut down at the type-A sites after it begins.
For this reason, the only way to reach a new very-long-run equilibrium is for the gathering productivity to reach a maximum value
at which opportunities for technical improvement are exhausted. It can be shown that as time goes to infinity, the probability of reaching the maximum gathering productivity
approaches one. The question we want to address here is whether type-B sites are active in the new VLRE associated with
. Proposition 4.8 resolves this issue.
Proposition 4.8
(very long run; climate plus population and technology).
Make the same assumptions as in Proposition 4.7, except that now the learning process from Section 4.7 begins whenever gathering is used, and therefore k is no longer constant. Assume case (a)(iii), (b)(ii), (b)(iii), or (c)(ii) from Proposition 4.7 applies, so population growth leads to gathering. In the cases (a)(iii), (b)(iii), and (c)(ii), sites of type B are active in VLRE. In case (b)(ii), sites of type B are active in VLRE if and only if
.
The results for cases (a)(iii), (b)(iii), and (c)(ii) are not surprising. We already saw in Proposition 4.7 that for these cases Malthusian population growth alone makes the type-B sites active, and therefore yields some sedentism even without technical change. The rise in gathering productivity induced by technical innovation reinforces this result.
What is more interesting is the result for case (b)(ii). From the definition of this case we have
, so it is not possible to support a positive population at type-B sites by hunting alone. Thus population growth by itself, in the absence of technological change, cannot lead to sedentism. However, population growth does lead to gathering at the type-A sites. The resulting rise in gathering productivity (from k0 to k*) can make the type-B sites active. This only occurs if
. The inequality indicates that it must become possible to support a positive population at type-B sites by gathering alone.
We can also make a definite prediction about the type-A sites that applies to all of the cases in Proposition 4.7. The following result holds regardless of whether or not there is any technological innovation with respect to gathering.
Proposition 4.9
(persistence of mobile activities).
Starting from the baseline VLRE defined as in Proposition 4.5, suppose the climate change described in A4.5 leads to a new VLRE. In the new VLRE, agents at the type-A sites must do some hunting.
One might imagine that if gathering became productive enough, the substitution effect would be so powerful that the type-A sites would abandon hunting completely and only gather. This argument is incorrect because it overlooks the fact that higher productivity for gathering also leads to a larger regional population in the long run, which increases the supply of labor at the type-A sites. This scale effect maintains the use of hunting at these sites, no matter how productive gathering may become.
4.10 Demography and Living Standards
Many authors argue that sedentism raises fertility (Bocquet-Appel and Naji, Reference Bocquet-Appel and Naji2006; Kelly, Reference Kelly2013a). In southwest Asia, sedentism was accompanied by a “fertility explosion”; and in both southwest Asia and Japan, sedentism was associated with larger settlements, higher site densities, and increased regional population (see Section 4.3). We show in this section that there are strong theoretical reasons why sedentary foraging will be associated with higher population levels than mobile foraging, other things being equal.
If we have higher fertility, clearly we must also have higher mortality in order for regional population to be constant in long-run equilibrium. It is often said that a lifestyle involving large permanent settlements increases mortality by causing greater exposure to infectious disease (see for example Scott, Reference Scott2017, 96–113). While this is no doubt true, our theory predicts worse nutrition and shorter life expectancy even in the absence of disease.
Suppose that for any given level of food per capita, a sedentary lifestyle increases the number of surviving offspring per adult. This could result from less physical exertion by women, shorter periods of breastfeeding, greater paternal contributions in households, lower economic costs or higher economic benefits derived from offspring, or diminished infant mortality (Kelly, Reference Kelly2013a, 193–202, 209–213). Any of these factors would shift up the entire ρ(y) function in Section 4.6 at each value of y.
As shown in Figure 4.3, this decreases the long run level of food per capita y* at which the population is stationary (ρ = 1). Thus it lowers the long-run standard of living measured in food units, by comparison with a long-run equilibrium where foragers are more mobile. The intuition is that any factor apart from food income that raises fertility must be offset by a long-run reduction in food income in order to ensure the stationarity of the population. With technology and resources held constant, this reduction in food per capita requires a higher regional population by comparison with mobile foragers.

Figure 4.3. Effects of sedentism on long-run food income
Further insights can be gained from a simple formal model (readers who prefer a verbal presentation can skip to the summary at the end of this section). For concreteness we focus on the cost of children. Suppose that women in mobile foraging groups find it difficult to carry two or more young children simultaneously, and try to limit these costs by engaging in prolonged breastfeeding or infanticide (see Section 2.6). A transition to sedentism might thus encourage women to have more children. Our model can also be interpreted in a context where the mechanisms are physiological and do not require any conscious choice about breastfeeding or infanticide, as we will discuss below.
We want to derive the function ρ(y, v) where y is food income for an adult, v is the cost of a child to an adult, and ρ is the number of surviving offspring. Assume that an adult requires a ≥ 0 units of food to stay alive. If y < a, the adult dies and has no children. If y ≥ a, the adult survives for one period with certainty. We only consider the latter case.
The adult must expend v > 0 units of food for each child. These are costs directly incurred by the adult, such as the calories used for carrying the child during infancy. The number of children is q ≥ 0. After these expenses, the adult has the net food income y − a − vq, which is divided equally among the q children. Therefore the food intake per child is z(q) = (y − a − vq)/q.
Let p(z) be the probability that a child having food intake z survives to adulthood, where p(0) = 0 and p(∞) = 1 with p′(z) > 0 and p′′(z) < 0 for all z ≥ 0 so that the survival probability is increasing and strictly concave. We disregard any non-food-related reasons for childhood mortality. In one time period the adult collects food, has children, and dies. The children inherit their food bequests from the adult, and each child then survives with the probability p(z) to become an adult in the next period.
Each child’s survival is independent, conditional on food, so there is a binomial distribution for surviving children. The expected number of such children is qp[z(q)]. Ignoring integer problems, the adult chooses the quantity of children q to achieve
where the upper bound on q comes from z(q) ≥ 0 (children cannot have negative food).
Even if one does not believe that women in prehistory had significant conscious control over fertility (q), nature would have provided physiological mechanisms through which fertility responds to (y, v) in a manner that solves (4.11), because this maximizes the expected number of surviving offspring and would be favored by natural selection. Such mechanisms may include suppression of ovulation, a reduced probability of fertilization, and an increased rate of miscarriage when food is scarce and the cost of children is high.
In the case y > a, from p(∞) = 1 we have qp[z(q)] → 0 as q → 0. Likewise from p(0) = 0 we have qp[z(q)] = 0 for q = (y − a)/v. Thus there is an interior solution q(y, v) > 0. The objective function is strictly concave in q so this solution is unique. We refer to q(y, v) as the quantity of children and p[z(q(y, v))] as the quality of children. We also sometimes call q(y, v) fertility and 1 – p[z(q(y, v))] child mortality.
The first-order condition for (4.11) can be written as p(z) = p′(z)(z + v). Treating this as an identity, it can be shown that z is an increasing function of v when food income (y) is held constant. Therefore when the cost per child (v) falls, as we expect with greater sedentism, the food intake per child (z) also falls. Hence p(z) falls, and each child is less likely to survive. Because there is an inverse relationship between z and q, it follows that q(y, v) must rise when the cost per child declines. Accordingly, sedentism causes adults to substitute in the direction of higher quantity and lower quality of children. As a result both fertility and child mortality increase at a fixed level of food income.
It can be shown that an increase in food income (y), holding the cost of children (v) constant, causes a linear increase in quantity but has no effect on quality. It can also be shown that the maximum value ρ(y, v) is increasing in y and decreasing in v. From an economic standpoint ρ(y, v) is just an indirect utility function that is increasing in income (y) and decreasing in price (v).
Continuing to assume that sedentism leads to a lower cost per child v, we want to know the consequences for long-run equilibrium. LRE requires ρ(y*, v) = 1, where y* is the level of food per adult that keeps population constant. In a comparison among such equilibria, the cost v and food income y* must move in the same direction to maintain the expected number of surviving offspring constant at one per adult. Specifically, suppose a transition from mobile to sedentary foraging causes the cost per child to decrease from v1 to v2 < v1. This shifts up the entire function ρ(y, v) as shown in Figure 4.3, which implies that y2* < y1* must hold in order to maintain ρ(y1*, v1) = ρ(y2*, v2) = 1. Therefore, in the long run, food income per adult also decreases. Other things equal, a lower y* means that regional population N must be larger in order to obtain a lower average product of labor. Thus we predict a larger population when the cost per child is lower, as compared to a parallel situation where the cost per child does not change (that is, v1 = v2 so y1* = y2*).
We know from the first-order condition for (4.11) that if sedentism decreases v, it must also decrease z and p(z). Because qp(z) = 1 holds for every LRE, this means that in the long run a transition from mobility to sedentism causes q(y, v) to rise. In the notation of the preceding paragraph, q(y1*, v1) < q(y2*, v2) so fertility is higher under sedentism.
Summary:
We can now pull these conclusions together. Start from a reduction in the cost of children caused by sedentism. At any given food income, this raises fertility. In the long run the increase in fertility must be offset by a reduction in food income per adult because the number of surviving offspring per adult must stay fixed at one (this is the definition of long-run equilibrium). The decreased food income and lower cost per child cause adults to substitute toward more children with a lower survival probability for each child. This reduces life expectancy calculated at birth.
These effects are distinct from the health effects connected to more crowding and more transmission of infectious diseases in permanent settlements, although such effects could also occur. Indeed, the decreased food intake per child arising in our theory would weaken the immune systems of children and make them more vulnerable to disease.
A transition to sedentism may initially increase the rate of population growth but the growth rate falls to zero in LRE. However, sedentism does have a permanent effect on the level of population, because for any given technology and natural resources, the population must become larger in order for long-run food per capita to become smaller.
The model of this section is too simple to include effects of sedentism on adult diet and mortality (adult food consumption was fixed at a subsistence level and adults always lived for one period). In a more sophisticated model, sedentism could yield lower adult food consumption and higher adult mortality. Although a subsistence minimum is a helpful simplification for the purposes of this section, this notion is not part of our overall theoretical framework and will not be used in later chapters.
All of the effects described in this section are a matter of degree. A society could utilize a combination of mobile and sedentary food collection techniques, and in this case the cost per child (v) would depend on the average rate of sedentism.
4.11 Resource Depletion
Archaeologists and anthropologists often use arguments about resource depletion to explain increases in dietary breadth, the transition to sedentism, and the transition to agriculture (see our discussion of Kelly, Reference Kelly2013a, in Section 4.2). A common story is that exogenous population growth leads to overharvesting of highly ranked prey such as large mammals. Eventually this leads to a focus on lower-ranked resources that are less easily depleted, such as rapidly reproducing smaller mammals, plants, or aquatic resources. For similar reasons, people might switch from mobile to stationary food resources, leading to greater sedentism, or from wild to cultivated plants, leading to agriculture.
There is an interesting symmetry between the resource depletion theories used in archaeology and our own Malthusian theory. Conventional archaeological theories treat population as exogenous, resource stocks as endogenous, and rely on exogenous changes in population to drive the depletion process, leading to social and technological change. Conversely, we treat resource availability as exogenous, population as endogenous, and rely on exogenous changes in resources to drive population dynamics, again leading to social and technological change.
In our view, the main problem with the traditional archaeological approach is the premise of exogenous population growth. The transitions with which we are concerned played out over centuries or millennia. On such time scales population must be treated as endogenous for the reasons discussed in Sections 2.6, 3.3, and elsewhere. We handle this problem through a Malthusian feedback loop where falling living standards lead to lower fertility and higher mortality, putting the brakes on population growth and leading to an equilibrium long-run population. These dynamics will tend to restrain the overharvesting of wild resources envisaged in the archaeological literature. There is no parallel problem for our theory because the effect of climate on resource abundance is clearly exogenous.
Before we consider the interaction between Malthusian population and resource depletion in more detail, it is important to be clear about the kind of depletion involved. First, in our theory we put great emphasis on exogenous changes in resource abundance due to climate change, so effects of this kind are included in our models. Second, to the extent that depletion effects are local and play out within one human generation, they are included through the diminishing marginal and average productivity of foraging labor. However, in our models the food resources grow back from one generation to the next. Therefore, depletion effects at the regional level or across generations are not included. Intergenerational resource depletion can lead to complex dynamics involving interactions between resource stocks and human populations. One influential example is the model of resource depletion on Easter Island constructed by Brander and Taylor (Reference Brander and Taylor1998).
A reasonable economic model of a transition to sedentism or cultivation typically has the feature that local population (the number of people at a production site) must pass a threshold level in order to trigger the transition in question. In this context, depletion has two effects going in opposite directions: (a) a downward shift in the marginal product curve for the highly ranked resource, which tends to accelerate the transition by reducing the population threshold required for it to occur and (b) a lower long-run population due to the decreased productivity resulting from depletion, which makes it less likely that the population threshold will be crossed. The net effect is indeterminate, so it is difficult to obtain sharp predictions.
Agent-based simulation methods have recently been used to study the interactions among mobility, population density, and resource depletion (Gallagher et al., Reference Gallagher, Shennan and Thomas2019). This framework endogenizes both population and resources. Not surprisingly, the authors find that when resources have high replenishment rates and low vulnerability to depletion, this tends to support more sedentism. They also find that sedentism is associated with lower mean food income, despite the absence from the model of the cost of children, crowding, or disease. The reason is that mobile foragers have more variance in their food incomes, and are at greater risk of falling below the subsistence level of food needed for survival. In equilibrium this is offset by a higher mean food income.
Although we agree that such interactions are of interest, we simplify our models by ignoring endogenous resource depletion and emphasizing the direct determination of resource abundance by nature. From a theoretical standpoint this enables us to show that resource depletion is not logically necessary to explain the transitions with which we are concerned. From an empirical standpoint we have not yet seen persuasive evidence that endogenous resource depletion was an important factor in the transitions to sedentism or cultivation. However, in our opinion the evidence favoring a causal role for exogenous climate change is quite convincing.
We do believe endogenous resource depletion was important for other phenomena in prehistory. For example, it seems likely that humans had a role in the mass extinctions of large mammals in Australia and the Americas (Riahi, Reference Riahi2020). It also seems quite likely that resource depletion contributed to numerous instances of social collapse.
We suggest a few ways in which empirical research might proceed. First, if an endogenous resource depletion process unfolds over centuries, traditional archaeological theory predicts that this should be accompanied by population growth because exogenous population growth drives the depletion. Our theory, on the other hand, predicts declining population or at least slower growth for Malthusian reasons, other factors held constant. Another distinction is that the traditional archaeological approach predicts falling living standards in the interval leading up to a transition involving sedentism or cultivation. By contrast, our Malthusian framework predicts that long-run living standards should remain roughly constant before such a transition, although living standards might fall afterward for the reasons we discussed in Section 4.10. We look forward to future archaeological findings that can disentangle these issues.
4.12 More Archaeological Evidence
This section briefly summarizes what is known (or sometimes, not known) about non-agricultural sedentism in several regions beyond those discussed in Section 4.3. All of these cases will be discussed further in Chapter 5. After describing the evidence, we will offer a few thoughts on the implications for our theory.
Northern China:
According to Shelach-Lavi (Reference Shelach-Lavi2015, 50–56), fewer than 20 very late (terminal) Pleistocene sites have been found in north China. Small stone tools, grinding stones, and ceramic vessels may have been part of a broad-spectrum hunter-gatherer strategy. The fact that ceramics are easily breakable and hard to transport suggests a “context of decreased mobility” (54). There are even fewer early Holocene sites in north China, and none is well documented.
Liu and Chen (Reference Liu and Chen2012, 46–51) believe that terminal Pleistocene sites show a high degree of hunter-gatherer mobility, with no evidence for storage facilities or residential structures. Based on site sizes, burials, and a range of technological features, they argue that early Holocene sites reflect a higher degree of sedentism (51–58).
Southern China:
For the late Pleistocene, evidence of human occupation is found in caves (information in this paragraph is from Shelach-Lavi, Reference Shelach-Lavi2015). Ceramic production dates back to 15,000–20,000 BP, the earliest in the world. The plant and animal species used were quite diverse. Open-air sites became common during the early Holocene, with some reflecting substantial expansion in community size. “House structures are found at most of the sites in the Yangzi River basin, together with formal burial pits, suggesting a transition to full-scale sedentism and the incipient development of village life” (59). There is an active debate about whether rice from these sites was domesticated (60).
Liu and Chen (Reference Liu and Chen2012, 61–64, 70–73) describe the early Holocene site of Shangshan in the Yangzi River region. They believe its “subsistence economy was characterized by the broad-spectrum strategy, relying on fishing, hunting, and collecting wild plants … [it was] a perhaps semisedentary or almost fully sedentary village” (72). They also state that technological innovations involving processing and storage for acorns and other nuts may have been prerequisites for a sedentary lifestyle. According to the authors, many of the features of this site parallel those of Jomon sites in Japan (see Section 4.3).
Saharan Africa:
Manning and Timpson (Reference Manning and Timpson2014) investigate population trends for the Sahara in the Holocene. The Sahara was much wetter between 12,000–6000 BP than today. This is known as the African Humid Period (AHP). In the early Holocene the region was occupied by mobile foragers, and in the mid-Holocene by pastoralists with domesticated cattle. The demographic trends include slow population growth before the AHP, a large increase shortly after 11,000 BP, a decline between 7600–6700 BP, a recovery during 6700–6300 BP, and a major collapse during 6300–5200 BP. The authors were unable to show a significant correlation between population and the available climate proxies, but argue that the synchronous nature of demographic change over a large geographic area indicates a causal role for climate.
Garcea (Reference Garcea2006) provides details on subsistence technology for the central Sahara and upper Nile Valley. She asserts that in the early Holocene a combination of hunting, gathering, and fishing made it possible for foraging groups to develop settlements that were continually occupied and semi-permanent. This lifestyle evolved along perennial rivers (the Nile) or seasonal watercourses. Food included fish, plants, and land animals attracted to riverbanks by drinking water. Evidence for greater sedentism includes pits, house foundations, deep wells, and clusters of stone structures (western Egypt); pottery, large combustion structures, intra-site organization, and walled huts (southwest Libya); and thickness of deposits, frequency of burials, density of potsherds and animal bones, and the construction of huts (Sudan). “Late foragers in the Sahara and Sudan continually occupied the same sites for long periods of time throughout the year” (214). Greater sedentism facilitated pregnancy, birthing, nursing, and childcare, which caused population growth. This led to new technology (pottery by 9000 BP, ground stone tools by 6000 BP, and also bone harpoons and fish hooks). Grinding stones were used for processing wild grain, dried meat, and fish, as well as cracking nuts. In the 7th millennium BP, climate deteriorated and some Saharan groups moved to the Nile Valley. Around the beginning of the 6th millennium BP, some groups returned to the desert where they herded domestic animals. This transition to pastoralism occurred several millennia before farming of pearl millet developed in the southern Sahara and Sahel (see Chapter 5).
South America:
The earliest stage of human occupation in the Americas is called the Paleo-Indian period and is associated with the hunting of now-extinct big game. The Archaic period comes after the Paleo-Indian period but before agriculture. Information in the next paragraph is from Moore (Reference Moore2014, ch. 4).
There is scattered evidence for Archaic sedentism in South America. At the end of the Pleistocene, people in southern Peru and northern Chile moved seasonally between the highlands and the coast, relying on fishing and mollusks from the latter. In the early Holocene, around 10,900–8900 BP, more permanent settlements emerged along the coast, indicated by post molds and a house floor. Travel into the sierra appears to have ceased. A group of 55 small sites emerged within one area, suggesting “more permanent coastal adaptations” (107). Similar settlements based on marine resources and sea birds arose elsewhere along the coast. At some coastal locations with freshwater sources in northern Chile, multiple lines of evidence indicate sedentism, including cemeteries with complex mummification practices. These traditions began around 7000 BP and lasted for nearly four millennia. Another well-watered location in southern Peru had three cemeteries, 75 terraced residential areas, and extensive shell middens. It was seasonally occupied in the Middle Archaic but became a permanent settlement in the Late Archaic. Moore does not mention any use of domesticated food sources at these sites.
Mesoamerica:
The Archaic period in Mesoamerica (about 9000–4000 BP) has traditionally been seen as a transitional interval between Paleo-Indian big-game hunting and the emergence of pottery. The semi-arid highlands of central Mexico appear to have been occupied by mobile foragers throughout much of the Archaic (information is from Kennett, Reference Kennett, Nichols and Pool2012). Although domesticated maize is not found in this area until the Late Archaic or after, domesticated squash may go back as far as 7900 BP. Similarly, in the Oaxaca Valley the main strategy seems to have been mobile foraging, despite dating of domesticated squash to 10,000 BP and domesticated maize to 6200 BP. These findings raise questions about the relationship between foraging and cultivation, although squash may have been used as a container rather than a food source. Matters are little clearer in the tropical lowlands, where many sites once located along the coast are now underwater. Some sites along the Pacific coast were used seasonally for mollusk harvesting, but these people may also have been swidden farmers. Because domesticated plants go back to the early Holocene (see Chapter 5), it is uncertain (at least to us) whether sedentary foraging without cultivation ever existed in Mesoamerica.
Eastern North America:
The discussion here is based on Gibson (Reference Gibson2006). A long tradition in the lower Mississippi Valley of building earthen mounds dates back to 7400–7300 BP. Most mounds could have been built by a few workers living in communities of a few dozen people. This practice peaked around 5600–5300 BP and then appears to have stopped for a long time. By far the largest such earthwork was built in 3700–3600 BP at Poverty Point in northeastern Louisiana. Construction is estimated to have taken at least a generation or two, and possibly a century or more. Over 2000 people, the Tamaroha, lived in the area, with more than four dozen villages and camps spread across 1800 square kilometers. No maize or other signs of cultivation have been found. Staple foods included fish and wild aquatic roots. Architectural evidence strongly suggests that the earthworks at Poverty Point were created by sedentary foragers who lived at the site year-round. The Tamaroha engaged in considerable technological innovation, including a new cooking oven, redesigned hunting equipment, improved fishnets and hoes, and stationary facilities. The causes for the rise of the Tamaroha are unclear, but they provide a striking example of sedentism without agriculture.
Although the evidence is fragmentary, events in some of the regions described in this section are consistent with our theory. China, Saharan Africa, and South America all developed sedentary foraging in the early Holocene, perhaps in response to an improved climate. This process often appears to have been associated with population growth and technological innovation. The situations for Mesoamerica and the Mississippi Valley are murkier. In all cases, we anticipate that archaeological research will supply new data that can be used to evaluate our theoretical framework.
4.13 Conclusion
We have defined sedentism to mean a willingness by some people to remain at a production site when weather conditions worsen. In our framework a time period is one human generation, so our theory is about the evolution of settlements that remain in use for decades or centuries, not merely years. Within a period we consider two kinds of sites, good and bad. Due to random weather shocks, sites that are good in one period may become bad in the next or vice versa. There are two methods of food collection, which for convenience we label “hunting” and “gathering.” The former involves portable tools (whether aimed at plants or animals) and the latter involves stationary tools (again, whether aimed at plants or animals). In principle, sedentism as we define it is logically compatible with exclusive reliance on portable tools or “hunting.”
We believe that climate change was the ultimate reason for increasing sedentism in the period leading up to the early Holocene. Our analysis starts from a climate where weather conditions have a low mean and high variance as at the Last Glacial Maximum. In the associated baseline equilibrium, only currently good sites are exploited and only “hunting” is used. Agents abandon a site whenever local weather conditions change from good to bad. We then consider an improvement in climate involving a higher mean and lower variance with respect to the distribution of weather conditions at individual sites.
Climate amelioration can lead to sedentism through three causal mechanisms. First, lower variance in weather conditions reduces the productivity gap between sites that are currently bad and sites that are currently good. Part of the regional population may therefore begin to use sites where the weather is currently bad. This implies that if the weather switches from good to bad at a particular site, some individuals will stay at that site, which fits our definition of sedentism. Second, in the long run a better climate yields population growth, which can lead directly to sedentism. Third, population growth can lead indirectly to sedentism through technological innovation.
The first two mechanisms are fully reversible, in the sense that if climate returns to its baseline state, this will set in motion an adjustment process that returns the system to the previous baseline equilibrium. This need not be true for the third mechanism that involves technical change. If innovations are always preserved once they are achieved, there is a ratchet effect where the productivity of gathering remains above its baseline value, population can remain above its baseline level, and sedentism can be preserved even if climate returns permanently to its baseline state.
Our theory is consistent with the fact that sedentism arose earliest and most often in temperate zones, where climate amelioration probably had the greatest proportional impact on natural productivity, rather than in tropical rain forests, deserts, or the Arctic. This would be expected for two reasons. First, the initial gap in productivity between good and bad sites was probably substantial in temperate areas. The subsequent large reduction in this gap would lead to sedentism through the first causal mechanism above. Second, the large proportional gain in mean productivity in temperate areas led to large responses in the form of Malthusian population growth. This provided more scope for our second and third causal mechanisms to operate. These implications seem consistent with the chronology of sedentism in southwest Asia and Japan set out in Section 4.3, and possibly with events in some of the regions discussed in Section 4.12.
We have addressed issues involving demography and living standards (in Section 4.10) and resource depletion (in Section 4.11). We close with a few comments on dietary breadth, property rights, investment, and site heterogeneity.
Dietary Breadth:
The model has implications for dietary breadth among foragers. A transition to sedentism involving our short-run causal mechanism (direct improvement in the quality of previously poor sites) does not imply any change in the food resources being exploited. A transition involving our long-run mechanism (Malthusian population growth without technological change) likewise does not imply any change in the nature of food resources. However, in our analysis a transition with technological innovation is always associated with the exploitation of new food resources. Our model predicts that this will occur first at sites with good weather conditions, and that previously exploited resources will remain in use at these sites as technology improves. Thus dietary breadth increases at such sites. This is consistent with the frequent comments by archaeologists that sedentism is associated with broad-spectrum food acquisition strategies.
Property Rights:
Reliable concentrations of natural resources tend to encourage both sedentism and the defense of property rights (Baker, Reference Baker2003; Bowles and Choi, Reference Bowles and Choi2013, Reference Feder2019). Allowing insiders to exclude outsiders from favorable sites would reinforce our conclusions, because agents trapped at sites where weather conditions have become bad will then face more difficulty in moving to a better site. Our definition of sedentism is based on the degree to which people remain at sites when weather conditions deteriorate, so closed access would tend to raise the rate of sedentism for the region as a whole. This is related to the idea of population packing in anthropology (see Kelly, Reference Kelly2013a). For more on the origins of group property rights over production sites, see Chapter 6.
Investment:
Another factor that can reinforce a transition to sedentism is the role of investments in durable capital. Returning to our metaphorical definition of “gathering” as a technology involving stationary assets, here we have treated such assets as automatic by-products of technological knowledge about “gathering.” In reality, labor time has to be taken away from food collection in order to create durable equipment and structures. The existence of secure property rights would encourage these investments. In turn, reliance on stationary assets would encourage agents to stay at a site despite weather shocks, thus reinforcing sedentism.
Site Heterogeneity:
In this chapter we emphasized positive climate trends as a basis for sedentism. However, suppose some of the sites within a region are consistently better than others, perhaps due to more reliable fresh water or more diverse ecosystems. A negative region-wide climate shock could disproportionately decrease productivity at sites of lower quality, causing migration from locations with poor permanent resources to those with richer permanent resources (e.g., from arid zones to river valleys). The short-run effect would be a local population spike at the higher-quality sites. As in Section 4.9, this local population spike can lower the marginal product of mobile food collection, lead to use of new food resources, and promote innovation involving the new resources. The result could be a tendency toward sedentism at the refuge sites. This is consistent with our narrative for southwest Asia in Section 4.3, where the cool and arid conditions of the Younger Dryas seem to have led simultaneously to more mobility for people in locations having few permanent geographic or ecological advantages, but continued or expanded settlements in favorable locations. We suggest in Chapter 5 that a dynamic of this kind can help explain the transition to agriculture in southwest Asia.
4.14 Postscript
This chapter is based on an article entitled “The origins of sedentism: Climate, population, and technology” in the Journal of Economic Behavior and Organization (Dow and Reed, Reference Dow and Reed2015). Funding was provided by the Social Sciences and Humanities Research Council of Canada (SSHRCC) and the Human Evolutionary Studies Project (HESP) at Simon Fraser University. We extend thanks to Peter Stahl and the audience at a conference of the Canadian Economics Association for comments on the original paper. The resulting opinions are our own.
Section 4.1 has been completely rewritten. Sections 4.2 and 4.3 include updates on the relevant literatures. We are especially grateful to Andrew Garrard of the Institute of Archaeology at University College London for advice about the literature on southwest Asia discussed in Section 4.3. The model in Sections 4.4–4.9 is formally identical to the one in the JEBO article but the text has been lightly edited. Sections 4.10, 4.11, and 4.12 are entirely new. We were generally aware of the archaeological information in Section 4.3 when we constructed the model, although not all the details from recent publications. We did not know about the archaeological evidence in Section 4.12. The conclusion in Section 4.13 has been rewritten to reflect changes elsewhere in the chapter.
5.1 Introduction
The importance of the transition to agriculture is almost impossible to overstate. Agriculture stimulated the spread of hereditary inequality and organized warfare, and was necessary for cities and states. Modern civilization could not exist without it. While the development of agricultural societies was slow, this process deserves the label “Neolithic Revolution” for the sheer scale of its impact.
The origin of agriculture is probably the most debated question in archaeology (Bellwood, Reference Bellwood2005, 14–28). It has also become a central concern for economists studying economic growth in the very long run. Before agriculture, technological innovation and population growth were unusual. After agriculture, they became commonplace. Further, as we will discuss in Section 12.3, there are strong correlations between the timing of the Neolithic Revolution across regions of the world and the modern economic performance of the nations located in those regions. A substantial group of economists believes that these correlations reflect causal dynamics unfolding over many millennia.
There is wide agreement that the most influential domestications were wheat and barley in southwestern Asia, rice in southern China, and maize in Mesoamerica. Some authors identify around 8–10 independent centers of plant domestication (Balter, Reference Balter2007); others identify as many as 24 (Purugganan and Fuller, Reference Purugganan and Fuller2009). We consider nine pristine centers: southwest Asia, northern China, southern China, Mesoamerica, South America, highland New Guinea, sub-Saharan Africa, eastern North America, and India.
From these pristine centers, agriculture diffused to most parts of the globe. We will not examine diffusion mechanisms here. For an archaeological study of the diffusion of agriculture from southwest Asia to Europe, see Shennan (Reference Shennan2018). For economic studies of diffusion, see Ashraf and Michalopoulos (Reference Ashraf and Michalopoulos2015) and Matranga (Reference Matranga2017).
Among pristine centers, southwest Asia has the most complete data documenting the transition from (a) gathering of wild plants to (b) cultivation to (c) domestication and finally to (d) fully agricultural societies. Cultivation refers to activities such as planting, weeding, and otherwise tending plants prior to harvest, rather than simply harvesting the plants supplied by nature. Domestication refers to genetic changes in plants caused by the practice of cultivation (Nesbitt, Reference Nesbitt, Cappers, Bottema and Baruch2002; Tanno and Willcox, Reference Tanno and Willcox2006; Weiss et al., Reference Weiss, Kislev and Harmann2006; Balter, Reference Balter2007). Following Price and Bar-Yosef (Reference Price and Bar-Yosef2011, S165) we use the term “farming” to mean the utilization of domesticated plants and/or animals for food and other purposes. We use the term “agriculture” to mean that farming and/or herding predominate among the activities of a community and provide its main diet.
The overall process from (a) to (d) took many thousands of years. Domesticated plants and animals were frequently accompanied for millennia by continued hunting and gathering before a completely agricultural economy arose. Gathering wild plants was a necessary first step in this process. Once cultivation began, from an economic standpoint it is not hard to understand the rest of the events in the sequence: the productivity of plant and animal exploitation increased over time due to learning by doing and domestication, and habitat for wild plants and animals was gradually destroyed by farming. Eventually this led to the displacement of foraging activities by agriculture. The origin of cultivation is the critical puzzle to be solved and provides the focal point for this chapter.
How can one explain the long lag between the evolution of modern humans and the shift to cultivation? Consider southwest Asia, which had a significant population of modern humans by 50,000 BP. Numerous grass species, including cereals that were later domesticated, were being gathered for food by at least 23,000 BP (Weiss et al., Reference Weiss, Wetterstrom, Nadel, Bar–Yosef and Smith2004). It then took over 10,000 years to move from gathering to cultivation (see Section 5.3). The transition from cultivation to fully domesticated plants and animals may have taken one or two millennia, with the transition to farming and finally agriculture requiring another one or two millennia.
The southwest Asian transition, like several other agricultural transitions, dates to around the Pleistocene–Holocene boundary. This suggests a common cause, and indeed a number of authors have pointed to climate change as a leading suspect behind the origins of agriculture. Richerson et al. (Reference Richerson, Boyd and Bettinger2001) assert that agriculture was impossible in the most recent Ice Age because prevailing climate conditions (low temperatures and atmospheric CO2 levels, extreme aridity, and large fluctuations on time scales of a decade or less to a millennium) severely limited returns on investments in agriculture. They argue that the warm, wet, and stable conditions of the Holocene encouraged cultural evolution toward reliance on specialized plant resources, leading ultimately to agriculture.
We agree that climate change was the trigger for cultivation in southwest Asia. However, we believe the role of climate was more complex than just a simple transition from poor conditions in the Pleistocene to good conditions in the Holocene. Following the Last Glacial Maximum, warmer and wetter conditions indeed led to abundant wild resources, population growth, and a sedentary lifestyle in southwest Asia as discussed in Section 4.3. This idyllic period ended around 13,000 years ago with an abrupt climate reversal, the Younger Dryas, in which colder and drier conditions returned. As we will discuss in Section 5.3, the timing of the first cultivation in southwest Asia is a matter for archaeological debate, but we believe that on balance the evidence supports initiation in the Younger Dryas.
Our causal explanation for the origins of cultivation during the Younger Dryas can be summarized as follows. Consider a region having many food production sites of varying quality, suppose there are many sedentary settlements in the region, and assume settlements are small enough to allow community decision-making with regard to food acquisition strategies. For simplicity let cereals be the only source of food and assume there are only two ways to obtain food: Gathering cereals or cultivating cereals. Because food is shared equally, the goal of each community is to maximize total food output.
Gathering and cultivation can be used in any combination by having some people engage in one and some in the other (or having individuals divide their time between the two). Each activity exhibits diminishing returns to labor: the additional food produced by an additional worker (the marginal product of labor) falls as more labor is devoted to that activity. Suppose that initially the marginal product of gathering exceeds the marginal product of cultivation. As long as this remains true, food output is maximized when all members of the community engage in gathering. But as more workers are added to the gathering activity, eventually the marginal product of gathering declines to a point where it is below the marginal product of the first worker who could potentially be employed in cultivation. Beyond that point, the maximization of total food output requires that some labor be used for cultivation.
For a graphical illustration of these ideas, see Figure 3.2 in Chapter 3, where we interpret the higher marginal product curve as gathering and the lower marginal product curve as cultivation, rather than interpreting the curves as referring to different resources. For any community size below the threshold Na shown in Figure 3.2, all labor is allocated to gathering, and for any community size above this threshold, some (but not all) labor is allocated to cultivation.
The question is what caused the sizes of settlements to increase so that it became optimal for some people to engage in cultivation. In our framework this resulted from migration among communities, where the migration process was triggered by climate change. We argue that a long period of good climate with abundant rainfall supported a high regional population, although the sizes of individual settlements were not yet large enough to induce cultivation. A climate shock in the direction of decreased rainfall (the Younger Dryas) reduced productivity throughout the region, but by proportionally more at sites with limited access to surface water relative to sites with rivers, lakes, marshes, or springs. This caused migration toward the latter sites, which served as refuge locations. Because the large regional population was forced through a narrow geographic bottleneck due to the limited number of refuge sites, there was a spike in local labor supply at these sites. This pushed such settlements beyond the size threshold required for cultivation.
We also explore long-run dynamics. One might think that with a recovery from the Younger Dryas and higher rainfall during the Early Holocene, population would once again disperse across the region, refuge settlements would lose population through out-migration, and cultivation would be abandoned in favor of a renewed focus on gathering. But after cultivation began, learning by doing gradually raised its productivity relative to gathering and this process preserved incentives for cultivation. This technological factor was reinforced by regional population growth in the Holocene, which tended to keep the sizes of many settlements above the threshold for cultivation. The resulting trajectory in southwest Asia led to domestication and fully agricultural societies.
In our model of this process, climate and geography are the exogenous variables. Climate change serves as the trigger, and geography is important because heterogeneity across production sites within a region is needed in order for climate change to generate migration effects. Cultivation is a short-run response to these migration effects, while in the long run, population and technology respond endogenously to climate.
We do not claim that the mechanism outlined above applies in every case, even if one accepts the fundamental role of climate change. We showed in Chapters 3 and 4 that technological innovation could be triggered by climate shocks of various kinds, such as biased negative shocks, biased positive shocks, and neutral positive shocks. The process we suggest for the southwest Asian region combines a neutral negative climate shock at the regional level with a bias generated by migration effects at the local level.
Similar dynamics may help to explain agricultural origins in China, sub-Saharan Africa, and other regions (see Section 5.10). However, we do not argue that the Younger Dryas was the specific climate trigger in all cases. For example, pristine agriculture arose in sub-Saharan Africa many millennia later, and we suggest that this occurred in response to a different negative climate shock, although the causal processes leading to cultivation appear to resemble those that played out much earlier in southwest Asia. Furthermore, in certain regions the Holocene itself may have functioned as a large positive shock that led to population growth and local densities sufficiently high to trigger cultivation. While we view climate change in some form as the most likely exogenous trigger, the causal details may well have varied considerably from one region to another.
Whether one prefers causal explanations based on climate change or other factors, in our view an adequate theory of the agricultural transition should address the following questions. Why were there relatively few pristine transitions? Why were there no such transitions in previous interglacial periods, in the last glacial period, or during the initial warming that followed the Last Glacial Maximum? Why were pristine transitions rare in tropical rainforests? Why did agriculture not emerge in the Holocene in some promising regions with large sedentary populations, sophisticated technologies, and complex social organization? Why did agriculture begin in places where natural resources were locally abundant, rather than marginal areas? We argue that our theory offers coherent answers to these questions, while competing explanations often fail to do so (see Section 5.11).
The rest of the chapter is organized as follows. Section 5.2 reviews a number of proposed causal explanations for the origins of agriculture. Section 5.3 summarizes the archaeological evidence for southwest Asia. We begin the construction of our formal model in Section 5.4. Short-run and long-run equilibrium are studied in Sections 5.5 and 5.6. We use the model to study the dynamics of the southwest Asian case in Section 5.7. Following this, Sections 5.8 and 5.9 discuss how the model addresses population trends and living standards.
We survey other agricultural transitions in Section 5.10 and important examples of non-transition in Section 5.11. These cases provide evidence that can be used to assess competing theories. Section 5.12 addresses the role of biological endowments, a factor emphasized by Diamond (Reference Diamond1997). Section 5.13 briefly summarizes our contributions. These include creating an empirically grounded explanation for pristine cultivation; showing that biased climate change, a frequent assumption in other work, is unnecessary and problematic; identifying heterogeneous production sites and migration as key causal factors; and endogenizing population and technology, which have often been treated as exogenous in previous theories. Section 5.14 is a short postscript. Proofs of the formal propositions are available at cambridge.org/economicprehistory.
5.2 Theories of Agriculture
Due to the vast scale of the literature, we will be highly selective in our review. We first consider theories from archaeology and then theories from economics.
Among archaeologists, one early advocate of a link between climate change and agriculture was V. Gordon Childe (Reference Childe1951, 67–72), who argued that reduced precipitation in arid sub-tropical regions forced animals, plants, and humans to retreat to springs and streams where water remained available. Childe believed that this process likely led to animal domestication. He thought that plant cultivation began earlier, perhaps in river valleys subject to annual flooding such as the Nile. Childe’s approach fell out of favor because he lacked data on climate as well as the location and timing of early cultivation.
During the 1960s and 1970s, archaeological attention shifted to theories based on population pressure (for references, see Weisdorf, Reference Weisdorf2005; Price and Bar-Yosef, Reference Price and Bar-Yosef2011). The “marginal zone” hypothesis suggested that population pressure would be felt first in areas with fewer foraging resources, and thus such areas would be the first to engage in agriculture. The “nuclear zone” hypothesis suggested the opposite: areas having a rich resource endowment would allow foragers leisure time to experiment with agriculture. Both arguments stressed the alleged effects of living standards on technical change.
From our standpoint, theories relying on population pressure as an exogenous variable are problematic for several reasons. First, in the short run people may migrate from areas with low food per capita to areas with high food per capita, and this tends to erode differences in living standards across sites. Second, in the long run, differences in living standards across sites tend to disappear due to Malthusian dynamics. Third, in the absence of external shocks, population stabilizes at an equilibrium level as in Chapters 2–4.
In recent decades numerous archaeologists have returned to climate explanations. Bar-Yosef and Meadow (Reference Bar-Yosef, Meadow, Douglas Price and Birgitte Gebauer1995) and Bar-Yosef (Reference Bar-Yosef2011) argue that cold and dry conditions during the Younger Dryas caused yields from wild cereal stands to decrease in southwest Asia, which provided incentives for cultivation. Bar-Yosef (Reference Bar-Yosef2011) extends this argument to explain the origin of cultivation in China. Similarly, Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001) suggest that the Younger Dryas drove foraging populations to cultivate staples in southwest Asia.
A common climate story among archaeologists is that climate change caused the natural range of valuable wild plants to move away from existing settlements, leaving the inhabitants with an unpleasant choice between staying in place and giving up important staple foods, or moving to follow these foods while giving up other advantages of their current location (perhaps reliable fresh water in a river valley). An alternative solution was to stay in place and cultivate the desired plants through increased inputs of human labor, as a substitute for inputs such as generous rainfall previously supplied by nature.
From an economic standpoint, the problem with this story is that it assumes the marginal product of cultivation labor for the desired plants somehow increased, at least in comparison with the marginal product of foraging, because otherwise cultivation would not become attractive. However, climate change was taking away complementary inputs such as rainfall, and thus shifting down the entire marginal product curve for cultivation. To meet this objection, one could maintain that the marginal product curve for foraging shifted down even more. But arguments based on biased climate shocks of this kind run into other difficulties, as we will discuss later in this section and in Section 5.13.
Another strand in the literature involves human behavioral ecology (HBE). This school of thought borrows concepts like marginal valuation and opportunity cost from microeconomic theory. HBE has a close connection to dietary breadth models and sees agriculture as a rational response to changes in resource availability (Winterhalder and Kennett, Reference Kennett and Kennett2006). Such changes could arise through climate shifts, population pressure, technological innovation, or resource depletion.
We share HBE’s emphasis on constrained optimization by individual agents. Our approach differs primarily by devoting attention to longer time periods, social aggregates, and system-level dynamics. We sympathize with the view that archaeologists should make greater use of economic concepts such as risk, discounting, economies of scale, and transaction costs in understanding the development of agricultural societies (Winterhalder and Kennett, Reference Winterhalder and Kennett2009), although our own model of initial cultivation does not employ these concepts. In particular, we assume agents are risk neutral, in contrast to a common focus on risk aversion among archaeologists (e.g., Bar-Yosef, Reference Bar-Yosef2011), largely because we want to construct tractable models and do not need risk aversion in order to obtain our results. We also want to avoid explanations based on unobservable risk preferences.
Some archaeologists highlight cultural or institutional factors, such as religion or status competition, as potential prime movers for agriculture. As Zeder and Smith (Reference Zeder and Smith2009) point out, proposals along these lines have not garnered much support. Zeder and Smith, like many archaeologists, view all “single-factor models” or “universal causes” as dubious. They stress the unique features of particular regional transitions and prefer multivariable explanations that incorporate climate, demography, economics, biology, social structures, and cultural belief systems.
At this point we would remind readers of our arguments in Chapter 1 about the virtues of simple models. But we also note that our own framework is not a single-factor model, despite the prominent role we give to climate change as a trigger. It also includes geographic, technological, and demographic factors, and requires that multiple necessary conditions be satisfied in order for a pristine transition to occur (see Section 5.11).
We now turn to research by economists. The main ideas in this literature involve property rights and resource depletion, technological innovation, and climate change. For convenience we organize the discussion according to these categories. However, some of the hypotheses discussed below involve multiple factors. For surveys of earlier work by economists, see Locay (Reference Locay1989, Reference Locay1997), Pryor (Reference Pryor1983, Reference Pryor1986, Reference Pryor, Field, Clark and Sundstrom2004), and Weisdorf (Reference Weisdorf2005).
Property Rights and Resource Depletion:
Economists have sometimes argued that foraging societies suffered from overharvesting of wild resources due to an open access property regime, and that this tragedy of the commons was exacerbated by population growth. As a consequence, private property and agriculture became attractive (Smith, Reference Smith1975; North and Thomas, Reference North and Thomas1977). Marceau and Myers (Reference Marceau and Myers2006) develop a more elaborate resource depletion story. They suggest that modest foraging capabilities were compatible with the existence of a grand coalition that restrained overharvesting. After technology advanced to a certain point, the grand coalition collapsed, overharvesting prevailed, and it became attractive for smaller coalitions to abandon foraging in favor of agriculture.
Riahi (Reference Riahi2020, Reference Riahi2021a, Reference Riahi2021b) focuses on domestication of animals rather than plants. He argues that the extinction rates for large mammals were tied to the timing of hominin dispersals across continents. The resulting coevolution of hominins and megafauna influenced the pristine emergence and diffusion of agriculture. In contrast to archaeological emphasis on the broad-spectrum revolution (Section 3.10) and resource depletion (Section 4.11), Riahi examines the suitability of the surviving megafauna for domestication. He also provides an econometric analysis of pristine agricultural transitions.
Bowles and Choi (Reference Bowles and Choi2019) continue in the tradition of linking property rights with agriculture, although without resource depletion. They argue that the decreased climate variance of the Early Holocene led to greater sedentism, that some sedentary groups with highly productive and concentrated resources developed private property institutions, and that some sedentary groups with private property also adopted farming. Bowles and Choi use evolutionary game theory to model these processes and emphasize two factors. First, a critical mass of individuals had to adopt private property institutions; and second, there was a positive feedback loop between farming and private property. This loop emerged because private property raised the payoffs from farming investments, while the outputs from farming were more conducive to the definition and defense of property rights than the outputs from foraging.
We agree with Bowles and Choi on some matters and disagree on others. We accept their view that early cultivation was no more productive than foraging and thus that the transition to cultivation did not have a technological trigger. We also agree that cultivation was not triggered by regional population pressure. However, we do not see private property as a necessary condition for early agriculture, and our formal model in this chapter shows how it could have developed under open access. We also differ from Bowles and Choi in our interpretation of the empirical evidence surrounding cultivation in southwest Asia, as will be discussed in Section 5.3.
The game-theoretic framework used by Bowles and Choi leads to two questions. First, how would it be possible to have two stable equilibria, one with foraging/common property and the other with farming/private property? Second, given two such equilibria, what would trigger a transition from one to the other? In contrast we rely on optimization at the local level and have unique equilibria at the regional level. Our central question is how a climate shock could move the system from a corner solution without cultivation to an interior solution with it. Unlike Bowles and Choi, we endogenize local and regional population, and we model technological innovation through learning by doing.
Our approach yields sharper predictions about the times and places of agricultural transitions, including some triggered by climate events later in the Holocene that were not related to the Younger Dryas (see Section 5.10). It also adds predictive content regarding population and productivity. On the other hand, Bowles and Choi offer predictions about family-level property institutions while we do not. Our theory of property rights, where groups rather than individuals or families control land, will be presented in Chapter 6.
Technology:
Agriculture is sometimes portrayed as a technological breakthrough analogous to the light bulb. In this view, agriculture was invented by an astute group of pioneers and diffused rapidly as others recognized its benefits. This scenario is no longer taken seriously for several reasons. First, anthropological research has made it clear that foragers are knowledgeable botanists and certainly grasp the feasibility of planting seeds in order to obtain a harvestable crop. Second, there are many examples of foragers who lived close to farmers or traded with them but did not embrace farming (Bellwood, Reference Bellwood2005, 28–43). Third, the hypothesis does not explain why agriculture arose in specific places and times, or why many millennia were needed to invent it.
Nevertheless, one can argue that a gradual accumulation of technical knowledge eventually made cultivation productive enough that it became attractive by comparison with foraging. Olsson (Reference Olsson2001) suggests that improvements in agricultural productivity could have arisen as a by-product of foraging. Hibbs and Olsson (Reference Hibbs and Olsson2004) and Olsson and Hibbs (Reference Olsson and Hibbs2005) develop a model of long-run growth in which the pace of innovation in a region is an increasing function of that region’s biogeographic endowment, including its suite of potentially domesticable plants and animals.
Weisdorf (Reference Weisdorf2003) combines exogenously improving food production with satiation in food consumption to model the emergence of a non-food-producing sector, which he identifies with the creation of food surpluses under agriculture. Weisdorf (Reference Weisdorf2009) argues that rising agricultural productivity was the central reason for the adoption of agriculture, and Guzmán and Weisdorf (Reference Guzmán and Weisdorf2011) assume explicitly that productivity for agriculture was higher than for foraging.
Baker (Reference Baker2008) presents a model where agricultural adoption depends on population density, technological sophistication, and natural resources. A key assumption is that the improvement of a general-purpose technology tends to increase total factor productivity more for agriculture than for foraging. Ashraf and Michalopoulos (Reference Ashraf and Michalopoulos2011, Reference Ashraf and Michalopoulos2015) propose that mild climate stress led to changes in foraging techniques, causing more investment in tools, infrastructure, and habitat clearance. In turn, this caused the latent productivity of agriculture to rise until a threshold was crossed and agriculture was adopted.
We agree that foraging activities could yield knowledge that would be useful for agriculture (for example, the development of sickles to harvest wild cereals). But in a foraging society, learning by doing seems more likely to raise foraging productivity than agricultural productivity. We doubt that agricultural productivity in a given region would have risen significantly in relation to foraging before cultivation had begun in that region. Accordingly, we are skeptical about hypotheses where increases in the latent productivity of cultivation play a central role in triggering a transition.
Climate Change:
As discussed earlier, archaeologists often treat climate change as a trigger for the Neolithic transition. However, such arguments can run into difficulty. If the climate improves, why doesn’t productivity rise equally for foraging and cultivation? If the climate deteriorates, why doesn’t productivity fall equally for both?
The model constructed by Olsson (Reference Olsson2001) includes the possibility that changes in climate were biased toward agriculture or against foraging. In particular, Olsson favors the view that conditions in southwest Asia deteriorated less for agriculture than foraging. Observe, however, that if this is true, the climate recovery of the Holocene should have restored foraging, which it did not. Other hypotheses involving direct climate biases are vulnerable to similar objections, unless they incorporate some kind of path dependence. Ashraf and Michalopoulos (Reference Ashraf and Michalopoulos2011, Reference Ashraf and Michalopoulos2015) rely on a different type of bias by arguing that mild climate stress promotes technological innovation biased toward cultivation. Given the lack of persuasive archaeological evidence for either direct or technically mediated climate biases, we remain doubtful about hypotheses of this sort.
Matranga (Reference Matranga2017) emphasizes a different climate dimension: seasonality. He notes that Earth’s orbital parameters lined up in the Early Holocene in a way that tended to raise seasonal variation in temperature and precipitation across the Northern Hemisphere. The causal argument is that greater seasonality increased incentives for sedentism and storage, and that sedentism facilitated a transition to agriculture. He stresses that seven centers of pristine agriculture were all highly seasonal, and establishes that agriculture spread more quickly in areas where the weather had more pronounced seasonal patterns. A limitation of Matranga’s approach is that he only offers a story about the shift from mobile foraging to sedentary foraging with food storage, not the full shift from sedentism to agriculture.
5.3 The Transition in Southwest Asia
Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018) provide a detailed history of climate conditions in southwest Asia during 16,000–9000 BP (see also Robinson et al., Reference Robinson, Black, Sellwood and Valdes2006; Bar-Yosef, Reference Bar-Yosef2011; Maher et al., Reference Maher, Banning and Chazan2011; Henry, Reference Henry, Bar-Yosef and Valla2013, and Section 4.3). Roberts et al. synthesize lines of evidence such as stable isotope measurements from speleothems and lake and marine sediments, along with pollen and charcoal data for a variety of species.
There are four phases. The coldest and driest spans 16,000–15,000 BP, and is associated with the Heinrich I climate event. Archaeologically, this corresponds to the Kebaran culture. Climate became warmer and wetter in the Bølling/Allerød interstadial of 15,000–13,000 BP, which corresponds to early Natufian culture. The third phase is the Younger Dryas stadial, bringing a reversal to colder and drier conditions during 13,000–11,700 BP and corresponding to late Natufian culture. Finally, warm and wet conditions returned in the Early Holocene starting around 11,700 BP. Archaeologically, the period 11,700–10,500 BP is known as the Pre-Pottery Neolithic A (PPNA).
There is some debate about the exact timing of the Younger Dryas in the Levant. For example, according to Bar-Yosef (Reference Bar-Yosef2011, S180), worsening conditions probably began around 12,600/12,500 BP rather than 13,000 BP (see also Rohling et al., Reference Rohling and Hassan2002; Mayewski et al., Reference Mayewski2004; Sima et al., Reference Sima, Paul and Schulz2004). The duration of the Younger Dryas in the Levant could thus have been shorter than what is indicated by ice cores from Greenland or Antarctica. Such ambiguities are unsurprising because the confidence intervals for regional climate proxies in southwest Asia are on the order of ±200–500 years (Roberts et al., Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018, 51).
Roberts et al. believe climate changes were synchronous across the region subject to uncertainties about dates. There were two major warming steps, one at approximately 14,500 BP and the other at approximately 11,700 BP. “These abrupt and sustained shifts in climate, with mean temperatures rising by up to 1°C per decade, had dramatic consequences for environmental resource availability and would have been felt directly within an individual human lifetime” (51).
Ohalo II, a site dating to 23,000 BP, provides the earliest evidence for extensive gathering of wild plants in southwest Asia. This site was submerged under the Sea of Galilee and discovered in 1989 after water levels dropped dramatically. The inhabitants gathered a broad range of grains (Weiss et al., Reference Weiss, Wetterstrom, Nadel, Bar–Yosef and Smith2004). A significant portion of their diet consisted of small-seeded grasses, far smaller than the wild cereals we associate with the origin of agriculture (which were utilized as well). At this time of extreme aridity, much of southwest Asia was thinly populated or unoccupied. People lived in small campsites in groups ranging from one to five families and pursued a nomadic lifestyle.
Early Natufian communities flourished in southwest Asia in the warm and wet Bølling/Allerød period (Bar-Yosef, Reference Bar-Yosef, Bellwood and Renfrew2002b, Reference Bar-Yosef, Fitzhugh and Habu2002c). Population grew substantially and some people became sedentary (see the references from Section 4.3). Food was plentiful and skeletal evidence shows few signs of trauma, suggesting little conflict among groups. Migration and settlement patterns closely followed changes in biomass, which in turn were largely determined by the amount and distribution of winter rains. This way of life lasted more than 1,500 years.
The formal model to be developed in Sections 5.4–5.7 is largely inspired by data from Abu Hureyra, a village located on the ecotone between the Euphrates River Valley and the woodland-steppe in what is today northeast Syria. Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001, 383) date the start of the village to approximately 13,500 BP, although others have dated it to about 13,100 BP (Colledge and Conolly, Reference Colledge and Conolly2010). In either case there is agreement that the initial occupation occurred during the late Bølling/Allerød. Thereafter Abu Hureyra provides a sequence of archaeological deposits extending through the Younger Dryas and into the Early Holocene, spanning more than 3,000 years. The information below is from Moore et al. (Reference Moore, Hillman and Legge2000) and Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001). We will review a number of debates about their arguments later in this section.
Excavation uncovered two superimposed settlements. Abu Hureyra 1 (AH1) was inhabited by sedentary foragers. The remains of food plants from 13,500 BP reflect a diverse diet typical of hunter-gatherer societies. There is no evidence for the cultivation of crops here or at any Natufian site before the Younger Dryas.
The use of several wild foods, including some caloric staples, fell rapidly in the early stages of the Younger Dryas. The order in which species declined was (i) drought-sensitive fruits and nuts, (ii) lentils and other large-seeded legumes, (iii) wheats and ryes, (iv) feather grasses, and (v) chenopods. This is consistent with advancing desiccation. Foods of types (i) and (ii) disappeared entirely while those of (iii), (iv), and (v) were used less heavily. Some staple foods from the river valley did not decline at all, probably due to regular over-bank flooding.
According to Moore et al. and Hillman et al., several lines of evidence indicate that cultivation began at Abu Hureyra 1 during the Younger Dryas. These include:
(a) Weeds. The decline in wild cereals was followed by the rapid rise of a weed flora typical of arid-zone cultivation involving substantial tillage. These weeds, which include small-seeded legumes, small-grained grasses, and dryland gromwells, are drought-sensitive and would not have replaced other plants without cultivation.
(b) Cereals. Wheats and ryes continued in use, albeit in smaller quantities, despite the YD having “almost certainly eliminated all wild stands from the area” (387). The authors argue that even under modern conditions no extensive stands of wild wheat or rye can grow within 60–70 km of Abu Hureyra. In the drier conditions of the YD, the nearest stands would have been even farther away.
(c) Legumes. Later in the YD lentils and other large-seeded legumes reappeared and increased in abundance, despite local absence for centuries and continuing aridity that would have prevented the natural re-establishment of wild stands.
The authors argue that cultivation of cereals and legumes would have been possible with clearance of competing scrub, soil tillage, and seeds obtained from distant wild stands. In their view, cultivation was precipitated by the decline in wild cereals and environmental stress was the trigger.
We note in passing that Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001) advanced a claim for domesticated rye seeds at AH1 early in the YD. This alleged finding was eventually rejected due to a lack of corroborating evidence from sites elsewhere in the region, possible intrusion from higher levels of the excavation, doubts about the radiocarbon dating, and difficulties in distinguishing large wild seeds from domesticated seeds (Nesbitt, Reference Nesbitt, Cappers, Bottema and Baruch2002; Balter, Reference Balter2007; Colledge and Conolly, Reference Colledge and Conolly2010). Accordingly, we will not comment further on this matter.
During the Younger Dryas, part of the Natufian population returned to a nomadic lifestyle (see Bar-Yosef and Meadow, Reference Bar-Yosef, Meadow, Douglas Price and Birgitte Gebauer1995; Bar-Yosef, Reference Bar-Yosef, Bellwood and Renfrew2002b, Reference Bar-Yosef, Fitzhugh and Habu2002c). An alternative option was to migrate to sites in river valleys and on lake shores where water was nearby, and where expanses of fertile soils became available as rivers and lakes shrank due to aridity (Bar-Yosef, Reference Bar-Yosef, Bellwood and Renfrew2002b, 116; Mithen, Reference Mithen2003, 53). The migration of population to Abu Hureyra 1, implied by growth in its geographic size, is an example. The population of Abu Hureyra 1 rose to perhaps as many as 100–300 people at its maximum during the Younger Dryas. The absence of evidence for any violence or fortifications at Abu Hureyra 1 suggests that this migration process was peaceful.
Access to reliable water supplies and fertile soils also characterized the other sites in southwest Asia that had an early shift to agriculture (Smith, Reference Smith1998). Bellwood (Reference Bellwood2005, 57) describes the sites in southwest Asia with the oldest potentially domesticated cereals as follows: “[A]ll were located near springs, lakes, or riverine water sources. Such sites include Jericho, Netiv Hagdud, Gilgal, Tell Aswad, Abu Hureyra, and Mureybet. Early agricultural sites elsewhere, such as Ali Kosh in Khusistan, had similar advantages.”
Abu Hureyra 2 is dated from roughly 11,400 BP, that is, within the PPNA. The villagers were farmers who also collected wild plants and hunted game, but eventually became wholly dependent on domesticated plants and animals. The population increased rapidly to levels more than twenty times the population of Abu Hureyra 1 and surpassed almost all other contemporary sites in southwest Asia.
In the preceding narrative we outlined three central arguments for cultivation at Abu Hureyra during the Younger Dryas. We turn now to archaeological debates about these claims, especially controversies over the role of arable weeds as an indicator.
Colledge and Conolly (Reference Colledge and Conolly2010) suggest that some “weeds” identified by Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001) at AH1 during the YD could actually have been food items obtained without cultivation. In particular, they suggest that small legumes and small-seeded grasses were low-ranked foods that replaced other increasingly scarce high-ranked foods. The former were common in steppe ecosystems and would have been available as climate worsened (along with feather grasses and chenopods). They maintain that this provides a more parsimonious explanation than invoking cultivation, although “[w]ithout discounting entirely the possibility of cultivation of wild cereals and legumes” (124–125).
We agree that dietary substitution toward small legumes and small-seeded grasses is a plausible explanation for the increasing frequency of such seed remains. But we note that Colledge and Conolly do not make a similar argument for the third weed category of dryland gromwells, which were “stony seeded” and inedible, as well as drought-sensitive and therefore unlikely to rise in frequency in the absence of cultivation. We also note that they do not dispute the assertions by Hillman et al. regarding the absence of uncultivated wild cereals within a reasonable distance of Abu Hureyra during the Younger Dryas, or the return of large-seeded legumes at a time when arid conditions persisted. Therefore these lines of evidence for cultivation at AH1 still stand.
Partly in response to Colledge and Conolly’s argument that some of the “weeds” at AH1 could have been food items, Willcox (Reference Willcox2012) identified 19 weed taxa that have been recognized as arable weeds at later Neolithic and Bronze Age agricultural sites, and have no known human use. He argues that “arable weeds are probably the best indicator of pre-domestic cultivation when the crops were morphologically indistinguishable from their wild ancestors” (163). His main results are as follows. The LGM site of Ohalo II had two weed taxa. The Natufian sites of AH1 and Dederiyeh each had six. A group of PPNA sites with no domestication had between 9 and 16, while a group of later Neolithic sites with domestication had between 12 and 17. Willcox asserts that the similarity in the counts for the latter two groups indicates cultivation at the PPNA sites. For the Natufian sites of Abu Hureyra and Dederiyeh, he comments that the weed frequencies “are low but nevertheless present. Because of this we should not totally discount the possibility of cultivation at Natufian sites” (166).
Bar-Yosef (Reference Bar-Yosef2011) adds Tel Qaramel, Mureybet, and Jerf el-Ahmar to the list of Natufian villages where arable weeds suggest cultivation in the Younger Dryas. In the last decade, the weed taxa identified by Willcox (Reference Willcox2012) have become widely accepted as a way of inferring cultivation prior to domestication (see the references from Weide et al., Reference Weide2021, 2) and have been used to support claims of cultivation during the Younger Dryas.
However, Weide et al. (Reference Weide2021) have recently criticized the conclusions of Willcox (Reference Willcox2012). They compare the weeds in modern cultivated fields with weeds in modern wild cereal stands, all located in the Levant. They show that the identification of weeds at the species level does allow one to distinguish statistically between the cultivated and wild stands. However, the Willcox taxa are at the broader genus level and fail to distinguish between the two types of habitat. Hence, knowing whether the Willcox weed taxa were present or absent at a site would not assist with inferences about the presence or absence of cultivation at that site.
The issue for us is whether this critique sheds any light on the presence or absence of cultivation during the Younger Dryas. In considering this issue, one should recognize that different authors are asking different questions. Moore et al. (Reference Moore, Hillman and Legge2000) and Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001), along with Colledge and Conolly (Reference Colledge and Conolly2010), are concerned with the changing plant frequencies at one site (AH1) and whether these dynamics support the hypothesis of cultivation at that location. Willcox (Reference Willcox2012) does not address these dynamics and instead compares the numbers of weed taxa across different sites observed at different points in time, although he does suggest that cultivation at AH1 should not be discounted.
Weide et al. (Reference Weide2021) make cross-sectional comparisons of modern sites with and without cultivation. They find that Willcox’s taxa are not helpful in drawing inferences about the presence of cultivation, which calls into question the recent archaeological use of these taxa to infer cultivation at various PPNA and Younger Dryas locations. Susan Colledge (private communication) points out that there is a mismatch between Weide et al. and Willcox in the data on which their results are based. Weide et al. use present-day ecological differences at the species level. But it is rarely possible at pre-Neolithic sites to identify wild taxa beyond the genus level because preservation is usually too poor. New data from older sites will be required to refute or confirm Willcox’s work.
We do not know how archaeologists will resolve this issue. But whatever the resolution may be, the Weide et al. findings have no relevance for the changes in plant frequencies observed at AH1 over time. For reasons discussed above, we believe that three lines of evidence for cultivation advanced by Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001) remain intact.
We now turn to two further issues: the timing of domestication, and the length of the lag between cultivation and domestication. For cereals, peas, lentils, bitter vetch, and chickpeas – the early domesticates in southwest Asia – a key difference from wild varieties is that domesticated crops cannot reseed themselves. Therefore, if domesticated seeds are found, this indicates the existence of cultivation because the seeds in question must have been both planted and harvested, not just harvested. Harlan (Reference Harlan1995), Smith (Reference Smith1998), and Moore et al. (Reference Moore, Hillman and Legge2000), provide details on the morphological markers of domestication. According to Willcox (Reference Willcox2012), the first domestic cereals in southwest Asia date to about 10,500 BP. Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018, 48) date the PPNA and PPNB to 11,700–10,500 and 10,500–9000 BP, respectively. Thus the first domesticated seeds appear at the boundary of the PPNA and PPNB.
As we discussed above, claims for cultivation in the PPNA or the Younger Dryas often rely upon arguments about arable weeds, crops found far from their natural habitats, or other circumstantial evidence. A separate line of argument involves the lag from cultivation to domestication. Artificial selection (which may be unintentional) takes time, so there is inevitably some interval between the point at which cultivation begins and the point at which it results in observable domestication. For cereal crops in southwest Asia, in the early 1990s this lag was estimated at no more than a few centuries, perhaps much less (Hillman and Davies, Reference Hillman and Stuart Davies1990). However, subsequent archaeological research and field experiments overturned this view, resulting in a new consensus that the lag was at least a millennium (Tanno and Willcox, Reference Tanno and Willcox2006; Balter, Reference Balter2007).
Allowing a time lag of 1000 years from cultivation to domestication places initial cultivation at 11,500 BP, within two centuries of the boundary between the PPNA and the Younger Dryas. If the lag was actually 1500 years then cultivation began at about 12,000 BP. Larson et al. (Reference Larson2014) estimate that for southwest Asia, “documented exploitation” or “necessary lead-time to domestication” began by 12,000 BP for wheat, barley, lentil, flax, sheep, goat, and pig, and by 11,500 BP for peas and cattle. The date of 12,000 BP falls squarely within the Younger Dryas.
Next we return to our earlier discussion of Bowles and Choi (Reference Bowles and Choi2019) from Section 5.2. We agree with these authors that morphologically domesticated seeds do not appear in southwest Asia until about 10,500 bp. But as discussed above, recent scholars accept lengthy lags between cultivation and domestication. Bowles and Choi characterize the relevant lag as “a matter of centuries” and are only prepared to grant that cultivation may have begun “at the very end of the Younger Dryas in Southwest Asia” (2194), although they do not think this is likely. But the recent consensus puts the lag at a millennium or more, providing substantial evidence that cultivation began in the Younger Dryas.
Bowles and Choi (Reference Bowles and Choi2019, 2215–2216) interpret Abu Hureyra as a case in which sedentism led to private property, which then led to farming. We are not aware of any evidence for the evolution of family-level property rights at AH1 within the relevant time frame. But there is strong evidence for a negative climate shock at AH1, and there is also evidence for local population growth, which is consistent with the model we will develop in the next several sections.
In their discussion of Dow et al. (Reference Dow, Reed and Olewiler2009), Bowles and Choi (Reference Bowles and Choi2019, 2220) suggest that we do not need climate adversity to obtain the local population growth that triggers cultivation in our story. This is correct, and in Section 5.10 we suggest that some pristine transitions to agriculture may have resulted from long-run Malthusian population growth generated by positive climate trends. But taking the various lines of evidence as a whole, we think it is more likely than not that initial cultivation in southwest Asia was triggered by the Younger Dryas. We recognize that future archaeological research may lead to a different conclusion. Nevertheless, we think the evidence is strong enough to warrant the exploration of causal mechanisms through which a negative climate shock could have led to initial cultivation. Sections 5.4–5.7 study one such mechanism.
5.4 The Production Site
Our goal in the next several sections is to construct an initial equilibrium in which there is no cultivation at any site, and identify conditions under which a new equilibrium emerges with positive labor time allocated to cultivation at some sites. This parallels the analytic approach taken in Chapters 3 and 4.
We treat a region as having a continuum of food production sites, where people and technological knowledge can flow freely across sites. Informally, two sites are in the same region if the cost of moving between them is relatively low, and different regions if the cost of moving between them is relatively high. These costs could involve physical distance; geographic barriers such as mountains, deserts, or oceans; social barriers such as territoriality, strong norms against intermarriage, or differences in culture or language; technological barriers involving an inability to exploit unfamiliar natural resources; or informational barriers involving a lack of knowledge about the features of remote sites.
When regions are defined in this way, exogenous shocks such as climate change may cause populations to move among the sites within a region but not between regions. Changes in the aggregate population of a region are therefore driven by natural increase or decrease, not migration. On the other hand, migration is crucial for determining the local populations at particular sites within a region. Although our formal modeling will simplify by allowing any individual agent to locate at any site in a region, in practice we do not need this stark assumption to obtain our qualitative results. It is sufficient to have local populations move freely between adjacent sites, so that from an economic point of view all sites in the region are “in the same market” and adjustments in local populations rapidly erode differences in food per capita across sites.
We do not regard cultural diffusion or trade as sufficient evidence that two sites are in the same region. Our migration arguments involve changes in the distribution of population across production sites, and transmission of culture or objects is not enough to establish that this occurred. Moreover, our model of technology from Chapter 3 involves learning through observation, so transmission of a technique to a new location generally requires the physical movement of knowledgeable individuals.
A region may have many different plant species that are available for cultivation. We will aggregate all such plant species and consider only two sources of food: foraging and cultivation. The food from these sources is identical in consumption. This clarifies the main causal mechanisms at work, but one could build a similar model in which each species has unique ecological and technological characteristics.
Each agent is endowed with one unit of labor time. The production function for foraging at an individual site is
and the production function for cultivation at a site is
, where
is foraging labor,
is cultivation labor,
is climate, and
is site quality. Climate is identical for all sites in the region and refers to average temperature and precipitation. Site quality is a permanent feature of a particular location, perhaps reflecting access to fresh water or fertile soil.
Assumption 5.1
and
are twice continuously differentiable. When any input is zero, output is zero. When all inputs are positive, each input has a positive first derivative and a negative second derivative.
The adult population
at a given site allocates labor time to maximize the total food obtained from the site, where we define the maximum food to be
(5.1)Proposition 5.1
(optimal time allocation).
Fix
. The optimization problem in (5.1) has a unique solution. H is continuous in (n, c, s) and strictly concave in n for any given
. The optimal
and the maximum food H are differentiable functions of (n, c, s) except possibly at the boundary
defined in (5.2) below.
We assume that when the local population n is sufficiently small, the marginal product of foraging exceeds the marginal product of cultivation even when all labor is allocated to foraging. However, when the population of a site becomes large enough, the diminishing marginal product of foraging eventually makes it attractive to put some labor into cultivation. Assumptions of this sort are standard in the economic literature on the origins of agriculture (Weisdorf, Reference Weisdorf2005). Similar assumptions were used in Chapters 3 and 4 (see Section 3.6 and Figure 3.2). We formalize this idea as follows.
Assumption 5.2 Fix
.
and
, where the subscript n denotes differentiation.
is positive and finite, with
.
Next define the population threshold
by
A5.2 implies that for
it is optimal to set
and
(zero cultivation), but for
it is optimal to set
and
(positive cultivation).
As discussed in Section 5.2, archaeologists have sometimes relied on exogenous population growth to explain the origins of agriculture. This might seem to be an obvious way in which to attain a population beyond the threshold na(c, s) and trigger cultivation. But as we argued in Chapters 2–4, explanations based on exogenous population growth are unsatisfying. Instead, we endogenize local population through migration among sites in the short run, and we endogenize regional population through Malthusian dynamics in the long run. Migration will provide the trigger needed to initiate cultivation.
We have established in Chapters 3 and 4 that climate changes need not be directly biased for or against particular resources or techniques in order to affect time allocation. Similarly, in the present context we do not want to claim that the negative shock of the Younger Dryas was directly biased in favor of cultivation relative to foraging. This is partly because there is insufficient archaeological evidence to support such a claim, and partly because we want to make the theoretical point that such biases are unnecessary to explain the transition to agriculture. An indirect causal channel where the climate shock leads to migration across sites can do the job.
We say that climate and site quality are neutral if na is a constant that does not depend on (c, s). This is true if the production functions have the following multiplicative form:
Assumption 5.3 ![]()
where k > 0 is a productivity parameter. A5.3 implies that the population threshold na is a decreasing function of the cultivation productivity k, so it is easier to cross this threshold when the latent productivity of cultivation is higher. A5.3 also implies that total output has the form
and that the optimal labor allocation (nf, ng) for a given n is independent of (c, s).
Food output per capita with the climate quality
and site quality
is
By the strict concavity of H, output per capita (y) is decreasing in local population (n).
Lemma 5.1
Suppose A5.1–A5.3 hold. For any fixed
we have
(a)

(b)

As we will explain in Section 5.6, this guarantees that it is possible to support a positive regional population in the long run. In contrast to Chapters 3 and 4, there is no need to consider cases where the equilibrium population is zero.
5.5 Short-Run Equilibrium
Because the region as a whole has many individual sites, and agents can migrate freely among sites, the definition of short-run equilibrium must take into account both the optimal time allocation from (5.1) for each individual site and the idea that in equilibrium, no agent wants to move to a different site. Within a given time period t, events unfold in the following sequence.
(i) Let the initial number of agents at a typical site of quality s at the start of period t be nsto. All sites of the same quality level have the same number of agents.
(ii) The current climate ct and the site qualities s are observed by all agents.
(iii) Each agent decides whether to stay at her current site or move elsewhere. These decisions determine final population levels for each site, denoted by nst.
(iv) The agents at each site obtain food by allocating labor to foraging and cultivation as in equation (5.1). Food is shared equally at each site, yielding yst per agent.
The location decisions at step (iii) are made by individual agents and involve a comparison of food per capita across all sites. If food per capita is unequal across sites, the agent goes where the most food is available. The agent is indifferent between sites that offer the same amount of food. The optimization decisions at step (iv) are made by local groups and involve a comparison of marginal products for foraging and cultivation at a given site with a given population. We assume that agents can anticipate their food income at step (iv) when making location decisions at step (iii). An individual agent is “small” relative to the population of a site. Therefore, she ignores her own influence on the group time allocation and the food per capita at a site when choosing among sites.
We assume an open access property rights regime where insiders cannot exclude outsiders. If there are no mobility costs and an individual agent treats the post-migration population at each site (nst) as parametric, then in equilibrium food per agent (yst) must be equal across sites. Otherwise, some agents would switch to different sites. Hence short-run equilibrium in period t requires a uniform food per capita
for all
.
The open-access assumption is a convenient way to generate a positive short-run relationship between site quality and local population, as in Proposition 5.2(b) below, but a similar relationship could arise under more complex property rights systems that impede full food equalization. For example, foragers can frequently move to better locations by exploiting kinship networks, generating a tendency for population to concentrate at good sites (Kelly, Reference Kelly2013a). Our qualitative results survive as long as this tendency exists.
Let Nt be the overall population mass for the region as a whole in period t. This fixed population is distributed across sites to equalize food per capita. We will write the inverse of y(n, c, s) as n(y, c, s), where the latter function indicates the local population n that yields the food per capita y when climate and site quality are
.
Definition 5.1 Fix the climate
and the regional population
for period t. The food per capita yt and associated local populations n(yt, ct, s) for
are a short-run equilibrium (SRE) for period t if
where q(s) is the density function for site quality (assumed positive everywhere).
The function q(s) gives the number of sites for each quality level s and is exogenously determined by the geography of the region. The equilibrium population densities
ensure that all agents throughout the region have the same food per capita yt and therefore do not want to change sites. This gives the following results.
Proposition 5.2
(short-run equilibrium).
(a) For any
, the SRE condition (5.4) has a unique solution
where yt is decreasing in Nt and increasing in ct.(b) Write
. For a fixed regional population
and a fixed climate
, the local population nst is increasing in the site quality s.
Part (a) shows that in the short run, food per capita across the region is lower when the regional population is higher (due to diminishing returns to labor), and food per capita is higher when climate is better. Part (b) gives the intuitive result that when agents are free to move, the higher-quality sites have higher populations.
Recall that cultivation will occur at a site if and only if that site has a population exceeding the threshold na. Because the best sites
have the highest populations in SRE, a necessary and sufficient condition for cultivation to arise somewhere in the region is
. If cultivation occurs at all, it must occur at the best sites.
We want a causal mechanism that links climate change with the local populations at the individual sites. For this purpose we think of the climate parameter (c) as indexing rainfall and the site quality parameter (s) as indexing surface water (springs, marshes, rivers, lakes). Under this interpretation, the climate quality c and site quality s should be good substitutes, because water from the sky is a good substitute for water on the ground. With this idea in mind, we impose more structure on the production technology.
Assumption 5.4 The function A(c, s) from A5.3 has constant returns to scale and an elasticity of substitution greater than unity.
A5.4 is a technical way of saying that rainfall and surface water are close substitutes. If the climate shifts toward greater rainfall at all sites, this is more beneficial for poor sites that have little surface water than for good sites that have abundant surface water. Therefore, productivity rises proportionately more at the poor sites. In order to maintain a uniform food per capita across all sites in the region, it is necessary for some population to move away from the sites where the productivity increase was relatively small, and toward the sites where the productivity increase was relatively large. For this reason, population at the best sites (those with s = 1) must fall in the short run.
The converse is also true. If the region as a whole suffers from reduced rainfall, this reduces productivity everywhere, but productivity declines more in relative terms at inferior sites that were more dependent on rainfall. It declines less in relative terms at the best sites, which are buffered from the shock by their proximity to rivers, lakes, and other permanent water sources. To preserve equal food per person, some population must shift from the relatively poor sites to the relatively good ones. This is the mechanism we will use to link climate change with cultivation in Section 5.7.
5.6 Long-Run Equilibrium
The next step is to make the regional population N endogenous. We do this in the standard Malthusian way. As in earlier chapters, a time period is one human generation. The number of surviving adult children
for an individual adult agent is an increasing function of that agent’s food income (y).
Assumption 5.5
is continuous and increasing with
and
. There is a unique
with
.
Let the number of adults at a typical site of quality s at the start of period t be nsto. The sequence of steps in each period is the same as in (i)–(iv) from Section 5.5, except that to model population dynamics we now add a fifth step:
(v) The current adults die at the end of period t. At the start of period t+1 the initial adult population at a typical site of quality s is
.
The food income yt at step (v) is determined according to D5.1. Because all sites have the same food per capita and the same population growth rate, we can write
where
is food per capita in SRE for the population Nt and climate ct.
We define long-run equilibrium as follows.
Definition 5.2 Fix the climate
. N*(c) is a long-run equilibrium (LRE) population for c if
. The LRE is non-null if N*(c) > 0.
The intuition behind this result is simple. To have a stationary regional population, food per capita must be y*. To compute the size of the stationary population, we must find the value for N that yields y* in short-run equilibrium. We therefore substitute
in the definition of SRE from D5.1, which determines what N*(c) must be in order to give y*. It is clear from (5.6) that the solution for N*(c) is unique, and it is non-null when
. The fact that N*(c) is an increasing function comes as no surprise: a better climate raises food output, and by Malthusian reasoning this allows the region to support a larger population.
As in Chapters 3 and 4, we simplify by assuming that the population converges to LRE along a monotonic path. This will be useful for the graphical analysis in Section 5.7.
Assumption 5.6 Monotone Population Adjustment. Let
so
. Consider some initial period arbitrarily labeled
in which the regional population is
, where
. If the climate
prevails in period
, the regional population moves in the direction of N*(c) in period
. Hence if the initial population is below the LRE level, we have
. If it is above the LRE level, we have
. If
is constant over time, the sequence
converges monotonically to N*(c) starting from any initial population
.
As long as the expression ρ[z(N, c)]N from (5.5) is increasing in N (that is, the direct positive effect of N exceeds the indirect negative effect operating through z and ρ), population adjustments will be monotonic.
5.7 The Effects of Climate Change
This section applies the formal model to the case of southwest Asia discussed in Section 5.3. We consider the temporal climate sequence {cIA, cWA, cYD, cHO} where IA = Ice Age, WA = Initial Warming, YD = Younger Dryas, and HO = Holocene. The loose term “Ice Age” refers to the Last Glacial Maximum or the Heinrich I event, depending on how one chooses to use the model, and the shorthand term “Initial Warming” refers to the Bølling/Allerød (B/A) interstadial. In order of climate quality, the parameter values are ranked from worst to best as
.
For reasons discussed earlier, we are especially interested in changes in rainfall. Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018) estimate that the Heinrich I phase (our “Ice Age”) had mean monthly precipitation around 60% of the modern level, the B/A (our “Initial Warming”) had mmp around 120% of the modern level, the Younger Dryas had mmp about 75% of the modern level, and the Early Holocene had mmp about 130% of the modern level. These estimates are consistent with the ranking we adopt for the four levels of our climate parameter.
We will use graphs to explain the qualitative features of the model. We simplify by treating climate changes as a series of abrupt jumps in the parameter
, with each jump followed by a period of stasis until the next jump. In reality, climate changes were considerably more complex and erratic. Even so, the speed of the warming events at 14,500 BP and 11,700 BP gives some justification for this stepwise approach (Roberts et al., Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018). The analysis in this section ignores effects on population caused by shifts from a mobile to a sedentary lifestyle, or vice versa. We discuss this issue and related demographic matters in Section 5.8.
Ice Age:
Denote long-run regional population at the LGM by
. The absence of cultivation in the last Ice Age indicates that even at the best sites
, local population n(y*, cIA, 1) was below the cultivation threshold na. We will use this long-run equilibrium as a starting point.
Initial Warming:
In the short run, a climate improvement from cIA to cWA would increase food per capita at all sites according to the function
, with regional population fixed at NIA. This results in a move from A to B in Figure 5.1. According to Proposition 5.3, cultivation cannot be a short-run response to this climate improvement, because the local population falls at the best sites:
If cultivation is not used at the best sites, it cannot be used at inferior sites either. The short-run loss in population for
is shown as a move from A to B in Figure 5.2.

Figure 5.2. Population density at the best sites
If the climate cWA had remained in effect permanently, in the long run population would have risen to a higher steady state
and food per capita would have returned to y* (a move from B to C in Figure 5.1). But even with population growth at the regional level, the local population at the best sites n(y*, cWA, 1) was not enough to stimulate cultivation (see the move from B to C in Figure 5.2).
Younger Dryas:
Now consider the deterioration in climate from cWA to cYD due to the Younger Dryas. In the short run, the regional population stays fixed at the level NWA inherited from the Initial Warming. Food per capita diminishes (a move from C to D in Figure 5.1). More significantly, a migratory response causes local population to rise at the best sites (a move from C to D in Figure 5.2), crossing the na threshold and yielding initial cultivation. Algebraically, we have
The reasoning involves another application of Proposition 5.3. As precipitation declined, climate refugees fled sites that lacked permanent water sources and sought sanctuary at better locations. This is consistent with the fact that many sites were abandoned during the Younger Dryas, and also with the fact that the sites in continued use (such as Abu Hureyra) were less dependent on rainfall. As noted in Section 5.3, the population of Abu Hureyra expanded in this phase, probably due to in-migration. Bar-Yosef (Reference Bar-Yosef, Bellwood and Renfrew2002b, 116) likewise suggests that as marginal areas became drier, kinship-based relocation caused population to rise in the fertile belt of the Levant.
Whether a climate reversal triggers cultivation in the short run depends on the size of the regional population when the negative shock hits. For a given climate c, let
be food per capita when the local population at the best sites is precisely at the threshold required for cultivation. If
, we must have
so cultivation occurs at the best sites. Substituting ya(cYD) into (5.4) determines the regional population threshold Na(cYD) beyond which cultivation would begin due to the migratory effects set in motion by the Younger Dryas.
Using (5.6) to compute the long run population
inherited from the Initial Warming phase, cultivation is a short run response to the Younger Dryas if and only if
This inequality says that the regional population associated with the Initial Warming was large enough that when the Younger Dryas arrived, the population at refuge locations like Abu Hureyra grew sufficiently via migration to make cultivation attractive. Because the inequality (5.9) states a necessary and sufficient condition for the transition to cultivation, we pause to provide some interpretive details.
First, a larger pre-existing regional population makes a transition more likely, all other things being equal. This follows immediately from the appearance of
on the left-hand side of (5.9). Second, a more severe climate reversal is more likely to trigger an agricultural transition. This follows from the fact that the right-hand side is smaller when cYD is smaller.
Such a transition is also more likely when the latent productivity of cultivation is higher, other things equal. Recall that the local population threshold na is a decreasing function of the cultivation productivity parameter k introduced in A5.3. This implies that ya(cYD) is increasing and Na(cYD) is decreasing in the parameter k, so (5.9) is more likely to hold when cultivation productivity is high.
If technical progress tends to raise the latent productivity of cultivation relative to the actual productivity of foraging (Ashraf and Michalopoulos, Reference Ashraf and Michalopoulos2011, Reference Ashraf and Michalopoulos2015), it becomes easier to satisfy (5.9). Conversely, if latent cultivation productivity had been low enough, even a large climate shock like the Younger Dryas would not have triggered a transition. A biological endowment favorable for cultivation (e.g., several species of large-seeded wild grasses; see Section 5.12) acts through the same causal channel. A more favorable endowment implies a higher productivity parameter k, implying a lower local population threshold na for cultivation. This makes it easier to satisfy inequality (5.9).
Finally, a transition is more likely when there are relatively few good sites and many poor ones, again holding other things equal. In this case a negative climate shock forces a large regional population through a few local bottlenecks, putting more pressure on the refuge sites and making it more likely that these sites will adopt cultivation. More formally, fix the existing population N*(cWA) on the left-hand side of (5.9) and consider the site density function q(⋅) on the right-hand side. The function n[ya(cYD), cYD, s] does not depend on q(⋅) and is increasing in s. Thus, a shift toward a less favorable site distribution in the sense of first-order dominance will lower the right-hand side and make it easier to satisfy inequality (5.9) for the given N*(cWA) on the left-hand side.
Until recently, received archaeological wisdom held that regional population fell during the Younger Dryas due to colder and drier conditions and resulting food scarcity. In the rest of this section, we explain how this scenario would play out within our model. In Section 5.8, we consider an alternative scenario where regional population rose during the Younger Dryas, along the lines suggested by Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018).
Assuming regional population declined during the Younger Dryas, in the long run this would have alleviated migratory pressures on the refuge sites. If the climate reversal had been permanent and technology had been constant, eventually the population would have dropped to NYD = N*(cYD) and food per capita would have returned to y* (see the dashed curve from D to E in Figure 5.1). Because the long-run population in the Initial Warming was insufficient to support cultivation, the still lower long-run population of the Younger Dryas would also have failed to do so. Thus, cultivation would only have been a temporary stopgap along a path leading back to universal foraging (see the dashed curve in Figure 5.2).
In reality, however, the practice of cultivation in southwest Asia led to increases in productivity through better knowledge about important technological details: optimal times for planting and harvesting, optimal locations, correct spacing and depth of seeds, the best methods of weeding, fertilization, irrigation, etc. Eventually artificial selection on the genetic characteristics of plants culminated in full domestication. In our model, this productivity growth shows up as an increase in the parameter k defined in A5.3.
We assume that productivity gains were freely available to everyone in the region. The process of productivity improvement can be modeled in many ways (see Chapters 3 and 4), and only broad qualitative issues are pertinent here. Suppose before cultivation starts, its productivity is k0. Let the regional population engaged in cultivation be
where S = S(N, c, k) is the marginal site quality for cultivation and ng is the optimal input of cultivation labor given the local population n[z(N, c, k), c, k, s]. A simple and general model for productivity growth is
where ϕ is increasing in Mt and kt+1 = kt when Mt = 0. Thus, when cultivation is active its productivity rises, and this occurs more rapidly when aggregate labor input to cultivation is larger. When cultivation is inactive, its productivity remains constant. These ideas are consistent with the model of learning by doing in Chapter 3.
Whether cultivation persists during a climate reversal depends on the outcome of a race between population and technology. As the regional population falls, populations at individual sites also fall, decreasing the number of sites using cultivation as well as the amount of labor allocated to cultivation at each site. This restrains technical progress and could shut down cultivation entirely. On the other hand, as productivity improves, more sites adopt cultivation and more labor is allocated to it where it is already in use. These effects promote more technical progress and slow the loss of population by cushioning the blow to living standards.
The resulting dynamics are shown in Figure 5.3 (climate is held constant at cYD). For each cultivation productivity level (k), the curve Na(k) gives the maximum regional population that is consistent with universal foraging. This is obtained by computing the cultivation threshold na(k) as a function of k, finding the food per capita ya(k) such that the local population is exactly na(k) at the best sites, and finally computing the regional population Na(k) that would be consistent with ya(k). The result is
We omit k as a separate argument of n(y, c, s) because only foraging is relevant in (5.12). To the left of this curve, labor is allocated entirely to foraging at all sites and cultivation productivity stays constant. To the right of this curve, labor is allocated to cultivation at some sites and its productivity therefore rises over time.
The second curve N*(k) in Figure 5.3 depicts the long-run regional population corresponding to a given cultivation productivity k. This is derived by finding the local population for each site consistent with the food per capita y*, given the productivity k, and then substituting these local populations into (5.6):
Below the Na(k) curve, N*(k) is a vertical line at NYD because k is irrelevant if all labor is used for foraging. Above Na(k), cultivation occurs and N*(k) is increasing in k.
The starting point P involves the long-run population NWA inherited from the Initial Warming phase and the latent cultivation productivity k0. During the Younger Dryas, this combination was sufficient to induce cultivation at good sites, so P is above the locus Na(k). Depending on the rate of productivity growth, two possibilities arise. If learning by doing is relatively slow, the system trajectory hits the Na(k) curve below Q, at a point like R. Cultivation then shuts down and the system moves horizontally to the left, approaching a point like S with the population NYD.
If learning by doing is fast enough, however, the trajectory eventually rises above the productivity level kQ. In this case it is impossible to return to pure foraging because Na(k) is to the left of N*(k). Hence, cultivation becomes permanent and its productivity continues to rise. This growth process is limited only by the possibility of a ceiling kmax on cultivation productivity at which technological opportunities are exhausted.
The archaeological evidence suggests that this latter trajectory was followed in southwest Asia during the Younger Dryas. In Figures 5.1 and 5.2, we assume that k has an upper bound kmax and so na has a lower bound namin. Improved cultivation technology implies a higher long run population in Figure 5.1 (point E′ rather than E with NYD′ rather than NYD). In Figure 5.2, population density at the best sites remains above the threshold na(k), which declines to namin as k increases to kmax, resulting in a move from D to E′.
The Holocene:
Finally, suppose the climate recovers. The short-run effect on food per capita is given by the upward jump from E′ to F in Figure 5.1. As with Initial Warming, the local populations drop at the best sites due to out-migration (the downward jump from E′ to F in Figure 5.2). This need not shut down cultivation because cultivation technology improved during the Younger Dryas, decreasing the threshold density na. The long-run result is a larger regional population NHO than under any previous regime (from F to G in Figure 5.1) and higher populations at the best sites (the path starting from point F in Figure 5.2). The combination of better technology and higher population led to the spread of cultivation across the region.
5.8 Demography
Few people doubt that agriculture led to population growth. Indeed this growth has acquired a name: the Neolithic Demographic Transition (Bocquet-Appel and Naji, Reference Bocquet-Appel and Naji2006; Bocquet-Appel, Reference Bocquet-Appel2008a, Reference Bocquet-Appel, Bocquet-Appel and Bar-Yosef2008b, Reference Bocquet-Appel2009, Reference Bocquet-Appel2011; Bocquet-Appel and Bar-Yosef, Reference Bocquet-Appel, Bar-Yosef, Bocquet-Appel and Bar-Yosef2008). Evidence is not hard to find. In most of the regional cases reviewed by Bellwood (Reference Bellwood2005), early agriculture was associated with more settlements, larger settlements, or both, often involving increases on the order of 10-fold to 50-fold. Bocquet-Appel uses the fraction of skeletons from ancient cemeteries that were 5–19 years old at death to infer population dynamics before, during, and after agriculture, defined by the date at which domesticates are first observed in the local area. Using data from Europe, North Africa, and North America, he finds that (a) population growth was positive but showed moderate slowing in the millennium before the transition and (b) population growth increased dramatically in the millennium after the transition. Notice that these population trends are defined in relation to the onset of domestication, not cultivation. If the gap between the two lasted for a millennium (see Section 5.3), it is quite possible that population growth could have slowed during the first millennium of cultivation as described in (a).
Bowles (Reference Bowles2011) and Bowles and Choi (Reference Bowles and Choi2019) argue that early cultivation was no more productive than foraging. Economic theory supports this assertion. Optimal time allocation implies that the marginal products of labor in foraging and cultivation will be equal when both activities are used. Also, when cultivation is just starting, the average and marginal products of cultivation are equal to each other, as well as to the marginal product of foraging. And finally, the average product of foraging exceeds its marginal product due to the concavity of the production function. Together these points imply that at an early stage of cultivation, the average product of foraging labor exceeds the average product of cultivation labor, as Bowles and Choi claim on empirical grounds.
This does not, however, imply anything about rates of population growth in early agricultural societies as compared with foraging societies. In general, the relative growth rates depend upon fluctuations in the natural environment, rates of productivity growth in cultivation and foraging, the economic costs and benefits of children in each society, and similar factors. Bettinger (Reference Allen, Bettinger, Codding, Jones and Schwitalla2016) argues that as an empirical matter, population growth in early agricultural societies was no faster than in foraging societies. He points to evidence from paleodemography that annual growth rates for foragers in the Rocky Mountains and Australia (in each case, about 0.04%) were similar to growth rates for farmers in Europe and North America. Nevertheless, in the millennia following the Neolithic Revolution, technical innovations like domestication and metallurgy eventually led to demographic domination by agricultural societies in most areas of the world.
In what follows, we discuss population trends specific to southwest Asia. We will focus on the earliest stages of cultivation in an attempt to shine further light on the causal mechanisms behind the Neolithic transition.
Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018) use summed calibrated radiocarbon probabilities to construct a proxy for population in southwest Asia during 16,000–9000 BP, with 1917 radiocarbon dates. The authors fit an exponential growth curve for this proxy and look for statistically significant deviations from the fitted curve to identify periods when the actual population was above or below the predicted level. This procedure reveals two important deviations from the trend. First, population is below expectations during 13,600–12,700 BP, which corresponds to the later part of the Bølling/Allerød and the start of the Younger Dryas. Population subsequently recovers to the expected range by 12,500 BP, well before the end of the Younger Dryas. Second, the population is above expectations from 11,700 BP (the end of the Younger Dryas) until 11,100 BP (the Early Holocene). The latter result is unsurprising given the major climate improvement associated with the Holocene.
Roberts et al. divide their data into three sub-regions: (a) the southern and central Levant, (b) the northern Levant and upper Mesopotamia, and (c) south-central Anatolia. All regions show significant departures from the pattern for southwest Asia as a whole. For the southern Levant, there is a significantly higher population during 14,500–13,500 BP, corresponding to the Bølling/Allerød climate period and the early Natufian cultural period (see Section 5.3). There appears to have been a brief period of population growth near the middle of the Younger Dryas, but this subsequently leveled off. Further growth had to await the close of the Younger Dryas and the onset of the Holocene.
Trends for the northern Levant/upper Mesopotamia region look quite different. Population was below the regional trend during 16,000–13,400 BP. This was followed by steady growth starting in the Bølling/Allerød and continuing through the Younger Dryas, with populations significantly above the regional expectation during 11,700–10,500 BP (the PPNA). The authors estimate that this sub-region had a roughly 20-fold population increase in the period 13,400–11,500 BP. They note that all sites in the area dating to the Younger Dryas were at intermediate elevations that include the “northern hilly flanks” of the Fertile Crescent.
Finally, south-central Anatolia had a period of about 500 years beginning around 15,200 BP in which population exceeded regional expectations. However, over 12,700–11,200 BP (most of the Younger Dryas and the first 500 years of the Holocene), this sub-region had a flat population that was consistently below predicted levels.
Before Roberts et al. (Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018), many archaeologists specializing in southwest Asia thought that regional population probably fell during the Younger Dryas. Our own views were similar, as indicated in Figure 5.1. The findings summarized above are recent and it remains to be seen whether they will be widely accepted among specialists. However, we will take them at face value and ask how they could be interpreted in light of our model.
If we ignore sub-regions and return to the population proxy for southwest Asia as a whole, the main result to be explained is why the negative deviation from the trend line ended at about the middle of the Younger Dryas (12,500 BP), rather than continuing until the arrival of the Holocene. This does not appear to be a great puzzle in our framework. We have argued that cultivation began during the Younger Dryas. We have also argued that cultivation leads to learning by doing, with associated productivity gains. Assuming that cultivation began early in the Younger Dryas, these gains could have begun to offset the adverse climate by the middle of the Younger Dryas, resulting in population growth.
The trends for the southern Levant also seem consistent with our expectations. It is not surprising that population would have grown during the warm and wet conditions of the Bølling/Allerød. Indeed, this is consistent with our arguments about sedentism in Section 4.3. It is likewise unsurprising that population leveled off in the Younger Dryas.
It is more startling that population in the northern Levant and upper Mesopotamia exhibited sustained growth throughout the Younger Dryas when climate was poor. Some of this growth could reflect migration from the southern Levant or south-central Anatolia. This is suggested by the fact that in the former case population was largely flat during the Younger Dryas, while in the latter case it was below regional trends. It is also consistent with hints that the Younger Dryas may have been less severe in the northern Levant and upper Mesopotamia than elsewhere in southwest Asia (Roberts et al., Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018, 62–64). The migratory story is likewise consistent with the finding that for southwest Asia as a whole, population remained below the trend until 12,500 BP and did not exceed the trend for the rest of the Younger Dryas.
Having said this, we still need to explain why the region-wide population grew at all in the Younger Dryas. Two factors consistent with our Malthusian framework could have counteracted the negative climate effect: Increasing sedentism and the productivity gains from cultivation. We established in Section 4.10 that sedentism will generally lead to higher population levels than would otherwise occur. If a large fraction of the regional population responded to the Younger Dryas by switching from mobile to sedentary living due to a reliance on refuge sites, this could account for some of the population growth. If these refuge sites were also disproportionately located in the northern Levant and upper Mesopotamia, population growth due to increased sedentism would tend to occur largely in this sub-region. Unfortunately, we are unaware of any data that illuminate this issue.
Another way to account for population growth in the Younger Dryas is through the effects of cultivation. Three effects could be important: (a) cultivation could provide an additional incentive for sedentism; (b) cultivation could decrease the economic cost of children, leading to a higher population, as will be explained in Section 5.9; and/or (c) cultivation could start a process of learning by doing that would raise productivity and therefore also population. The concentration of population growth in the northern Levant during the Younger Dryas is consistent with Bar-Yosef’s (Reference Bar-Yosef2011) argument that cultivation began within this sub-region and then spread southward. If cultivation caused population growth during the Younger Dryas, it would have had to arise at a number of sites in order to generate an appreciable aggregate effect.
We should emphasize that we interpret the concept of “learning by doing” broadly in the context of early cultivation. This no doubt included narrow technical matters such as the timing of planting, the depth and spacing of planted seeds, and so on. But social and cultural processes were probably also involved, through participation in communal activities surrounding plant processing, detoxification, cooking, and storage (Asouti and Fuller, Reference Asouti and Fuller2013). Cultural evolution of this sort probably enhanced the productivity of cultivation relative to hunting and gathering.
If one believes that cultivation did not begin until the Early Holocene, then the full burden of explaining population growth in the Younger Dryas falls upon increased sedentism. Although it is conceivable that sedentism alone could have been sufficient, in our opinion the evidence from Section 5.3 makes it more plausible that cultivation was a factor, and perhaps the primary factor, generating population growth despite the adverse climate regime. In general, the finding of population growth during the Younger Dryas, if accepted, strengthens the inference that cultivation began in this period.
5.9 Living Standards
Most archaeologists believe that early farmers were worse off than their foraging ancestors in the sense that they had poorer nutrition, more disease, and shorter lives. The relevant evidence from teeth and skeletons is extensive (see Cohen, Reference Cohen2009; Lambert, Reference Lambert2009; and the references cited there). The literature is often unclear about whether the decrease in living standards was associated specifically with sedentism, cultivation, or agriculture. Furthermore, some researchers emphasize poor diet and a weaker immune system, while others emphasize greater exposure to infectious diseases when people are sedentary, live in large settlements, and/or have domesticated animals.
The first question raised by these findings is why people would voluntarily adopt a technology that made them worse off. Our theory suggests some answers. We believe that in the short run, cultivation was triggered by a negative climate shock and migration among sites in a region. Even if inhabitants at each site always respond optimally to the climate they currently face, people will become worse off when climate becomes worse (economists will recognize this as an application of the envelope theorem). Thus a short-run drop in living standards is to be expected.
In the long run, a simple Malthusian model predicts a return to the previous level of food per adult through a decrease in regional population. Food per adult will continue to remain constant in the long run regardless of further changes in climate or technology. To explain a long-run reduction in living standards, we require a change in the economic costs and benefits of children. If agricultural technology makes children less costly or more productive (perhaps because children can now add to food output through sowing, weeding, and harvesting), the same mechanisms as in Section 4.10 lead to lower long-run food per adult (lower y*). This reinforces whatever long-run decline in living standards resulted from sedentism alone. If children become increasingly valuable in production as learning by doing and domestication proceed, equilibrium food per adult will continue to decline. This can yield a gradual decline in nutrition for both adults and children. If poor diet compromises immune function, it can also lead to more disease. These effects are in addition to disease effects resulting from larger communities and animal domestication.
A second question is how lower living standards can be reconciled with increased fertility and population growth (see Lambert, Reference Lambert2009). The model sketched in Section 4.10 also resolves this problem. We showed there that if the “price” of children to their parents drops (either due to lower costs or higher productivity benefits) the long-run consequence is higher fertility, higher child mortality, and lower life expectancy at birth. At the same time, regional population can grow, because in long-run equilibrium, lower food per adult is associated with higher population for any given climate and technology. Moreover, for the usual Malthusian reasons, increasing agricultural productivity over time implies rising regional population over time. Thus our theoretical framework can account for evidence that agricultural populations grew even as living standards fell.
Guzmán and Weisdorf (Reference Guzmán and Weisdorf2011) cite evidence about the decline in living standards under agriculture, stress the greater productivity of children in agriculture as compared to foraging, and reach conclusions similar to those described above. Robson (Reference Robson2010) focuses on biological factors including disease in early Neolithic communities. In contrast to our interpretation, Rowthorn and Seabright (Reference Rowthorn and Seabright2010) attribute the reduction in living standards to costs associated with the defense of agricultural output (see Chapters 7 and 8).
5.10 Other Transitions
The task of this section is to review evidence about pristine transitions other than the one in southwest Asia. Although the particular climate episode of the Younger Dryas may have been relevant in certain other regions, some of the pristine transitions discussed in this section occurred much later. Thus our interest is less in the Younger Dryas per se, and more with the potential applicability of the general causal mechanism investigated in Sections 5.4–5.7, involving a large pre-existing regional population, heterogeneous sites, a negative climate shock, and migration to refuge locations. As we explain in more detail at the end of this section, we do not believe that a negative climate shock was the trigger in all cases and we are open to the possibility that positive climate trends in the Holocene may have led to pristine cultivation in some regions.
Our survey of regional cases proceeds in rough chronological order from earlier to later transitions. In all cases, it is essential to distinguish dates for initial cultivation from dates for full domestication. There is often a very long gap between the two. Larson et al. (Reference Larson2014) provide a broad overview of dates for initial cultivation across regions, along with dates for domestication. Purugganan and Fuller (Reference Purugganan and Fuller2009) offer a similar overview for domestication dates. Even with such estimates in hand, uncertainties about the timing of climate change and initial cultivation make causal inferences very challenging. At most, some of the cases described below offer tentative support for our theory. We are unaware of any cases that directly contradict our theory.
Northern China:
There were two main centers of pristine agriculture in China, one in the north around the Yellow River basin involving millet, and another in the south around the Yangzi River basin involving rice. Some archaeologists regard these as aspects of a single overarching transition process rather than as two distinct centers (Bellwood, Reference Bellwood2005; Liu and Chen, Reference Liu and Chen2012, 67; Shelach-Lavi, Reference Shelach-Lavi2015, 66–67). We accept Cohen’s (Reference Cohen2011) view that cultural interactions between the north and south were extensive. However, we treat them as separate cases here because they involved the domestication of different plants in different natural environments. Hunting, gathering, and fishing remained important for several millennia in both the north and south before full dietary reliance on domesticates (Cohen, Reference Cohen2011; Liu and Chen, Reference Liu and Chen2012; Shelach-Lavi, Reference Shelach-Lavi2015).
Liu and Chen (Reference Liu and Chen2012, 34) comment that “The widespread monsoon maximum in north China ca. 9000 cal. BP provided favorable conditions for the flourishing of early Neolithic villages along the Liao and the middle and lower Yellow River valleys.” To the north of the Yellow River, evidence for domesticated millet comes from the Cishan culture (8000–7700 BP). To the south of the river, the Peiligang culture also provides evidence of millet cultivation (8500–7500 BP). In all, there are five known early centers of millet farming in north China (Bettinger et al., Reference Bettinger2010a). The early Neolithic villages in this region had a few hundred people each (Liu and Chen, Reference Liu and Chen2012, 71).
Some authors assert that millet domestication occurred earlier than these dates would suggest. Larson et al. (Reference Larson2014) believe that foxtail millet was “exploited,” although not domesticated, as early as 11,500 BP. Lu et al. (Reference Lu2009) provide phytolith evidence for domesticated broomcorn millet at Cishan by 10,300 BP. Crawford (Reference Crawford2009) and Bettinger et al. (Reference Bettinger, Barton and Morgan2010b) accept the claim for millet farming at Cishan near 10,300 BP. Zhao (Reference Zhao2011) criticizes the methods used by Lu et al. (Reference Lu2009) but agrees that millet domestication might have begun by 10,000 BP. Cohen (Reference Cohen2011) expresses caution and calls for further research.
Much depends on the lag between cultivation and domestication, because if the domestication date of 10,300 BP is accepted, then a lag of 1500 years or more puts initial cultivation in the Younger Dryas. This seems reasonable, given current views about the length of this lag (see Section 5.3). Indeed, Bar-Yosef (Reference Bar-Yosef2011) proposes that the Younger Dryas led to millet cultivation by causing hunter-gatherers to retreat to favorable habitats such as river valleys, and generating “population pressure” at these locations.
A major problem for north China is that there are very few Early Holocene sites, and none has been extensively excavated or documented (Shelach-Lavi, Reference Shelach-Lavi2015, 54–56). One (Nanzhuangtou) has been dated to ca. 12,000–10,000 BP, and another (Donghulin) has been dated to ca. 11,000–9000 BP. Shelach-Lavi states that there is a chronological gap of about one thousand years, as well as a geographic gap, between these sites and the earliest known Neolithic villages in north China. Yang et al. (Reference Yang2012) date the occupation of Nanzhuangtou to 11,500–11,000 BP and the early phase of occupation of Donghulin to 11,150–10,500 BP. For both sites, they use evidence involving starch grains found on tools to argue that domestication of millet had begun by these dates.
Another intriguing site is Dadiwan, in the western Loess Plateau near the valley of the Qing Shui River (a tributary of the upper Wei River). Bettinger et al. (Reference Bettinger2010a, Reference Bettinger, Barton and Morgan2010b) suggest that prior to the end of the Pleistocene, microlithic hunters were located in deserts along the upper Yellow River about 300–400 km north of Dadiwan. Around the time of the Younger Dryas (12,800–11,500 BP), cold and dry episodes caused the upper Yellow River environment to deteriorate, and these hunters were pushed further south toward the upper Wei River and the gallery forests of the western Loess Plateau. By 8000 BP, and possibly earlier, the inhabitants had begun to engage in millet cultivation. A key problem is the absence of radiocarbon dates at Dadiwan for 18,500–8000 BP (for a skeptical view, see Cohen, Reference Cohen2011). More generally, at present there is no definite connection between the Younger Dryas and initial cultivation in north China, but the question remains open.
Southern China:
The crucial crop in the south was rice. No sites show the complete transition from foraging to agriculture. Molecular data is consistent with a single origin of domesticated rice between 13,500–8200 BP (Shelach-Lavi, Reference Shelach-Lavi2015, 60). Zhao (Reference Zhao2011) believes that rice cultivation began around 10,000 BP, and that the earliest site in China clearly exhibiting rice agriculture is Jiahu, located on the upper Huai River and dated to 9000–7800 BP. This is consistent with the date of 10,000 BP given for the earliest rice cultivation by Larson et al. (Reference Larson2014). By 8500 BP agricultural villages of 200–300 people were distributed across the middle and lower Yangzi valley, and in regions to the north (Shelach-Lavi, Reference Shelach-Lavi2015, 50). These communities had permanent houses, public structures, and cemeteries. Rice agriculture gradually replaced hunting and gathering over several millennia (Zhao, Reference Zhao2011).
Rice phytoliths have been used to estimate the timing of domestication in the Yangzi valley (data in this paragraph come from Zuo et al., Reference Zuo2017). Phytoliths have been found in marine sediments dating to 13,900–13,000 BP, but their wild or domesticated status is unclear. Rice remains from the Shangshan site dated to about 11,000–8600 BP may reflect cultivation, but there has been controversy over whether these remains were wild or domesticated, as well as the reliability of previous dating methods. More recent dating indicates that Shangshan was occupied by about 9400 BP and the nearby site of Hehuashan by about 9000 BP. Zuo et al. report morphological evidence from phytoliths that rice domestication was underway by 9400 BP, and they suggest that this process can be linked to the climatic transition from the Pleistocene to the Holocene.
Mesoamerica:
The information on this region is from Piperno and Smith (Reference Piperno, Smith, Nichols and Pool2012) except where stated. In addition to the crucial domestication of maize from wild teosinte, other domesticates included two squash species, common beans, lima beans, two pseudo-cereals, avocado, chili pepper, and some trees. Tomato and cacao were native to South America but were probably domesticated in Mexico. Based on genetic research, maize and one squash species were domesticated once within a circumscribed region. Some other species were domesticated on multiple occasions either within Mesoamerica, or both there as well as South and/or North America.
The earliest archaeological evidence of domestication in the lowlands is from the Central Balsas region of southwestern Mexico where the wild ancestors of maize and one type of squash are found. Starch grain and phytolith evidence indicate the domestication of both by 8700 BP. The context was a seasonally dry tropical forest (see the discussion of such forests in South America below). Matsuoka et al. (Reference Matsuoka, Vigouroux, Goodman, Sanchez, Buckler and Doebley2002) infer from genetic data that maize was domesticated around 9000 BP. In the case of maize, Larson et al. (Reference Larson2014) offer an estimated date of 10,000 BP for “documented exploitation before domestication or posited as necessary lead-time to domestication.” Larson et al. cite the same date for morphological evidence for the domestication of squash.
Evidence for domestication in highland areas comes mainly from five dry caves excavated between 1954 and 1966. Domestic bottle gourds from 10,000 BP have been found at Guila Naquitz in Oaxaca (Flannery, Reference Flannery1986; Marcus and Flannery, Reference Marcus and Flannery1996; Smith, Reference Smith2001). This species is a “container crop” rather than a food crop, and is thought to have a wild ancestor in East Asia (Piperno and Smith, Reference Piperno, Smith, Nichols and Pool2012, 158). Another domesticated species of squash has been found at the same site. The earliest evidence for domestication of the common bean, which appears to have occurred in western Mexico, is not until 2300 bp, around the same time as the domestication of the turkey.
According to Piperno and Smith, “It is now clear that in both the highlands and tropical lowlands, plant cultivation and domestication emerged during the early Holocene period” (152). They see the chronology as similar to that of southwest Asia and China. Humans are known to have migrated into Mesoamerica around 15,000 years ago (Goebel et al., Reference Goebel, Waters and O’Rourke2008). However, any connection of initial maize cultivation to the Younger Dryas is purely conjectural. We have no evidence about the timing of initial cultivation, and a scenario of this kind requires a lag of up to 3000 years from cultivation to domestication of maize, or earlier dates for domestication than are currently accepted.
South America:
This case is complex, and the dates given by Larson et al. (Reference Larson2014) for cultivation and domestication vary widely by species. South America does not appear to have had a single center for agriculture. Piperno (Reference Piperno2011) believes there may have been two or three independent centers and that the data suggest the development of cultivation in the Early Holocene (11,000–7600 BP). Plant domestication is evident by 7600 BP with a significant part of the diet coming from crop plants by this time. During 7600–7000 BP both site numbers and artifact densities increase, probably due to higher carrying capacity associated with horticulture and slash-and-burn cultivation.
The wild ancestors of domesticated plants were often native to seasonal tropical forests that had 4–7 months each year with little or no rainfall. Food production is often associated with rock shelters or small open-air occupations near secondary watercourses. Settlements were small and many were probably seasonal, but they tended to be in areas of resource abundance. Unlike the Near East and China, cultivation did not involve large permanent villages in major river valleys. Piperno believes that the Near East trajectory was “a product of ecological and demographic circumstances very different from those associated with the beginnings of Neotropical food production” (S462).
Piperno does say that “the shift from foraging to food production began within contexts of rapid and significant changes of climate, vegetation, and fauna occurring at the close of the Pleistocene” (S465). She believes the main factor was a change from savanna-like scrub to seasonal tropical forest. This lowered the efficiency of hunting and gathering, and led to greater diet breadth due to the prevalence of smaller animals as well as plants that required extensive processing. Hence, cultivation became a more attractive strategy relative to full-time hunting and gathering. Piperno believes the key period was 11,000–9000 BP and that this process was especially relevant to “highly seasonal types of tropical forest, where the end-Pleistocene environmental perturbations would have impacted foraging return rates most strongly” (S465).
There is a distinct literature on the origins of agriculture in the Andes. Dillehay et al. (Reference Dillehay, Rossen, Andres and Williams2007) report evidence for horticultural economies in a dry forest valley in the Andes at 10,000 BP but their claim of domesticated squash, peanuts, and other plants has been questioned. Well-accepted evidence for sedentary settlements, pottery, and agriculture dates to around 5000 BP (Balter, Reference Balter2007, 1833; Quilter, Reference Quilter2014, 64, 76–78). Domesticated crops included squash, achira, and beans, but not maize (Bellwood, Reference Bellwood2005, 159–164).
El Niño-Southern Oscillation (ENSO) refers to oscillations in the Pacific Ocean between a warm tropical-water phase and a cold-water phase. The gradual onset of this cycle and a general increase in climate variability began around 5800 BP (Sandweiss et al., Reference Sandweiss, Maasch and Anderson1999). Archaeological evidence shows that in this period, major debris slides wiped out coastal foraging villages, and local fish populations disappeared due to migration to cooler water. Any causal connection with plant cultivation in the Andean highlands is uncertain (Bellwood, Reference Bellwood2005, 149; Quilter, Reference Quilter2014, 30–31, 105–106), but coastal populations may have migrated inland, raising population densities there and inducing cultivation.
New Guinea:
Highland New Guinea was an independent center of domestication for bananas and taro, and perhaps also yams and sugar cane (Bellwood, Reference Bellwood2005, 142–145). Larson et al. (Reference Larson2014) use 10,000 BP as a date for “documented exploitation” or “necessary lead-time to domestication” for bananas, taro, and yams, with management or cultivation by 7000 BP but still without morphological indications of domestication.
The information below is taken from Denham (Reference Denham2011, S383–S387) except where indicated. Agriculture emerged out of foraging practices sometime in the Early Holocene but few sites dating to this period have been excavated. There are “questionable” claims for Pleistocene settlements in the highlands. However, most settlements postdate 4000 BP. Knowledge about the phenotypic or genotypic transformation from wild to domestic plants is weak, as is knowledge about the relationship between sedentism and agriculture. Pottery and domesticated animals are absent. The case for early agriculture rests mainly on evidence for agricultural technology and its environmental effects.
The most detailed data come from the Upper Wahgi Valley and relate to bananas, taro, and yams. Denham believes these plants were of lowland derivation and brought to the highlands by people, although they could have grown wild in the highlands during the Early Holocene. By 7000–6500 BP, mounded cultivation occurred on the wetland margin of Kuk Swamp. By 4000 BP, ditched fields and greater domestication existed. Denham hypothesizes that managed plants were translocated from lower altitudes to the floor of the Upper Wahgi Valley, where wild stands of the same species were rare or absent.
Bellwood (Reference Bellwood2005) points out that the domestication of bananas, taro, and yams was centered in highland valleys and that the same plants could have been domesticated in the lowland rainforests but were not. He suggests that this occurred because highland valleys were at the edge of the wild range for these plants, so foragers in the highlands were more exposed to environmental stresses than foragers in the tropical lowlands.
Sub-Saharan Africa:
Northern Africa has shifted from a “green Sahara” in the Early Holocene to a large desert today (information in this paragraph is from Manning and Timpson, Reference Manning and Timpson2014; see also Section 4.12). The “African Humid Period” (AHP) began around 12,000 BP, with an initial occupation by hunter-gatherers and massive growth in population density starting shortly after 11,000 BP. Pastoralists with domestic livestock appear around 8000–7500 BP. The population density decreased during 7600–6700 BP but rebounded in 6700–6300 BP, reaching a Holocene maximum. A major population collapse followed during 6300–5200 BP at the end of the AHP.
Data from Lake Yoa in northern Chad show that increasing aridity began around 5600 BP, with windblown sand appearing by about 3700 BP and a true desert ecosystem by 2700 BP (Kröpelin et al., Reference Kröpelin2008). Annual rainfall was about 250 mm in 6000 BP, less than 150 mm by 4300 BP, and less than 50 mm by 2700 BP. Brooks (Reference Brooks2006, Reference Brooks2013) offers comments on the cultural consequences of greater aridity in the Sahara after 6000 BP.
Africa differs from other examples of pristine agriculture in having domesticated animals, particularly cattle, long before the domestication of indigenous plants (Manning and Fuller, Reference Manning, Fuller, Stevens, Nixon, Murray and Fuller2014). Plant domestication occurred in five geographic zones: three in West Africa (the Sahara/Sahel, grassy woodlands, and forest margins), and two further to the east (East Sudanic grasslands and Ethiopian uplands). Important crops included pearl millet, finger millet, African rice, cowpea, yam, sorghum, and tef, with domestication occurring around 4500 bp or later (Fuller and Hildebrand, Reference Fuller, Hildebrand, Mitchell and Lane2013). We omit discussion of the estimated dates for cultivation from Larson et al. (Reference Larson2014) in this case due to an overly broad definition of the geographic region.
Domesticated pearl millet has been found in the Tilemsi Valley in northeast Mali (information is from Manning and Fuller, Reference Manning, Fuller, Stevens, Nixon, Murray and Fuller2014). This valley “provided a fertile and accessible corridor into sub-Saharan Africa at a time of increasing aridification and southward displacement of Saharan populations” (73). The wild progenitors of this crop are found nearby in the Saharan zone of West Africa. Sites occupied during 4500–4000 BP offer evidence of domesticated pearl millet. Initial inhabitants of the Tilemsi Valley brought domesticated millet as well as livestock (cattle, sheep, and goats) from elsewhere. Manning and Fuller suggest a lag between cultivation and domestication of about 1000–2000 years and infer that cultivation began around 6000–5000 BP. They also suggest that pearl millet was suitable for cultivation by mobile pastoralists due to its minimal water requirements and high productivity over a short growing season.
Another important African crop, sorghum, had a wild progenitor located in the savannahs of what is presently the Sahara Desert (Fuller and Stevens, Reference Fuller, Stevens, Mercuri, D’Andrea, Fornaciari and Hohn2018). These wild stands retreated south as the Sahara dried and expanded, and this process may have been accompanied by southward migration. Fuller and Stevens suggest a protracted period of sorghum cultivation during 5500–3700 BP that eventually led to domestication. This was probably associated with a relatively sudden increase in settlement sizes after 5800 BP and a shift to a more sedentary lifestyle (Winchell et al., Reference Winchell2018). The inhabitants were harvesting mixed stands of wild and domestic sorghum by about 5500–5000 BP.
This is consistent with earlier archaeological views about climate change as a driver of domestication in sub-Saharan Africa. According to Smith (Reference Smith1998, 110), some researchers believe “the timing of initial domestication of millet and sorghum was tied to the southward expansion of the desert, which intensified about 4000 years ago, displacing people south.” Smith goes on to state that African rice may also fit this model (112). Bellwood (Reference Bellwood2005, 103) describes a similar archaeological consensus. Smith observes that settlements in the savanna zone from 5000–3000 BP were located on the shores of lakes, that fish were an important food source, and that wild rice, millet, and sorghum were probably harvested at these sites. We infer that such locations may have played a refuge role in relation to increasing aridity.
Eastern North America:
The dates for initial cultivation reported by Larson et al. (Reference Larson2014) are generally around 6000 BP. The information here is taken from Smith (Reference Smith2011). Domesticated seeds for four plants, including squash and sunflower, have been found in seven sites scattered across the oak-savannah and oak-hickory forests of North America. Three of the plants are dated to 5000–4400 BP. The fourth is dated to 3800 BP but Smith believes that further research will likely push this back to the earlier interval. Three more plants were probably cultivated but evidence for domestication is lacking. This occurred alongside hunting and gathering. Wild food resources included white-tailed deer, small mammals, and turkeys; oak, hickory, and walnuts; and fish, bivalves, and waterfowl.
Around 6500–6000 BP, many river systems developed meandering patterns with oxbow lakes, backswamps, and shoals. This increased the abundance and diversity of floodplain resources, both plant and animal; “at the same time an apparent decrease in effective precipitation resulted in a deterioration of upland resources” (S477). This led to intensified human occupation of river and stream corridors. Domestication probably occurred first in resource-rich river valleys associated with small secondary or tertiary tributaries of the Mississippi River. Three of the four species involved are floodplain weeds that colonize disturbed soil exposed by spring floods. The fourth (sunflower) is found in the same setting but also thrives in other environments.
Smith believes the cultural context involved river valley base camps that were permanent or semi-permanent and reoccupied annually by a half dozen or more extended families. When these camps flooded, extended families would move to the uplands for shorter-term occupations. There is no evidence of ascribed status differentiation or any organization beyond the level of the extended family.
In Smith’s opinion, domestication cannot be explained by population pressure or resource depletion, greater territoriality and competition, or an environmental downturn. However, he does not offer an explicit causal story that would account for the transition. The hint that lower rainfall caused population to shift from uplands to river valleys, and that this led to domestication, seems consistent with our theory.
India:
The case of the Indian subcontinent is complex. A number of indigenous plants were domesticated there, but some domesticates arrived from outside the region, and some indigenous plants may have been domesticated in response to these external influences. Sites showing the transition from hunting and gathering to cultivation have not yet been found. When sedentary villages are observed, dependence upon cultivation already exists. Most of the hard evidence for cultivation and domestication dates from about 5000 bp onward. Due to space constraints and the absence of direct relevance for the theoretical issues with which we are concerned, we omit a detailed review here. The interested reader can consult Fuller (Reference Fuller2011).
It should be clear from this brief survey that there is no smoking gun among the pristine transitions outside southwest Asia. However, environmental stresses appear to be plausibly connected with the origins of agriculture in several regions. Our theory may have some application to northern China and sub-Saharan Africa, where refuge locations could have been important. Initial cultivation in southern China and Mesoamerica could be connected with the Pleistocene–Holocene transition but clear evidence is lacking. The role of seasonally dry tropical forests in South America could be consistent with a biased climate shift that depressed the returns to foraging and made cultivation relatively more attractive, even without migration to refuge locations. The development of agriculture in highland New Guinea rather than the tropical lowlands could reflect greater vulnerability to climate shocks in the highlands. We find only a vague suggestion that our framework might apply to eastern North America and no apparent application to India.
We close this section by emphasizing that we do not see a negative climate shock as being necessary for the start of cultivation in all cases. There is at least one other way for cultivation to begin that is consistent with our model: A positive climate trend leading to region-wide Malthusian population growth, which could eventually cause populations at the local level to exceed the threshold needed for cultivation. This mechanism is quite similar to the impacts of climate amelioration discussed in Chapters 3 and 4, which led to technical innovation and sedentism through long-run population growth. In this chapter we have focused on the scenario of a negative shock and resulting migration among sites because we believe this scenario best fits the currently known facts about southwest Asia. But it is certainly possible that in other parts of the world, the Holocene served as a large and protracted positive shock, and induced population growth that led to cultivation.
The main difference in the two causal channels is that a negative shock relies on short-run migration to generate local population spikes, while a positive shock generates more gradual local population growth due to long-run natural growth for the region as a whole. Accordingly, transitions involving negative shocks are likely to have sharper correlations with the timing of initial cultivation. Another distinction is that negative shocks tend to initiate cultivation at a small set of refuge sites, while climate amelioration and long-run population growth tend to initiate cultivation across a broader subset of sites (although the best sites in the region will lead the way in either case). Further empirical research may make it possible to distinguish between these two mechanisms.
5.11 Non-Transitions
A general theory of agricultural origins should not only be consistent with known transitions. It should also be consistent with cases where no transition occurred. Ideally, as we suggested in Section 5.1, it should address the following questions.
Why a small number of pristine cases?
Consider the conditions that must hold in order for the model of Sections 5.4–5.7 to predict initial cultivation. A period of “good” climate must last long enough for a large regional population to accumulate, diet must be strongly dependent on wild plants vulnerable to climate change, a negative climate shock must be sudden enough that gradual population decline through lower fertility does not solve the problem, a few sites must remain habitable while others are abandoned when the climate turns “bad,” migration flows must be sufficiently free, and the productivity of cultivation must rise quickly enough once it begins. Multiplying the probabilities of the numerous necessary conditions together yields a low probability of agriculture, which is consistent with the small number of pristine cases in the archaeological record.
Why no agriculture in earlier interglacials?
The last interglacial period comparable to our own, the Eemian, occurred between about 126,000–116,000 BP, with glacial conditions gradually intensifying during 116,000–78,000 BP. Anatomically modern humans did not live outside Africa in large numbers until around 50,000–70,000 BP, and we do not know to what degree Africa would have been affected by climate shocks around the time of the Eemian. We also do not know whether anatomically modern humans from 100,000 years ago had the cognitive or technological capabilities needed for agriculture.
Why no agriculture during the last Ice Age?
There is no doubt that modern cognitive abilities existed by 40,000–13,000 BP. Richerson et al. (Reference Richerson, Boyd and Bettinger2001) attribute the absence of agriculture at this time to the low mean and high variance of weather during the last Ice Age, which deterred investments in agriculture. This may have played a role, but our approach suggests a further factor: climate instability ruled out long mild periods during which large regional populations could accumulate. There was no precedent for the 1,500–2,000 years of warm and moist climate during which Natufian society arose in southwest Asia. Without this large pre-existing population, a climate reversal would not have caused a sufficient spike in local populations at the best sites, and the cultivation threshold would not have been crossed.
Why no agriculture in the initial warming after the Last Glacial Maximum?
The climate reversal of the Younger Dryas was apparently a necessary condition for initial cultivation. The Natufians failed to embrace cultivation despite 1,500 years of favorable conditions prior to this negative shock, and might never have done so in the absence of such a shock. We attribute this to the insufficiency of local population densities arising through long-run Malthusian growth alone.
Why no pristine agriculture in tropical rainforests?
Bellwood (Reference Bellwood2005) observes that pristine agriculture never emerged in the rainforests of Africa (or for that matter, anywhere south of the equator in Africa), or in southeast Asia. Higham (Reference Higham, Douglas Price and Gebauer1995) takes a similar view of southeast Asia. Pristine agriculture also failed to arise in lowland New Guinea or tropical Australia. Piperno (Reference Piperno2011) makes a strong case for domestication in tropical South America, but she stresses the prominent role of forested areas subject to prolonged seasonal droughts. From our perspective, the obvious reason for the rarity of pristine agricultural transitions in tropical rainforests is that such areas are buffered from climate shocks involving periods of aridity lasting for a generation or more. Our theory requires protracted aridity in a populous region with production sites of varying quality. This was probably a rare conjunction in rainforest environments.
Why no pristine agriculture in Japan or on the northwest coast of North America?
These two cases drive home the point that a rich natural setting, a sedentary lifestyle, a high level of technological sophistication, and a complex society are insufficient to bring about an agricultural economy. Furthermore, both had severe Younger Dryas events and experienced the cooling of 8200 BP, making them especially good points of comparison. For these reasons, we will discuss each case in some detail.
The Jomon period in Japan is dated from about 16,000 BP (see Section 4.3). It is characterized by very early pottery, relatively early sedentism, and a rather late transition to full-scale rice farming (around 2500 BP). When cultivation finally arrived it appears to have been borrowed, along with other cultural characteristics, from the Mumun culture of Korea. It is uncertain whether this involved major population flows from Korea. Rice cultivation could easily have been borrowed from China or Korea at much earlier dates but was not (Bellwood, Reference Bellwood2005, 114)
The coastal inhabitants of the Pacific Northwest in North America are often used as the textbook example of a complex society not based on agriculture (see Johnson and Earle, Reference Johnson and Earle2000, 204–217). Fish were a key food source, especially the rich seasonal runs of salmon and eulachon. Many other wild foods were used, such as shellfish, waterfowl, marine and land mammals, roots, and berries, but by 8000 bp salmon was the dominant food. Despite a promising environment with mild temperatures and ample precipitation, agriculture never became a core component of the economy (Deur, Reference Deur, Hirt and Goble1999, Reference Deur2002).
Our explanation for the non-transition to agriculture in both of these societies is that they were cushioned from negative climate shocks by the availability of fresh water and marine dietary resources. This averted a shift to cultivation despite high population densities (according to Habu, Reference Habu2004, this is a standard archaeological explanation for the failure of the Jomon to adopt agriculture). Moreover, differences in site qualities might have been less extreme as compared to southwest Asia (or China, or sub-Saharan Africa), so climate fluctuations would have affected most sites in a parallel way. This would have restrained any migratory responses to such shocks, dampening the local population spikes needed to trigger cultivation.
These two cases highlight the fact that our model requires more than just a large climate reversal. If the reversal does not have a major impact on staple foods, or does not occur in a geographic setting with large differences in site qualities, it will not stimulate a transition. The Jomon and Northwest Coast examples had large pre-existing populations and were exposed to significant climate shocks, but lacked these other necessary factors.
5.12 Biological Endowments
Diamond (Reference Diamond1997) argues that agriculture began in regions having wild plant and animal species that were easy to domesticate. The leading case is southwest Asia, which had an outstanding array of grasses, legumes, and animals that were excellent candidates for domestication. China also had a favorable biological endowment. By contrast, New Guinea and eastern North America had substantially less favorable endowments. Some regions that have high agricultural productivity today, such as California, southwestern and southeastern Australia, southern Africa, and Chile and Argentina, were disqualified from pristine transitions because they had little or nothing to domesticate. As a result, agriculture had to diffuse to such locations from other parts of the world. These ideas have been influential among economists (see Sections 1.10 and 12.3).
When Diamond seeks to explain the timing of the Neolithic in southwest Asia, he appeals to an assortment of factors including resource depletion, increasing availability of domesticable wild plants, technological innovation, and population growth. He says that prior to 8500 BC (10,500 BP), these factors had not yet come into play. Climate is only mentioned in the context of an argument that the transition from the Pleistocene to the Holocene expanded the geographic range for domesticable wild cereals (1997, ch. 6).
Our general view is that Diamond’s story may help to explain where agricultural transitions occurred, but says little about when they occurred. Having a good biological endowment does not by itself shed any light on the question of whether cultivation would begin at 12,000 BP or 8000 BP or 4000 BP. Another problem with Diamond’s theory is that it focuses on ease of domestication. In our view, the key question is why cultivation began at all, so what matters most is the marginal productivity of cultivating wild plants relative to the marginal productivity of foraging. The ease of domestication is important in determining the rate at which agricultural productivity rises after cultivation begins, but that is a separate issue.
Despite these objections, there is some empirical support for Diamond’s thesis. Olsson and Hibbs (Reference Olsson and Hibbs2005) and Bleaney and Dimico (Reference Bleaney and Dimico2011) find that regions with more potentially domesticable plant and animal species tended to have earlier transitions to agriculture. However, most of the observations in their data sets involve the diffusion of agriculture, rather than the pristine transitions with which we are concerned. For a study of pristine transitions, albeit with a different focus from Diamond, see Riahi (Reference Riahi2020).
We think of Diamond (Reference Diamond1997) as providing the “supply side” for a theory about the origins of agriculture, but lacking the “demand side” that would account for the timing of decisions to cultivate. In our approach the supply side consists of the latent productivity of cultivation before it begins and the rate of productivity growth after it gets underway. The demand side involves climate shocks, migration to refuge sites, and the diminishing marginal product of foraging. These factors determine the timing of pristine cultivation. Our model from Sections 5.4–5.7 has both supply and demand components.
To see why biological endowments do not provide a full explanation, consider the failure of the wild-rice-eating people of tropical southeastern Asia to initiate agriculture, in contrast to the people of southern China who did. These two populations had similar biological endowments, but a pristine transition only occurred in the area that was more exposed to climate shocks and had more heterogeneous sites. A similar example is the contrast between lowland (tropical) New Guinea with no transition, and highland (non-tropical) New Guinea where domestication of bananas, taro, and yams occurred. This can be explained by the greater vulnerability of the highlands to climate shocks and the tendency of tropical climates to mute variations in site quality. Although one can argue that Japan and northwest North America lacked good candidates for domestication (the supply-side story), they also had abundant foraging opportunities that were relatively insensitive to climate shocks (the demand-side story).
We close this section by emphasizing that our theory is open to refutation. If future research shows that pristine agriculture evolved in a region where important staple foods were not vulnerable to climate shocks or where the initial cultivation of staples did not coincide with a climate reversal, the version of our theory involving negative climate shocks and migratory responses would not apply, so other explanations would have to be sought. A similar verdict would follow if pristine agriculture occurred in a region with a low initial population density or many high-quality sites, if it occurred first at the lower-quality sites, or if it was not accompanied by migration from poor sites to good ones.
Our survey of pristine transitions in Section 5.10 suggests that our theory is likely to be more relevant for some regions than others. We also suggested that some regional cases might be better explained by positive climate change accompanied by Malthusian population growth in the long run. We leave it for future researchers to decide whether these ideas provide useful insights in particular cases.
5.13 Conclusion
We have developed a formal model to explain the origins of agriculture, and we have provided empirical support for our theory. This chapter contributes to the economic literature on the Neolithic Revolution in three main ways: by showing that biased climate change was not necessary for the transition, by highlighting the importance of migration among heterogeneous local sites, and by making population and technology endogenous. We briefly summarize each point.
Biased Climate Change:
Much of the literature assumes that agriculture arose due to climate changes that favored cultivation over foraging (see Section 5.2). While this could be true in certain cases, economists rarely provide any evidence that such biases actually existed. If a positive climate shock occurred, why would foragers want to start cultivating rather than enjoying rich foraging opportunities? If a negative climate shock occurred, why would foragers start cultivating at a time when cultivation productivity probably fell? Arguments appealing directly to climate biases raise further puzzles. If a good climate leads to cultivation, why didn’t cultivation start in southwest Asia during the Bølling/Allerød period? If a bad climate leads to cultivation, why didn’t cultivation start during the Last Glacial Maximum, and why didn’t cultivation end in the Early Holocene?
Our formal analysis was carried out on the assumption that climate changes are neutral with respect to the choice between foraging and cultivation. The migration effect from a negative climate shock removes the need for a direct bias, and provides additional empirical content by predicting that the best sites within a region should start cultivating first. Our theory gets the timing right because it is not a specific climate state that causes agriculture. Rather, it is the entire sequence of climate events. The model in this chapter can be extended to accommodate biased shocks if future archaeological research shows that such shocks were important in particular regions.
Heterogeneity and Migration:
Theories about the agricultural transition often do not distinguish between processes occurring at the regional and local levels. In our view, this distinction is critical. It is entirely possible that population could be increasing at the regional level while decreasing at some individual sites or vice versa. Models that fail to take this point into account will overlook the role of scarce refuge sites as agricultural incubators. To see the importance of this factor, consider a counterfactual. If all of the sites in southwest Asia had been identical, the Younger Dryas would not have triggered any migration among sites, there would have been no short-run spike in local population anywhere, and there would have been no reason to allocate any labor to cultivation.
Population and Technology:
Many economists and archaeologists have debated the relative importance of climate, geography, population, and technology as factors in the origins of agriculture. We view climate and geography as the underlying exogenous variables, with climate change functioning as the trigger that accounts for the timing of initial cultivation, and geography contributing the necessary condition of heterogeneous sites. However, population and technology cannot be ignored. Without a large regional population, a climate reversal cannot generate local population levels sufficient to trigger cultivation. Without rapid technical change once cultivation gets underway, a declining regional population during the reversal could take a society back to foraging rather than forward to a farming economy. By endogenizing population and technology, our model helps to clarify these crucial interactions.
5.14 Postscript
This chapter is based on the journal article “Climate reversals and the transition to agriculture” published in the Journal of Economic Growth (Dow, Reed, and Olewiler, Reference Dow, Reed and Olewiler2009). Our co-author Nancy Olewiler, an environmental economist at the School of Public Policy at Simon Fraser University, contributed to early drafts of that article. Doug Allen, Matthew Baker, Ofer Bar-Yosef, Cliff Bekar, Sam Bowles, Sue Colledge, Patrick Francois, Oded Galor, Brian Hayden, Hillard Kaplan, Gordon Myers, Arthur Robson, and four anonymous referees gave helpful comments on drafts of the original article. We are also grateful to audiences at Simon Fraser University, the University of British Columbia, the 2005 SSHA conference, the 2006 AEA conference, the 2006 SFU workshop on the Neolithic transition, and the 2006 Conference on Early Economic Developments at the University of Copenhagen. The Social Sciences and Humanities Research Council of Canada provided financial support. All are absolved of responsibility.
This chapter benefited from feedback at a 2019 seminar given at the Institute of Archaeology, University College London. We are very grateful to Stephen Shennan for hosting our visit to UCL and to Andrew Bevan, Sue Colledge, Dorian Fuller, and Andy Garrard for their expert advice. We also thank Rowan Flad of Harvard for his guidance with regard to China. Sam Bowles, Sue Colledge, and Stephen Shennan commented on a preliminary draft of this chapter. All are likewise absolved of responsibility.
Sections 5.1 and 5.2 have been largely rewritten, with updating in 5.2 to reflect developments in the economic literature since 2009. We have also updated the evidence on southwest Asia in Section 5.3. The model in Sections 5.4–5.7 is unchanged from the original journal article but the text has been revised to clarify various conceptual points. Sections 5.8 and 5.9 are new. Section 5.10 on other transitions has been updated and expanded. Sections 5.11–5.13 have been substantially rewritten.
The formal model in Sections 5.4–5.7 was based almost entirely on data for Abu Hureyra in Moore et al. (Reference Moore, Hillman and Legge2000) and Hillman et al. (Reference Hillman, Hedges, Moore, Colledge and Pettitt2001). When writing the JEG article, we had a general awareness of the Neolithic transition in regions such as China and sub-Saharan Africa, roughly at the level of Bellwood (Reference Bellwood2005), but the archaeological evidence from outside southwest Asia contributed almost nothing to the construction of the model. We had no knowledge of the sources from 2009 and later discussed in Sections 5.8–5.10.















