Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-08T03:37:26.385Z Has data issue: false hasContentIssue false

Using Topic Modelling to Reassess Heritage Values from a People-centred Perspective: Applications from the North of England

Published online by Cambridge University Press:  15 June 2023

Martina Tenzer
Affiliation:
Department of Archaeology University of York King's Manor Exhibition Square York YO1 7EP UK Email: martina.tenzer@york.ac.uk
John Schofield
Affiliation:
Department of Archaeology University of York King's Manor Exhibition Square York YO1 7EP UK Email: john.schofield@york.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

The historic environment—comprising a palimpsest of landscapes, buildings and objects—carries meaning and plays a crucial role in giving people a sense of place, identity and belonging. It represents a repository of ever-accumulating collective and individually held values—shared perceptions, experiences, life histories, beliefs and traditions. These social or private values are mostly ascribed by people to familiar places within this environment based on the ontological security which this everyday heritage provides. However, these values are notoriously hard to capture and categorize. This makes it difficult to incorporate them into heritage-management strategies, which typically rely on objective, fact-based datasets. In this paper, we present a new methodology to capture those elusive values, by combining Topic Modelling with the principles of Grounded Theory. Results show that our novel approach is viable and replicable and that these important values can be effectively and meaningfully integrated, thus creating more inclusive approaches to heritage management than exist currently.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press on behalf of the McDonald Institute for Archaeological Research
Figure 0

Figure 1. Detailed methodology developed in this research. The aim is to create the topics based on the ‘stories’ of survey participants and correlate these with value categories as set out in Historic England's Conservation Principles (English Heritage 2008), extending the current value catalogue. (Green: manual process; blue: computed process; red: outlook; yellow: visualization.)

Figure 1

Figure 2. Study areas: Peak District National Park and Sheffield city. Residents of these areas were invited to participate in a survey and provide up to five favourite places within the study areas and their ‘stories’ of personal connection. (Map created in QGIS; data contain OS data © Crown copyright and database right 2022. Map tiles by Stamen Design, under CC BY 3.0. Data by OpenStreetMap, under OdbL.)

Figure 2

Figure 3. Interactive ‘story map’, showing the favourite places provided by residents of the two study areas who participated in an online survey. The participants were asked to provide up to five places that matter most to them and give an up to 300-word description of the reasons for the connection to these places. (Map created in QGIS/Leaflet, © OpenStreetMap contributors; data contain OS data © Crown copyright and database right 2022.)

Figure 3

Figure 4. The Topic coherence gives the degree to which documents in one cluster are closely related. This depends on the size of the cluster and the number of topics chosen. For example, if we subdivide the documents into four different topics, the documents have less coherence—similar semantics and supporting the statements of each other—than at the point of 35 clusters, where the number of documents in one cluster is smaller but the relationship between the documents higher. At 35 topics the coherence is highest, flattening out with more subdivisions. Therefore, 35 topics will provide the best first insight into the latent topics within the documents (tqx94 2022).

Figure 4

Figure 5. The Cluster Dendrogram shows the 35 topic clusters with their respective labels chosen by the modelling process. The levels (Height) show the similarities of topics. Topics in the lower level of the diagram on branches close together show topic clusters that have themes closely related to each other. For example, topic 10: ‘great_walk’ and topic 12: ‘part_life’ consist of documents or ‘stories’ with content that support each other and can be summarized under the respective label. Similarly, topic 4: ‘national_park’ and topic 18: ‘green_space’ consist of documents with a similar theme. At a higher level, all four clusters are related to each other—to a lesser degree than at a lower level but more closely related than, for example, topic 3: ‘forge_dam’, again, a level higher up.

Figure 5

Table 1. Following the Topic Modelling process, the optimal number of topics (35) was chosen to create labels for the topic clusters. The labels are based on the most frequent key terms in the documents/stories of each topic cluster. Labels vary in their quality, which makes a manual evaluation necessary.

Figure 6

Table 2. Historic England's value categories as set out in the Conservation Principles (English Heritage 2008). Additional nuance to these categories is proposed based on the underlying themes identified in survey data. The additional values (shaded cells) address societal trends and the changing and dynamic demands and needs of residents in the Peak District National Park and the city of Sheffield following the COVID-19 pandemic.

Figure 7

Figure 6. Result of manual evaluation of the topic modelling process. Document count over topics, showing the number of documents (stories) allocated to the corresponding value categories as defined by Historic England. High counts (red) show the dominant value category in a topic cluster. Rows with yellow cells (low counts) show incoherent topics with high variation in values. The proposed category for ‘Green Space’ value, including ‘Health’ value, dominates in the assessment.

Figure 8

Figure 7. The category of communal values can be subdivided into private, spiritual and commemorative values, as set out in Historic England's Conservation Principles. Themes emerging from the survey data show a trend to connect places based on green space, health and arts & culture. Yellow colours in the heatmap represent low numbers of stories, while darker red colours represent higher numbers of stories categorized in the respective categories. The graph shows that survey respondents based individual place attachment predominantly on aspects of communal, green space and health aspects.

Figure 9

Figure 8. Mapped overview of categories across the study areas. Distinct areas show clusters. (Map created in QGIS; data contain OS data © Crown copyright and database right 2022. Map tiles by Stamen Design, under CC BY 3.0. Data by OpenStreetMap, under ODbL).

Figure 10

Figure 9. Detail of the overview map (Fig. 8). The plague village Eyam (lower left) is predominantly valued by survey participants for the ‘Historical’ value (n = 5). Padley Gorge (centre), one of the temperate rainforests of Britain (Shrubsole 2022), is valued for the ‘Green Space’ qualities (n = 4) and ‘Communal’ values (n = 4), similar to the National Trust's Longshaw Estate (upper right) with recognition given predominantly to ‘Green Space’ value (n = 3) and ‘Communal’ value (n = 3). (Map created in QGIS, © OpenStreetMap contributors; data contain OS data © Crown copyright and database right 2022.)

Figure 11

Figure 10. Detail of the overview map (Fig. 8). Notable is the difference between the public perception of the two valleys: Rivelin Valley (upper centre) and the Porter Valley (centre). Rivelin Valley, with its deep early industrial history present in the various sites of ‘Wheels’ along the river, is predominantly valued for its ‘Historical’ value (n = 6), while the Porter Valley in the heart of the city is predominantly valued for its ‘Green Space’ value (n = 14). (Map created in QGIS, © OpenStreetMap contributors; data contain OS data © Crown copyright and database right 2022.)

Figure 12

Figure 11. Detail of the overview map (Fig. 8). The legend provides a more fine-grained categorization. ‘Historical’ (n = 12) and ‘Arts & Culture’ values (n = 12) dominate. (Map created in QGIS, © OpenStreetMap contributors; data contain OS data © Crown copyright and database right 2022.)

Supplementary material: File

Tenzer and Schofield supplementary material 1

Tenzer and Schofield supplementary material
Download Tenzer and Schofield supplementary material 1(File)
File 135.5 KB
Supplementary material: File

Tenzer and Schofield supplementary material 2

Tenzer and Schofield supplementary material
Download Tenzer and Schofield supplementary material 2(File)
File 93.8 KB
Supplementary material: File

Tenzer and Schofield supplementary material 3

Tenzer and Schofield supplementary material
Download Tenzer and Schofield supplementary material 3(File)
File 125.9 KB