Ecotourism

The Spatio-temporal Characteristics of Shanghai Tourist Flow Network Based on Change Point Detection

  • XIA Shuang ,
  • ZHANG Yao ,
  • FANG Tianhong , *
Expand
  • School of Art Design and Media, East China University of Science and Technology, Shanghai 200237, China
* FANG Tianhong, E-mail:

XIA Shuang, E-mail:

Received date: 2024-04-03

  Accepted date: 2024-08-01

  Online published: 2025-03-28

Supported by

The Key Project of National Natural Science Foundation of China(42130510)

Abstract

Taking Shanghai as an example, this study obtained the online travel notes data from Xiaohongshu and Qunar in the past 10 years to construct the Shanghai tourist flow network (STFN) and used the methods of change point detection (CPD) and complex network analysis (CNA) to reveal the spatial structure characteristics of Shanghai tourism flow and the dynamic evolution process of STFN. The results showed that: (1) In the past 10 years, Shanghai tourist market had experienced a process of evolution from stable and orderly to short-term fluctuation and then gradual recovery, and the year of 2019 was the turning point of tourist flow network evolution. (2) The small-world and approximate scale-free characteristics of STFN were verified, and the network changed from disassortative to temporary assortative, showing a development trend of external expansion and internal separation. (3) While the centrality indicators of tourist flow network remained stable as a whole, the attention to cultural nodes was also increasing with the emergence of new nodes; (4) In terms of spatial connection, new popular nodes emerged and the relationship between them and the surrounding nodes was strengthened; (5) The spatial pattern of tourist flow network presented an inverted “V” shape and gradually expanded to southwest and southeast, forming a network with core nodes as the center and radiating outward. At the same time, newly emerging nodes at the periphery had formed relatively independent clusters.

Cite this article

XIA Shuang , ZHANG Yao , FANG Tianhong . The Spatio-temporal Characteristics of Shanghai Tourist Flow Network Based on Change Point Detection[J]. Journal of Resources and Ecology, 2025 , 16(2) : 546 -557 . DOI: 10.5814/j.issn.1674-764x.2025.02.022

1 Introduction

The tourism industry is one of the crucial industries in Shanghai, and closely related to the well-being of citizens. It plays an important supporting role in enhancing the urban level and core competitiveness of Shanghai. In June 2021, the Shanghai Municipal People’s Government issued the “Shanghai’s 14th Five-Year Plan for Deepening the Construction of a World-Famous Tourist City”, and continuously promoted the construction of a high-quality world-famous tourist city. With the continuous improvement of Shanghai’s recreational and livable functions, Shanghai’s tourist attractions present a situation of quality improvement and optimization in the central urban area, and the emergence of new points in the suburbs. Tourist flows, as a collective spatial movement phenomenon generated by similar tourism demands, can reflect the competitiveness of cities and the composition of the attraction system. Building a tourist flow network helps to adjust the tourism supply structure in a targeted manner, optimize resource allocation, enhance tourists’ travel experience and advance the integrated development of regional tourism. The impact of the COVID-19 pandemic on people’s travel behaviors is profound. Considering this additional factor, has the tourism flow network in Shanghai shown fluctuations and new characteristics? This paper intends to use online travelogue data as a basis to analyze the spatiotemporal evolution of Shanghai’s tourism flow network over the past decade, fully grasping tourists’ preferences for Shanghai’s tourist attractions and the evolutionary patterns and characteristics of the tourist travel network. This is of reference significance for the optimization planning of Shanghai’s tourist attractions and the efficient allocation of tourism resources in the post-pandemic era.

2 Literature overview

2.1 Tourist flow

Tourist flow, within the scope of “flow research” (including goods flow, traffic flow, tourist flow, information flow, etc.), was closely related to “mobility” (Zhong et al., 2010). Because tourism resources are generally immobile and unevenly distributed in space, tourists generate “flow” between different attractions to meet their own recreational needs. This phenomenon of tourists’ migration among scenic spots was called tourist flow (Zhang et al., 2005). The study of tourist flow began as early as the 1960s, mainly focusing on the spatial pattern and impact of tourist flow (Williams and Zelinsky, 1970). China’s research on this issue emerged at the end of the last century, initially focusing on the system, spatial distribution (Ma and Li, 2000; Zhang, 2000), and spatial structure (Wu and Bao, 2005; Zhong et al., 2009) of tourist flow. The construction and definition of tourism flow system were also still hot topics of research (Yuan, 2005; Xue, 2006) at that time. Later, the research focused on the spatial effect (Ma et al., 2008; Wang et al., 2010), spatial distribution characteristics, spatial transfer (Liu and Ma, 2008; Zheng et al., 2010), and flow quantity and quality analysis of tourist flow (Wang and Wu, 2016; Kang et al., 2020), mainly concentrating on the characteristics of spatiotemporal distribution (Yan et al., 2017) and network structure of tourist flow (Wang et al., 2023). Tourist flow had been widely valued by the academic community (Wang et al., 2022a). To this day, tourist flows have been widely valued by the academic community, with continuous in-depth research making certain progress in aspects such as spatial patterns, impacts, system construction, spatiotemporal distribution characteristics, and network structures. However, there is a lack of in-depth analysis of the dynamic changes in tourist flows and individual behaviors, and discussions on the social motives and psychological mechanisms behind tourist flows are not deep enough. This has limited the precision of tourist flow forecasting and the formulation of management strategies.

2.2 Tourist flow network

Tourist flow not only referred to the spatial movement trajectories of tourists but also encompassed the collection of various economic and social impacts resulting from this movement. It also reflected the connections between tourism destinations. Various spatial scales had been involved in the research on the spatial structure of tourist flow networks. The research questions included the structural characteristics (Sun et al., 2023), influencing factors (Xu et al., 2018; Li et al., 2021), evolutionary mechanisms (Yang and Wu, 2015), spatial patterns (Wang et al., 2020), comparative analysis of urban tourist flow network evolution (Yang and Wu, 2015), and cross-over studies with other societal hotspots (Cheng and Jia, 2020), etc. Various research methods were employed, such as GIS spatial analysis (Wang et al., 2021; Sun et al., 2023), social network analysis (Fang et al., 2023), QAP method (Zhou et al., 2020), etc. In terms of data acquisition, early studies heavily relied on survey questionnaires and official statistics, facing challenges in terms of quantity, adequacy, and timeliness. With the development of smart tourism, the mining of digital footprint data had made up for the defects of traditional data, and the ways of obtaining digital information such as online travel notes of tourism websites (Gholamhosseinzadeh et al., 2021), Baidu Index (Liang and Ma, 2023), Tencent population migration big data (Wu et al., 2020), Flickr website (Laura and Balzan, 2023), etc. have become new hotspots of attention. Overall, research on tourist flow networks covers aspects such as spatial structure, characteristics, influencing factors, evolutionary mechanisms, and urban evolution features. It is in a transitional phase from traditional data acquisition to the mining of digital footprint data, with the analysis of the complexity and dynamics of tourist flow networks not being comprehensive enough.

2.3 Change point detection (CPD)

Change point detection (CPD), as an emerging interdisciplinary research method, was often applied to the research fields of statistics, signal processing, time series, etc., and was most commonly used for the prediction and anomaly detection of time series, which specifically referred to finding the smallest set of breakpoints in the given time series data, where the distribution or statistical properties of the data underwent a significant change at each breakpoint.
Assuming there was an ordered data sequence: y1, ${{y}_{2}},\cdots,{{y}_{n}}$, a change occured at some point in time, resulting in different statistical properties before and after the change, this point was considered a change point (Wang and Huang, 2023). Several advanced algorithms had been proposed, mainly divided into three categories: statistical methods represented by autoregressive models (AR), moving average models (MA), pruned exact linear time (PELT), and simple exponential smoothing (SES); classical machine learning methods such as K-means clustering - subsequence time series clustering (STSC) (Keogh and Kasetty, 2003), one-class support vector machine (OC-SVM) (Tax and Duin, 1999), and local outlier factor (LOF) (Breunig et al., 2000); and deep learning methods, mainly including multilayer perceptron (MLP), residual neural network (Resnet) (He et al., 2016), and long short-term memory (LSTM) networks (Van Houdt et al., 2020), which widely were applied in various fields. With the maturity of the models and methods of complex network analysis (CNA), scholars had conducted extensive research in areas such as global transportation network characteristics (Del Mondo et al., 2021; Peng et al., 2021), landscape ecological security (Zhou et al., 2021), evolution of commodity trade patterns (Jiang and Wang, 2020), and more. The research on tourist flow network at the international (Wang et al., 2022b), domestic (Zhou and Xu, 2019) and urban agglomeration levels (Wang and Liu, 2022; Zhang and Yuan, 2023), a series of achievements had been made (Zhou et al., 2023). However, there is no research on applying the CPD to the tourism market stage division. Through this method, it is expected to identify the key time points more accurately in the evolution process of the tourist flow network and reveal the evolution process of the tourism network structure and the overall network characteristics. However, there has not yet been any research applying change point detection methods to the segmentation of tourism market stages. Through this method, it is expected to more accurately identify the key time points in the evolution of tourist flow networks, thereby revealing the evolution process of the tourist network structure and the overall network characteristics.

2.4 Complex network analysis (CNA) and tourist flow

Currently, most of the research on tourist flow network relies on web text mining of tourism information data. Research methods included social network analysis (SNA) (Ren et al., 2023), Geography information system spatial analysis (GIS), Girvan-Newman algorithm (GN) (Zhang et al., 2023), among which social network analysis method has the largest proportion.
SNA, as a classic method of social relation research, mainly focused on the relationship and social structure (Zeng and Yu, 2022) between individuals and emphasizes revealing the group relationship and individual position in these relationships in the society. In SNA, more attention was paid to the relationship between members, and the individual attributes of network members were easily ignored. However, complex network analysis (CNA) can more widely study the general properties of network structure, including physical system, biological system, and social system, etc., and included various connection relationships between nodes, not limited to social relations. Applying CNA into the research of tourist flow network can use more types of measurement indicators, including degree distribution, clustering coefficient, network diameter, etc., to describe the structure and behavior of the network more comprehensively.
In summary, although there were abundant research achievements on tourist flow network, few studies had applied complex network methods to urban scale, and previous studies mostly divided time nodes by events, paying little attention to the natural evolution of the network. Tourist flow network is a dynamic system, using CPD to divide the time series can improve the accuracy of the time series division, and help to identify the turning points in the network time series, thus more accurately depict the spatial-temporal evolution characteristics of the network. During the outbreak of the COVID-19 pandemic, the tourism industry was severely impacted, and the radius of people’s travel was greatly restricted. Currently, there has been no scholarly research comparing the changes in the tourism flow network before and after the pandemic. With the pandemic factor superimposed, the changes in Shanghai’s tourism flow network from both spatial and temporal perspectives are of practical significance for further understanding the resilience of Shanghai’s tourism industry development and for more scientifically planning and distributing tourism resources in the post-pandemic era. Therefore, this paper collects the footprint data of online travelogues in Shanghai over the past decade, applies change point detection and complex network analysis methods, constructs a spatiotemporal evolution model of Shanghai’s tourism flow network, reveals the structural characteristics and spatiotemporal evolution laws, portrays its long-term trends and short-term fluctuation characteristics in evolution, and explores the endogenous driving forces of its evolution, providing more scientific and effective support for the spatiotemporal planning and management of tourism in Shanghai.

3 Methods

3.1 Study area

Shanghai, as the economic, financial, and trade center of China, is also a renowned tourist city in the country. It possesses unique advantages in terms of location, transportation, resource endowment, factor markets, and consumer markets for the development of the tourism industry.
Shanghai not only boasts rich cultural and tourism resources, including Red Culture, Jiangnan Culture, and Shanghainese Culture, but also gathers high-quality resources in finance, technology, and talent, along with numerous outstanding tourism enterprises. There are currently 130 national A-level tourist attractions, 11 national historical and cultural towns, 3449 immovable cultural relics, 1058 excellent historical buildings, and 44 historical preservation areas. Additionally, the city is home to 153 museums, 94 art galleries, 217 theaters and new performance spaces, and 385 cinemas.
In 2019, Shanghai received 361.4051 million domestic tourists throughout the year, with a total revenue of 552.2 billion yuan, with the tourism sector contributing over 6% to the city’s GDP. Despite the impact of the COVID-19 pandemic, Shanghai welcomed 236 million domestic tourists in 2020, generating 280.95 billion yuan in domestic tourism revenue. These figures represent recovery rates of 65.3% and 58.7% compared to 2019 levels for tourist numbers and revenue, respectively. Notably, Shanghai’s recovery rates exceeded the national average. In the first half of 2023, Shanghai hosted 144 million domestic tourists, with revenue totaling 172.4 billion yuan. These numbers indicate growth of 55% and 69% compared to 2020, respectively. Shanghai’s tourism industry had rebounded to 83% and 78% of the levels during the same period in 2019, respectively, surpassing the national average recovery rates by 6% and 8%, respectively. These statistics highlighted Shanghai’s resilience and efforts in revitalizing its tourism sector despite the challenges posed by the pandemic. These advantages demonstrate that Shanghai, as a vibrant and modern tourist destination, maintains a unique appeal to visitors despite the challenges faced by the tourism market due to external factors.
Shanghai’s tourism industry leads the nation and serves as a model for other domestic cities (Li and Qu, 2021). Therefore, this study selects Shanghai as the research subject, intending to reveal the structural characteristics and evolution patterns of the STFN. The findings will contribute to the theoretical basis for optimizing the spatial layout of urban tourism.

3.2 Data collection

The data collection involved mining online travelogue data from both Xiaohongshu (Little Red Book) and Qunar, two prominent platforms. Xiaohongshu, a rising social e-commerce platform in China, primarily attracts a young user base, boasting a current monthly active user (MAU) count of 260 million. In contrast, Qunar is a well-established domestic online travel service platform with a large user base. Filtering travelogues with Shanghai as the destination on the two platforms from 2014 to 2023, a total of 4326 footprint records after cleaning had been obtained. The basic geographical data was sourced from the National Basic Geographic Information Center, while the geographical coordinates of tourism nodes were obtained from the Gaode Maps API, using the WGS1984 coordinate system. Annual data on the number of domestic tourists received by Shanghai was derived from the “Shanghai Statistical Yearbook” and the Shanghai Tourism Industry Statistical Bulletin, covering a period from 2014 to 2022.

3.3 Model construction

The STFN is constructed based on visitor footprints. This network is represented as an undirected weighted graph denoted by G=(V, E, W), where $V=\{{{v}_{1}},{{v}_{2}},\cdots,{{v}_{n}}\}$ represents the set of nodes, $E=\left\{ {{e}_{ij}} \right\}$ represents the set of edges, and $W=\left\{ {{w}_{ij}} \right\}$ represents the set of weights. Here, ${{e}_{ij}}$ indicates whether there is a connection between node ${{v}_{i}}$ and node ${{v}_{j}}$; if a connection exists,${{e}_{ij}}$=1; otherwise, ${{e}_{ij}}$=0. $W=\left\{ {{w}_{ij}} \right\}$ represents the set of weights, quantified by the number of likes received in travelogues. This network structure aids in identifying key nodes in tourist activities, understanding the relationships and connection strengths between nodes, and exploring the evolution of the tourist flow network over time.

3.4 Change point detection (CPD)

Existing evaluations of univariate time series data indicates that statistical methods perform best in detecting point anomalies and collective anomalies. Not only do these methods provide more accurate anomaly detection, but they also execute faster, with shorter training and prediction times (Braei and Wagner, 2020). One notable statistical algorithm is the Pruned Exact Linear Time (PELT) algorithm proposed by Killick et al. (2012). This dynamic programming-based statistical method identifies structural change points in non-stationary sequence data. It achieves this by minimizing a cost function to determine the positions of change points. The expression is as follows:
$Cost(t)=\underset{0\le s\le t-1}{\mathop{\min }}\,[Cost(s)+Penalty(s+1,t)]$
where, Cost(t) represents the total cost of a split at position t, Cost(s) represents the total cost of a split at position s, Penalty(s+1, t) is the penalty term between positions s+1 and t, which is typically associated with the magnitude of the change between segmentation points.

3.5 Complex network analysis (CNA)

Since the 1990s, complex network theory has rapidly developed and permeated various disciplines. Many scholars have applied it to model the tourist flow network, aiming to reveal the underlying mechanisms within systems. These models provide insights for data analysis, system optimization, and decision-making (Table 1). By building tourist flow networks connecting numerous tourist nodes, complex real- world phenomena can be simulated and explained (Zhang and Liu, 2014; Zhang and Liu, 2015).
Table 1 Formulas and meanings of statistical indicators for complex networks
Indicators Formula Mathematical meaning Concept definition
$\sigma $ $\sigma =\frac{C}{Crand}\times \frac{Lrand}{L}$ σ is a metric used to quantify the small-world characteristics of a network; C represents the actual clustering coefficient of the network; L represents the actual average shortest path length of the network; Crand represents the average clustering coefficient of a random graph with the same number of nodes and edges; Lrand represents the average shortest path length of a random graph with the same number of nodes and edges Compare the network’s clustering and short path properties with the corresponding values in a random network. If σ is greater than 1, it indicates that the network exhibits small-world characteristics
$\omega $ $\omega =\frac{Lr}{L}-\frac{C}{Cl}$ ω is a metric used to quantify the small-world characteristics of a network; C and L represent the average shortest path length, Lr represents the average shortest path length of the equivalent random graph, and Cl represents the average clustering coefficient of the equivalent lattice graph The clustering coefficient measures to what extent a network resembles a lattice or a random graph. $\omega $ close to 0 implies small-world characteristics
Degree ${{k}_{i}}=\sum\limits_{j}{{{A}_{ij}}}$ ki is the degree of node i; Aij indicates whether there is a connection between node i and node j, 1 if there is, or 0 if there is not It represents the number of edges connecting a specific node to other nodes in a network, reflecting its centrality within the network
Degree of weighting ${{k}_{wi}}=\sum\limits_{j}{{{w}_{ij}}}$ kwi is the weighted degree of node i; wij represents the weight of the edge between node i and node j It represents the weight of the edges between nodes in a network. It reflects the sum of the weights of the edges connecting a specific node to other nodes, indicating its connectivity strength within the network, i.e., its influence
Cumulative degree
distribution
${{P}_{cum}}(k)=P(K\ge k)$ ${{P}_{cum}}(k)$ represents the proportion of nodes with degree at least k; K represents the degree of the node Describe the probability distribution of node degrees in a network. It represents the cumulative probability of nodes with at least k connections in the network
Average
clustering coefficient
${{C}_{avg}}=\frac{1}{N}\sum\limits_{i=1}^{N}{{{C}_{i}}}$ Cavg is the average clustering coefficient of the entire network; Ci represents the clustering coefficient of node i, and N represents the total number of nodes in the network It reflects the aggregation of nodes in the network, which represents the network’s clustering tendency.
Average shortest path ${{L}_{avg}}=\frac{1}{N(N-1)}\sum\limits_{i\ne j}{{{d}_{ij}}}$ Lavg is the average shortest path length of the network; N represents the total number of nodes in the network; dij represents the shortest path length between node i and node j It characterizes the average distance between any two nodes, reflecting the degree of separation among nodes in the network
Network
density
$D=\frac{2\times L}{N\times (N-1)}$ D represents the value of the network density; L represents the actual number of connections (edges) in the network, and N is the total number of nodes in the network The greater the number of edges in the network, the denser the network is.
Network
diameter
$D=\text{ma}{{\text{x}}_{i,j}}{{d}_{ij}}$ D represents the value of the network diameter; dij represents the shortest path length between node i and node j This indicates the maximum distance for information propagation within the network, which reflects the longest shortest path length between any two nodes in the network
The compatibility of node degrees $r=\frac{\mathop{\sum }_{i,j}\left( {{A}_{ij}}-\frac{{{k}_{i}}{{k}_{j}}}{2m} \right)\delta ({{k}_{i}},{{k}_{j}})}{\mathop{\sum }_{i,j}\left( {{A}_{ij}}-\frac{{{k}_{i}}{{k}_{j}}}{2m} \right)(1-\delta ({{k}_{i}},{{k}_{j}}))}$ r represents the compatibility of node degrees; Aij indicates whether there is a connection between node i and node j; ki represents the degree of node i ; m represents the number of edges in the network. δ (ki, kj) represents the Leopold Kronecker δ sign, which is 1 when k = kj and 0 otherwise If assortativity is positive, it indicates that the network tends to connect nodes with similar degrees; conversely, it tends to connect nodes with different degrees
Near centrality ${{C}_{i}}=\frac{1}{\frac{1}{N-1}\sum\limits_{j\ne i}{{{d}_{ij}}}}$ Ci represents the near centrality value representing node i. N represents the total number of nodes in the network; dij represents the shortest path length between node i and node j It measures the average distance from a node to other nodes in the network, indicating the centrality of the node within the network
Centrality of intermediate numbers $CB(v)=\sum\limits_{\begin{smallmatrix}
s\ne v,\ v\ne u \\
s,t\in V
\\
s\ne t
\end{smallmatrix}}{\frac{{{\sigma }_{st}}(v)}{{{\sigma }_{st}}}}$
CB (ν) represents the betweenness centrality of node ν; σst represents the number of shortest paths from node s to node t; σst (ν) represents the number of shortest paths through node ν It reflects the frequency with which a node lies on the shortest paths connecting other nodes in the network, thus indicating the node’s influence in information propagation within the network
Centrality of eigenvectors $Ax=\lambda x$ A represents the adjacency matrix of the network, representing the connections between nodes; x represents the eigenvector centrality of nodes; λ represents the eigenvalue corresponding to the eigenvector x It is an index to measure the importance of nodes in a network, considering the connection strength between nodes and their neighbors

4 Results

4.1 Network timing division

Figure 1 illustrated the phase division of the Shanghai tourist market. It can be observed that the previous phase remained stable and orderly, while the subsequent phase experienced brief fluctuations followed by gradual recovery. The number of domestic tourists grew steadily during the previous phase, indicating that the scale of the tourist network continued to expand, with overall growth being relatively orderly and stable. However, upon entering the subsequent phase, the network’s complexity and uncertainty became evident. Disruptions caused by the COVID-19 pandemic led to a sharp decline in tourist numbers, increasing the overall instability of the tourism system. In 2021, as tourist numbers began to rise, the Shanghai tourist market faced a recovery, and a new evolutionary trend was budding.
Figure 1 Points of variation and stages of the Shanghai tourist market

4.2 Verification of network complexity features

The cumulative probability distribution of the two phases of the STFN was calculated (as shown in Figure 2) and compared it with an equivalent-sized (same number of nodes and edges) random network. The cumulative degree distribution of the STFN for both periods, when plotted on a log-log scale with logarithmic binning, approximately followed the scale-free characteristics typical of complex networks. In contrast, the random network exhibited a power-law distribution, highlighting significant differences between the two.
Figure 2 Scale-free characteristic metrics for the two-phase network
Quantitative indicators related to the small world characteristics of the STFN were statistically analyzed (as summarized in Table 2) (Barrat and Weigt, 2000). The values of $\sigma $ and $\omega $ were obtained by averaging results from ten random generations, and their standard deviation and coefficient of variation were calculated (Telesford et al., 2011). According to the table,$\sigma $>1 and $\omega $ falled within the range of (-1, 1), both standard deviations were below 10%, and the coefficients of variation were below 15%, confirming the validity of the computed results. In summary, both networks exhibited small-world network characteristics, indicating that the STFN possessed small-world characteristics and approximated a scale-free network, aligning with the fundamental features of complex networks.
Table 2 Table of indicators of small world characteristics for the two phases
Statistical stage Average Standard deviation Coefficient of variation
σ ω σ ω σ ω
2014-2019 1.150405 -0.38931 0.051929 0.051414 0.04514 -0.13207
2020-2023 2.315617 -0.89681 0.091601 0.045524 0.039558 -0.05076

4.3 The overall structure of the network

4.3.1 The network transitioned from disassortative to temporarily assortative characteristics

To gain deeper insights into the network structure, the assortativity of node degrees was calculated to measure the correlation between a node’s degree and the degrees of its neighboring nodes. The computed results revealed that the assortativity coefficient (r value) for the period 2014-2019 was -0.0421, indicating disassortative behavior. However, during the period 2020-2023, the assortativity coefficient increased to 0.1669, signifying assortative behavior. This shift suggested that in the previous phase, Shanghai’s prominent tourism nodes, acting as essential hubs for connectivity and information dissemination, were more likely to connect with destinations of lower degrees. It also implied that tourists enjoyed considerable freedom in choosing diverse and heterogeneous destinations during their travel, validating findings from previous research by Xu et al. (2022). In contrast, the subsequent phase exhibited assortative features, where tourists tended to select nodes with similar popularity levels. Destination choices became less diverse, but lesser-known, niche destinations began to attract attention.

4.3.2 Network exhibits overall expansion and internal separation trend

Analysis of node and edge counts (as shown in Table 3) revealed significant growth in the STFN, indicating an expanding scale with the addition of new nodes. This expansion implied the establishment of novel connections between nodes and the emergence of new tourism routes, leading the network toward greater complexity and diversity. Examining the average clustering coefficient, during the stable growth phase (2014-2019), the network exhibited an average clustering coefficient of 0.356. However, during the chaotic and transitional phase, the clustering coefficient dropped to 0.247, indicating reduced clustering. This weakening of clustering suggested that the network was undergoing a transformation, with more dispersed node connections and relatively fewer inter-node links. Analyzing the average shortest path length, the average shortest path for the period 2014-2019 was 3.452, while for 2020-2023, it increased to 5.031. This indicated that relationships between nodes in the subsequent phase became sparser, corroborating the network’s overall expansion and internal separation. This trend was primarily attributed to several factors. The addition of new tourism nodes, including remote villages like Haishen Village and Wufang Village, led to fewer connections with existing nodes, resulting in greater dispersion and reduced clustering. The COVID-19 pandemic disrupted travel, causing a decrease in overall tourist numbers in Shanghai. Consequently, connections to popular tourism nodes weakened. Changing tourism patterns, demands, and behaviors, along with restrictions on local movement, prompted city residents to explore lesser-known destinations. Additionally, online platforms such as Little Red Book and Qunar influenced tourists’ destination choices (Claire and Hoi, 2023), contributing to shifts in the network structure.
Table 3 Statistics on the amount of network features in the two phases
Statistical stage Number of nodes Number of edges Average clustering coefficient Average shortest path Network density Network diameter
2014-2019 411 1350 0.356 3.452 0.010 12
2020-2023 1474 3606 0.247 5.031 0.002 20

4.4 Evolution of key node functions

Key nodes were typically nodes within a network that hold significance in maintaining stability and connectivity. These nodes played a crucial role in the overall structure and functionality of the network. By identifying and analyzing the characteristics of these nodes, a better understanding of the network’s structure and functions could be achieved (Loureiro et al., 2020). The weighted degree, closeness centrality, intermediate centrality, and eigenvector centrality of the top 10 tourist destinations in the STFN were calculated (as shown in Table 4) to analyze the key nodes changes of the network.
Table 4 Statistics on the amount of key node features of the two-phase network
Statistical period Key nodedw Degree of weighting Key nodecin Centrality of
intermediate
numbers
Key nodecc Closeness centrality Key nodece Centrality of
eigenvectors












2014-2019
The Bund 2960553 Nanjing road walkway 0.0889 The Bund 0.2629 The Bund 0.2414
Nanjing road walkway 2662445 The Bund 0.0781 Wukang road 0.2601 Nanjing road walkway 0.2221
Wukang road 2203503 Wukang road 0.0525 Nanjing road walkway 0.2591 Wukang road 0.2084
Yu Garden 1672685 Lujiazui 0.0516 Shanghai
Disney Resort
0.2556 Shanghai Disney Resort 0.2070
Town god’s temple of
Shanghai
1521739 Humin road 0.0457 Lujiazui 0.2552 Lujiazui 0.2018
Oriental pearl
TV tower
1514918 Grand
gateway 66
0.0441 Jing’an Temple 0.2475 Yu Garden 0.1811
Lujiazui 1496821 Caoxi road 0.0431 Town god’s temple of Shanghai 0.2471 Town god’s
temple of
Shanghai
0.1711
Tianzifang 1233904 Shanghai Disney Resort 0.0422 Yu Garden 0.2468 Wukang building 0.1687
Shanghai
Disney Resort
1180176 Jing’an
Temple
0.0392 Shanghai
museum
0.2467 Shanghai
museum
0.1676
Anfu road 990447 Yu Garden 0.0371 Oriental pearl TV tower 0.2442 Tianzifang 0.1630













2020‒2023
The Bund 1041680 The Bund 0.1769 The Bund 0.4386 The Bund 0.2465
Nanjing road walkway 763636 Tianzifang 0.1057 Nanjing road walkway 0.4245 Town god’s
temple of
Shanghai
0.2306
Town god’s temple of
Shanghai
612702 Nanjing road walkway 0.0970 Tianzifang 0.4229 Tianzifang 0.2263
Tianzifang 488072 Town god’s temple of Shanghai 0.0846 Town god’s temple of Shanghai 0.4224 Nanjing road walkway 0.2233
Yu Garden 396304 Oriental pearl TV tower 0.0636 Oriental pearl TV tower 0.4098 Neo world of Shanghai 0.2071
Oriental pearl
TV tower
388425 Lujiazui 0.0513 Neo world of Shanghai 0.4064 Lujiazui 0.2051
Lujiazui 280850 Nanxiang ancient town 0.0452 Lujiazui 0.4030 Oriental pearl TV tower 0.2023
Neo world of Shanghai 253747 Wukang road 0.0419 Yu Garden 0.3956 Yu Garden 0.1799
Shanghai
Disney Resort
227849 Qibao ancient town 0.0402 Shanghai world expo park 0.3915 Shanghai
museum
0.1683
People’s Square of Shanghai 199626 Jing’an
Temple
0.0388 1933 Shanghai 0.3893 Shanghai world expo park 0.1657
From the Table 4, it can be observed that:
(1) Various nodes experienced rapid growth in their weighted degrees, and Anfu road, emerging as a popular node in recent years, had standed out. This indicated that while the area around the Bund and Nanjing road walkway remained the core of the Shanghai tourism network, Anfu road’s influence had significantly increased due to evolving consumer demands.
(2) The centrality indicators for most nodes remained relatively stable, suggesting the network structure and the function of the key nodes were relatively stable, and traditional tourist destinations maintained consistent popularity levels.
(3) The Bund-Nanjing road walkway consistently served as a vital network resource hub. However, the strengthening intermediary effects of nearby ancient towns and their elevated status as “critical channels” had shifted the landscape. Nanxiang ancient town and Qibao ancient town, representing innovative models of cultural and tourism integration, played pivotal roles in enhancing network connectivity.
(4) In addition to traditional tourist destinations, the centrality of nodes such as the Shanghai world expo park and the Shanghai museum had further increased. This suggested that cultural nodes have become new popular choices, and the demand for cultural experiences had gradually become one of the essential factors influencing destination choices for tourists.

4.5 Network space evolution

To further explore the spatial evolution characteristics of the STFN, ArcGIS 10.8 was used to transform the topological relationships of the Shanghai tourism network into spatial connections. The strength of the Shanghai tourist network linkages during the period 2014-2023 were sorted and visually represented in space, as shown in Figure 3. Referring to relevant threshold research results (Wu et al., 2015) and considering the quantity of connections in the STFN, linkages were extracted for 0-0.25%, 0.25%-0.5%, 0.5%- 1.0%, 1.0%-2.5%, 2.5%-5.0%, and 5.0%-10% of the main contact strengths. These were then merged and simplified into three types of connection networks: core network, skeleton network, and regional network (Jiang et al., 2023).
Figure 3 Spatial evolution of network linkages in the two phases
The connections within the core network had significantly increased, indicating connections between core nodes gradually strengthened, leading to the emergence of new popular tourism nodes. Initially, network connections were concentrated in the central area around the Bund. Over time, the network expanded southeastward, ultimately forming a pattern with the Bund as the core and Shanghai Disney resort as a secondary center.
The weighted degrees of various skeleton network nodes had increased to varying extents. This reflected the rising importance and influence of these nodes within the tourist flow network. The overall structure of the network remained relatively stable, exhibiting an inverted “V” shape. The southeastward expansion trend began to emerge.
The tendency for expansion was more pronounced within the regional network, with network linkages gradually extending from the central core nodes towards the southwest and southeast, forming an outward-radiating network. Simultaneously, this expansion extended beyond core-to- core connections and included links between core nodes and adjacent non-core nodes, such as the relationship between the world expo park and Century park. Nodes like the Shanghai astronomical museum and Dishui lake had formed relatively independent pairs of connections, which was due to their geographically distant locations from the city center.

5 Discussion

This study explored the spatio-temporal evolution of the tourist flow network in Shanghai, and extended and expanded on the aspects of time division, network complexity verification, etc. It had made some progress on the aspects of the evolutionary stages, overall structure identification, and spatial characteristics analysis of the STFN. However,
there are still some limitations:
(1) The experiment of using domestic tourist data for CPD had certain limitations. Tourist flow should include not only tourist flow, but also information flow, capital flow, material flow, energy flow and cultural flow, etc. To obtain more objective conclusions, it is necessary to correlate and verify with different industries in the future.
(2) The dimensions of network travelogues were insufficient, and the accuracy was limited. In the future, big data of intelligent mobile devices can be used to capture tourists’ real-time behavior more accurately and understand the spatio-temporal evolution law of tourist flow more comprehensively.
(3) The evolution characteristics of STFN had received enough attention, but the analysis of the influencing factors was still insufficient. It is necessary to further explore the driving mechanism behind the network evolution.
Tourist flow network has the characteristics of dynamism and complexity. To better promote the development of Shanghai’s tourism industry, firstly, the government and tourism enterprises need to continuously monitor the changes in the tourism market, adapt to the changing market demand and new tourist flow trends, and timely adjust the tourism management and operation strategies to ensure the stability and health of the tourism market. Secondly, for the heterogeneity of the nodes in the tourist flow network, the government and tourism enterprises need to provide personalized services and marketing, promote the integrated development of culture and tourism, provide tourists with a more comprehensive experience. Finally, the development of the emerging nodes on the edge needs more support, so that they can become the new core of the outward radiation network and enhance the attractiveness and competitiveness of the whole network.

6 Conclusions

Using CPD and CNA to identify critical turning points, the dynamic evolution characteristics of the STFN are explored. The conclusions are as follows:
(1) The evolution of STFN in the past 10 years can be divided into two stages: 2014-2019 and 2020-2023. It underwent a phase of steady orderliness, a brief fluctuation, followed by a gradual recovery stage. The year of 2019 was the turning point of its network change. The outbreak of the COVID-19 pandemic in early 2020 had a huge impact on the tourism industry, causing the overall change of STFN. The tourist flow network in Shanghai experienced fluctuations after the outbreak of the pandemic, but then gradually recovered, indicating that the tourist flow network in Shanghai exhibits different development patterns at different times, and also reflects its sensitivity and adaptability to external shocks.
(2) STFN had small-world and approximate scale-free network characteristics. Under the joint influence of various factors such as the pandemic, policies, and tourists’ travel preferences, the network changed from disassortative to temporarily assortative, and the trend of external expansion and internal separation was significant. These network evolution characteristics reflected the dynamism and complexity of STFN. Under the joint influence of multiple factors such as the pandemic, policies, and tourists' travel preferences, the network transitions from a disordered state to an ordered state, indicating that the tourist flow network in Shanghai can adapt to environmental changes by adjusting its structure. This also reflects the dynamics and complexity of Shanghai’s tourist flow network.
(3) Shanghai’s tourism network was stable overall, while new nodes had also emerged. Traditional popular destinations were still important, and the rise and diversification of emerging cultural nodes had added new vitality to the network. The Shanghai tourist flow network is overall stable, and at the same time, the rise of emerging cultural nodes and the diversification of tourism demands also continuously inject new vitality into the network. This indicates that while maintaining stability, the Shanghai tourist flow network is also constantly adapting to and integrating new elements.
(4) With the emergence of new popular nodes and the strengthening of the connection with the surrounding nodes, the tourist flow network presented an inverted “V” pattern and gradually expanded southwest and southeast. While forming an outward radiation network, the edge also generated relatively independent connections of new nodes. This indicates that while maintaining the cohesion of the central areas, the Shanghai tourist flow network is also diffusing to the peripheral regions, forming a more diverse and extensive set of connections.
In summary, the research reveals the dynamic evolution of the network characteristics and structure of the Shanghai tourist flow network at different time stages, as well as its adaptive development in response to external shocks (such as the COVID-19 pandemic) and internal updates (such as the growth of new tourism nodes and changes in tourists' preferences). It also verifies the complexity of the network and its resilience in the face of shocks. This not only strengthens the understanding of the structural characteristics and development trends of the Shanghai tourist flow network but also helps planners to formulate more flexible strategies, improve the efficiency of resource allocation, and respond to increasingly diverse market demands and potential risk crises, to better meet challenges, seize opportunities, and promote the sustainable development of Shanghai's tourism industry.
[1]
Barrat A, Weigt M. 2000. On the properties of small-world network models. The European Physical Journal B—Condensed Matter and Complex Systems, 13(3): 547-560.

[2]
Braei M, Wagner S. 2020. Anomaly detection in univariate time-series: A survey on the state-of-the-art. ArXiv, 2004(00433). DOI: 10.48550/arXiv.2004.00433.

[3]
Breunig M M, Kriegel H P, Ng R T, et al. 2000. LOF: Identifying density-based local outliers. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, (5): 93-104.

[4]
Cheng Z, Jia T F. 2020. Study on the coupling and coordination relationship between the structure of tourism flow network and tourism environment in Shanghai. Modern Urban Research, 2020(3): 106-112. (in Chinese)

[5]
Claire L, Hoi T C. 2023. Social media engagement and impacts on post-COVID-19 travel intention for adventure tourism in New Zealand. Journal of Outdoor Recreation and Tourism, 44(A): 100612. DOI: 10.1016/j.jort.2023.100612. (in Chinese)

[6]
Del Mondo G, Peng P, Gensel J, et al. 2021. Leveraging spatio-temporal graphs and knowledge graphs: Perspectives in the field of maritime transportation. ISPRS International Journal of Geo-Information, 10(8): 541. DOI: 10.3390/ijgi10080541.

[7]
Fang Y L, Wang Q Y, Wu Y N, et al. 2023. Spatial distribution characteristics of tourist flow network structure for cities along Shanghai-Kunming high-speed railway. Geography and Geo-Information Science, 39(5): 138-144. (in Chinese)

[8]
Gholamhosseinzadeh M S, Chapuis J, Lehu J M. 2021. Tourism netnography: How travel bloggers influence destination image. Tourism Recreation Research, 48(23): 1-17.

[9]
He K, Zhang X, Ren S, et al. 2016. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (12): 770-778.

[10]
Jiang J L, Lu J Y, Ge D Z, et al. 2023. Research on spatial-temporal evolution characteristics of urban economic networks in Yangtze River Delta. Resources and Environment in the Yangtze River Basin, 32(1): 24-39. (in Chinese)

[11]
Jiang W X, Wang Y. 2020. Research on structural change characteristics and Influencing factors of electronic products trade network along the Belt and Road: Based on complex network analysis method. International Business Research, 41(5): 26-40. (in Chinese)

[12]
Kang L, Li Z T, Li W R. 2020. The Q-Q model analysis of China’s provincial inbound tourism market structure. Journal of Shaanxi Normal University (Natural Science Edition), 48(4): 20-27. (in Chinese)

[13]
Keogh E, Kasetty S. 2003. On the need for time series data mining benchmarks: A survey and empirical demonstration. Data Mining and Knowledge Discovery, 7(4): 349-371.

[14]
Killick R, Fearnhead P, Eckley I A. 2012. Optimal detection of changepoints with a linear computational cost. Journal of the American Statistical Association, 107(500): 1590-1598.

[15]
Laura C, Balzan M V. 2023. Characterizing nature-based recreation preferences in a Mediterranean small island environment using crowdsourced data. Ecosystems and People, 19(1): 2274594. DOI: 10.1080/26395916.2023.2274594.

[16]
Li L, Tao Z, Lai Z C, et al. 2021. Analysis of the Internet attention and tourism flow network structure of red tourism resources in Long March National Cultural Park. Journal of Natural Resources, 36(7): 1811-1824. (in Chinese)

DOI

[17]
Li Q, Qu L Y. 2021. The network structure and influencing factors of the tourist flow within the city: The case of Shanghai. World Regional Studies, 30(1): 114-124. (in Chinese)

DOI

[18]
Liang X Y, Ma L J. 2023. The spatial pattern and formation mechanism of domestic tourist flow circulations: Analysis based on network attention data. Tourism Tribune, 38(9): 104-117.

[19]
Liu H Y, Ma Y F. 2008. Coupling analysis of the linkages between inbound tourism flows and tourism economy: A case study of the westward diffusion of tourists through Shanghai. Resources Science, 30(8): 1162-1168. (in Chinese)

[20]
Loureiro G B, Ferreira J C E, Messerschmidt P H Z. 2020. Design structure network (DSN): A method to make explicit the product design specification process for mass customization. Research in Engineering Design, 31(2): 197-220.

[21]
Ma Y F, Li Y J. 2000. Spatial analysis of the inbound tourism flow to and in China. Journal of Shaanxi Normal University (Natural Science Edition), 28(3): 121-124. (in Chinese)

[22]
Ma Y F, Wang G X, Zhang Y Y. 2008. Empirical research on spatial field effects of inbound tourist flows in China: Case study of Sichuan Province. Resources Science, 30(11): 1747-1753. (in Chinese)

[23]
Peng P, Cheng S F, Yang Y, et al. 2021. Research on characteristics and evolution of global LNG transportation network. Geographical Research, 40(2): 373-386. (in Chinese)

[24]
Ren H K, Wei W, Wang K H. 2023. Structural evolution of tourism flow network in Ningxia under the influence of COVID-19. Arid Land Geography, 46(2): 316-324. (in Chinese)

DOI

[25]
Sun Y M, Liu P X, Zhang J X, et al. 2023. Spatial network structure and influencing factors of tourism flow in Northwest China based on online ordering data. Journal of Shaanxi Normal University (Natural Science Edition), 51(6): 123-133. (in Chinese)

[26]
Tax D M J, Duin R P W. 1999. Support vector domain description. Pattern Recognition Letters, 20(11-13): 1191-1199.

[27]
Telesford Q K, Joyce K E, Hayasaka S, et al. 2011. The ubiquity of small- world networks. Brain Connectivity, 1(5): 367-375.

DOI PMID

[28]
Van Houdt G, Mosquera C, Nápoles G. 2020. A review on the long short-term memory model. Artificial Intelligence Review, 53(8): 5929-5955.

[29]
Wang C, Qiao H, Zhang S J, et al. 2022a. Evolution of spatial pattern of inbound tourism flows and enlightenment of high-quality development of metropolitan tourism: A case study of Shanghai. Journal of Natural Resources, 37(12): 3167-3182. (in Chinese)

[30]
Wang G W, Wu T H. 2016. Temporal and spatial distribution of the quantity and quality of the inbound tourism flow to Inner Mongolia Autonomous Region. Journal of Arid Land Resources and Environment, 30(12): 199-203. (in Chinese)

[31]
Wang J W, Wang G Q, Liu Y, et al. 2021. Spatio-temporal distribution and network structure of red tourism flow in Jinggangshan. Journal of Natural Resources, 36(7): 1777-1791. (in Chinese)

DOI

[32]
Wang Q Y, Gao W, Wei J J, et al. 2022b. Global tourism network landscape and its influencing factors. Tourism Tribune, 37(8): 133-149. (in Chinese)

[33]
Wang R Y, Zhang W K, Cheng S W. 2023. Study on the changes of spatio-temporal characteristics of tourism flow in Guangzhou City before and after COVID-19. Journal of Natural Science of Hunan Normal University, 46(4): 133-143. (in Chinese)

[34]
Wang X Y, Huang J W. 2023. Change-point detection based on gamma distribution. Information Technology and Informatization, (8): 73-78. (in Chinese)

[35]
Wang X Y, Liu X Y. 2022. Development and evolution of urban tourism economic network structure and its optimization under the influence of high-speed rail in the Yellow River Basin. Economic Geography, 42(9): 211-218. (in Chinese)

DOI

[36]
Wang Y M, Ma Y F, Wang M X. 2010. Research on spatial field effects on YRV of inbound tourist flows in Shanghai City. Economic Geography, 30(5): 854-858. (in Chinese)

[37]
Wang Z H, Tang C S, Qiao H H, et al. 2020. Spatial structure characteristics of rural tourism flow based on digital footprint: Taking Huzhou City in Zhejiang Province for example. Economic Geography, 40(3): 225-233, 240. (in Chinese)

[38]
Williams A, Zelinsky W. 1970. On some patterns in international tourist flows. Economic Geography, 46(4): 549-567.

[39]
Wu J F, Bao H S. 2005. Research on the distance decay of the tourist flow. Human Geography, 20(2): 62-65. (in Chinese)

[40]
Wu K, Fang C L, Zhao M X. 2015. The spatial organization and structure complexity of Chinese intercity networks. Geographical Research, 34(4): 711-728. (in Chinese)

[41]
Wu S S, Wang L C, Liu H Y. 2020. Study on the structural characteristics of tourism flow network in the Yellow River Basin. Economic Geography, 40(10): 202-212. (in Chinese)

[42]
Xu M, Huang Z F, Cao F D, et al. 2018. The network structure features and influence factors of tourism flows based on online data analysis—Taking the Yangtze River Delta Region as an example. Economic Geography, 38(6): 193-202. (in Chinese)

[43]
Xu Y, Peng P, Lu F, et al. 2022. Exploring spatio-temporal characteristics of tourist travel behavior before and after COVID-19: A case study in Hong Kong, China. Geographical Research, 41(12): 3199-3213. (in Chinese)

[44]
Xue Y. 2006. On the regional implosion of tourist flows: From self-organization to organization. Tourism Tribune, 21(4): 47-54. (in Chinese)

[45]
Yan S S, Liang L K, Suo Z H, et al. 2017. Big data based spatiotemporal distribution characters of tourism flow in Luoyang. Economic Geography, 37(8): 216-224. (in Chinese)

[46]
Yang X Z, Wu J. 2015. A diachronic comparison of the characteristics of the structure of tourism flow network in Nanjing. Tourism Science, 29(4): 35-46. (in Chinese)

[47]
Yuan Y J. 2005. An approach to tourism flow and tourism double-flow system construction. Tourism Science, 19(1): 6-11. (in Chinese)

[48]
Zeng B D, Yu T. 2022. Analysis on the network structure and quarterly difference of Chinese tourist flows in Japan. Tourism Research, 14(6): 81-95. (in Chinese)

[49]
Zhang F F, Liu J. 2014. Reviews of complex networks topology structure and evolution models research (Ⅰ). Journal of Systems Science, 22(2): 55-57. (in Chinese)

[50]
Zhang F F, Liu J. 2015. Reviews of complex networks topology structure and evolution models research (Ⅱ). Journal of Systems Science, 23(1): 52-55. (in Chinese)

[51]
Zhang G H, Yuan H Y. 2023. Spatial network pattern and complexity of urban tourism information flow in China. Economic Geography, 43(1): 197-205. (in Chinese)

DOI

[52]
Zhang H. 2000. An analysis of the tourism movement spatial distribution of foreign tourists in Chinese famous tourism cities. Human Geography, 15(2): 56-57. (in Chinese)

[53]
Zhang J H, Zhang J, Li N, et al. 2005. An analysis on spatial field effect of domestic tourist flows in China. Geographical Research, 24(2): 293-303. (in Chinese)

[54]
Zhang Y D, Li C, Chen Y, et al. 2023. The community organization model and hierarchy in Xinjiang’s tourism flow network. Scientia Geographica Sinica, 43(9): 1619-1628. (in Chinese)

[55]
Zheng C, Ma Y, Wu S. 2010. Research on dynamic rules of inbound tourist flow westward transference in typical cities of China—A case of the tourist flow from Shanghai to Xi’an. Journal of Huazhong Normal University (Natural Sciences), 44(2): 337-340. (in Chinese)

[56]
Zhong S, Zhang J, Han G, et al. 2010. Flow: Problems and prospects. Human Geography, 25(2): 31-36. (in Chinese)

[57]
Zhong S, Zhang J, Ren L, et al. 2009. Concentration on problems within spatial pattern of tourist flow in the way of critical thinking. Progress in Geography, 28(5): 705-712. (in Chinese)

[58]
Zhou H L, Xu C X. 2019. Analysis on spatial network structure’s characteristics of China’s urban tourism information flow. Statistics & Decision, 35(20): 91-94. (in Chinese)

[59]
Zhou J L, Fan Y, Di Z R. 2023. Progress in complex networks. Journal of Beijing Normal University (Natural Science), 59(5): 691-704. (in Chinese)

[60]
Zhou J Z, Chen Y, Zhou H, et al. 2021. The landscape ecological security of a mining city: A two-layer complex network analysis method. China Environmental Science, 41(12): 5817-5826. (in Chinese)

[61]
Zhou L, Wu D T, Yu H, et al. 2020. Evolution of urban tourism flow network structure based on network travel notes: A case study of Beijing city. Scientia Geographica Sinica, 40(2): 298-307. (in Chinese)

DOI

Outlines

/