Elsevier

Journal of Computational Science

Volume 10, September 2015, Pages 338-350
Journal of Computational Science

A data-driven agent-based model of congestion and scaling dynamics of rapid transit systems

https://doi.org/10.1016/j.jocs.2015.03.006Get rights and content

Highlights

  • Develop a full-scale data-driven agent-based model (ABM) of rapid train system (RTS).

  • Model integrates a route choice model and results are validated using smart card data.

  • Analyze congestion and scaling dynamics of RTS using the developed ABM.

  • Propose a procedure that captures the penultimate station effect in RTS.

Abstract

Investigating congestion in train rapid transit systems (RTS) in today's urban cities is a challenge compounded by limited data availability and difficulties in model validation. Here, we integrate information from travel smart card data, a mathematical model of route choice, and a full-scale agent-based model of the Singapore RTS to provide a more comprehensive understanding of the congestion dynamics than can be obtained through analytical modelling alone. Our model is empirically validated, and allows for close inspection of congestion and scaling dynamics. By adjusting our model, we can estimate the effective capacity of the RTS trains as well as replicate the penultimate station effect, where commuters travel backwards to the preceding station to catch a seat, sacrificing time for comfort. Using current data, the crowdedness in all 121 stations appears to be distributed log-normally. We find that increasing the current population (2 million) beyond a factor of approximately 10% leads to an exponential deterioration in service quality. We also show that incentivizing commuters to avoid the most congested hours can bring modest improvements to the service quality. Finally, our model can be used to generate simulated data for statistical analysis when such data are not empirically available, as is often the case.

Introduction

To tackle rising population density in urban cities, transportation planners often construct train rapid transit systems (RTS) as a first step. Yet continued population growth forces the RTS to evolve towards increased complexity with more train lines added to satisfy demand. With the increased complexity, planners are confronted with the difficulty of predicting commuter ridership, route choices, and also the various outcomes of the system during disruptions. Moreover, increased station and train crowdedness in RTS lead to congestion, commuter discomfort, trip delays, and lowered overall service quality standards. It is therefore imperative that modern transportation models be capable of investigating not just the issues of efficient, robust and scalable transportation, but also of commuter comfort and satisfaction.

The introduction of smart card ticketing in RTS has serendipitously enabled large-scale data analytics into commuter travel behaviour [1], [17]. Analytical and regression models have been developed to estimate commuters’ spatio-temporal density [20], identification of boarded trains [10], travel patterns [4], and transit use variability [14]. Yet, it is noted that the information captured by smart cards has limitations [17]; for example, most datasets do not contain routing information as they capture information only at the entry and egress points of journeys.

In contrast to analytical and regression models, agent-based models (ABM) strive to model each individual agent in a manner most natural to the system at hand [3]. Essentially, an agent is autonomous and formulates decisions and interacts with other agents directly. By directly replicating the mechanics of the system, an ABM permits the observation of emergent phenomena that arise from the interactions of the agents with each other [3] – provided the mechanics are correctly characterized and the model is well-calibrated.

ABM has seen recent success in modelling large-scale transportation [7], [15], [21]. However, there are not many studies which incorporate smart card data to drive RTS demand for better calibration. In our previous work [11], we had leveraged upon anonymized travel smart card transactional data to synthesize travel demand for a smaller-scale agent-based model of the Singapore transit system involving only one of the operational train lines, and achieved a very close match between the simulated and empirical travel duration distributions. In that work, we also investigated the dynamics of the smaller-scale system with regard to population growth.

Here, we extend our previous work [11] by: (1) expanding the model to cover all seven operational lines; (2) adding a route-choice mechanism inferred statistically from empirical travel duration distributions [13]; (3) incorporating station-specific walk-times; (4) investigating dynamics that were not directly measurable in our dataset, such as station crowdedness; (5) estimating the effective train capacity; (6) modelling the penultimate station effect; and (7) running further population scaling scenarios. We validate our model by ensuring the travel duration distributions generated from our simulations are well-calibrated to the empirical dataset. This would lend strength to any inferences derived from our scenarios. Apart from these goals, ultimately, we strive to construct a simulation platform that can be used to evaluate the efficacy of proposed strategies in tackling current and future urban transportation issues.

Our experiments in this work are focused on the Singapore rail transit system, which began operations in 1987 and is now one of the busiest RTS in the world. Despite the focus, our approach can be applied to other rail transit systems in the world, as we do not utilize any Singapore-specific mechanics or assumptions in our model.

Section snippets

Data

Our model is dependent on data for the following purposes: (1) to construct the transit infrastructure, (2) to instantiate the commuter agents corresponding to the actual travel demand, (3) to calibrate the travel time components of the network, and (4) to accurately model the commuters’ decision making (e.g., route choice).

We model the Singapore rail transit system in our experiments. The Singapore RTS comprises two train systems: the Mass Rapid Transit (MRT) system consisting of four

Computational model

Our approach to modelling the RTS comprises two aspects: (1) the modelling of the trains as they traverse the rail network, and (2) the modelling of commuters as they travel from their origins to their destinations. The first aspect, the modelling of trains, is straightforward as we do not fully model the physical mechanisms of the trains, and it is only sufficient that our simulated train arrivals can fit the publicly available train schedules (i.e., first-train timings and train arrival

Evaluation of agent-based model and simulation

Having validated the model in the previous section, here, we evaluate the model and determine if it can adequately replicate the mechanics of the rail-transit system. First, we investigate the effective train capacity, which can be lower than the operational capacity of the train. Second, we observe dynamics recorded in our simulation that are not explicitly manifested in the empirical data, including the penultimate station effect. Lastly, we explore how the penultimate station effect can be

Scenario descriptions

A major concern to urban planners is the scalability of their transit systems with regard to commuter population growth. In our population scaling scenarios, we adjust the transit demand in our model to predict how population growth may affect commuter experiences with respect to travel duration and number of trains missed. Here, population refers to the number of journeys simulated. An actual commuter may take multiple journeys in a single day. In these scenarios, we utilize the Monday

Conclusion

In this work, we had incorporated empirically derived travel demand data into a full-scale agent-based model of the train rapid transit system in Singapore. Our approach granted us a more comprehensive view of congestion dynamics than afforded by analysing the dataset directly. We were able to synthesize highly detailed measurements, including crowdedness and number of trains missed. With these measurements, transport operators can accurately estimate the comfort and satisfaction of commuters,

Acknowledgements

We thank the Land Transport Authority of Singapore for providing the anonymized smart card data. We thank Dr. Gary Lee Kee Khoon and Dr. Terence Hung Gih Guang for discussions. This research is supported by the Science and Engineering Research Council of the Agency for Science, Technology and Research (A*STAR) of Singapore (Complex Systems Programme grant number 122 45 04056); and by the A*STAR Computational Resource Centre through the use of its high performance computing facilities.

Nasri Bin Othman is a Research Engineer at the Institute of High Performance Computing, Singapore, where he investigates urban systems. He received his B.Eng. in Computer Science from Nanyang Technological University, Singapore. His research interests include modelling and simulation, urban transportation, evolutionary computation, and interactive visualization.

References (21)

There are more references available in the full text version of this article.

Cited by (16)

  • Real-time agent-based crowd simulation with the Reversible Jump Unscented Kalman Filter

    2021, Simulation Modelling Practice and Theory
    Citation Excerpt :

    Recent efforts have been made to develop methods that will allow ABMs to react to real-world events as they happen. Whilst promising, they exhibit a number of limitations such as: the need for manual calibration [13] (which is infeasible in most cases); implementations that contain only a few agents and/or limited interactions [14,15]; the use of agent-based models that are simple enough to be approximated by an aggregate mathematical model [3,16]; parameters that can be dynamically optimised but a model state that is not updated [17]; or the use of DA methods that do not scale to large model size [10,15]. Also, importantly, none of the previous approaches have attempted to assimilate data for categorical variables.

  • Challenges, tasks, and opportunities in modeling agent-based complex systems

    2021, Ecological Modelling
    Citation Excerpt :

    As a result, it is almost impossible to reproduce model results or to interpret them. For instance, Othman et al. created an ABM of the rail network in Singapore, which used only train ticket purchasing data as its input (Othman et al., 2015). There is an implicit assumption that models built using new forms of micro-data will capture the essential processes that are taking place in these systems.

  • Improved estimation of commuter waiting times using headway and commuter boarding information

    2018, Physica A: Statistical Mechanics and its Applications
    Citation Excerpt :

    Many modern cities around the world have implemented Automated Passenger (fare) Collection (APC) data and Automated Vehicle Location (AVL) procedures that seek to improve the overall efficiency of bus services. In addition, the analysis of datasets resulting from these platforms have increased our understanding of the dynamics of bus operations [6–10]. Unfortunately, the dynamics of actual commuter waiting times remains difficult to accurately record.

  • Modeling and evaluation of highly complex computer systems architectures

    2017, Journal of Computational Science
    Citation Excerpt :

    In the design phase, modeling is generally adopted as a means to explore the solutions before developing prototypes: for HCCS, prototypes may be very expensive or not a viable possibility. For example, systems that must strictly adhere to safety standards and require a proof of conformance in order to be viable [4], such as railways or aerospace systems [5–7], in which models may help in reducing the use of expensive prototypes and lower the costs of the overall design phase; or, in the case of very large computing facilities, adopting a prototype in the small will not reveal all problems that may arise in the large scale, such as communication problems, energy consumption effects [8,9], workload interactions [10,11], scheduling effects at various scales [12–14], and all the adaptations to the evolutions of the workload in the lifespan of the system [15], and in this case models may help in choosing between alternative approaches that show their effects in the large only, after big investments, or may signal in advance emerging problems such as saturation, competition for resources or side effects. In the integration phase, the problem lies in the possible mismatches or, again, interactions between components, that may lead to unexpected problems.

View all citing articles on Scopus

Nasri Bin Othman is a Research Engineer at the Institute of High Performance Computing, Singapore, where he investigates urban systems. He received his B.Eng. in Computer Science from Nanyang Technological University, Singapore. His research interests include modelling and simulation, urban transportation, evolutionary computation, and interactive visualization.

Erika Fille Legara, a former scholar at the Santa Fe Institute Complex Systems Summer School, obtained a Ph.D. in Physics at the National Institute of Physics at the University of the Philippines, Diliman. Bulk of her work has been on the diverse applications of statistical mechanics and network theory in understanding the mechanisms behind various socio-economic systems and paradigms such as multi-level markets, telecommunication companies, and news and media framing. She is currently a Scientist at the A*STAR Institute of High Performance Computing (IHPC) working on big data and visualization, urban complexity, and complex networks. More details on her research can be found at: http://www.erikalegara.com

Vicknesh Selvam is a scholar with the Agency of Science, Technology and Research (A*STAR) currently working as a Research Engineer at the Institute of High Performance Computing (IHPC). He received his Bachelor of Sciences in Mathematics, with a specialization in Computing, at the University of California, Los Angeles. His research interests include Network Science, Image Processing, Cryptography and Mathematical Modelling. He is currently working on the data analysis and modelling of Urban Transport networks under the Urban Systems Initiative.

Christopher Pineda Monterola is a Senior Scientist at the Institute of High Performance Computing in A*STAR Singapore. He is currently the Capability Group Manager (CGM) of the Complex Systems (CxSy) Group at the IHPC. Chris is also the Principal Investigator of the Complexity Science Programme of the IHPC under the CxSy. Prior to his stint in Singapore, he was a postdoctoral fellow at the Max Planck Institute for the Physics of Complex Systems. He obtained a Ph.D. in Physics in 2002 from the National Institute of Physics, University of the Philippines Diliman (UPD) where he was an Associate Professor 7. More details on his works can be found at: http://www.chrismonterola.net

This article is an extension of [16].

View full text