A new study has revealed a measurement system that detects subway travellers' movements to help public transport planners make smarter decisions around future investment and management.
The study was released as part of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining in Sydney this week, and is titled Traffic Measurement and Route Recommendation System for Mass Rapid Transit (MRT).
The system has been operating for several months on Singapore's Mass Rapid Transit (MRT) network, which has 92 stations and five train lines. It looks at the number of people built up on station platforms, the flows of people making transfers at stations, and the expected travel time for each possible route.
The authors of the study, from DataSpark (part of SingTel), developed an algorithm that detects individual subways trips from anonymised real-time location-based data from SingTel.
As Singapore's largest telecommunications company, it was able to tap into the mobile phone location data of 3.98 million customers, which was anonymised ID, latitude, longitude, timestamp and service type.
Privacy was taken into account when conducting the study by anonymising the original unique ID for each mobile customer through a two-step non-reversible AES Encryption and Hashing process.
"[This] makes it impossible to trace back to the original unique ID or the mobile customer," the authors said.
They also complied with Singapore's Personal Data Protection Act (PDPA), and consent from customers was also given through SingTel's Data Protection Policy.
To ensure that the selected customer base is representative of Singapore's population, the authors compared the distribution of residences of SingTel customers with Singapore's 2010 census data. The two distributions highly linearly correlated, meaning it was quite a representative population.
Looking at the most densely populated stations, which are indoors, the algorithm was able to distinguish SingTel cell towers for the indoor subway from all others when detecting individual trips and their locations.
"We define an MRT trip as the sequence of indoor stations s = [s1, . . . sn] a passenger α passes during their ride and the corresponding timestamps t = [t1, . . . tn] when each of these stations is reached. s1 is the boarding station, sn the station of disembarkation and s2, . . . sn - 1stations α passes during their ride."
To test the measurements that the algorithm produced against some 'ground truth' or real measurements, the authors compared with counts taken at popular shopping spot, Orchard MRT station, engaging a market research company to do this.
"We determined the number of SingTel subscribers departing and arriving at Orchard MRT station during the same three days on an hourly basis using our MRT trip detection algorithm and market extrapolated these numbers with the scaling factor A. This takes into account both SingTel's market share in Singapore as well as the mobile device penetration rate in Singapore.
Sign up for CIO Asia eNewsletters.