Data Sandbox+

We have created Data Sandbox+ to enable you to familiarise yourselves with some of the rail datasets that are available for the purpose of this research competition. We will keep it live and add as many datasets as possible. 

There may be other data that is not currently collected by the industry but that you think should be - in which case, please make this clear on your proposal.

There will be discussion with successful project teams about providing further access to datasets required to carry out their projects.

PLEASE READ THE T&Cs CAREFULLY 

By downloading the datasets below you agree to the following terms:  

  1. The sample datasets shall be downloaded and used solely for the purposes of preparing a bid for the RSSB Data Sandbox+ competition and for NO other purpose and/or project whatsoever;
  2. If having downloaded the datasets you decide not to submit and/or withdraw a bid to the RSSB Data Sandbox+ competition or your bid is unsuccessful you shall as soon as is reasonably practicable delete all records and/or copies made of such datasets (or parts thereof); and agree not to save, copy, and/or store any such datasets (or part thereof) on your systems and/or use for any other purpose; and
  3. Not to forward and/or disclose the datasets (or parts thereof) to any other party for any purpose. 



TfL Open Data Portal

The TfL data portal provides a range of data feeds and guidelines for using them. Including timetable data, station locations and facilities, departure boards, line status and station status, and network statistics.

RDG Customer Heartbeat™ & Promises - presentation

The Customer Heartbeat™ is a tool which maps 108 segments or ‘touchpoints’ of a rail journey and helps us to understand where the service provided does not yet meet customer expectations. This presentation provides and update on customer insight and satisfaction/sentiment monitoring.

RDG Customer Heartbeat™

The Customer Heartbeat™ is a tool which maps 108 segments or ‘touchpoints’ of a rail journey and helps us to understand where the service provided does not yet meet customer expectations.

Leaf fall report - 221 fleet (Virgin Trains)

Leaf fall report for Virgin Trains 221 fleet, which individually lists top WSP distances recorded that morning from the fleet matched to balise location; summary of the last 7 days WSP slides over 0.75 miles; the sanding seconds for each cab vehicle with a sanding risk based on sand hopper capacity.

Leaf fall report - 390 fleet (Virgin Trains)

Leaf fall report for Virgin Trains 390 fleet, which individually lists top WSP distances recorded that morning from the fleet matched to balise location; summary of the last 7 days WSP slides over 0.75 miles; the sanding seconds for each cab vehicle with a sanding risk based on sand hopper capacity.

Leaf Fall Mapping (Virgin Trains)

Leaf fall map - track sector distances within the West Coast route are created and the summarised results of the OTDR “WSP activation distance” are mapped against them using either balise locations or journey time reports for the headcode. 

Autumn Performance – Network Rail

This presentation contains an analysis of autumn performance data from 2015. The graphs provide a comparison of autumn performance in 2015 to 5 year average. The slides contain a comparison of leaf fall percentages, safety KPIs (such as wrong side track circuit failures and SPADs) and delay minutes.

Leaf-fall data - Met Office

This Met Office data provides information regarding the amount of leaves that have fallen from trees. Daily leaf-fall data measures quantities day-by-day, whilst cumulative leaf-fall data adds days together to measure how total leaf-fall has progressed over the season.

To access this dataset, please contact transport@metoffice.gov.uk.

Great Western Railway data

GWR can make a range of data feeds (including PPM and RT data) available for interested academic partners for the purpose of this research competition. The academic organisation will have to sign an non-disclosure agreement (NDA) in order to obtain such data.

Please get in touch via researchcompetitions@rssb.co.uk if you are interested in collaborating with GWR.

South Western Railway data

SWR can make a range of data feeds available for interested academic partners for the purpose of this research competition. The academic organisation will have to sign an non-disclosure agreement (NDA) in order to obtain such data.

Please get in touch via researchcompetitions@rssb.co.uk if you are interested in collaborating with SWR. 

Weather data (Met Office)

We will be able to make sample weather data (temperature; wind speed / direction; humidity; precipitations; snow depth; dew point etc.) available to individual projects upon request. Users will be asked to agree and sign specific T&Cs. 

Please get in touch via researchcompetitions@rssb.co.uk if you are interested in this data.


Network Model (Network Rail)

The Network Model is a geospatial representation of the rail network. The model is managed in ArcGIS and is split into links and nodes. Each link is indicative of the centreline of the tracks, not the railheads. These links are split between intersections or to a buffer stop/end block. 


Manager drop in (Arriva Trains Wales)

Manager drop in data relates to incidents solely attributable to ATW functions (i.e. drivers, conductors, stations) and allows to calculate each function's impact on overall performance. It is possible to filter the data further within each function, for example narrowing the results to a specific area within each function.

Function Performance Reports (Arriva Trains Wales)

Function Performance Reports for the period 15 Oct - 11 Nov. It includes: delay minutes summary; cancellations summary; PPM report; reports for specific route areas; drivers/conductors/fleet performance reports; depot performance report; staffed and unstaffed stations performance reports.

Haulage journeys (TS Catapult)

TS Catapult can provide haulage journeys data from detailed telematics fleet providers. Differentiation between HGV and LCV is visible and will give overview of freight transit patterns together with port of entry and exit.


Mapping Grids (TS Catapult)

These are detailed mapping grids with road direction markers, speed limits and vehicle flow that would allow to build robust models of towns or regions at a granular level.


Online Journey Planner (OJP) (RDG)

Online Journey Planner is the engine used to plan routes, calculate fares and establish ticket availability on National Rail Enquiries digital channels. The OJP accesses real-time information directly from DARWIN, meaning all journey plans take account of all delays, schedule changes and last minute cancellations made by the train companies. [Open data]

Knowledgebase - KB (RDG)

Knowledgebase is the content engine and database of the National Rail Enquiries website. It contains a wealth of static and real-time information about traveling by train on the GB rail network, such as information about station facilities, service disruption, and engineering work. [Open access]

Darwin (RDG)

Darwin provides real-time arrival and departure predictions, platform numbers, delay estimates, as well as real-time schedule changes and cancellations. It powers all NRE and train operator customer facing real-time information tools, including websites, mobile apps and train station departure board screens. [Open access]


List of datasets (by organisation)

Network Rail

  • Attributed Delay Data 
  • Network Rail Open feeds, which include: SCHEDULE, MOVEMENT, TD (train positioning), TSR (Temporary Speed Restrictions), VSTP (Very Short Term Plan), RTPPM (Real-Time Public Performance Measure), SMART, Corpus, BPLAN, Train Planning Network Model 
  • TD (train describer) data from Dec 2016 - May 2017 (upon request)
  • TRUST data from Dec 2016 - May 2017 (upon request)
  • Station data (upon request)
  • Network Model (upon request)
  • Line Speed data (upon request)
  • Tonnage data (upon request)
  • GPS feeds
  • Seasonal contracts report
  • Autumn base plan report 
  • Circuit trains report
  • Missed site report
  • Seasonal site treatments
  • Autumn performance 
  • CIF timetable data (national) Dec 2017, and May 2018 (May until Dec 2018) (upon request)
  • Delay measures (upon request)
  • VSTP (Very Short Term Plan) from Jan 2017 to May 2017 (upon request)
  • On time measures, for Period 13 2018/19
  • PPM, for Period 13 2018/19

Rail Delivery Group (RDG)

  • Darwin
  • Knowledgebase
  • Online Journey Planner 
  • LENNON database
  • National Rail Enquiries (NRE) data feeds
  • Customer Heartbeat, plus a status update and quantified data

Virgin Trains

  • Genius allocations data
  • Genius diagram data
  • Bugle train running data
  • Bugle incident data
  • Bugle list of delays
  • On Train Data Recorder - station dwells
  • On Train Data Recorder - journey events
  • TMS 
  • Orbita
  • Web Gemini
  • Passenger numbers - airbag data
  • Passenger numbers - TM counts
  • Reservations/ ticket sales
  • Leaf fall 221 fleet
  • Leaf fall 390 fleet
  • Leaf fall mapping

Greater Anglia

  • Bugle data
  • DayOne timing data from TD.net
  • Nexala R2M screenshots
  • Nexala R2M remote train monitoring (door opening/interlock times)
  • Autumn slide data
  • Track diagrams
  • Route map

Mersey Rail

  • Train running data
  • Passenger count data

Southeastern Railway

  • Unit movements data
  • Driver compliance System Retrieved Data
  • Warning Systems Data

Arriva Trains Wales

  • Function Performance Reports
  • Daily Performance Report
  • Daily incident drop in
  • Daily journey drop in
  • Manager drop in
  • Bugle incident report
  • Period to date (PTD) incident data
  • Wheel Slide Protection (WSP) Benefit

South Western Railway

  • Wessex performance data
  • Other data - upon request (upon request)

Great Western Railway

  • PPM and RT data (upon request)

Northern

  • Autumn heat map 
  • Route comparison 2016-17
  • Station overruns 2013-17
  • Unit tyre turning frequencies
  • LNE autumn review

Angel Trains

  • Fuel volume
  • Acoustic footage of a Pendolino train
  • Sample of networker fleet RCM system
  • Vibration monitors from inside the saloon of East Coast High Speed Trains
  • EC4T data - Part 1 & 2
  • Saloon temperature data from class 166's
  • Engine data from a class 166 train

Transport for London

  • Open Data Portal

TS Catapult

  • Mapping Grids (upon request)
  • Mobile network data (upon request)
  • National roadworks data (upon request)
  • Haulage journeys data (upon request)
  • Sentiment mapping data (upon request)

Met Office

  • Weather data (upon request)
  • Weather observation (upon request)
  • Leaf fall (upon request)
  • Adhesion forecasting (upon request)