Data Sandbox
This Data Sandbox is designed to enable you to familiarise yourselves with some of the rail datasets that are available within the industry. We are currently in the process of adding as many datasets as possible.
PLEASE READ THE T&Cs CAREFULLY
By downloading the datasets below you agree to the following terms:
- The sample datasets shall be downloaded and used solely for the purposes of preparing a bid for the RSSB Data Sandbox+ competition and for NO other purpose and/or project whatsoever;
- If having downloaded the datasets you decide not to submit and/or withdraw a bid to the RSSB Data Sandbox+ competition or your bid is unsuccessful you shall as soon as is reasonably practicable delete all records and/or copies made of such datasets (or parts thereof); and agree not to save, copy, and/or store any such datasets (or part thereof) on your systems and/or use for any other purpose; and
- Not to forward and/or disclose the datasets (or parts thereof) to any other party for any purpose.
Search the Data Sandbox using keywords or scroll to the end of the page for a list of datasets (by organisation).
Greater Anglia track diagrams
Greater Anglia track diagrams from February 2019 indicating track & signalling, station locations and crossing sites in Anglia.
TfL Open Data Portal
The TfL data portal provides a range of data feeds and guidelines for using them. Including timetable data, station locations and facilities, departure boards, line status and station status, and network statistics.
On Time Measures for Period 13 2018/19 (Network Rail)
On Time Measures for Period 13 2018/19 (Network Rail)
VSTP (VeryShort Term Plan) data (Network Rail)
VSTP (VeryShort Term Plan) data from Jan 2017 to May 2018 (Network Rail)
TD data for C-class (Network Rail)
TD data for C-class for Jan 2017 to May 2018 (Network Rail) - train positioning data at signalling berth level. Please get in touch via researchcompetitions@rssb.co.uk if you are interested in this data.
Delay Measures (PSS Network Rail database)
Delay Measures (PSS Network Rail database) - provides approx. ten years historic attribution data to then end of 2017-18. Please get in touch via researchcompetitions@rssb.co.uk if you are interested in this data.
CIF - Timetable data (national) (Network Rail)
CIF - Timetable data (national) - contains two files, December 2017 and May 2018 (May to Dec 2018), with scheduling information. Please get in touch via researchcompetitions@rssb.co.uk if you are interested in this data.
RDG Customer Heartbeat™ & Promises - presentation
The Customer Heartbeat™ is a tool which maps 108 segments or ‘touchpoints’ of a rail journey and helps us to understand where the service provided does not yet meet customer expectations. This presentation provides and update on customer insight and satisfaction/sentiment monitoring.
RDG Customer Heartbeat™
The Customer Heartbeat™ is a tool which maps 108 segments or ‘touchpoints’ of a rail journey and helps us to understand where the service provided does not yet meet customer expectations.
OTDR (Virgin Trains)
On-Train Data Recorder data (Virgin Trains) - including a slide event on 12 Oct
Leaf fall report - 221 fleet (Virgin Trains)
Leaf fall report for Virgin Trains 221 fleet, which individually lists top WSP distances recorded that morning from the fleet matched to balise location; summary of the last 7 days WSP slides over 0.75 miles; the sanding seconds for each cab vehicle with a sanding risk based on sand hopper capacity.
Leaf fall report - 390 fleet (Virgin Trains)
Leaf fall report for Virgin Trains 390 fleet, which individually lists top WSP distances recorded that morning from the fleet matched to balise location; summary of the last 7 days WSP slides over 0.75 miles; the sanding seconds for each cab vehicle with a sanding risk based on sand hopper capacity.
Leaf Fall Mapping (Virgin Trains)
Leaf fall map - track sector distances within the West Coast route are created and the summarised results of the OTDR “WSP activation distance” are mapped against them using either balise locations or journey time reports for the headcode.
Autumn Performance – Network Rail
This presentation contains an analysis of autumn performance data from 2015. The graphs provide a comparison of autumn performance in 2015 to 5 year average. The slides contain a comparison of leaf fall percentages, safety KPIs (such as wrong side track circuit failures and SPADs) and delay minutes.
Wheel Slide Protection (WSP) Benefits (Arriva Trains Wales)
This dataset from Arriva analyses the benefits of WSP technology
Unit Tyre Turning Frequencies 2017 (Northern)
This Northern dataset records unit tyre turning frequencies during 2017.
TRUST Leg summaries (Northern)
This Northern dataset records TRUST Leg summaries covering incidents, primary and reactionary delays, ppm fails, cancellations and total delays.
Station Overruns 2013-17 (Northern)
This Northern dataset records station overrun incidents by station from 2013-17.
2016-17 Route Comparison (Northern)
This geographical data from Northern maps autumn delays by line.
Leaf-fall data - Met Office
This Met Office data provides information regarding the amount of leaves that have fallen from trees. Daily leaf-fall data measures quantities day-by-day, whilst cumulative leaf-fall data adds days together to measure how total leaf-fall has progressed over the season.
To access this dataset, please contact transport@metoffice.gov.uk.
Adhesion forecasting data - Met Office
This is generated to predict adhesion levels on the tracks. It is presented as a colour-coded map, with different colours indicating different levels of expected adhesion.
To access this dataset, please contact transport@metoffice.gov.uk.
Weather observation data - Met Office
This data is collected and quality controlled by the Met Office. It captures standard weather parameters such as temperature; wind speed/direction; humidity; precipitation; dew point and other general weather information.
To access this dataset, please contact transport@metoffice.gov.uk.
Autumn Slides (Greater Anglia)
This Greater Anglia dataset records the significant slide data (slides greater than 5 seconds) for Autumns 2016 and 2017.
R84 – Seasonal Site Treatments (Network Rail)
This dataset shows which Network Rail treatments were applied and which were missed. It is organised around the sites on a given Line and/or Engineer’s Line Reference (ELR) location.
R83 – Missed Site Report (Network Rail)
This dataset shows which planned Network Rail treatment sites were missed (but not which were treated).
R82 – Circuit Trains Report (Network Rail)
This dataset shows which Network Rail treatment trains are running which circuits and what their planned treatments are.
R81 – Autumn – Base Plan Report (Network Rail)
This dataset shows the base circuits and details the type of treatment applied by Network Rail trains.
R08 – Seasonal Contracts Report (Network Rail)
This dataset shows the treatment trains that Network Rail ran with details about missed treatment sites.
Acoustic footage of a Pendolino train (Angel Trains)
This file is in CHL format. It is quite large, so please download it using the following WeTransfer link: https://we.tl/t57PsDIWiP
Sample of networker fleet RCM system (Angel Trains)
Sample of Angel Trains' networker fleet RCM system
Vibration monitors from inside the saloon of East Coast High Speed Trains (Angel Trains)
Vibration monitors from inside the saloon of East Coast High Speed Trains (Note: the East Coast franchise no longer operates)
EC4T data - Part 2 (Angel Trains)
One month worth of electric current for traction (EC4T) data from London MidlandsClass 350 fleet
EC4T data - Part 1 (Angel Trains)
One month worth of electric current for traction (EC4T) data from London MidlandsClass 350 fleet
Great Western Railway data
GWR can make a range of data feeds (including PPM and RT data) available for interested academic partners for the purpose of this research competition. The academic organisation will have to sign an non-disclosure agreement (NDA) in order to obtain such data.
Please get in touch via researchcompetitions@rssb.co.uk if you are interested in collaborating with GWR.
South Western Railway data
SWR can make a range of data feeds available for interested academic partners for the purpose of this research competition. The academic organisation will have to sign an non-disclosure agreement (NDA) in order to obtain such data.
Please get in touch via researchcompetitions@rssb.co.uk if you are interested in collaborating with SWR.
Weather data (Met Office)
We will be able to make sample weather data (temperature; wind speed / direction; humidity; precipitations; snow depth; dew point etc.) available to individual projects upon request. Users will be asked to agree and sign specific T&Cs.
Please get in touch via researchcompetitions@rssb.co.uk if you are interested in this data.
Tonnage data (Network Rail)
Network Rail can make this data available to individual projects upon request. Please get in touch via researchcompetitions@rssb.co.uk if you are interested in this data.
Line speed data across the network (Network Rail)
Network Rail can make this data available to individual projects upon request. Please get in touch via researchcompetitions@rssb.co.uk ifyou are interested in this data.
Network Model (Network Rail)
The Network Model is a geospatial representation of the rail network. The model is managed in ArcGIS and is split into links and nodes. Each link is indicative of the centreline of the tracks, not the railheads. These links are split between intersections or to a buffer stop/end block.
Period to date (PTD) incidents data (Arriva Trains Wales)
This data feed captures all the incidents to date with information including the number of delay minutes, cancellations and PPM failures per incident.
Manager drop in (Arriva Trains Wales)
Manager drop in data relates to incidents solely attributable to ATW functions (i.e. drivers, conductors, stations) and allows to calculate each function's impact on overall performance. It is possible to filter the data further within each function, for example narrowing the results to a specific area within each function.
Incident drop in (Arriva Trains Wales)
The incident drop in report is a high level overview of all incidents that have occurred in the whole period to date.
Daily incident drop in (Arriva Trains Wales)
Daily incident drop in data relates to all performance incidents that occurred on a specific day.
Function Performance Reports (Arriva Trains Wales)
Function Performance Reports for the period 15 Oct - 11 Nov. It includes: delay minutes summary; cancellations summary; PPM report; reports for specific route areas; drivers/conductors/fleet performance reports; depot performance report; staffed and unstaffed stations performance reports.
Driver Compliance System Retrieved Data (Southeastern Railway)
Incidents with open doors duration less than 15 seconds.
Unit movements data (Southeastern Railway)
Provides detailed station arrival and departure information.
TRUST data from Jan 2018 to May 2018 (Network Rail)
TRUST data from Jan 2018 to May 2018, includes train movements and cancellations across the entire network - through automatic processing of train movements from signalling, data entry from signallers and Train Operating Company (TOC) control rooms.
Haulage journeys (TS Catapult)
TS Catapult can provide haulage journeys data from detailed telematics fleet providers. Differentiation between HGV and LCV is visible and will give overview of freight transit patterns together with port of entry and exit.
National roadworks data (TS Catapult)
TS Catapult can provide national roadworks data covering the UK, which gives the ability to observe the effects and impact on journey routes that roadworks in a specific area may have.
Mobile network data (TS Catapult)
TS Catapult can provide mobile network data for the UK in an aggregated anonymised form. Data can be separated into vehicle and train journeys.
Mapping Grids (TS Catapult)
These are detailed mapping grids with road direction markers, speed limits and vehicle flow that would allow to build robust models of towns or regions at a granular level.
DayOne timing data (Greater Anglia)
Timing data from TD.net (Train Describer), includes timings in seconds.
Stations data (Network Rail)
Network Rail can make CCTV footage and other passenger flow data (where available) of the stations Network Rail manages available to individual projects upon specific requests. Please get in touch via researchcompetitions@rssb.co.uk if you are interested in this data.
LENNON - Latest Earnings Networked Nationally Over Night (RDG)
National database for ticket revenue, journeys and miles. Apportioned and sales.
Passenger numbers - train manager counts (Virgin Trains)
Passenger counts done by train managers where available.
Passenger numbers - airbag data (Virgin Trains)
Shows airbag pressures, which can be converted into estimates of passenger numbers.
Online Journey Planner (OJP) (RDG)
Online Journey Planner is the engine used to plan routes, calculate fares and establish ticket availability on National Rail Enquiries digital channels. The OJP accesses real-time information directly from DARWIN, meaning all journey plans take account of all delays, schedule changes and last minute cancellations made by the train companies. [Open data]
Knowledgebase - KB (RDG)
Knowledgebase is the content engine and database of the National Rail Enquiries website. It contains a wealth of static and real-time information about traveling by train on the GB rail network, such as information about station facilities, service disruption, and engineering work. [Open access]
Darwin (RDG)
Darwin provides real-time arrival and departure predictions, platform numbers, delay estimates, as well as real-time schedule changes and cancellations. It powers all NRE and train operator customer facing real-time information tools, including websites, mobile apps and train station departure board screens. [Open access]
On Train Data Recorder - journey events pt 2 (Virgin Trains)
Showing select event data from a journey with times and distances (zipped .xpt files)
On Train Data Recorder - journey events pt 1 (Virgin Trains)
Showing select event data from a journey with times and distances (zipped .xpt files).
On Train Data Recorder - station dwells pt 2 (Virgin Trains)
Shows arrival, door opening/closing times.
On Train Data Recorder - station dwells pt 1 (Virgin Trains)
Shows arrival, door opening/closing times.
Performance Metrics for Period 13 2018/19 (Network Rail)
PPM and CaSL (Cancellation and significant lateness) data by train
Attributed delay data (Network Rail)
Historic delay attribution data and glossary (also see: https://www.networkrail.co.uk/who-we-are/transparency-and-ethics/transparency/datasets/)
List of datasets (by organisation)
Network Rail
- Attributed Delay Data
- Network Rail Open feeds, which include: SCHEDULE, MOVEMENT, TD (train positioning), TSR (Temporary Speed Restrictions), VSTP (Very Short Term Plan), RTPPM (Real-Time Public Performance Measure), SMART, Corpus, BPLAN, Train Planning Network Model
- TD (train describer) data from Dec 2016 - May 2017 (upon request)
- TRUST data from Dec 2016 - May 2017 (upon request)
- Station data (upon request)
- Network Model (upon request)
- Line Speed data (upon request)
- Tonnage data (upon request)
- GPS feeds
- Seasonal contracts report
- Autumn base plan report
- Circuit trains report
- Missed site report
- Seasonal site treatments
- Autumn performance
- CIF timetable data (national) Dec 2017, and May 2018 (May until Dec 2018) (upon request)
- Delay measures (upon request)
- VSTP (Very Short Term Plan) from Jan 2017 to May 2017 (upon request)
- On time measures, for Period 13 2018/19
- PPM, for Period 13 2018/19
Rail Delivery Group (RDG)
- Darwin
- Knowledgebase
- Online Journey Planner
- LENNON database
- National Rail Enquiries (NRE) data feeds
- Customer Heartbeat, plus a status update and quantified data
Virgin Trains
- Genius allocations data
- Genius diagram data
- Bugle train running data
- Bugle incident data
- Bugle list of delays
- On Train Data Recorder - station dwells
- On Train Data Recorder - journey events
- TMS
- Orbita
- Web Gemini
- Passenger numbers - airbag data
- Passenger numbers - TM counts
- Reservations/ ticket sales
- Leaf fall 221 fleet
- Leaf fall 390 fleet
- Leaf fall mapping
Greater Anglia
- Bugle data
- DayOne timing data from TD.net
- Nexala R2M screenshots
- Nexala R2M remote train monitoring (door opening/interlock times)
- Autumn slide data
- Track diagrams
- Route map
Mersey Rail
- Train running data
- Passenger count data
Southeastern Railway
- Unit movements data
- Driver compliance System Retrieved Data
- Warning Systems Data
Arriva Trains Wales
- Function Performance Reports
- Daily Performance Report
- Daily incident drop in
- Daily journey drop in
- Manager drop in
- Bugle incident report
- Period to date (PTD) incident data
- Wheel Slide Protection (WSP) Benefit
South Western Railway
- Wessex performance data
- Other data - upon request (upon request)
Great Western Railway
- PPM and RT data (upon request)
Northern
- Autumn heat map
- Route comparison 2016-17
- Station overruns 2013-17
- Unit tyre turning frequencies
- LNE autumn review
Angel Trains
- Fuel volume
- Acoustic footage of a Pendolino train
- Sample of networker fleet RCM system
- Vibration monitors from inside the saloon of East Coast High Speed Trains
- EC4T data - Part 1 & 2
- Saloon temperature data from class 166's
- Engine data from a class 166 train
Transport for London
- Open Data Portal
TS Catapult
- Mapping Grids (upon request)
- Mobile network data (upon request)
- National roadworks data (upon request)
- Haulage journeys data (upon request)
- Sentiment mapping data (upon request)
Met Office
- Weather data (upon request)
- Weather observation (upon request)
- Leaf fall (upon request)
- Adhesion forecasting (upon request)
An error occurred trying to play the stream. Please reload the page and try again.
Close