The motivation for this project comes from the crucial lack of open data regarding public transportation in Italy: differently from what happens in other European countries, there is no public database curating these data nationally and releasing them in an aggregate form. A first effort in independently collecting a dataset of that kind has been carried on by TrainStats, but the data is currently presented in a form very close to raw.
The goal of this project is to fill this gap, by making data available to the general population in a simple and intuitive manner. On one side, this would be a service to the community, improving the information currently available and helping people in planning a more informed trip schedule. On the other, it would improve public accountability, pushing Trenitalia to improve its rail system to decrease delays and malfunctions.
*Ritardi is the Italian for Delays
The website is available at openritardi.org
We publish tutorials on how to use OpenRitardi data and replicate our analysis. We welcome contribution with new processing ideas and visualization. You can find more information on this page (here in pdf).
We believe this project can be impactful, fun, and a chance to learn new skills or meet interesting people. Our mission is to showcase how the use of open data can improve public services.
We have decided to launch an open-source community to support this project with the following two guidelines:
- everybody can contribute with anything that (choose any of the following):
- excites them
- believes is useful for the mission
- believes is useful for themselves (e.g. learn web-development or data science skills)
- nobody in the community should feel any pressure to do any activity
and with the following goals:
- expanding the current openritardi.org with new countries
- expanding the data science part of the project
- find a way to make it impactful
- build a community driven by the interest of open data and data science
🚀 You can join our community by signing up on our Slack!
Contribution is welcome! Please use the Roadmap to suggest new features, feel free to open new issues. We will credit you on the About page on the website.
The website is stored on GitHub pages. The data for stations and train delays is currently stored in this repository in zip archives in the data/dataset_generated
folder.
The GitHub action that prepares the website unzips all files and stores them in the website/data
folder.
If you develop locally, you can clone the folder and run the following Bash command
$ bash prepare_data.sh
to unzip all datasets locally.
The data folder contains all the procedures to download and process all data of Italian trains.
Once you have unzipped the data using the script above, you can run a local server to do your experiments. Feel free to use whatever local server.
A good possibilty is to use Visual Studio Code with the Live Server extention enabled. You can easily launch a live server which points to the website/dist
folder and you can access the preview of the website from localhost
and the port given by Live Server, in your favorite browser.
You can statically compile the website with the following commands
cd website
npm install
npm run build
The folder dist
is the entrypoint of the website.
Feel free to submit pull requests. Please document your code in English and write a PR description. Test your code before submitting a PR and after implementing PR comments as it will most likely not be tested by the reviewers!