This is a repository for tools and pipelines for importing data into Data Commons.
Data Commons is an Open Knowledge Graph that provides a unified view across multiple public data sets and statistics. It includes APIs and visual tools to easily explore and analyze data across different datasets without data cleaning or joining.
-
The tools are built using Apache Maven version 3.8.0.
For MacOS:
brew install maven
-
The tools use protobuf and require that
protoc
be installed.For MacOS:
brew install protoc
You can build and test the Java code from a Unix shell.
To build: ./build.sh
To run tests: ./test.sh
In https://github.com/datacommonsorg/import, click on "Fork" button to fork the repo.
Clone your forked repo to your desktop.
Add datacommonsorg/import repo as a remote:
git remote add dc https://github.com/datacommonsorg/import.git
Every time when you want to send a Pull Request, do the following steps:
git checkout master
git pull dc master
git checkout -b new_branch_name
# Make some code change
git add .
git commit -m "commit message"
git push -u origin new_branch_name
Then in your forked repo, you can send a Pull Request. If this is your first time contributing to a Google Open Source project, you may need to follow the steps in contributing.md.
Wait for approval of the Pull Request and merge the change.
Apache 2.0
For general questions or issues, please open an issue on our
issues page. For all other
questions, please send an email to support@datacommons.org
.
Note - This is not an officially supported Google product.