HEPcrawl

HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.

The project is currently in early stage of development.

See full documentation at http://pythonhosted.org/hepcrawl

Name		Name	Last commit message	Last commit date
Latest commit History 770 Commits
.github		.github
docs		docs
hepcrawl		hepcrawl
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
INSTALL.rst		INSTALL.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
RELEASE-NOTES.rst		RELEASE-NOTES.rst
docker-compose.deps.py2.yml		docker-compose.deps.py2.yml
docker-compose.test.py2.yml		docker-compose.test.py2.yml
pytest.ini		pytest.ini
scrapy.cfg		scrapy.cfg
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HEPcrawl

About

Releases

Packages

Contributors 27

Languages

License

inspirehep/hepcrawl

Folders and files

Latest commit

History

Repository files navigation

HEPcrawl

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 27

Languages

Packages