Order of operation

spider.py to crawl a web site and store pages linked to it into the database; recording link between pages. stores data in spider.sqlite
spdump.py for cleaning data
sprank.py to rank the pages. Assigns weights to pages based on number of links. Then ranks pages on the basis of those weights. Iterate it as much as you want.
spdump.py for cleaning again
spreset.py to restart the page rank calculations without re- spidering the wab pages.
spjson.py to create spider.js

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
bs4		bs4
LICENSE		LICENSE
README.md		README.md
d3.v2.js		d3.v2.js
force.css		force.css
force.html		force.html
force.js		force.js
spdump.py		spdump.py
spider.js		spider.js
spider.py		spider.py
spider.sqlite		spider.sqlite
spjson.py		spjson.py
sprank.py		sprank.py
spreset.py		spreset.py

Provide feedback