-
Notifications
You must be signed in to change notification settings - Fork 5
/
README.md~
49 lines (39 loc) · 1.07 KB
/
README.md~
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
微博爬虫 WeiboSpider
==
**Author:** [gpxlcj][1]
**Update time:**2015.05.07
Description
----------
A spider to catch the microblog data from [weibo][2].
The data can be crawled by time, keywords and so on.the data field can be customed to grab.
**The summarize of each scripts:**
`main.py` start script
`settings.py` the configure file
`weibo_api.py` get info via weibo api
`save.py` save data
`login.py` account login
`base.py` base function
`spider.py` catch weibo data
`convert_shp.py` convert to shp data
ENVIRONMENT
--
- Python 2.7.6
- rsa 3.1.4
- requests 2.5.3
- openpyxl 2.2.0
- pyshp 1.2.1
EXAMPLE
--
You can custom the `main.py` to design your spider. Get the information from [Introduction][3]
```python
#! -*- coding:utf-8 -*-
from login import wblogin
from base import init_env
import json
if __name__ == "__main__":
init_env()
log = json.dumps(wblogin(USERNAME, PASSWORD), ensure_ascii=False)
```
[1]:http://github.com/gpxlcj/
[2]:http://weibo.com
[3]:http://weibospider.gpxlcj.com