A monitor for craigslist searches.
*Does not work on Windows.
The craigslist monitor uses a command line interface to create web scrapers which monitor craigslist searches for new listings. A scraper monitoring a search will check periodically for new listings and output their details into a text file.
First, create a manager which will oversee active scrapers:
cmonitor manager
If cmonitor is not installed as a script, try re-installing with
sudo pip install craigslist-monitor
Next, to add a scraper, three things must be specified in the following order:
- The name of the scraper (used for stopping the scraper).
- The name of the item to be searched and monitored __or__ the url of a craigslist search.
- The file that the scraper will output details of new items to.
cmonitor add computer_scraper computer output
The above example of a scraper would monitor
this
craigslist search for new listings. There are other options to further
customize the scraper listed under cmonitor add --help
.
To stop the scraper:
cmonitor manager -s computer_scraper
Alternatively, to quit both, the manager and any active scrapers:
cmonitor manager -q
- To list the names of all active scrapers:
cmonitor manager -l
- To quit the manager and all active scrapers:
cmonitor manager -q
- To stop an active scraper:
cmonitor manager -s <scraper_name>
- If a listing contains words specified by –exclude-words, the scraper
will not output the details of that listing:
cmonitor add <scraper details> -e [words...]
- If –renewals is used, the scraper will output the details of listings
that have been renewed (off by default):
cmonitor add <scraper details> -r
- The two positive digits following –time-refresh are used with
random.randrange to determine how long (in seconds) until the scraper
checks for new listing (default is 60 and 180):
cmonitor add <scraper details> -t 200 300
usage: cmonitor add [-h] [-r] [-e word [word ...]] [-t pos_int pos_int] name monitor output Add an item (craigslist search) to be monitored. positional arguments: name The name of the process. monitor The url of a craigslist search or the item name to be monitored. output The file to which items found by the monitor will be written to optional arguments: -h, --help show this help message and exit -r, --renewals Include old posts that are renewed. -e word [word ...], --exclude-words word [word ...] Exclude posts that contain these words (not case sensitive). -t pos_int pos_int, --time-refresh pos_int pos_int The two positive integers used with randrange(lower, upper) to determine how long (in seconds) until the scraper checks for new items (first integer must be lower than the second). The default is 60 and 180.
usage: cmonitor manager [-h] [-q | -l | -s scraper_name] Starts the manager if issued with no commands. optional arguments: -h, --help show this help message and exit -q, --quit Quits the manager and all scrapers -l, --list List all the currently running scrapers -s scraper_name, --stop scraper_name Stop the named scraper