Releases: simonw/shot-scraper
Releases · simonw/shot-scraper
0.15
- New --log-requests option for logging out newline-delimited JSON showing all requests made while rendering a page, including image and asset loads. #88
- New shot-scraper pdf formatting options:
--format
,--width
,--height
,--scale
and--print-background
. Thanks, Eddie Chapman. #87 - Removed
-h
shortcut for help - use--help
instead.-h
was clashing with the shorter version of--height
. Thanks, Matthew Bafford. #84
0.14.3
0.14.2
- Documentation improvements. 0.14.1...0.14.2
0.14.1
0.14
- Documentation is now hosted at shot-scraper.datasette.io and has been expanded to multiple pages. #73
shot-scraper $URL --wait-for EXPRESSION
can be used to take the screenshot only once the provided JavaScript expression returnstrue
. See Waiting until a specific condition. #72- The
wait_for:
key in the YAML format used byshot-scraper multi
provides equivalent functionality for scripted multiple screenshots. - New documentation on Using shot-scraper with GitHub Actions. #74
- New example: Extracting page content with Readability.js
0.13
0.12
--selector-all
option to take a screenshot that encompasses every element matching the specified CSS selector - complements--selector
which takes a screenshot of just the first element matching that selector. See Specifying elements using JavaScript filters. #64selector_all:
andselectors_all:
keys in theshot-scraper multi
YAML format.--js-selector
and--js-selector-all
options for specifying elements to screenshot using a JavaScript expression, for cases which cannot be handled using CSS selectors #43. The following example takes a screenshot of the first paragraph on the page that mentionsshot-scraper
:shot-scraper https://github.com/simonw/shot-scraper \ --js-selector 'el.tagName == "P" && el.innerText.includes("shot-scraper")'
js_selector:
,js_selectors:
,js_selector_all:
andjs_selectors_all:
equivalent keys in theshot-scraper multi
YAML format.--user-agent
option for setting a custom user agent header. #63--browser webkit
option for running WebKit - thanks, Ryan Murphy. #56
0.11
- New
shot-scraper accessibility --timeout
option, thanks Ben Welsh. #59 shot-scraper auth --browser
option for authentication using a browser other than Chromium. #61- Using
--quality
now results in a JPEG file with the correct.jpg
extension. Thanks, Ian Wootten. #58 - New
--reduced-motion
flag for emulating the "prefers-reduced-motion" media feature. Thanks, Ryan Murphy. #49
0.10
- New
-b/--browser
option for theshot-scraper install
,shot
,multi
andjavascript
commands. This can be used to install and run alternative browsersfirefox
,chrome
orchrome-beta
. Thanks, Ben Welsh. #53 - New
--timeout
option forshot-scraper shot
andshot-scraper multi
. Thanks, Ben Welsh. #47 shot-scraper multi
now continues to create other shots despite a timeout error, unless--fail-on-error
is passed. Thanks, Ryan Cheley. #50- README now describes the
async () => { ... }
pattern forshot-scraper javascript
, as discussed in Extracting web page content using Readability.js and shot-scraper. - New demo: shot-scraper-template, which lets you setup shot-scraper running in a GitHub repository in a single click. See also Instantly create a GitHub repository to take screenshots of a web page.
- New scraping demo: scrape-hacker-news-by-domain uses
shot-scraper javascript
to scrape a web page. See Scraping web pages from the command-line with shot-scraper.
0.9
-
New shot-scraper javascript command for executing JavaScript against a web page and returning the result to the console as JSON: #38
% shot-scraper javascript datasette.io document.title "Datasette: An open source multi-tool for exploring and publishing data"
This can be used for web scraping and data extraction. Any JavaScript errors will cause the command to return an exit code of 1, so this can also be used to run tests against a website from within a continuous integration environment such as GitHub Actions.
-
The
shot-scraper pdf
andshot-scraper accessibility
commands can both now be used with local files in addition to URLs. #37 -
The
output:
key is no longer required in YAML shot configuration: if omitted, an automatic filename will be used instead. #40 -
An empty YAML file no longer produces an error. #41