Releases: yujiosaka/headless-chrome-crawler
Releases · yujiosaka/headless-chrome-crawler
1.8.0
Added
- Set
previousUrltoonSuccessargument. - Set
options,depth,previousUrlto errors. - Support
customCrawlfor HCCrawler.connect() and HCCrawler.launch()'s options. - Add Dockerfile and tips for using Docker.
Changed
- Drop
newpageevent. - Update Puppeteer version to 1.5.0.
Fixed
- Fix a bug of not marking skipped requests correctly.
- Fix
requestfinishedevent's argument as described in the API reference.
1.7.0
1.6.0
[1.6.0] - 2018-04-21
Added
- Support
viewportandskipRequestedRedirectfor crawler.queue()'s options. - Emit
requestdisallowedevent. - Make
onSuccesspassredirectChainin the response.
changed
- Bump Node.js version up to 8.10.0.
- Update Puppeteer version to 1.3.0.
- Move node_redis to the peer dependencies.
- Make crawler.queue() to return Promise.
Fixed
- Fix a bug of silently failing to insert jQuery due to CSP.
1.5.0
Added
- Support
waitForfor crawler.queue()'s options. - Support
slowMofor HCCrawler.connect()'s options.
Fixed
- Fix a bug of not allowed to set
timeoutoption per request. - Fix a bug of crawling twice if one url has a trailing slash on the root folder and the other does not.
1.4.0
Added
- Support
browserCachefor crawler.queue()'s options. - Support
depthPriorityoption again.
1.3.4
changed
- Drop
depthPriorityfor crawler.queue()'s options.
1.3.3
Added
- Emit
newpageevent. - Support
deniedDomainsanddepthPriorityfor crawler.queue()'s options.
changed
- Allow
allowedDomainsoption to accept a list of regular expressions.
1.3.2
Added
- Support
followSitemapXmlfor crawler.queue()'s options.
Fixed
- Fix a bug of not showing console message properly
1.3.1
1.3.0
Added
- Add HCCrawler.defaultArgs() method.
- Emit
requestretriedevent.
changed
- Use
cacheoption not only for remembering already requested URLs but for request queue for distributed environments. - Moved
onSuccess,onErrorandmaxDepthoptions from HCCrawler.connect() and HCCrawler.launch() to crawler.queue().