02-02- scrapy shell fetch

01

Aftering installing and creating, Create a Spider, change into the spiders directory

I don't think you need this, it works without ipython

pip install ipython
ipython

02

Make a spider, in the spiders directory with the command,

scrapy genspider bookspider https://books.toscrape.com/

provide name for spider alone with baseUrl

This is the page the spider will base it self in https://books.toscrape.com/

03

 The allowed_domains [] list will allow us to crawl, while limiting the amount of urls to connect to. This restricts the crawling to the allowed_domains websites only.

 The start_urls is obviously the starting urls.

04

The response in the parse method will be returned from the fetch command seen in next lesson