Scrapy (ReconSpider)
Last updated
Last updated
We will leverage Scrapy and a custom spider tailored for reconnaissance on inlanefreight.com
. If you are interested in more information on crawling/spidering techniques, refer to the "" module, as it forms part of CBBH as well.
Before we begin, ensure you have Scrapy installed on your system. If you don't, you can easily install it using pip, the Python package installer:
In the location where you create the virtual environment use the following command
This command will download and install Scrapy along with its dependencies, preparing your environment for building our spider.
First, run this command in your terminal to download the custom scrapy spider, ReconSpider
, and extract it to the current working directory.
With the files extracted, you can run ReconSpider.py
using the following command:
Replace inlanefreight.com
with the domain you want to spider. The spider will crawl the target and collect valuable information.
After running ReconSpider.py
, the data will be saved in a JSON file, results.json
.