Import Files - Adding Paths to the Crawler
You can add import files to your target to guide the crawler, specifying paths for the crawler to add to the scan even if none of the other pages in the target link to the paths listed in the import file.
If you ENABLE the option labelled "Restrict scans to import files", then the crawler will add to the scan ONLY the paths listed in the import file, ignoring all other parts of the target.
If you DISABLE the option labelled "Restrict scans to import files", then the crawler will crawl the target as usual, and use the import file to add other paths listed in the import file EVEN if no other part of the target links to them (orphaned folders/files).
For example, if you create a target with URL http://www.example.com, and use a text import file for with the following:
http://www.example.com/main/sub1/ http://www.example.com/extra/sub3/ |
...then, depending on whether the option "Restrict scans to import files" is enabled or disabled, we get the following behaviour:
Restrict Option | Will crawl and scan | Will NOT crawl and scan |
Enabled | http://www.example.com/extra/sub3/ http://www.example.com/main/sub1/ | http://www.example.com/main/sub2/ http://www.example.com/extra/sub1 http://www.example.com/new/ http://www.example.com/ |
Disabled | http://www.example.com/ http://www.example.com/extra/sub1 http://www.example.com/extra/sub3/ http://www.example.com/main/sub1/ http://www.example.com/main/sub2/ http://www.example.com/new/ |