Broken Link Checker

A web crawling tool that checks for broken links

Broken Link Checker

Last Updated: 2/3/2018

Description

 A web crawling tool that checks for broken links. Built in PHP, this tool recursively scrapes the HTML of a webpage to find all of its hyperlinks. The tool then follows each link, recursively checking the hyperlinks on each destination page in return, until all links have been explored or a user-specified depth is reached.

 The results populate a color-coded table whose rows show which links were successfully followed (green) and which links were found to be broken or caused an error (red).

 The tool is quite effective at locating broken links, although at this time there are still a few cases which cause false positives for broken links. These include certain relative paths and subdomains. These issues will be addressed in upcoming updates to the tool.

Future Plans

  • Better handling of subdomains
  • Better handling of relative paths
  • Construct a graphical representation of crawl paths

Back to Projects