Skip to content

Use Python to explore a website's internal links. Then apply D3 to visualize those connections as an interactive network graph with scorecards.

Notifications You must be signed in to change notification settings

devbret/website-internal-links

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mapping A Website's Internal Links

Preview Of Resulting Visualization

Use Python to explore a website's internal links. Then apply D3 to visualize those connections as an interactive network graph with scorecards.

Set Up

Programs Needed

Steps

  1. Install the above programs.
  2. Open a shell window (For Windows open PowerShell, for MacOS open Terminal & for Linux open your distro's terminal emulator).
  3. Clone this repository using git by running the following command: git clone git@github.com:devbret/website-internal-links.git.
  4. Navigate to the repo's directory by running: cd website-internal-links.
  5. Install the needed dependencies for running the script by running: pip install -r requirements.txt.
  6. Edit the app.py file on line 115, to include the website you would like to visualize.
  7. Run the script with the command python3 app.py.
  8. To view the website's connections using the index.html file you will need to run a local web server. To do this run: python3 -m http.server.
  9. Once the network map has been launched, hover over any given node for more information about the particular web page. By clicking on a node, you will be sent to the related URL address.

Please Also Consider

Generating visualizations for this app takes an unexpectedly large amount of processing power. It is thus advisable to initially experiment with mapping less than one hundred pages per launch.

About

Use Python to explore a website's internal links. Then apply D3 to visualize those connections as an interactive network graph with scorecards.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published