Skip to content

hpitawela/cincinnati_crawler

Repository files navigation

University of Cincinnati Web Crawler App

This app was designed to extract information from professor profile sites of University of Cincinnati: [https://researchdirectory.uc.edu/] . Extracted information is written across 15 tables in a PostGreSQL database.

Developer written LOC(lines of code) ~ 2800

Technologies Used:

  • Java language
  • jsoup library
  • regular expressions

How to run from Command Line Client

  • Go to src folder of WebCrawlApp
  • Enter following to compile
     javac -cp ".;../lib/jsoup-1.11.2.jar" WebCrawlApp.java
    
  • Enter following to run
     java -cp ".;../lib/jsoup-1.11.2.jar" WebCrawlApp
    

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages