The first (and currently only) spider just scrapes every page of every topic upder Health Conditions in the WebMD Message Boards. It yields the topic, title, body, responses, and tags of each thread. To be used eventually to train a conversational agent as a part of my High Performace Computing Club (HPCC) project.
-
Notifications
You must be signed in to change notification settings - Fork 0
tasiah/Spiders
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Scraping web pages
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published