-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
german news media #23
Comments
Excellent! The for developers vignette is unfortunatley outdated. This is the approach I currently use: # get some data to inspect and test
rss_url <- "http://www.bild.de/rssfeeds/rss3-20745882,feed=alles.bild.html"
test_df <- pb_collect(rss_url,
timeout = 60,
ignore_fails = TRUE)
# set up new parser
use_new_parser("https://www.bild.de/",
author = "[@schochastics](https://github.com/schochastics)",
issue = "[#23](https://github.com/JBGruber/paperboy/issues/23)",
rss = rss_url,
test_data = test_df)
# see what has been collected in your browser
pb_inspect(test_df)
# when the parser looks roughly done, you can run the same function again to run
# the tests
use_new_parser("https://www.bild.de/",
author = "[@schochastics](https://github.com/schochastics)",
issue = "[#23](https://github.com/JBGruber/paperboy/issues/23)",
rss = rss_url,
test_data = test_df) |
welt.de seems to timeout users quickly when the rss feed is accessed too many times |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I'd be willing to contribute with some german news outlets (will add more over time to the list):
The text was updated successfully, but these errors were encountered: