-
-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Full Content support for Phoronix.com #623
Comments
Thanks @privacyadmin. The update for phoronix.com is available as of 2024.12.1086.
|
Hi. I would like to reopen this as it doesn't seem to fix the site. I have already updated to the latest version as mentioned. e.g. https://www.phoronix.com/review/amd-epyc-9005-hpc-tuning |
hm, looks like they use a different format for these articles. I'll take a look! |
Thank you for taking the time to relook into this. Much appreciated and Merry Xmas! |
@privacyadmin I added a fix for this in 2024.12.1087. Feel free to reopen this issue if you're still seeing problems after that! |
Just received another Phoronix article with the same issue. https://www.phoronix.com/news/Deadline-DRM-Scheduler-RFC Hope this helps. Thanks and Happy New Year |
@privacyadmin the unfilled full content icon makes it look like the article isn't being parsed at all. ![]() I added some error handling in 2025.01.1089 that will at least make this more visible if it's a fetch error. |
Yes it seems that when I refresh my Photonix feeds, it says failed to fetch article. The site is definitely reachable since I can access it via tapping the article title or using another RSS reader. Any idea why it happens this way and if it's fixable? |
The "failed to fetch article" comes back when there's no response from the site. The exact reason should be in the crash log file under Settings > General > Share crash logs. The log entries that have |
This are the error messages from the latest CapyReader. 01-04 14:08:03.919 20167 20357 I cr.full_content: error_type=MissingBodyError error_message=403 01-04 14:08:05.368 20167 20357 I cr.full_content: error_type=MissingBodyError error_message=403 01-04 14:08:10.112 20167 20167 W WindowOnBackDispatcher: sendCancelIfRunning: isInProgress=false callback=c.y@ed9912a 01-04 14:08:18.240 20407 20407 W libc : Access denied finding property "ro.debuggable" 01-04 14:08:18.239 20407 20407 W logcat : type=1400 audit(0.0:37955): avc: denied { read } for name="u:object_r:userdebug_or_eng_prop:s0" dev="tmpfs" ino=456 scontext=u:r:untrusted_app:s0:c193,c256,c512,c768 tcontext=u:object_r:userdebug_or_eng_prop:s0 tclass=file permissive=0 app=com.capyreader.app |
Thanks @privacyadmin. Those are 403 forbidden errors then. There's not much I can do in the app when a website comes back with that. Based on some chatter in Phoronix's forums, it looks like this might be a problem that happens semi-regularly with that site. |
Hmm...I just tested a Phoronix article with another RSS reader with some findings.
Is it possible that my setting for Phoronix site was corrupted or something? Any chance to reset it for Phoronix while leaving the other sites preferences intact? |
Short answerThese are all good points. The full content mode is just an on/off switch so there's no specific site data for Phoronix compared to any other site. I published another version just now, 2025.01.1092-dev, that I hope will solve this. Let me know! Long answerAs to your other points, I mentioned 403 because the HTTP status code is logged alongside the
In my experience failures like 403 errors and 4xx errors in general will always be much faster because there's less data to transmit, and no article data to parse, making it feel instant. To your third point, I think some sites may block Capy based on its user agent which is why opening it in the article header (which uses the browser's user agent) or FeedMe render different results. To my understanding, headers, location and IP all act as indicators for Cloudflare's bot scores. These indicators don't work the same across all regions. As an example, I tried accessing another feed while testing hardwarezone.com.sg (#622). Full content works on a different RSS reader called Reeder for Mac, but not for Capy. The following returns a 200 response. curl 'https://www.hardwarezone.com.sg/review-microsoft-surface-laptop-7-snapdragon-x-series-mobile-processors' \
-H 'Host: www.hardwarezone.com.sg' \
-H 'Accept: */*' \
-H 'Accept-Language: en-US,en;q=0.9' \
-H 'Connection: keep-alive' \
-H 'User-Agent: Reeder/5050001 CFNetwork/1568.200.51 Darwin/24.1.0' But if I change the user agent to Capy's user agent, curl 'https://www.hardwarezone.com.sg/review-microsoft-surface-laptop-7-snapdragon-x-series-mobile-processors' \
-H 'Host: www.hardwarezone.com.sg' \
-H 'Accept: */*' \
-H 'Accept-Language: en-US,en;q=0.9' \
-H 'Connection: keep-alive' \
-H 'User-Agent: CapyReader (RSS Reader https://capyreader.com/) okhttp/4.12.0' Partial output, 403 response code:
However, if I exclude curl 'https://www.hardwarezone.com.sg/review-microsoft-surface-laptop-7-snapdragon-x-series-mobile-processors' \
-H 'Host: www.hardwarezone.com.sg' \
-H 'Accept: */*' \
-H 'Accept-Language: en-US,en;q=0.9' \
-H 'Connection: keep-alive' \
-H 'User-Agent: CapyReader (RSS Reader; https://capyreader.com/)' Partial output, 200 response code:
|
Thank you for the investigation into Phoronix. I just updated to the latest 2025.01.1092-dev and confirmed that #622 is working and the user agent fixed the issue. I tested again with Phoronix again and still no joy. What I find particular interesting is that your end seems to be working with Phoronix (based on your response from 5 days ago) but not for me. At this point, do let me reset my Capy Reader installation and try it again. Will report back on this. |
Short update:
At this point, I can only assume that some combinations of factors (IP, location, useragent etc) will result in the full content error on Capy Reader since you are unable to replicate it on your end. I'm not sure if you are ok to leave this ticket opened for now or to close it. I am fine either way for now. |
Sorry that didn't fix this feed for you. Depending on whether or not you want to continue trying, you could see if the Phoronix forum mods are willing to unblock the user-agent |
Feed URL
https://www.phoronix.com/news/Raspberry-Pi-HEVC-H265-Decode
Add any details, links, or screenshots about the article layout that's missing or wrong
Text and images are missing/incomplete
The text was updated successfully, but these errors were encountered: