-
Notifications
You must be signed in to change notification settings - Fork 818
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Net Scan: 72TB BAB&anOptions, 210 TB app_vars #4368
Comments
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
@gwarser The Internet is much much bigger than 1 million websites, it's close to 2 billion. I use Common Crawl, which (I think) crawls top 40 million websites or something. Maybe it's time for generic scriptlet? |
|
Whitelisted, and whitelisted. |
Can you test |
It's up to you, if you think it's not worth fixing, I'll just whitelist it. |
Yes, I meant nothing until BAB kicks in 😆. |
Alright, whitelisted. I'm still working on my Chromium as a Service, it refuses to load extensions in headless mode and I can't find a working tutorial to make EC2 headful... |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
Main type (FAB, anOptions, etc.) is from the scanner output, which may not be accurate. |
We should probably add a warning that some of the listed domains may be NSFW.
Yes, I fixed |
OK, updated opening. |
|
Second "hard batch" domains are all valid. |
Well yea, I tested those ones. I'll be testing everything from now on, either manually or semi-automated with Puppeteer. |
didn't want to open an issue since it's related to this.
|
I'm not sure why but I can't manage to keep 10 worker children alive, so I'm downscaling my cluster to 4 worker servers (8 children), I finished processing 72 TB out of 210 TB of data.
In order to prevent maintainers from burning out, I will now add a daily cap on how many links I post:
In addition, I will not post more than 500 links combined per day.
Note that some domains may be NSFW.
The text was updated successfully, but these errors were encountered: