Replies: 2 comments 13 replies
-
|
I would like to avoid a situation where an runaway automated crawling process causes undue load on our service. We have tools in place to mitigate some of the problems and, indeed, our site has been live for over a decade and undoubtedly crawled by manage search index crawlers. What data are you interested in collecting? |
Beta Was this translation helpful? Give feedback.
13 replies
-
|
@joelverhagen Can you help me check why there is issue #10427? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
As discussed in #10221, I'm going to write a spider program to crawl the full site data of nuget.org. The goal is to recreate a new nuget-trend website with advanced statistics charts. Although https://nugettrends.com/ is there, it's a half-done project. A few features are not implemented. I did check the code 2 years ago.
Nuget team is very slow in working on advanced statistics tickets. A few statistics tickets have been there for years (some are open since 2017).
There are 43 open issues under Area:Statistics category. Check this
I'd like to know if there is any concern from nuget team if I wanna crawl the data from nuget.org. And I know that you cannot share your data because there are some PII information in your database.
Beta Was this translation helpful? Give feedback.
All reactions