Currently repoman does not do online network checks, this though causes some trouble since we have no way to report moved or gone pages, and the same goes for distfiles.
Having a further scan level for repoman to check HOMEPAGE and SRC_URI for validity (and reporting eventually where they result moved, which would allow to find eventual spam links that packages.gentoo.org is currently providing) would definitely help.
I proposed (a few months back) the name "repoman anal" but Zac turned it down ;)
Yesterday, I opened bug #295194. Should that one be closed in favor of this one? Or should it depend on this one?
Ah I couldn't find it, sorry… but I guess a blocker is good, since we're going to need something else as well, like a way to check for mirrors ;)
Do you expect developers to fix:
- Homepages that have moved (300-level redirect)
- Homepages that are gone (400-level error page)
- Homepages that are broken (500-level error page)
- Homepages that are spam
- Distfiles that are missing.
I am not sure repoman is really the best place for these checks; although having devs do it is appealing as a distributed effort (as opposed to one or two lonely qa guys trying valiantly to fix stuff).
I guess here I'd rather see a third-party tool doing this stuff first; with people actually fixing stuff; as opposed to adding yet another option to repoman that only two lonely qa devs will ever use.
and I sincerely don't want to write a tool to parse an ebuild file to be able to do something that repoman does already.
Checking for spam links is quite difficult so I don't pretend that repoman be able to do that, but the rest should be pretty trivial to do.
I never said you shouldn't use the portage API (no ebuild parsing required.)
Well, I'm already working on support for limiting the amount of checks performed by repoman to a specific group. When this is done, I may introduce additional check group for SRC_URI, disabled by default.
*** Bug 315243 has been marked as a duplicate of this bug. ***