Hi folks, jkt@slon ~ $ curl -v http://archives.gentoo.org/gentoo-doc-cvs/msg_01182.xml 2>&1 >/dev/null | grep Content-Length < Content-Length: 1 (Message number randomly picked up, haven't tried any other messages nor the other lists yet.)
So, I can get the source XML, that's good. There are some errors, though: a) Root element name doesn't match the DTD identifier (ie. your root element is <mainpage> while the DTD definition talks about <guide>) b) id="listarchive" isn't allowed (see /usr/portage/metadata/dtd/guide.dtd) c) Attribute "plain" for <pre> element isn't defined d) Using <foo bar="true"> isn't a standard approach, it's recommended to use <foo bar="bar"> to indicate that the option "bar" is present (otherwise just omit the entire attribute), iirc. e) You fail to properly escape all the entities (""" isn't a valid entity reference, """ is) - /gentoo-admin/msg_00216.xml, line 108 f) The generated XML isn't valid (missing "caption" attribute for the <pre> tag, <uri> isn't allowed inside <pre> and a lot of other errors as shown by `xmllint --noout --valid $file`. If you're looking for our XSLT wizzard, it's neysx. Some help might be provided at #gentoo-doc as we're probably the "mainstream producers" of XML content for webnodes. HTH
solar: archives is yours, could you please look at this old bug?
chizu is the owner of that code. He is better suited to review his code vs me. I've never touched it before. He can usually be reached via irc #osuosl if not responding to email.
The content length problem is worked around locally, waiting for bug 161855 or upstream Mhonarc to fix it in the package. Some of the XML errors have been fixed, too (been a while, I don't remember exactly which ones are still an issue).
*** Bug 207273 has been marked as a duplicate of this bug. ***
*** Bug 211194 has been marked as a duplicate of this bug. ***
*** Bug 262788 has been marked as a duplicate of this bug. ***
This needs to be retested after I regenerate all the XML again.
I'm no longer hitting this problem -> closing.