Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 7744 - HTML-SimpleParse-0.10.ebuild (New Package)
Summary: HTML-SimpleParse-0.10.ebuild (New Package)
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: New packages (show other bugs)
Hardware: x86 Linux
: High enhancement (vote)
Assignee: Michael Cummings (RETIRED)
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2002-09-10 05:53 UTC by Todd
Modified: 2003-02-04 19:42 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments
The ebuild. (HTML-SimpleParse-0.10.ebuild,461 bytes, text/plain)
2002-09-10 05:56 UTC, Todd
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Todd 2002-09-10 05:53:26 UTC
 
Comment 1 Todd 2002-09-10 05:55:07 UTC
This module is a simple HTML parser. It is similar in concept to HTML::Parser,
but it differs in a couple of important ways.

Final requirement for mimerdesk, I hope.

Belongs in
/dev-perl/HTML-SimpleParse

Comment 2 Todd 2002-09-10 05:56:32 UTC
Created attachment 3787 [details]
The ebuild.
Comment 3 Seemant Kulleen (RETIRED) gentoo-dev 2002-11-01 16:23:29 UTC
todd: in what ways does it differ?
Comment 4 Todd 2002-11-01 17:58:41 UTC
I do not believe that this package will be required in the next version of
mimerdesk, due out in a few days.  So I don't know whether you still want to
evaluate it for inclusion.

*Ripped from readme*
First, HTML::Parser knows which tags can contain other tags, which start
tags have corresponding end tags, which tags can exist only in the <HEAD>
portion of the document, and so forth.  HTML::SimpleParse does not know any
of these things.  It just finds tags and text in the HTML you give it, it
does not care about the specific content of these tags (though it does
distiguish between different _types_ of tags, such as comments, starting
tags like <b>, ending tags like </b>, and so on).

Second, HTML::SimpleParse does not create a hierarchical tree of HTML
content, but rather a simple linear list.  It does not pay any attention to
balancing start tags with corresponding end tags, or which pairs of tags are
inside other pairs of tags.
Comment 5 Michael Cummings (RETIRED) gentoo-dev 2002-12-02 11:00:50 UTC
Todd,

Just let me know if this is still relevant. Ready to add if so.
Comment 6 Michael Cummings (RETIRED) gentoo-dev 2002-12-18 12:37:17 UTC
in portage. enjoy.