7744 – HTML-SimpleParse-0.10.ebuild (New Package)

Bug 7744 - HTML-SimpleParse-0.10.ebuild (New Package)

Summary: HTML-SimpleParse-0.10.ebuild (New Package)

Status:	RESOLVED FIXED

Alias:	None

Product:	Gentoo Linux
Classification:	Unclassified
Component:	New packages (show other bugs)
Hardware:	x86 Linux

Importance:	High enhancement
Assignee:	Michael Cummings (RETIRED)

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:

Reported:	2002-09-10 05:53 UTC by Todd
Modified:	2003-02-04 19:42 UTC (History)
CC List:	0 users

See Also:
Package list:
Runtime testing required:	---

Attachments
The ebuild. (HTML-SimpleParse-0.10.ebuild,461 bytes, text/plain) 2002-09-10 05:56 UTC, Todd	Details
View All Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Todd 2002-09-10 05:53:26 UTC

Comment 1 Todd 2002-09-10 05:55:07 UTC

This module is a simple HTML parser. It is similar in concept to HTML::Parser,
but it differs in a couple of important ways.

Final requirement for mimerdesk, I hope.

Belongs in
/dev-perl/HTML-SimpleParse

Comment 2 Todd 2002-09-10 05:56:32 UTC

Created attachment 3787 [details]
The ebuild.

Comment 3 Seemant Kulleen (RETIRED) gentoo-dev

2002-11-01 16:23:29 UTC

todd: in what ways does it differ?

Comment 4 Todd 2002-11-01 17:58:41 UTC

I do not believe that this package will be required in the next version of
mimerdesk, due out in a few days.  So I don't know whether you still want to
evaluate it for inclusion.

*Ripped from readme*
First, HTML::Parser knows which tags can contain other tags, which start
tags have corresponding end tags, which tags can exist only in the <HEAD>
portion of the document, and so forth.  HTML::SimpleParse does not know any
of these things.  It just finds tags and text in the HTML you give it, it
does not care about the specific content of these tags (though it does
distiguish between different _types_ of tags, such as comments, starting
tags like <b>, ending tags like </b>, and so on).

Second, HTML::SimpleParse does not create a hierarchical tree of HTML
content, but rather a simple linear list.  It does not pay any attention to
balancing start tags with corresponding end tags, or which pairs of tags are
inside other pairs of tags.

Comment 5 Michael Cummings (RETIRED) gentoo-dev

2002-12-02 11:00:50 UTC

Todd,

Just let me know if this is still relevant. Ready to add if so.

Comment 6 Michael Cummings (RETIRED) gentoo-dev

2002-12-18 12:37:17 UTC

in portage. enjoy.