Main > Tools > HTML Tools >

Jericho HTML Parser 3.0

Jericho HTML Parser 3.0

Sponsored Links

Jericho HTML Parser 3.0 Ranking & Summary

RankingClick at the star to rank
Ranking Level
User Review: 0 (0 times)
File size: 2 MB
Platform: Mac OS X
License: GPL
Price: FREE
Downloads: 18
Date added: 2009-04-11
Publisher: M Jericho

Jericho HTML Parser 3.0 description

Free and open source HTML parser for your Mac

Jericho HTML Parser is an open source Java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any invalid or unrecognized HTML.
Jericho HTML Parser also provides high-level HTML form manipulation functions.

Main features:
- - The presence of badly formatted HTML does not interfere with the parsing of the rest of the document, which makes the library ideal for use with "real-world" HTML that chokes other parsers.
- - PHP, JSP, ASP, PSP and Mason server tags are explicitly recognized by the parser. This means that normal HTML is still parsed properly even if there are server tags inside them, which is common for example when dynamically setting element attributes.
- - It is neither an event nor tree based parser, but rather uses a combination of simple text search, efficient tag recognition and a tag position cache. The text of the whole source document is first loaded into memory, and then only the relevant segments searched for the relevant characters of each search operation.
- - Compared to a tree based parser such as DOM, the memory and resource requirements can be far better if only small sections of the document need to be parsed or modified. Incorrect or badly formatted HTML can easily be ignored, unlike tree based parsers which must identify every node in the document from top to bottom.
- - Compared to an event based parser such as SAX, the interface is on a much higher level and more intuitive, and a tree representation of the document element hierarchy is easily created if required.
- - The begin and end positions in the source document of all parsed segments are accessible, allowing modification of only selected segments of the document without having to reconstruct the entire document from a tree.
- - The row and column number of each position in the source document are easily accessible.
- - Provides a simple but comprehensive interface for the analysis and manipulation of HTML form controls, including the extraction and population of initial values, and conversion to read-only or data display modes. Analysis of the form controls also allows data received from the form to be stored and presented in an appropriate manner.
- - Custom tag types can be easily defined and registered for recognition by the parser.
- - Built-in functionality to extract all text from HTML markup, suitable for feeding into a text search engine such as Apache Lucene.
- - Built-in functionality to render HTML markup with simple text formatting.
- - Built-in functionality to format HTML source code that indents elements according to their depth in the document element hierarchy.
- - Built-in functionality to compact HTML source code by removing all unnecessary white space.

Jericho HTML Parser 3.0 Screenshot

Advertisements

Jericho HTML Parser 3.0 Keywords

Bookmark Jericho HTML Parser 3.0

Hyperlink code:
Link for forum:

Jericho HTML Parser 3.0 Copyright

WareSeeker periodically updates pricing and software information of Jericho HTML Parser 3.0 full version from the publisher, so some information may be slightly out-of-date. You should confirm all information before relying on it. Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future development of Jericho HTML Parser 3.0 Edition. Download links are directly from our publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed

Allok Video Splitter 2.2.0 Review:

Name (Required)
Email(Required)
Captcha
Featured Software

Want to place your software product here?
Please contact us for consideration.

Contact WareSeeker.com
Related Software
TEXT to HTML Parser - Converts styled text into HTML/CSS using Tex-Edit Plus Free Download
A free and open source Verilog 2005 synthesizable subset parser Free Download
HTML & PHP text editor. Free Download
HTML Face - Small, fast HTML editor Free Download
Convert documents into web ready html format. Free Download
Remove HTML E - Translate HTML coded message into plain text Free Download
Unicode HTML Generator - Generate static web pages that contain unicode characters Free Download
Strip HTML - Strip HTML from one or more messages Free Download