HTML Parser 1.6

Sponsored Links

HTML Parser 1.6 Ranking & Summary

RankingClick at the star to rank
Ranking Level
User Review: 10 (1 times)
File size: 4.14 MB
Platform: Windows All
License: GPL
Price:
Downloads: 586
Date added: 2007-08-24
Publisher: Derrick Oswald

HTML Parser 1.6 description

A Java library used to parse HTML in either a linear or nested fashion Primarily used for transformation or extraction, it features filters, custom tags, visitors, and easy to use JavaBeans. HTML Parser is a robust, fast, and well tested package.
Welcome to the homepage of HTMLParser - a super-fast real-time parser for real-world HTML. What has attracted most developers to HTMLParser has been its simplicity in design, speed and ability to handle streaming real-world html.
The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data).
In general, to use the HTMLParser you will need to be able to write code in the Java programming language. Although some example programs are provided that may be useful as they stand, its more than likely you will need (or want) to create your own programs or modify the ones provided to match your intended application.
To use the library, you will need to add either the htmllexer.jar or htmlparser.jar to your classpath when compiling and running. The htmllexer.jar provides low level access to generic string, remark and tag nodes on the page in a linear, flat, sequential manner.
The htmlparser.jar, which includes the classes found in htmllexer.jar, provides access to a page as a sequence of nested differentiated tags containing string, remark and other tag nodes.
Extraction
Extraction encompasses all the information retrieval programs that are not meant to preserve the source page.
This covers uses like:
- text extraction, for use as input for text search engine databases for example
- link extraction, for crawling through web pages or harvesting email addresses
- screen scraping, for programmatic data input from web pages
- resource extraction, collecting images or sound
- a browser front end, the preliminary stage of page display
- link checking, ensuring links are valid
- site monitoring, checking for page differences beyond simplistic diffs
There are several facilities in the HTMLParser codebase to help with extraction, including filters, visitors and JavaBeans.
Transformation
Transformation includes all processing where the input and the output are HTML pages.
Some examples are:
- URL rewriting, modifying some or all links on a page
- site capture, moving content from the web to local disk
- censorship, removing offending words and phrases from pages
- HTML cleanup, correcting erroneous pages
- ad removal, excising URLs referencing advertising
- conversion to XML, moving existing web pages to XML
During or after reading in a page, operations on the nodes can accomplish many transformation tasks "in place", which can then be output with the toHtml() method. Depending on the purpose of your application, you will probably want to look into node decorators, visitors, or custom tags in conjunction with the PrototypicalNodeFactory.

HTML Parser 1.6 Screenshot

Advertisements

HTML Parser 1.6 Keywords

Bookmark HTML Parser 1.6

Hyperlink code:
Link for forum:

HTML Parser 1.6 Copyright

WareSeeker periodically updates pricing and software information of HTML Parser 1.6 full version from the publisher, so some information may be slightly out-of-date. You should confirm all information before relying on it. Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future development of HTML Parser 1.6 Edition. Download links are directly from our publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed

Allok Video Splitter 2.2.0 Review:

Name (Required)
Email(Required)
Captcha
Featured Software

Want to place your software product here?
Please contact us for consideration.

Contact WareSeeker.com
Related Software
Will read and analyze HTML files Free Download
HTML parse unit allows you to parse HTML code, extract HTML elements and NAME=Value pairs Free Download
HTML Analyser is a powerful IE plug-in that analyses HTML files loaded in IE, providing richer information about the pages, and shows you what IE does to the code. It provides information about the co Free Download
A simple but powerful java HTML parser library allowing analysis and manipulation of parts of an HTML document Free Download
Library for parsing and manipulating real world malformed HTML Free Download
A package to deal with the HTML code. Currently contains advanced HTML parser Free Download
eConn Virtcert Parser. - Download Virtcert.com inventory into CSV file at your desktop. It can extract the data and store in database. It takes inputs in form of doc, docx, html, rtf, and text format Free Download
Takes naked HTML and parses it into a VBScript Response. Version 2 adds the ability to retain your HTML formatting or compress it into a single line for smaller page sizes. Also added install and unin Free Download