An other library for the common problem of parsing arbitrary text data.
A small, easy-to-use Java library
LR parser without lookahead limitation.
Semantic action defined in class not in grammar definition file.
Piccolo is the fastest SAX parser for Java, supporting SAX1, SAX2, and JAXP (SAX only). Piccolo is different from other parsers in that it was developed using parser generators. It weighs 160K including XML APIs. See http://piccolo.sf.net for more...
MozillaParser is a Java Html parser based on mozilla's html parser. it acts as a bridge from java classes to Mozilla's classes and outputs a java Document object from a raw ( and dirty) HTML input
The Java Sitemap Parser can parse a website's Sitemap (http://www.sitemaps.org/). This is useful for web crawlers that want to discover URLs from a website that is using the Sitemap Protocol.
Java Micro XML Parser provides XML Writer and Pull/DOM-like XML Parser. Small 21kb jar is easily embedded and only requires J2ME. Very clean and simple API.
Small footprint, fast Java XML parser. Parses to an easily manipulable nested class structure which can be converted back to formatted or unformatted XML with a single call. Uses and creates straight XML, no DTDs necessary or used.
SMIng parser and API in java for the SMIng proposal available at :http://www.ietf.org/html.charters/sming-charter.html
JEPLite is a light-weighted (re)implementation of the Java Expression Parser (jep.sourceforge.net). The intention is to strip some of its not-so-often-used features, and thus speeding the rest. Includes expression optimizer.
A fully-featured, extensible Java command-line parser conforming to the POSIX/GNU command-line utility recommendations. Its carefully designed API allows parsing of complex command-lines with just a few lines of code.
The FFPOJO Project is a Flat-File Parser, POJO based, library for Java applications. IMPORTANT: the source code of this project was migrated to GitHub: https://github.com/gibaholms/ffpojo
eBook parser library - Java library, designed for parsing many formats of electronic books
Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete...
A project to make a simple, but complete, bibtex parser using regexes in Java. It is meant as a library for a larger program, and so is not interactive. Testers, contributors, comments and suggestions are more than welcome.
Matra is a java based XML DTD Parser Utility. It allows you to parse a dtd, generate a dtdtree for the dtd and merge the dtd and its included modules into a single dtd. http://matra.sourceforge.net
The LALR parser generator (LPG) is a tool for developing scanners and parsers written in Java, C++ or C. Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
This project aims to provide a Java-Bytecode compiler for the fairly new programming language WRL and others like PHP4 and Postfix, based on the research on the SableCC parser generator and common W3C.ORG standards.
BNF for Java is a BNF Compiler-Compiler, or Parser-Generator. It implements ISO Standard Backus-Naur Format, using Java. BNF allows you to create a syntax, or a complete language, to parse your data source. Your custom Java extensions generate...
A tool that transcribes Parsing Expressions into a parser written in Java. An integral part of the tool are facilities for specifying semantics (also in Java). The tool does not generate a "packrat parser", but a set of transparent...
Native Java parser framework, no (pre-)compilers needed. Closely follows BNF syntax. Optimized for performance and low overhead. Highly flexible and extendable using a streamlined API. Integrate semantics, tracing and even context sensitive parsers.