Validating parsers

The CPUinfo reports that it tests at 232.65 Bogomips.The C compiler is GCC, the one that came with Red Hat 5.2.May 5, 1999 Clark Cooper This article compares the performance of six implementations of a single program that processes XML.

validating parsers-68

So you're welcome to download what I've got and try it out for yourself.

Also, since I'm more experienced in Perl and C than Java and Python, gurus of those two languages may want to comb through the implementations written in them, checking for newbie mistakes.

The reason is that XML software should be small, fast, and compatible.

HTML browsers are allowed to display HTML documents with errors (like missing end tags). A "well formed" XML document is not the same as a "valid" XML document. In addition, it must conform to a document type definition.

This section describes in detail how I went about testing parser performance in C, Java, Perl, and Python on a Linux system.

If you are so inclined, run the test yourself on your own system.

I described this program in my September article on this site.

It produces a top-down statistical report on the elements in an XML document: the number of occurrences; a breakdown of the number of children, parents, and attributes; and character count.

When I say that a single program was implemented six times, I mean that each implementation produces (or should produce) exactly the same output for a given input document.

