interface.html revision 43d3f61ad5c142c8c17e45c8c954432916ffceab
1<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/1999/REC-html401-19991224/loose.dtd"> 2<html> 3<head> 4<meta content="text/html; charset=ISO-8859-1" http-equiv="Content-Type"> 5<style type="text/css"><!-- 6TD {font-size: 10pt; font-family: Verdana,Arial,Helvetica} 7BODY {font-size: 10pt; font-family: Verdana,Arial,Helvetica; margin-top: 5pt; margin-left: 0pt; margin-right: 0pt} 8H1 {font-size: 16pt; font-family: Verdana,Arial,Helvetica} 9H2 {font-size: 14pt; font-family: Verdana,Arial,Helvetica} 10H3 {font-size: 12pt; font-family: Verdana,Arial,Helvetica} 11A:link, A:visited, A:active { text-decoration: underline } 12--></style> 13<title>The SAX interface</title> 14</head> 15<body bgcolor="#8b7765" text="#000000" link="#000000" vlink="#000000"> 16<table border="0" width="100%" cellpadding="5" cellspacing="0" align="center"><tr> 17<td width="180"> 18<a href="http://www.gnome.org/"><img src="smallfootonly.gif" alt="Gnome Logo"></a><a href="http://www.w3.org/Status"><img src="w3c.png" alt="W3C Logo"></a><a href="http://www.redhat.com/"><img src="redhat.gif" alt="Red Hat Logo"></a> 19</td> 20<td><table border="0" width="90%" cellpadding="2" cellspacing="0" align="center" bgcolor="#000000"><tr><td><table width="100%" border="0" cellspacing="1" cellpadding="3" bgcolor="#fffacd"><tr><td align="center"> 21<h1>The XML C library for Gnome</h1> 22<h2>The SAX interface</h2> 23</td></tr></table></td></tr></table></td> 24</tr></table> 25<table border="0" cellpadding="4" cellspacing="0" width="100%" align="center"><tr><td bgcolor="#8b7765"><table border="0" cellspacing="0" cellpadding="2" width="100%"><tr> 26<td valign="top" width="200" bgcolor="#8b7765"><table border="0" cellspacing="0" cellpadding="1" width="100%" bgcolor="#000000"><tr><td> 27<table width="100%" border="0" cellspacing="1" cellpadding="3"> 28<tr><td colspan="1" bgcolor="#eecfa1" align="center"><center><b>Main Menu</b></center></td></tr> 29<tr><td bgcolor="#fffacd"><ul style="margin-left: -2pt"> 30<li><a href="index.html">Home</a></li> 31<li><a href="intro.html">Introduction</a></li> 32<li><a href="FAQ.html">FAQ</a></li> 33<li><a href="docs.html">Documentation</a></li> 34<li><a href="bugs.html">Reporting bugs and getting help</a></li> 35<li><a href="help.html">How to help</a></li> 36<li><a href="downloads.html">Downloads</a></li> 37<li><a href="news.html">News</a></li> 38<li><a href="XML.html">XML</a></li> 39<li><a href="XSLT.html">XSLT</a></li> 40<li><a href="architecture.html">libxml architecture</a></li> 41<li><a href="tree.html">The tree output</a></li> 42<li><a href="interface.html">The SAX interface</a></li> 43<li><a href="xmldtd.html">Validation & DTDs</a></li> 44<li><a href="xmlmem.html">Memory Management</a></li> 45<li><a href="encoding.html">Encodings support</a></li> 46<li><a href="xmlio.html">I/O Interfaces</a></li> 47<li><a href="catalog.html">Catalog support</a></li> 48<li><a href="library.html">The parser interfaces</a></li> 49<li><a href="entities.html">Entities or no entities</a></li> 50<li><a href="namespaces.html">Namespaces</a></li> 51<li><a href="upgrade.html">Upgrading 1.x code</a></li> 52<li><a href="threads.html">Thread safety</a></li> 53<li><a href="DOM.html">DOM Principles</a></li> 54<li><a href="example.html">A real example</a></li> 55<li><a href="contribs.html">Contributions</a></li> 56<li> 57<a href="xml.html">flat page</a>, <a href="site.xsl">stylesheet</a> 58</li> 59</ul></td></tr> 60</table> 61<table width="100%" border="0" cellspacing="1" cellpadding="3"> 62<tr><td colspan="1" bgcolor="#eecfa1" align="center"><center><b>Related links</b></center></td></tr> 63<tr><td bgcolor="#fffacd"><ul style="margin-left: -2pt"> 64<li><a href="http://mail.gnome.org/archives/xml/">Mail archive</a></li> 65<li><a href="http://xmlsoft.org/XSLT/">XSLT libxslt</a></li> 66<li><a href="http://www.cs.unibo.it/~casarini/gdome2/">DOM gdome2</a></li> 67<li><a href="ftp://xmlsoft.org/">FTP</a></li> 68<li><a href="http://www.fh-frankfurt.de/~igor/projects/libxml/">Windows binaries</a></li> 69<li><a href="http://pages.eidosnet.co.uk/~garypen/libxml/">Solaris binaries</a></li> 70<li><a href="http://bugzilla.gnome.org/buglist.cgi?product=libxml">Bug Tracker</a></li> 71</ul></td></tr> 72</table> 73</td></tr></table></td> 74<td valign="top" bgcolor="#8b7765"><table border="0" cellspacing="0" cellpadding="1" width="100%"><tr><td><table border="0" cellspacing="0" cellpadding="1" width="100%" bgcolor="#000000"><tr><td><table border="0" cellpadding="3" cellspacing="1" width="100%"><tr><td bgcolor="#fffacd"> 75<p>Sometimes the DOM tree output is just too large to fit reasonably into 76memory. In that case (and if you don't expect to save back the XML document 77loaded using libxml), it's better to use the SAX interface of libxml. SAX is 78a <strong>callback-based interface</strong> to the parser. Before parsing, 79the application layer registers a customized set of callbacks which are 80called by the library as it progresses through the XML input.</p> 81<p>To get more detailed step-by-step guidance on using the SAX interface of 82libxml, see the <a href="http://www.daa.com.au/~james/gnome/xml-sax/xml-sax.html">nice 83documentation</a>.written by <a href="mailto:james@daa.com.au">James 84Henstridge</a>.</p> 85<p>You can debug the SAX behaviour by using the <strong>testSAX</strong> 86program located in the gnome-xml module (it's usually not shipped in the 87binary packages of libxml, but you can find it in the tar source 88distribution). Here is the sequence of callbacks that would be reported by 89testSAX when parsing the example XML document shown earlier:</p> 90<pre>SAX.setDocumentLocator() 91SAX.startDocument() 92SAX.getEntity(amp) 93SAX.startElement(EXAMPLE, prop1='gnome is great', prop2='&amp; linux too') 94SAX.characters( , 3) 95SAX.startElement(head) 96SAX.characters( , 4) 97SAX.startElement(title) 98SAX.characters(Welcome to Gnome, 16) 99SAX.endElement(title) 100SAX.characters( , 3) 101SAX.endElement(head) 102SAX.characters( , 3) 103SAX.startElement(chapter) 104SAX.characters( , 4) 105SAX.startElement(title) 106SAX.characters(The Linux adventure, 19) 107SAX.endElement(title) 108SAX.characters( , 4) 109SAX.startElement(p) 110SAX.characters(bla bla bla ..., 15) 111SAX.endElement(p) 112SAX.characters( , 4) 113SAX.startElement(image, href='linus.gif') 114SAX.endElement(image) 115SAX.characters( , 4) 116SAX.startElement(p) 117SAX.characters(..., 3) 118SAX.endElement(p) 119SAX.characters( , 3) 120SAX.endElement(chapter) 121SAX.characters( , 1) 122SAX.endElement(EXAMPLE) 123SAX.endDocument()</pre> 124<p>Most of the other interfaces of libxml are based on the DOM tree-building 125facility, so nearly everything up to the end of this document presupposes the 126use of the standard DOM tree build. Note that the DOM tree itself is built by 127a set of registered default callbacks, without internal specific 128interface.</p> 129<p><a href="mailto:daniel@veillard.com">Daniel Veillard</a></p> 130</td></tr></table></td></tr></table></td></tr></table></td> 131</tr></table></td></tr></table> 132</body> 133</html> 134