The built-in Expat parser makes it possible to process XML documents in PHP.
What is XML?
XML is used to describe data and to focus on what data is. An XML file describes the structure of the data.
In XML, no tags are predefined. You must define your own tags.
If you want to learn more about XML, please visit our XML tutorial.
What is Expat?
To read and update – create and manipulate – an XML document, you will need an XML parser.
There are two basic types of XML parsers:
- Tree-based parser: This parser transforms an XML document into a tree structure. It analyzes the whole document, and provides access to the tree elements. e.g. the Document Object Model (DOM)
- Event-based parser: Views an XML document as a series of events. When a specific event occurs, it calls a function to handle it
The Expat parser is an event-based parser.
Event-based parsers focus on the content of the XML documents, not their structure. Because of this, event-based parsers can access data faster than tree-based parsers.
Look at the following XML fraction:
An event-based parser reports the XML above as a series of three events:
- Start element: from
- Start CDATA section, value: Jani
- Close element: from
The XML example above contains well-formed XML. However, the example is not valid XML, because there is no Document Type Definition (DTD) associated with it.
However, this makes no difference when using the Expat parser. Expat is a non-validating parser, and ignores any DTDs.
As an event-based, non-validating XML parser, Expat is fast and small, and a perfect match for PHP web applications.
Note: XML documents must be well-formed or Expat will generate an error.
The XML Expat parser functions are part of the PHP core. There is no installation needed to use these functions.
An XML File
The XML file below will be used in our example:
|<?xml version=”1.0″ encoding=”ISO-8859-1″?>|
<body>Don’t forget me this weekend!</body>
Initializing the XML Parser
We want to initialize the XML parser in PHP, define some handlers for different XML events, and then parse the XML file.
//Initialize the XML parser
//Function to use at the start of an element
//Function to use at the end of an element
//Function to use when finding character data
//Specify element handler
//Specify data handler
//Open XML file
//Free the XML parser
The output of the code above will be:
|— Note —|
Message: Don’t forget me this weekend!
How it works:
- Initialize the XML parser with the xml_parser_create() function
- Create functions to use with the different event handlers
- Add the xml_set_element_handler() function to specify which function will be executed when the parser encounters the opening and closing tags
- Add the xml_set_character_data_handler() function to specify which function will execute when the parser encounters character data
- Parse the file “test.xml” with the xml_parse() function
- In case of an error, add xml_error_string() function to convert an XML error to a textual description
- Call the xml_parser_free() function to release the memory allocated with the xml_parser_create() function
More PHP Expat Parser
For more information about the PHP Expat functions, visit our PHP XML Parser Reference.