WordProcessingML

From Just Solve the File Format Problem
Revision as of 01:09, 19 April 2014 by Dan Tobias (Talk | contribs)

Jump to: navigation, search
File Format
Name WordProcessingML
Ontology
Extension(s) .xml

WordProcessingML or Word 2003 XML Document is an XML-based format which was introduced in Microsoft Office 2003 as one of the formats which could be chosen in the "Save As" feature to save Word documents, though not the default format (which was DOC, a proprietary binary format). This is a different format from the DOCX format introduced in Office 2007, which consists of a ZIP archive of various files including XML. In contrast, WordProcessingML is a single XML file, uncompressed, and is unable to store all features which can be present in an Word document. The 2007 versions are still capable of loading and saving WordProcessingML, even if a different XML-based format is the default format.

Contents

Identification

A WordProcessingML file has the following header:

<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<?mso-application progid="Word.Document"?>
<w:wordDocument
  xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"
  xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint"
  xmlns:o="urn:schemas-microsoft-com:office:office"
  w:macrosPresent="no"
  w:embeddedObjPresent="no"
  w:ocxPresent="no"
  xml:space="preserve">

(which can vary somewhat depending on what version of Word it is saved from; from Word 2007 it lacks line breaks and uses different namespace URIs: http://schemas.microsoft.com/aml/2001/core, uuid:C2F41010-65B3-11d1-A29F-00AA00C14882, http://schemas.openxmlformats.org/markup-compatibility/2006)

Related formats

Sample files

Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox