WordProcessingML

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Sample file)
(Identification)
Line 21: Line 21:
 
   xml:space="preserve">
 
   xml:space="preserve">
  
(which can vary somewhat depending on what version of Word it is saved from; from Word 2007 it lacks line breaks and uses different namespace URIs: <code>http://schemas.microsoft.com/aml/2001/core</code>, <code>uuid:C2F41010-65B3-11d1-A29F-00AA00C14882</code>, <code>http://schemas.openxmlformats.org/markup-compatibility/2006</code>)
+
(which can vary somewhat depending on what version of Word it is saved from; from Word 2007 it lacks line breaks and uses different namespace URIs, varying depending on whether you save it in 2003 or 2006 format: 2003 format has <code>http://schemas.microsoft.com/aml/2001/core</code>, <code>uuid:C2F41010-65B3-11d1-A29F-00AA00C14882</code>, <code>http://schemas.openxmlformats.org/markup-compatibility/2006</code>; 2006 format has <code>http://schemas.microsoft.com/office/2006/xmlPackage</code>, <code>http://schemas.openxmlformats.org/package/2006/relationships</code>, <code>http://schemas.microsoft.com/office/word/2006/wordml</code>)
  
 
== Related formats ==
 
== Related formats ==

Revision as of 01:15, 19 April 2014

File Format
Name WordProcessingML
Ontology
Extension(s) .xml

WordProcessingML or Word 2003 XML Document is an XML-based format which was introduced in Microsoft Office 2003 as one of the formats which could be chosen in the "Save As" feature to save Word documents, though not the default format (which was DOC, a proprietary binary format). This is a different format from the DOCX format introduced in Office 2007, which consists of a ZIP archive of various files including XML. In contrast, WordProcessingML is a single XML file, uncompressed, and is unable to store all features which can be present in an Word document. The 2007 versions are still capable of loading and saving WordProcessingML, even if a different XML-based format is the default format.

Contents

Identification

A WordProcessingML file has the following header:

<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<?mso-application progid="Word.Document"?>
<w:wordDocument
  xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"
  xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint"
  xmlns:o="urn:schemas-microsoft-com:office:office"
  w:macrosPresent="no"
  w:embeddedObjPresent="no"
  w:ocxPresent="no"
  xml:space="preserve">

(which can vary somewhat depending on what version of Word it is saved from; from Word 2007 it lacks line breaks and uses different namespace URIs, varying depending on whether you save it in 2003 or 2006 format: 2003 format has http://schemas.microsoft.com/aml/2001/core, uuid:C2F41010-65B3-11d1-A29F-00AA00C14882, http://schemas.openxmlformats.org/markup-compatibility/2006; 2006 format has http://schemas.microsoft.com/office/2006/xmlPackage, http://schemas.openxmlformats.org/package/2006/relationships, http://schemas.microsoft.com/office/word/2006/wordml)

Related formats

Sample files

Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox