WordProcessingML

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Identification)
(Sample files)
 
(4 intermediate revisions by one user not shown)
Line 2: Line 2:
 
|formattype=electronic
 
|formattype=electronic
 
|subcat=Document
 
|subcat=Document
 +
|subcat2=Word Processor
 +
|subcat3=Microsoft Word
 
|extensions={{ext|xml}}
 
|extensions={{ext|xml}}
 
}}
 
}}
'''WordProcessingML''' or '''Word 2003 XML Document''' is an [[XML]]-based format which was introduced in Microsoft Office 2003 as one of the formats which could be chosen in the "Save As" feature to save Word documents, though not the default format (which was [[DOC]], a proprietary binary format). This is a different format from the [[DOCX]] format introduced in Office 2007, which consists of a [[ZIP]] archive of various files including XML. In contrast, WordProcessingML is a single XML file, uncompressed, and is unable to store all features which can be present in an Word document. The 2007 versions are still capable of loading and saving WordProcessingML, even if a different XML-based format is the default format.
+
'''WordProcessingML''' or '''Word 2003 XML Document''' is an [[XML]]-based format which was introduced in Microsoft Office 2003 as one of the formats which could be chosen in the "Save As" feature to save Word documents, though not the default format (which was [[DOC]], a proprietary binary format). This is a different format from the [[DOCX]] format introduced in Office 2007, which consists of a [[ZIP]] archive of various files including XML. In contrast, WordProcessingML is a single XML file, uncompressed, and is unable to store all features which can be present in an Word document. The 2007 versions are still capable of loading and saving WordProcessingML, even if a different XML-based format is the default format. (The "Save As" feature gives you two different XML formats, a "2003" version and an undated one that is based on a 2006 schema.)
 +
 
 +
The "WordProcessingML" term has also sometimes been [http://officeopenxml.com/anatomyofOOXML.php used] to describe the newer DOCX format as well.
  
 
== Identification ==
 
== Identification ==
Line 28: Line 32:
  
 
== Sample files ==
 
== Sample files ==
* [http://www.dan.info/sampledata/Word_2003_XML_Document.xml Sample WordProcessingML document (saved from Word 2007)]
+
* [https://www.dan.info/sampledata/Word_2003_XML_Document.xml Sample WordProcessingML document (saved from Word 2007)]
* [http://www.dan.info/sampledata/msword/testing.xml Document saved from Windows Word 2007 in XML 2006 format]
+
* [https://www.dan.info/sampledata/msword/testing.xml Document saved from Windows Word 2007 in XML 2006 format]
* [http://www.dan.info/sampledata/msword/testing-2003.xml Document saved from Windows Word 2007 in XML 2003 format]
+
* [https://www.dan.info/sampledata/msword/testing-2003.xml Document saved from Windows Word 2007 in XML 2003 format]
  
 
== Links ==
 
== Links ==

Latest revision as of 00:06, 12 February 2020

File Format
Name WordProcessingML
Ontology
Extension(s) .xml

WordProcessingML or Word 2003 XML Document is an XML-based format which was introduced in Microsoft Office 2003 as one of the formats which could be chosen in the "Save As" feature to save Word documents, though not the default format (which was DOC, a proprietary binary format). This is a different format from the DOCX format introduced in Office 2007, which consists of a ZIP archive of various files including XML. In contrast, WordProcessingML is a single XML file, uncompressed, and is unable to store all features which can be present in an Word document. The 2007 versions are still capable of loading and saving WordProcessingML, even if a different XML-based format is the default format. (The "Save As" feature gives you two different XML formats, a "2003" version and an undated one that is based on a 2006 schema.)

The "WordProcessingML" term has also sometimes been used to describe the newer DOCX format as well.

Contents

[edit] Identification

A WordProcessingML file has the following header:

<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<?mso-application progid="Word.Document"?>
<w:wordDocument
  xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"
  xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint"
  xmlns:o="urn:schemas-microsoft-com:office:office"
  w:macrosPresent="no"
  w:embeddedObjPresent="no"
  w:ocxPresent="no"
  xml:space="preserve">

(which can vary somewhat depending on what version of Word it is saved from; from Word 2007 it lacks line breaks and uses different namespace URIs, varying depending on whether you save it in 2003 or 2006 format: 2003 format has http://schemas.microsoft.com/aml/2001/core, uuid:C2F41010-65B3-11d1-A29F-00AA00C14882, http://schemas.openxmlformats.org/markup-compatibility/2006; 2006 format has http://schemas.microsoft.com/office/2006/xmlPackage, http://schemas.openxmlformats.org/package/2006/relationships, http://schemas.microsoft.com/office/word/2006/wordml)

[edit] Related formats

[edit] Sample files

[edit] Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox