Warning: Unknown: Unable to allocate memory for pool. in Unknown on line 0

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/index.php on line 54

Warning: Cannot modify header information - headers already sent in /usr/local/www/mediawiki/includes/WebStart.php on line 63

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 94

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 97

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 100

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 103

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/Defines.php on line 187

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 115

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 134

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/LocalSettings.php on line 137

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/LocalSettings.php on line 139

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/LocalSettings.php on line 144

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/LocalSettings.php on line 145

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/LocalSettings.php on line 153

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 150

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/WebStart.php on line 157

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/Setup.php on line 381

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/Setup.php on line 382

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/Setup.php on line 383

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/Setup.php on line 384

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require_once(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/languages/Language.php on line 20

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007

Warning: Cannot modify header information - headers already sent in /usr/local/www/mediawiki/includes/WebResponse.php on line 38

Warning: Cannot modify header information - headers already sent in /usr/local/www/mediawiki/includes/WebResponse.php on line 38
DOCX - Just Solve the File Format Problem

DOCX

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search

Warning: require(): Unable to allocate memory for pool. in /usr/local/www/mediawiki/includes/AutoLoader.php on line 1007
(Clarified a few statements.)
m (Change telparia.com samples link to template)
 
(37 intermediate revisions by 9 users not shown)
Line 1: Line 1:
{|
+
{{FormatInfo
|[[File Formats]]
+
|formattype=electronic
| >
+
|subcat=Document
|[[Electronic File Formats]]
+
|subcat2=Word Processor
| >
+
|subcat3=Microsoft Word
|[[Document]]
+
|extensions={{ext|docx}}
| >
+
|mimetypes={{mimetype|application/vnd.openxmlformats-officedocument.wordprocessingml.document}}
| DOCX
+
}}
|}
+
[[Office Open XML]] (OOXML) representation (.DOCX) is the default file format for documents created by Microsoft Word as of Word 2007.  The format is based on [[XML]] component files in a container based on the [[ZIP]] format.  It replaced the binary [[DOC]] format used in earlier Word versions, and comes in two flavours, 'strict' and 'transitional' (see below).
  
Office Open XML Presentation (.DOCX) is the default file format for documents used by Microsoft Word as of Word 2007.  
+
This format (or the XML components of it) has been referred to as [[WordProcessingML]], a name also used for the standalone XML files the earlier Word 2003 was able to generate.
 +
 
 +
Graphical inserted elements may be stored in the form of [[DrawingML]], embedded in the XML.
  
 
== History ==
 
== History ==
  
This (along with the other Office Open XML items [[PPTX]] and [[XLSX]]) was initially standardized as ECMA-376 in 2006. Three formats of this standard have been produced; the second version also corresponds to ISO/IEC 29500.
+
This (along with the other Office Open XML document types [[PPTX]] and [[XLSX]]) was initially standardized as ECMA-376 in 2006. Three versions of ECMA-376 have been produced; the second version corresponds to ISO/IEC 29500:2008, approved as an ISO/IEC standard in April 2008. Changes to the standard between 2008 and 2012 were primarily corrections based on individual defects reported as implementation of the standard proceeded and required to ensure functional interoperability with existing applications.  They do not introduce new functionality.
 +
 
 +
== Format ==
 +
 
 +
=== High-level structure ===
 +
Like the other "Open XML" formats, this file format actually consists of various files (mostly [[XML]]) compressed into a [[ZIP]] archive, with this fact obscured from the end user by the use of a different file extension.
 +
 
 +
=== Strict versus Transitional ===
 +
The OOXML standard actually defines two different format variations: 'strict' and 'transitional' OOXML. The transitional form is not fully specified within the standard documentation, as it is very closely bound to the specific behaviour of Microsoft Office and the older binary formats. The strict form is the fully standardised form, but Microsoft have been slow to fully support OOXML-Strict as the default format for Office documents, leading to interoperability problems. See [http://blog.gardeviance.org/2013/12/once-more-unto-breach-dear-friends-once.html this blog post for a more detailed look at the interoperability issues], and here [https://twitter.com/swardley/status/436463566410244097 for some context from 2014 concerning government support for open formats]. Some more commentary is [http://www.robweir.com/blog/2009/11/asking-right-questions-about-office.html here].
 +
 
 +
== Specs ==
 +
* [https://www.ecma-international.org/publications-and-standards/standards/ecma-376/ ECMA-376 specification ]
 +
* [http://standards.iso.org/ittf/PubliclyAvailableStandards/index.html ISO publicly available standards, including the latest ISO/IEC 29500 specification] (as of November 2012, this is ISO/IEC 29500:2012)
 +
* [http://www.digitalpreservation.gov/formats/fdd/fdd000395.shtml?loclr=blogsig OOXML Format Family -- ISO/IEC 29500 and ECMA 376 (Library of Congress)]
 +
* [http://www.digitalpreservation.gov/formats/fdd/fdd000397.shtml?loclr=blogsig DOCX Transitional (Office Open XML), Library of Congress]
 +
* [http://www.digitalpreservation.gov/formats/fdd/fdd000400.shtml?loclr=blogsig DOCX Strict (Office Open XML), Library of Congress]
 +
* [http://www.digitalpreservation.gov/formats/fdd/fdd000396.shtml?loclr=blogsig Markup Compatibility and Extensibility (Office Open XML), Library of Congress]
 +
 
 +
== Sample files ==
 +
* [https://www.dan.info/sampledata/msword/testing.docx Windows Word 2007 sample file]
 +
* {{DexvertSamples|document/docx}}
 +
 
 +
== Software ==
 +
* [http://johnmacfarlane.net/pandoc/ Pandoc: Document format conversion swiss-army knife]
 +
* [https://github.com/jkr/docx2pandoc docx2pandoc: translate DOCX to Pandoc output formats]
 +
* [http://textract.readthedocs.org/en/latest/ Textract: extract text from various document formats including DOCX]
 +
* [https://pypi.python.org/pypi/Python-OOXML/0.12 Python library for parsing Office Open XML files]
 +
* [https://keyj.emphy.de/projects/tools-scripts/ docx_unattach_template] - repair docx created with a template whose original file name is no longer valid
 +
 
 +
== Other links and references ==
 +
* [http://en.wikipedia.org/wiki/Office_Open_XML Office Open XML : Wikipedia]
 +
* [http://support.microsoft.com/kb/924074 How to open new file formats in earlier versions of Microsoft Office]
 +
* [{{ForensicsWikiURL|word_document_%28docx%29}} Forensics Wiki article]
 +
* [http://www.afr.com/p/technology/why_it_might_be_time_to_dump_word_lQriIFyzmyoStP3nijq8bO Why it might be time to dump Word]
 +
* [https://joinup.ec.europa.eu/elibrary/case/complex-singularity-versus-openness Complex singularity versus openness]
  
== References ==
+
[[Category:XML based file formats]]
* [http://www.ecma-international.org/publications/standards/Ecma-376.htm ECMA-376 specification ]
+
[[Category:ZIP based file formats]]
* [http://standards.iso.org/ittf/PubliclyAvailableStandards/index.html ISO/IEC 29500 specification]
+
[[Category:Microsoft]]

Latest revision as of 04:07, 28 December 2023

File Format
Name DOCX
Ontology
Extension(s) .docx
MIME Type(s) application/vnd.openxmlformats-officedocument.wordprocessingml.document

Office Open XML (OOXML) representation (.DOCX) is the default file format for documents created by Microsoft Word as of Word 2007. The format is based on XML component files in a container based on the ZIP format. It replaced the binary DOC format used in earlier Word versions, and comes in two flavours, 'strict' and 'transitional' (see below).

This format (or the XML components of it) has been referred to as WordProcessingML, a name also used for the standalone XML files the earlier Word 2003 was able to generate.

Graphical inserted elements may be stored in the form of DrawingML, embedded in the XML.

Contents

[edit] History

This (along with the other Office Open XML document types PPTX and XLSX) was initially standardized as ECMA-376 in 2006. Three versions of ECMA-376 have been produced; the second version corresponds to ISO/IEC 29500:2008, approved as an ISO/IEC standard in April 2008. Changes to the standard between 2008 and 2012 were primarily corrections based on individual defects reported as implementation of the standard proceeded and required to ensure functional interoperability with existing applications. They do not introduce new functionality.

[edit] Format

[edit] High-level structure

Like the other "Open XML" formats, this file format actually consists of various files (mostly XML) compressed into a ZIP archive, with this fact obscured from the end user by the use of a different file extension.

[edit] Strict versus Transitional

The OOXML standard actually defines two different format variations: 'strict' and 'transitional' OOXML. The transitional form is not fully specified within the standard documentation, as it is very closely bound to the specific behaviour of Microsoft Office and the older binary formats. The strict form is the fully standardised form, but Microsoft have been slow to fully support OOXML-Strict as the default format for Office documents, leading to interoperability problems. See this blog post for a more detailed look at the interoperability issues, and here for some context from 2014 concerning government support for open formats. Some more commentary is here.

[edit] Specs

[edit] Sample files

[edit] Software

[edit] Other links and references

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox

Warning: Cannot modify header information - headers already sent in /usr/local/www/mediawiki/includes/WebResponse.php on line 38

Warning: Cannot modify header information - headers already sent in /usr/local/www/mediawiki/includes/WebResponse.php on line 38

Warning: Cannot modify header information - headers already sent in /usr/local/www/mediawiki/includes/WebResponse.php on line 38