XLS
Dan Tobias (Talk | contribs) (→References) |
Dexvertbot (Talk | contribs) m (Change telparia.com samples link to template) |
||
(19 intermediate revisions by 7 users not shown) | |||
Line 2: | Line 2: | ||
|formattype=electronic | |formattype=electronic | ||
|subcat=Document | |subcat=Document | ||
+ | |subcat2=Spreadsheet | ||
|extensions={{ext|xls}} | |extensions={{ext|xls}} | ||
|mimetypes={{mimetype|application/vnd.ms-excel}} | |mimetypes={{mimetype|application/vnd.ms-excel}} | ||
}} | }} | ||
+ | [[XLS]] is a family of spreadsheet formats native to [[Microsoft Excel]]. | ||
− | + | ==Microsoft Office EXCEL 97-2007== | |
The Excel Binary File Format (.xls) Structure is the binary file format used by Microsoft Excel 97, 2000, 2002, and Office Excel 2003. It is also supported by Microsoft Office Excel 2007. | The Excel Binary File Format (.xls) Structure is the binary file format used by Microsoft Excel 97, 2000, 2002, and Office Excel 2003. It is also supported by Microsoft Office Excel 2007. | ||
+ | |||
+ | Later Excel versions use [[XLSX]] as their native format, though they still support the older format as well. | ||
+ | |||
+ | ===Versions=== | ||
+ | {| class="wikitable" | ||
+ | ! Version !! BIFF Version !! Document type !! PRONOM !!Year | ||
+ | |- | ||
+ | |Excel 2.x || BIFF2 || Worksheet || {{PRONOM|fmt/55}} || 1987 | ||
+ | |- | ||
+ | |Excel 3.0 || BIFF3 || Worksheet || {{PRONOM|fmt/56}} || 1990 | ||
+ | |- | ||
+ | |Excel 4.0 || BIFF4 || Worksheet || {{PRONOM|fmt/57}} || 1992 | ||
+ | |- | ||
+ | |Excel 5.0 || BIFF5 || Workbook || {{PRONOM|fmt/59}} || 1993 | ||
+ | |- | ||
+ | |Excel 7.0 || BIFF5 || Workbook || {{PRONOM|fmt/59}} || 1995 | ||
+ | |- | ||
+ | |Excel 8.0 || BIFF8 || Workbook || {{PRONOM|fmt/61}} || 1997 | ||
+ | |- | ||
+ | |Excel 9.0 || BIFF8 || Workbook || {{PRONOM|fmt/62}} || 1999 | ||
+ | |- | ||
+ | |Excel 10.0 || BIFF8 || Workbook || {{PRONOM|fmt/62}} || 2001 | ||
+ | |- | ||
+ | |Excel 11.0 || BIFF8 || Workbook || {{PRONOM|fmt/62}} || 2003 | ||
+ | |} | ||
+ | |||
+ | ==Handling of date values== | ||
+ | Excel stores date values as floating point numbers that represent the number of days since a given start date. According to [http://datapub.cdlib.org/2014/04/10/abandon-all-hope-ye-who-enter-dates-in-excel/ this piece], the default start dates are different for Excel for Windows (which uses January 1, 1900 and) Excel for Mac (which uses January 1, 1904). On top of this, the 1900 date system also erroneously assumes that 1900 was a leap year. This assumption was introduced on purpose in order to ensure compatibility with a bug in Lotus 1-2-3. In practice this may lead to dates that are off by 4 years and 1 day, depending on the software that is used to read/process the files. | ||
+ | |||
+ | == Software == | ||
+ | * [http://libxls.sourceforge.net/ libxls] - Library for reading XLS | ||
+ | * [http://sourceforge.net/projects/xlslib/ xlsLib] - Library for writing XLS | ||
+ | * [http://www.lexicon.net/sjmachin/xlrd.html The xlrd Module, a Python module for extracting data from MS Excel spreadsheet files] - contains detailed information about Excel's date handling | ||
+ | |||
+ | == Sample files == | ||
+ | * [http://labs.nationalarchives.gov.uk/wordpress/index.php/2014/01/datasets/ National Archives (UK) datasets (includes some XLS files)] | ||
+ | * {{DexvertSamples|document/xls}} | ||
==References== | ==References== | ||
Line 14: | Line 53: | ||
* [http://www.joelonsoftware.com/items/2008/02/19.html Why are the Microsoft Office file formats so complicated? (And some workarounds)] | * [http://www.joelonsoftware.com/items/2008/02/19.html Why are the Microsoft Office file formats so complicated? (And some workarounds)] | ||
* [http://www.advisorone.com/2013/04/16/oops-reinhart-rogoff-research-tripped-up-by-typo?t=theory-strategy Typo in Excel spreadsheet apparently led to erroneous result in economic paper that was influential on government policy] | * [http://www.advisorone.com/2013/04/16/oops-reinhart-rogoff-research-tripped-up-by-typo?t=theory-strategy Typo in Excel spreadsheet apparently led to erroneous result in economic paper that was influential on government policy] | ||
+ | * [https://uc3.cdlib.org/2014/04/09/abandon-all-hope-ye-who-enter-dates-in-excel/ Abandon all hope, ye who enter dates in Excel] | ||
+ | * [http://support.microsoft.com/kb/180162 XL: The 1900 Date System vs. the 1904 Date System] | ||
+ | * [http://decalage.info/file_formats_security/office MS Office 97-2003 legacy/binary formats security] - article with lots of resources on MS Office formats, including analysis techniques, tools and parsing libraries | ||
+ | * [https://www.openoffice.org/sc/excelfileformat.pdf Open Office - Excel Versions 2, 3, 4, 5, 95, 97, 2000, XP, 2003] | ||
+ | |||
+ | == External links == | ||
+ | * [{{ForensicsWikiURL|excel_spreadsheet_%28xls%29}} ForensicsWiki entry] (need to copy over whatever relevant details are available) | ||
[[Category:Microsoft]] | [[Category:Microsoft]] | ||
+ | [[Category:Microsoft Compound File]] |
Latest revision as of 04:07, 28 December 2023
XLS is a family of spreadsheet formats native to Microsoft Excel.
Contents |
[edit] Microsoft Office EXCEL 97-2007
The Excel Binary File Format (.xls) Structure is the binary file format used by Microsoft Excel 97, 2000, 2002, and Office Excel 2003. It is also supported by Microsoft Office Excel 2007.
Later Excel versions use XLSX as their native format, though they still support the older format as well.
[edit] Versions
Version | BIFF Version | Document type | PRONOM | Year |
---|---|---|---|---|
Excel 2.x | BIFF2 | Worksheet | fmt/55 | 1987 |
Excel 3.0 | BIFF3 | Worksheet | fmt/56 | 1990 |
Excel 4.0 | BIFF4 | Worksheet | fmt/57 | 1992 |
Excel 5.0 | BIFF5 | Workbook | fmt/59 | 1993 |
Excel 7.0 | BIFF5 | Workbook | fmt/59 | 1995 |
Excel 8.0 | BIFF8 | Workbook | fmt/61 | 1997 |
Excel 9.0 | BIFF8 | Workbook | fmt/62 | 1999 |
Excel 10.0 | BIFF8 | Workbook | fmt/62 | 2001 |
Excel 11.0 | BIFF8 | Workbook | fmt/62 | 2003 |
[edit] Handling of date values
Excel stores date values as floating point numbers that represent the number of days since a given start date. According to this piece, the default start dates are different for Excel for Windows (which uses January 1, 1900 and) Excel for Mac (which uses January 1, 1904). On top of this, the 1900 date system also erroneously assumes that 1900 was a leap year. This assumption was introduced on purpose in order to ensure compatibility with a bug in Lotus 1-2-3. In practice this may lead to dates that are off by 4 years and 1 day, depending on the software that is used to read/process the files.
[edit] Software
- libxls - Library for reading XLS
- xlsLib - Library for writing XLS
- The xlrd Module, a Python module for extracting data from MS Excel spreadsheet files - contains detailed information about Excel's date handling
[edit] Sample files
[edit] References
- Binary file format specification *.xls (97-2007) format
- Why are the Microsoft Office file formats so complicated? (And some workarounds)
- Typo in Excel spreadsheet apparently led to erroneous result in economic paper that was influential on government policy
- Abandon all hope, ye who enter dates in Excel
- XL: The 1900 Date System vs. the 1904 Date System
- MS Office 97-2003 legacy/binary formats security - article with lots of resources on MS Office formats, including analysis techniques, tools and parsing libraries
- Open Office - Excel Versions 2, 3, 4, 5, 95, 97, 2000, XP, 2003
[edit] External links
- ForensicsWiki entry (need to copy over whatever relevant details are available)