DIET (compression)
|  (→See also) |  (→Identification) | ||
| Line 15: | Line 15: | ||
| == Identification == | == Identification == | ||
| − | + | For what it's worth, the newer versions of DIET detect compressed files by searching for the byte sequence {{magic|0x9d 0x89}}, and ASCII "{{magic|dlz}}", in the first 126 bytes of the file. Both must appear, in that order. This works for the newer formats, but not for all of the older ones. | |
| − | + | === Identification of EXE files === | |
| + | Below are some version-specific characteristics of DIET-compressed EXE files. | ||
| − | + | Some DIET-compressed EXE files have {{magic|9d 89}} in the EXE checksum field at offset 18 (refer to [[MS-DOS EXE#Header structure]]), and some have ASCII "{{magic|diet}}" in the unused bytes at offset 28. These signatures might be less reliable than other means of identifying DIET format, as they could be modified. | |
| − | + | Also, be aware of [[LGLZ]] format, which can be mistaken for DIET. | |
| − | + | Let "<code>8e db 8e...</code>" be the byte sequence {{magic|8e db 8e c0 33 f6 33 ff b9 08 00 f3 a5 4b 48 4a}}. | |
| + | |||
| + | v1.00-1.00d: | ||
| + | * <code>03 00</code> at offset 20 (the IP register) | ||
| + | * <code>8e db 8e...</code> at offset 55 | ||
| + | |||
| + | v1.02b-1.20 | ||
| + | * {{magic|9d 89}} at offset 18 | ||
| + | * <code>8e db 8e...</code> at offset 52 | ||
| + | * "{{magic|dlz}}" at offset 87 | ||
| + | |||
| + | v1.44 | ||
| + | * {{magic|9d 89}} at offset 18 | ||
| + | * "{{magic|diet}}" at offset 28 | ||
| + | * <code>8e db 8e...</code> at offset 72 | ||
| + | * "{{magic|dlz}}" at offset 107 | ||
| + | |||
| + | v1.45f | ||
| + | * {{magic|9d 89}} at offset 18 | ||
| + | * "{{magic|diet}}" at offset 28 | ||
| + | * <code>8e db 8e...</code> at offset 77 | ||
| + | * "{{magic|dlz}}" at offset 108 | ||
| + | |||
| + | === Identification of COM files === | ||
| + | v1.00-1.00d: Files start with {{magic|bf}}, and have {{magic|fd f3 a5 fc 8b f7 bf 00}} at offset 17. Note: The CRC field is at offset 35, and the compressed data starts at offset 37. | ||
| + | |||
| + | v1.02b-1.20: Files start with {{magic|be}}, have {{magic|fd f3 a5 fc 8b f7 bf 00}} at offset 17, and {{magic|'d' 'l' 'z'}} at offset 35. | ||
| + | |||
| + | v1.44-1.45f: Files start with {{magic|f9}}, have {{magic|9d 89}} at offset 10, and {{magic|'d' 'l' 'z'}} at offset 65. | ||
| + | |||
| + | === Identification of data files === | ||
| + | v1.00-1.00d: Files start with bytes {{magic|b4 4c cd 21 9d 89}}. Note: The CRC field is at offset 6, and the compressed data starts at offset 8. | ||
| + | |||
| + | v1.02b-1.20: Files start with bytes {{magic|9d 89 'd' 'l' 'z'}}. | ||
| + | |||
| + | v1.44-1.45f: Files start with bytes {{magic|b4 4c cd 21 9d 89 'd' 'l' 'z'}}. | ||
| == See also == | == See also == | ||
Revision as of 15:30, 19 August 2023
DIET is an executable compression and file compression utility for DOS, developed by Teddy Matsumoto. It does executable compression of EXE files (to EXE) and COM files (to EXE or COM).
It can also compress arbitrary data files. Such files can be transparently decompressed by DIET's TSR utility.
Both types of files can be decompressed using the -RA option.
| Contents | 
Technical notes
Researchers should note that DIET's behavior depends on the cluster size of the relevant filesystem. Use the -B option (introduced in v1.10a) to turn off this feature, or else DIET will probably decide not to compress most of your files.
Identification
For what it's worth, the newer versions of DIET detect compressed files by searching for the byte sequence 0x9d 0x89, and ASCII "dlz", in the first 126 bytes of the file. Both must appear, in that order. This works for the newer formats, but not for all of the older ones.
Identification of EXE files
Below are some version-specific characteristics of DIET-compressed EXE files.
Some DIET-compressed EXE files have 9d 89 in the EXE checksum field at offset 18 (refer to MS-DOS EXE#Header structure), and some have ASCII "diet" in the unused bytes at offset 28. These signatures might be less reliable than other means of identifying DIET format, as they could be modified.
Also, be aware of LGLZ format, which can be mistaken for DIET.
Let "8e db 8e..." be the byte sequence 8e db 8e c0 33 f6 33 ff b9 08 00 f3 a5 4b 48 4a.
v1.00-1.00d:
-  03 00at offset 20 (the IP register)
-  8e db 8e...at offset 55
v1.02b-1.20
-  9d 89at offset 18
-  8e db 8e...at offset 52
-  "dlz" at offset 87
v1.44
-  9d 89at offset 18
-  "diet" at offset 28
-  8e db 8e...at offset 72
-  "dlz" at offset 107
v1.45f
-  9d 89at offset 18
-  "diet" at offset 28
-  8e db 8e...at offset 77
-  "dlz" at offset 108
Identification of COM files
v1.00-1.00d: Files start with bf, and have fd f3 a5 fc 8b f7 bf 00 at offset 17. Note: The CRC field is at offset 35, and the compressed data starts at offset 37.
v1.02b-1.20: Files start with be, have fd f3 a5 fc 8b f7 bf 00 at offset 17, and 'd' 'l' 'z' at offset 35.
v1.44-1.45f: Files start with f9, have 9d 89 at offset 10, and 'd' 'l' 'z' at offset 65.
Identification of data files
v1.00-1.00d: Files start with bytes b4 4c cd 21 9d 89. Note: The CRC field is at offset 6, and the compressed data starts at offset 8.
v1.02b-1.20: Files start with bytes 9d 89 'd' 'l' 'z'.
v1.44-1.45f: Files start with bytes b4 4c cd 21 9d 89 'd' 'l' 'z'.
See also
Specifications
- DIET v1.02b → DIETTECH.DOC [Possibly an unfinished draft -- lots of errors.]
Software
-  DIET
- v1.00
- v1.00d: TEKNO 6-1998 → DEMOER/GRAVITY3/PC_4KB/SPARK.ZIP → SRC_NORM.ZIP → DIET.EXE
- v1.02b
- v1.10a
- v1.20
- v1.44
- v1.45f
- ANORMAL's DOSEXE collections → Executable Tools Pack → packers/diet.*
- Various versions at old-dos.ru
 
For other utilities that may decompress DIET-compressed executables, see Executable compression#Decompression software.

