Yahoo Groups

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Downloaded archive)
 
(2 intermediate revisions by one user not shown)
Line 3: Line 3:
 
}}
 
}}
 
'''Yahoo Groups''' is an email list service run by Yahoo!, which until 2019 also included web-readable forums and file areas, but these were discontinued then leaving only the email-based features. When those features (and the online archives of the messages) were discontinued, users were given the opportunity to download an archive for a limited time.
 
'''Yahoo Groups''' is an email list service run by Yahoo!, which until 2019 also included web-readable forums and file areas, but these were discontinued then leaving only the email-based features. When those features (and the online archives of the messages) were discontinued, users were given the opportunity to download an archive for a limited time.
 +
 +
ArchiveTeam is attempting to archive parts of its content, though much of it is marked as private and hence inaccessible to outside users.
  
 
== Downloaded archive ==
 
== Downloaded archive ==
Line 8: Line 10:
 
When you use the [https://groups.yahoo.com/neo/getmydata Get My Data] feature, you are told to wait for an email notification of the completion of the archive. When this comes (possibly weeks later), you download the file there, in this format:
 
When you use the [https://groups.yahoo.com/neo/getmydata Get My Data] feature, you are told to wait for an email notification of the completion of the archive. When this comes (possibly weeks later), you download the file there, in this format:
  
The file is a [[ZIP]] archive, with a cryptic name with lots of seemingly random numbers (probably in hexadecimal since letters a-f are in it).
+
The file is a [[ZIP]] archive, with a long cryptic name with lots of seemingly random numbers (probably in hexadecimal since letters a-f are in it).
  
 
Within it, the first layer of subdirectories consists of the names of the groups being archived; they'll archive all the groups you're a member of whether you're a group owner or not.
 
Within it, the first layer of subdirectories consists of the names of the groups being archived; they'll archive all the groups you're a member of whether you're a group owner or not.
Line 19: Line 21:
  
 
The <code>messages.zip</code> archive has one or more files in [[mbox]] format containing the messages from the group in chronological order, with names ending in .00001, .00002, etc. It's broken up into files of approximately 2.3 megabytes. At least in older archives, you can usually read through them by opening them in a text editor; more recent ones are harder because of the prevalence of [[HTML]]-format messages and the utter inability of modern email users to trim quoted material, meaning that the archives are full of raw code and excessive repetitive quotage.
 
The <code>messages.zip</code> archive has one or more files in [[mbox]] format containing the messages from the group in chronological order, with names ending in .00001, .00002, etc. It's broken up into files of approximately 2.3 megabytes. At least in older archives, you can usually read through them by opening them in a text editor; more recent ones are harder because of the prevalence of [[HTML]]-format messages and the utter inability of modern email users to trim quoted material, meaning that the archives are full of raw code and excessive repetitive quotage.
 +
 +
Sometimes there's also a <code>medias.json</code> file giving some sort of information in [[JSON]] format.
  
 
== Links ==
 
== Links ==
 
* [https://groups.yahoo.com/neo Yahoo Groups site]
 
* [https://groups.yahoo.com/neo Yahoo Groups site]
 
* [https://groups.yahoo.com/neo/getmydata Get My Data]
 
* [https://groups.yahoo.com/neo/getmydata Get My Data]
 +
* [https://archiveteam.org/index.php?title=Yahoo!_Groups ArchiveTeam project page]
  
 
[[Category:Yahoo!]]
 
[[Category:Yahoo!]]

Latest revision as of 23:05, 2 November 2019

File Format
Name Yahoo Groups
Ontology

Yahoo Groups is an email list service run by Yahoo!, which until 2019 also included web-readable forums and file areas, but these were discontinued then leaving only the email-based features. When those features (and the online archives of the messages) were discontinued, users were given the opportunity to download an archive for a limited time.

ArchiveTeam is attempting to archive parts of its content, though much of it is marked as private and hence inaccessible to outside users.

[edit] Downloaded archive

When you use the Get My Data feature, you are told to wait for an email notification of the completion of the archive. When this comes (possibly weeks later), you download the file there, in this format:

The file is a ZIP archive, with a long cryptic name with lots of seemingly random numbers (probably in hexadecimal since letters a-f are in it).

Within it, the first layer of subdirectories consists of the names of the groups being archived; they'll archive all the groups you're a member of whether you're a group owner or not.

Beneath that, there are more ZIP archives, one for each of the categories of things being archived, such as files.zip, links.zip, and messages.zip.

The files.zip archive contains all the files from the file area, including subdirectory structures when the files are organized in folders.

The links.zip archive has the web links from the links section of a group, in Internet Shortcut format.

The messages.zip archive has one or more files in mbox format containing the messages from the group in chronological order, with names ending in .00001, .00002, etc. It's broken up into files of approximately 2.3 megabytes. At least in older archives, you can usually read through them by opening them in a text editor; more recent ones are harder because of the prevalence of HTML-format messages and the utter inability of modern email users to trim quoted material, meaning that the archives are full of raw code and excessive repetitive quotage.

Sometimes there's also a medias.json file giving some sort of information in JSON format.

[edit] Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox