word -> text

Matthew Gillen me-5yx05kfkO/aqeI1yJSURBw at public.gmane.org
Sun Sep 19 14:13:32 EDT 2010


On 09/19/2010 12:19 PM, Englander, Irvin wrote:
> I had some Word docs that got severely corrupted a year or so ago. Surprisingly, I was able to get much of the text back using gunzip from a command line prompt. Nothing else I tried worked. So there must be some similarity in their compression algorithms.

The default office 2007 file format is xml, but then they compress it with
straight up zip compression. Run 'file' on a docx/pptx and it will return
"Zip archive data, at least v2.0 to extract"

Matt





More information about the Discuss mailing list