Good Word doc -> plain text conversion

John Abreau abreauj-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org
Mon Sep 20 09:40:26 EDT 2010


Did you try the -w option? From the man page:


       -w width
              In text mode this is the line width in characters.  A  value
 of
              zero puts an entire paragraph on a line, useful when the text
is
              to used as  input  for  another  wordprocessor.  This  value
 is
              ignored in PostScript mode.


On Mon, Sep 20, 2010 at 9:30 AM, Ian Stokes-Rees <
ijstokes-/2FeUQLD3jedFdvTe/nMLpVzexx5G7lz at public.gmane.org> wrote:

>
>
> On 9/20/10 12:01 AM, jc-8FIgwK2HfyJMuWfdjsoA/w at public.gmane.org wrote:
> > Dan Ritter wrote:
> > | antiword is the usual candidate. Every one of Google's first ten
> > | results for that are relevant.
> >
> > Yeah, I thought of that, too, but I was hoping there might be something
>  that
> > does  a  better  job.   In  one of my current sample .doc files, for
> example,
> > antiword produces the curious table entry:
>
>
> Use antiword and recompile it yourself with no line length limit.  I'm
> sure you'll easily find some hard-coded value of 138 in there.
>
> "antiword" is the standard and will, I suspect, support more than
> anything else you find.
>
> Ian
> _______________________________________________
> Discuss mailing list
> Discuss-mNDKBlG2WHs at public.gmane.org
> http://lists.blu.org/mailman/listinfo/discuss
>



-- 
John Abreau / Executive Director, Boston Linux & Unix
GnuPG KeyID: 0xD5C7B5D9 / Email: abreauj-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org
GnuPG FP: 72 FB 39 4F 3C 3B D6 5B E0 C8 5A 6E F1 2C BE 99





More information about the Discuss mailing list