Boston Linux & UNIX was originally founded in 1994 as part of The Boston Computer Society. We meet on the third Wednesday of each month, online, via Jitsi Meet.

BLU Discuss list archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Discuss] Now you see it now you don't



On November 27, 2023, John Abreau wrote:
>The command-line tool "pdftotext" will extract text from a PDF file, and
>"pdfimages" will extract images from a PDF file. Both tools are in the rpm
>package "poppler-utils".

And if the text happens to be in images, the package ocrmypdf can
convert the images to text. Do this before running pdftotext.

Dan



Valid HTML 4.01! Valid CSS!



Boston Linux & Unix / webmaster@blu.org