BLU Discuss list archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Discuss] Now you see it now you don't



On November 27, 2023, John Abreau wrote:
>The command-line tool "pdftotext" will extract text from a PDF file, and
>"pdfimages" will extract images from a PDF file. Both tools are in the rpm
>package "poppler-utils".

And if the text happens to be in images, the package ocrmypdf can
convert the images to text. Do this before running pdftotext.

Dan