[NFBCS] searching for text in PDF documents

Michael McQuaid mickmcquaid at gmail.com
Sat Jan 9 19:33:35 UTC 2021


When extracting text from pdfs, I usually use the command line tool
pdftotext with the -layout option. This extracts all the readable text to a
plain text file.

It is available from http://poppler.freedesktop.org

- Mick

On Sat, Jan 9, 2021 at 1:54 PM Elizabeth Campbell via NFBCS <
nfbcs at nfbnet.org> wrote:

> Good afternoon all,
>
> I've serve on the bargaining committee for our newsroom union, and we were
> asked to research various contracts for information about overtime,
> vacations, etc as we put together our proposals for management. These
> contracts are PDF documents, and I am having a tough time finding specific
> information in these documents. I am using the Control F command for
> finding words and phrases, but I don't think that approach is working. Any
> suggestions are welcome.
>
> Thanks!!
>
> --
> Elizabeth Campbell
> _______________________________________________
> NFBCS mailing list
> NFBCS at nfbnet.org
> http://nfbnet.org/mailman/listinfo/nfbcs_nfbnet.org
> To unsubscribe, change your list options or get your account info for
> NFBCS:
> http://nfbnet.org/mailman/options/nfbcs_nfbnet.org/mickmcquaid%40gmail.com
>


More information about the NFBCS mailing list