Redacting pdf

4/9/2023

Besides a document's text layer, metadata, and other components of a PDF document which this tool scans and can redact text from, there are many other components of PDF documents that this tool does not look at, such as: Of exotic capabilities used rarely or in specialized circumstances.

The PDF format is an incredibly complex data standard that has hundreds, if not thousands, Limitations Not all content may be redacted Get this module and then install its dependencies with: Rewrite, remove, or add XML metadata using functions that operate on the parsed XMP DOM (e.g.wipe out all metadata except for certain fields). Rewrite, remove, or add new metadata fields on a field-by-field basis (e.g.replace social security numbers with "XXX-XX-XXXX"). Use regular expressions to perform text substitution on the text layer (e.g.Graphical elements, images, and other embedded resources are not touched. the Document Information Dictionary, a.k.a.the text layer of the document's pages (content stream text).This Python module is a general tool to help you automatically redact text from PDFs. Pdf-redactor uses pdfrw under the hood to parse and write out the PDF. A general-purpose PDF text-layer redaction tool, in pure Python, by Joshua Tauberer and Antoine McGrath.

0 Comments

Redacting pdf

Leave a Reply.

Author

Archives

Categories