Skip to content

Visual side by side comparision of rendered PDF documents (CLI tool, python function, unittest assert and back2back plugin)

License

Notifications You must be signed in to change notification settings

vokimon/visualpdfdiff

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Visual PDF diff

CI Status Coverage Status PyPI PyPI - Downloads PyPI - Python Version

Visual side by side comparision of rendered PDF documents.

You can use this in several ways:

  • a commandline tool to obtain the diff of two pdfs
  • a Python function to do the same from other Python programs
  • an equality assertion to be used in python-unittest
  • a back-to-back assertion to be used in python-unittest
  • an extension for the tool back2back to make back-to-back tests of the PDF outputs of your commands

The generated diff looks like this:

Diff output example

Installation

sudo apt install imagemagick # Or the equivalent if not debian based
pip install visualpdfdiff
pip install b2btest # if you want to use the back2back command

NOTE: visualpdfdiff requires enabling ImageMagick to handle PDF. This is disabled by default for security reasons. If you are running a web server accepting PDF files from outside, please, consider the security implications.

Edit /etc/ImageMagick-*/policy.xml and uncomment th line:

<policy domain="coder" rights="read | write" pattern="PDF" />

And comment the line:

<!-- <policy domain="coder" rights="none" pattern="PDF" /> -->

Comand line diff tool

visualpdfdiff a.pdf b.pdf [output-diff.pdf]

Returns 0 if both pdfs are raster equal, -1 if they are not.

If an output is provided the side-by-side diff pdf is generated. Not providing an output is faster when diff exists, though, so checking and then generating is faster when you expect, being equal most of the time.

Python diff function

from visualpdfdiff import diff

haveDifferences = diff('a.pdf', 'b.pdf', 'out.pdf')

Unittest back-to-back assertion

Compares against the last validated output. If the extension is PDF, visualpdfdiff will be chosen to detect and output the differences.

class MyClass_Test(unittest.TestCase):

	from b2btest import assertB2BEqual

	def test_otherMethod_conditions(self):
		...
		self.assertB2BEqual('b.pdf')

Command back2back tests

Using the back2back command. Also, by installing this package, PDF outputs are compared using visualpdfdiff.

myTest:
  command: ./myreportscript.py -o output.pdf
  outputs:
  - output.pdf

Similar tools

  • pdfdiff: Extracts text and diff that, then draws an outline on the text. Better for text diffing, but not so for layout diffing.
  • qtrac's diffpdf: A quite nice (qt-based) graphical tool that does both text and visual diffing. Is not maintained anymore since the authors moved to a close source license.
  • vslavik's diff-pdf:
  • diff-pdf-visualy Quite similar to this one, not just in name, but does not generate an output pdf

CHANGES

1.0 (Unreleased)

  • First version as independent module
  • Previous versions were part of somenergia-oomakotest a test suite to compare outputs of mako reports generated by odoo

TODO

  • Raster resolution configurable by keywords
  • Diff metadata as well
  • Make an overlay fully transparent within the diff zone, and translucent gray in the matching zone

About

Visual side by side comparision of rendered PDF documents (CLI tool, python function, unittest assert and back2back plugin)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages