This is a project to remove duplicated slides by similarity of pixels The python-requirements are PyPDF2, wand and numpy. To see intermediate similar images, you also need ipython. Also you need imagemagick and under arch, you may have to change your config. To get a preview, go to öä.eu/remove_duplicates.html