RESOURCE: Watermarking and OCR-ing Your Images

Ammon Shepherd (University of Virginia) has shared the code and instructions for batch watermarking and OCR-ing images. During the course of archival research, Shepherd needed a way to extract text from images of book pages and to add watermarks indicating the source of an image. As a result, he created a script – both a bash script and in Ruby – that uses ImageMagick to add the watermark and tesseract to OCR the images.

Author: Zach Coble

Zach is the Digital Scholarship Specialist at New York University.