Skip to content

DIY Page Turner Project Roundup

One of the best things about my community is that it is just chock full of awesome people.

One of the worst things about being the “DIY BOOK SCANNER guy” is that people always ask “DOES IT TURN PAGES?”.

Well, my friends, it turns pages.

jck57/Monson’s Servo Auto Scanner.

DIY Page Turner from Berlin Hackerspace C-Base:

dtic was among the page-turning pioneers of our forum:
Revision 1:

Revision 2:

jck57/Monson is actually building a full-auto scanner, check out his other impressive engineering:

As you can see, page turning is not an easy problem to solve.

Houston Area Meetup

hg1027 is proposing a DIYBookScanner meetup in Houston at TxRx – Houston members and interested parties, check it out!

Book Scan Wizard Gives DIY Book Scanners An Upload Button

Steve Devore’s Book Scan Wizard is the most automated, complete DIY Book Scanning software ever written. If you’re doing any kind of bulk or batch scanning, you owe it to yourself to check it out.

And now, through a new partnership with the Internet Archive, Book Scan Wizard and DIYBookScanner offer an upload button – every DIY Book Scanner can share his or her work with the world, or get an OCR’d copy of their scans for free. I can’t over-write the significance of this upload button in Book Scan Wizard – it closes the loop. We now have a complete path from your bookshelf to your Kindle, free OCR for all scanners, conversion to DAISY for the blind, and a world-class web book interface, courtesy the Archive.

Unfortunately I’m traveling at the moment, so I’m really not giving the new upload button the post it deserves. And it’s important to say here that this service could go away at any moment – we are really just dipping our toes into the water here. So get Book Scan Wizard while it’s hot and fresh, and get scanning — and uploading.

Book Scan Wizard hits RC1 Milestone!

Book Scan Wizard is the latest scan post-processor from Steve DeVore. Book Scan Wizard will take the images straight from your cameras, interleave them, crop them, scale them, adjust them, and otherwise process them into something suitable for binding. On top of all that raw functionality, Steve is a fantastic developer, super-active – and also proactively created good documentation. And best of all, he is using his own software to do his own work – reflecting our collective experience that when we make tools to do our own work, they work much better than off-the-shelf stuff. I am personally quite grateful for all the work done on Book Scan Wizard and plan to use it almost exclusively for my next scanner.

So all that said, you could imagine my excitement when Steve made the announcement that BSW has hit Release Candidate status.

That’s big news – and it’s only the beginning for this excellent post-processor. Soon, Book Scan Wizard is going to be a part of a newly formed collaboration with the Internet Archive. But that’s a post for another day.

A Couple Of Great Hacks From The Forum.

The community here is really creative. Since our work revolves around cheap cameras, a lot of great camera modifications and hacks show up.

One such hack was just featured on Hack A Day – a battery replacement hack by Marcos Kusnick and friends:

Using a syringe to trigger any camera, by jck57:

Cut a slot in the camera to let the SD card out without taking it off the tripod by will1384:

A script that makes the shutter button “smart” so it can delete or do other tasks by You1 (Edvin):

Book Scanner design has come a long way, too – page turning with air:

T-slot scanners by Translucent1:

Laser-based book scanning! By Everyone!

Daniel Reetz in the New York Times/Broad Updates

So, this project made it into the print edition of the New York Times today, February 10th. Page C1 – Arts:

https://www.nytimes.com/2011/02/10/arts/10innovative.html?src=twrhp

I am proud to be called part of the “dark matter of innovation”.

I must correct one crucial fact – more than 250 people have built DIY Book Scanners – and I’m also aware of a whole shadow world of scanner builders who never join the forum and share (they show up on Craigslist and in other forums and websites every other day or so). My estimate on the total number of builds, documented and undocumented, completed and left unfinished, is between 350 and 500. DIYBookScanner.org (the site where the action happens) has been around for ~600 days, so that’s quite a few builds.

This book scanner project is going gangbusters. We have not one, but two of the best open source book
scanning applications out there (Scan Tailor and Book Scan Wizard, both GPL) which are being actively developed, improved, and community supported. Scan Tailor, in particular, has the best text-based dewarping ever implemented in free software. We have three community-developed open source ways to bind digital books (Bindery, DJVUBind, and PDFmaker), and we have a ton of hardware innovation going on. On the hardware front, we decided to try to see how little hardware we can use – so we’re co-developing hardware and software to use laser beams to “dewarp” (flatten) the images of book pages. Our first results came within hours of trying things out… and things look great. If this method reaches anything near the potential we’re seeing now, in the near future we could go from a $300 high speed book scanner made from trash to a $30 high speed book scanner made in the
USA – that fits in your handbag.

In the last year my community has helped out in Haiti, Africa, Canada, Indonesia, and all over the States. By “helped out” I mean offering advice, money, hardware, software, ongoing support, and even in-person meetups in Portland, OR (and elsewhere). We have substantial representation all over the world – particularly in Brazil (you should see these guys innovate), Germany (builders of auto page turners!), and Russia (a place where camera-based book scanning has been going on for over 40 years – the original DIY’ers). You can’t search for book scanning without finding us, because we’ve tried or seen almost everything out there, shared our experiences, and improved it as a collective effort. That said, sometimes finding what you want in our forums is a pain in the ass. I’m working on it.

As the founder and steward of this project, I often wonder how long book scanning will seem as important as it does now, in this pivotal moment of books gone bits. While once, I thought these thoughts cynically, I now think the same thoughts — differently. We are creating the future of personal document digitization, making it easy, free, and powerful — as it should be — and using things and skills that people already have. Someday, I hope, it will not be a big deal or seem so important, because it will be 1. freely available to anyone and 2. just a baseline expectation, like the free and simple use of printers and phone-cams is today.

Maybe I’m wrong. It’s hard to avoid such grandiose thoughts when you see a thousand people working together toward a common goal, each in their own way, on their own terms, in their own time, and according to their ever-rising ability. Maybe another technology will come along and disrupt our innovation, making things even easier, cheaper, faster, and more accessible.

I’d be glad to see that happen.

Thanks for your time.
Daniel

A Poll About Cameras.

Patrick Hall is conducting a poll to figure out which cameras people are using. Please drop by and make your mark! The more the merrier, as such things go.

Also, Pat is awesome and if you missed it, he made a fileslip scanner from a cardboard box. We’re now working together on a somewhat sturdier model.

New Scan Postprocessor: Book Scan Wizard

Steve Devore, or steve1066d as he is known here, has produced a new Java-based scan post-processor. It’s almost ready for release, and could use some testing. It’s pretty exciting work for those of us with very stable scanners, because it allows the same post-processing parameters to be applied to every page (an oft-requested feature for Scan Tailor).

In his own words:

I decided to develop a new application designed for Book scanning post-processing. It’s not quite ready for a beta, but its getting close.

What I wanted was a tool that I could define certain actions to be performed, like deskewing, correcting keystone distortion, and cropping, and have them automatically apply to the entire batch. I wanted a tool that could be run interactively to set up the job, then having the option to run the actual full processing without user intervention. So I decided to come up with a new tool, which I’m calling BSW (Book Scan Wizard). I am releasing it as open source, under the GPL license.

Its a bit different animal than Scan Tailor in that you define just what you want done to the pages. So while Scan Tailor will try to figure out the margins by examining the pages, with BSW you click on the image corners and add a crop operation.

This works on the premise that the book scanner keeps the pages more or less in the same position from one scan to the next, so that once that operations are defined (with some separate configuration of the left and right pages), it can be applied to the remainder of pages. The goal is to be able to set up the configuration of an entire book in less than 5 minutes, and be able to set up a bunch of books and convert them all without any user intervention.

Steve wrote a very informative post about Book Scan Wizard. Here are a few screenshots to whet your appetite:

Steve, thank you for your excellent work and thank you for sharing!

New Version of Scan Tailor, Including Automatic Dewarping.

Tulon has again released a new version of the venerable post-processing application Scan Tailor, this time including dewarping functionality! You can download it here.

Please try out the software and look for any bugs. Please remember that feature suggestions go here and are not welcome in the development and announcement threads.

Before dewarping:

After dewarping:

This Month in DIY Book Scanner News

dansheffler finds a great free utility for batch cropping in OSX. He also found a great way to bind PDFs using Automator.

clemd973, who has been doing lots of cool stuff lately, started a thread on Acrobat tips and tricks. May it grow into the web’s greatest resource on the topic.

ceanne1 has been working on a bunch of community resources. For example, here’s her “DIY Book Scanner Glossary” thread, which turned out excellent, and here’s her thread on cameras that support PTP. Don’t miss her origami scanner design thread.

blender’s OCR Page Namer got a release here.

Likewise, Misty got another version of PDFMaker out for people to experiment with.

will1384 put up a great post on how to make a trigger.

There was plenty more, but for that, you’ll have to read the forums yourself.