Scan Tailor "Enhanced"

Scan Tailor specific announcements, releases, workflows, tips, etc. NO FEATURE REQUESTS IN THIS FORUM, please.

Moderator: peterZ

Post Reply
pejuko
Posts: 30
Joined: 17 Feb 2011, 17:06

Scan Tailor "Enhanced"

Post by pejuko »

Hi,

I've been working on some new ScanTailor features which are in git repository but they are not included in any ScanTailor release yet. I've prepared this development release that you can test and use it if you find it useful.

This release: Tiff compression option allows to disable compression in tiff files or select one of other standard compression methods (lzw, deflate, packbits, jpeg).

Deviation patch enables highligting of different pages. Highlighted in red are pages from Deskew filter with too hight skew, from Select Content filter pages with different size of content and in Margins filter are highlighted pages which does not match others.

Automargins patch allows keep page content on original place. In the Margins step you can choose from Auto, Manual (default) and Original mode.
The manual mode is the original one. Auto mode try to decide if it is better to align page top, bottom or center. Original mode keeps page on their vertical original position.

Pagedetect patch allows detect page in black margins and enables keeping original page layout or switch off page content detection. To detect and keep original page layout from command line try:

Code: Select all

scantailor-cli --disable-content-detection --enable-page-detection --enable-fine-tuning --margins=0 images outdir/
Or you can let the content detection enabled and enable also auto margins:

Code: Select all

scantailor-cli --enable-page-detection --enable-fine-tuning --enable-auto-margins images outdir/
Matchlayout patch adds --match-layout options to cli version which detects atypical pages and does not match their size with others automaticaly.

Pictureshape patch adds option for mixed pages to choose from free shape and rectangular shape images. This patch does not improve the original algoritm but creates from the detected "blobs" rectangular shapes and the rectangles that intersects joins to one.

For windows I've compiled executable installation package. Other platforms have to compile from prepared source code which you can download from the same place or merge them yourselfs from git.

You can download the files from https://sourceforge.net/projects/scanta ... /enhanced/

Latest version: 20120503
This post updated 5/3/2012

Petr Kovar
Last edited by pejuko on 14 Dec 2011, 07:17, edited 2 times in total.
User avatar
Misty
Posts: 481
Joined: 06 Nov 2009, 12:20
Number of books owned: 0
Location: Frozen Wasteland

Re: Scan Tailor "Enhanced"

Post by Misty »

Hi, Petr,

Is this release stable enough for general use? Is there a git tag I can use to pull the code in this release? If so, I was going to add this as a build option in Homebrew so Mac users can easily try out the development release.
The opinions expressed in this post are my own and do not necessarily represent those of the Canadian Museum for Human Rights.
Tulon
Posts: 687
Joined: 03 Oct 2009, 06:13
Number of books owned: 0
Location: London, UK
Contact:

Re: Scan Tailor "Enhanced"

Post by Tulon »

Hi Petr,

I am glad you still hang around. Your post reminded me I promised you to review the automargins code, which I never did. Sorry about that. I rarely work on ST these days, as I got tired of that. I hope you understand.

I do support people releasing their own versions of ST. After all, the Free Software / Open Source thing is all about being able to do just that. Keep up the good work!
Scan Tailor experimental doesn't output 96 DPI images. It's just what your software shows when DPI information is missing. Usually what you get is input DPI times the resolution enhancement factor.
User avatar
daniel_reetz
Posts: 2812
Joined: 03 Jun 2009, 13:56
E-book readers owned: Used to have a PRS-500
Number of books owned: 600
Country: United States
Contact:

Re: Scan Tailor "Enhanced"

Post by daniel_reetz »

Petr,

Recognize you from the Scan Tailor dev-list. Thanks for setting this up and sharing with the community here. Hopefully we can help you build for all platforms.
eL_PuSHeR
Posts: 125
Joined: 28 Jun 2010, 15:25

Re: Scan Tailor "Enhanced"

Post by eL_PuSHeR »

Thank you very much for this release. Your hard work is appreciated.
User avatar
jimboh
Posts: 82
Joined: 06 Jan 2011, 00:55

Re: Scan Tailor "Enhanced"

Post by jimboh »

Would you post a screenshot of Pictureshape in action?
pejuko
Posts: 30
Joined: 17 Feb 2011, 17:06

Re: Scan Tailor "Enhanced"

Post by pejuko »

jimboh wrote:Would you post a screenshot of Pictureshape in action?
There is an example:

The original "free shape" mode:
scantailor-freeshape.cut.400.jpg
scantailor-freeshape.cut.400.jpg (28.61 KiB) Viewed 37328 times
and "rectangular shape" mode:
scantailor-rectangularshape.cut.400.jpg
scantailor-rectangularshape.cut.400.jpg (23.98 KiB) Viewed 37328 times
Both screen-shots are taken with Picture Zones enabled.
pejuko
Posts: 30
Joined: 17 Feb 2011, 17:06

Re: Scan Tailor "Enhanced"

Post by pejuko »

Tulon wrote: I do support people releasing their own versions of ST. After all, the Free Software / Open Source thing is all about being able to do just that. Keep up the good work!
Would it be possible to create on SF new folder for such releases?
Tulon
Posts: 687
Joined: 03 Oct 2009, 06:13
Number of books owned: 0
Location: London, UK
Contact:

Re: Scan Tailor "Enhanced"

Post by Tulon »

Yes, sure. I just gave you file release permissions. Now you should be able to create folders and upload files.
Scan Tailor experimental doesn't output 96 DPI images. It's just what your software shows when DPI information is missing. Usually what you get is input DPI times the resolution enhancement factor.
pejuko
Posts: 30
Joined: 17 Feb 2011, 17:06

Re: Scan Tailor "Enhanced"

Post by pejuko »

Misty wrote:Is this release stable enough for general use? Is there a git tag I can use to pull the code in this release? If so, I was going to add this as a build option in Homebrew so Mac users can easily try out the development release.
Misty,

I'm using ScanTailor mostly on linux and with the cli version I've processed automatically thousands of pages with very good result.

Files are now on SourceForge in subdirectory:
https://sourceforge.net/projects/scanta ... /enhanced/
so it should be more comfortable for packagers to download them now.

Instead of pulling from git I recommend to use the .tar.gz package. It's merged from four branches and there are conflicts.
Post Reply