B/W source - how to keep format?
Moderator: peterZ
B/W source - how to keep format?
Hello.
This is the first time I am using Scan Tailor with a source that is pure black & white (600 dpi). How can I make output the same as original? If I choose black & white, the outcome appears either too black or too washed out. If I choose color/grayscale the output sizes gets too big. Help.
This is the first time I am using Scan Tailor with a source that is pure black & white (600 dpi). How can I make output the same as original? If I choose black & white, the outcome appears either too black or too washed out. If I choose color/grayscale the output sizes gets too big. Help.
Re: B/W source - how to keep format?
Can you post an example page?
Scan Tailor experimental doesn't output 96 DPI images. It's just what your software shows when DPI information is missing. Usually what you get is input DPI times the resolution enhancement factor.
Re: B/W source - how to keep format?
What? That page isn't even processed and has bad dpi.
- daniel_reetz
- Posts: 2812
- Joined: 03 Jun 2009, 13:56
- E-book readers owned: Used to have a PRS-500
- Number of books owned: 600
- Country: United States
- Contact:
Re: B/W source - how to keep format?
Umm, that is definitely a processed image and the "DPI" is fine/large - are you referring to the moire effects that are happening because it was scanned on a flatbed?
Thanks for sharing this, dingodog!
Thanks for sharing this, dingodog!
- dingodog
- Posts: 110
- Joined: 22 Jul 2010, 18:19
- Number of books owned: 1000
- Country: on the net
- Location: on the net
- Contact:
Re: B/W source - how to keep format?
Hum? so do you believe to Adobe?eL_PuSHeR wrote:What? That page isn't even processed and has bad dpi.
You should not to believe
ppi is the same, dpi changed, but dpi is not ppi# pdfinfo el-factor-humano.pdf
Tagged: no
Pages: 1
Encrypted: no
Page size: 6578 x 4424 pts
File size: 458678 bytes
Optimized: no
PDF version: 1.4
I rotated image with MTpaint and MTpaint stripped the dpi info (but image has the same ppi) no quality loss it is happened
do you want the PROCESSED IMAGE alone?
I extracted from jbig2enc output
- http://imageshack.us/photo/my-images/40 ... aspng.png/
image has the dithering, since jbig2enc cannot BLANK non-text areas like Scantailor can do
such type of images need to be pre-processed, adjusting contrast and brightness values before to be encioded with jbig2enc (this helps also with scantailor)
original colorful image is needed
Re: B/W source - how to keep format?
Forgive me but I do not see what your point is. My issue is with Scan Tailor's BW output.
- dingodog
- Posts: 110
- Joined: 22 Jul 2010, 18:19
- Number of books owned: 1000
- Country: on the net
- Location: on the net
- Contact:
Re: B/W source - how to keep format?
the same task can be performed with jbig2enc
I'm experimenting various blur values in order to remove the dithering under text areas
blurring already dithered image (1bit) before to process with jbig2enc, helps to clear dithering
blurring image you have provided with radius=4 (gaussian blur)
and then encoding with jbig2enc
I obtain a more clear result:
- http://ifile.it/ko5u714
I'm experimenting various blur values in order to remove the dithering under text areas
blurring already dithered image (1bit) before to process with jbig2enc, helps to clear dithering
blurring image you have provided with radius=4 (gaussian blur)
and then encoding with jbig2enc
Code: Select all
jbig2 -s -p -v -T 125 out.png && pdf.py output>out.pdf
- http://ifile.it/ko5u714
Re: B/W source - how to keep format?
Try mixed mode. If the file size is problem then you can reduce number of gray tones using imagemagick like this:
convert out/page.tif -depth 2 -quality 100 page.png
This will create page.png which should have only 4 gray tones but looks usualy good.
I get this result from scantailor+ convert:
http://www.4shared.com/photo/y6-7IKB8/out.html
In this I marked the text area below picture as non image, so the background has disappeared and I set the thicknes to -30.
convert out/page.tif -depth 2 -quality 100 page.png
This will create page.png which should have only 4 gray tones but looks usualy good.
I get this result from scantailor+ convert:
http://www.4shared.com/photo/y6-7IKB8/out.html
In this I marked the text area below picture as non image, so the background has disappeared and I set the thicknes to -30.