<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Scanning to PDF in Linux</title>
	<atom:link href="http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/feed/" rel="self" type="application/rss+xml" />
	<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/</link>
	<description>Open information and technology.</description>
	<lastBuildDate>Mon, 05 Dec 2011 09:40:58 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: david</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-394</link>
		<dc:creator><![CDATA[david]]></dc:creator>
		<pubDate>Thu, 05 Apr 2007 15:23:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-394</guid>
		<description><![CDATA[I scanned a one-page document in greyscale at 300 dpi using xsane&#039;s multipage support and ended up with a 2.6 MB PDF file.  The second time, I saved from xsane in Postscript format, then used ps2pdf to produce a PDF, and ended up just with a 0.5 MB file.  I&#039;ll try gscan2pdf.]]></description>
		<content:encoded><![CDATA[<p>I scanned a one-page document in greyscale at 300 dpi using xsane&#8217;s multipage support and ended up with a 2.6 MB PDF file.  The second time, I saved from xsane in Postscript format, then used ps2pdf to produce a PDF, and ended up just with a 0.5 MB file.  I&#8217;ll try gscan2pdf.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: A. Pater</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-393</link>
		<dc:creator><![CDATA[A. Pater]]></dc:creator>
		<pubDate>Tue, 20 Mar 2007 19:33:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-393</guid>
		<description><![CDATA[I second Luke O&#039;Connell.
It&#039;s a little strange that there&#039;s no Open Source or Freeware program under POSIX that produces PDF&#039;s with a hidden (OCR&#039;ed) text layer over the scanned images. It can&#039;t be that hard to program such a thing. But that is only my uneducated guess. Since it is possible (e.g. with the outstanding Jarnal program - a JAVA based app) to add a visible text layer to a PDF under POSIX systems.
The author of gscan2pdf says he wants to add such a function to his app in the future. Hope he will be able to do just that!]]></description>
		<content:encoded><![CDATA[<p>I second Luke O&#8217;Connell.<br />
It&#8217;s a little strange that there&#8217;s no Open Source or Freeware program under POSIX that produces PDF&#8217;s with a hidden (OCR&#8217;ed) text layer over the scanned images. It can&#8217;t be that hard to program such a thing. But that is only my uneducated guess. Since it is possible (e.g. with the outstanding Jarnal program &#8211; a JAVA based app) to add a visible text layer to a PDF under POSIX systems.<br />
The author of gscan2pdf says he wants to add such a function to his app in the future. Hope he will be able to do just that!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gary Thompson</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-392</link>
		<dc:creator><![CDATA[Gary Thompson]]></dc:creator>
		<pubDate>Fri, 23 Feb 2007 12:43:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-392</guid>
		<description><![CDATA[I&#039;ve just discovered gscan2pdf and am very pleased with it.  I recommend it if you&#039;re tired of the bloated zlib compressed PDF&#039;s created by Xsane.]]></description>
		<content:encoded><![CDATA[<p>I&#8217;ve just discovered gscan2pdf and am very pleased with it.  I recommend it if you&#8217;re tired of the bloated zlib compressed PDF&#8217;s created by Xsane.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Luke O'Connell</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-391</link>
		<dc:creator><![CDATA[Luke O'Connell]]></dc:creator>
		<pubDate>Tue, 02 Jan 2007 14:05:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-391</guid>
		<description><![CDATA[Some interesting information on this page. I am STILL trying to find a way to create searchable PDFs under Linux. Like the author, I scan all my paperwork and rely on that scan being searchable. I have looked high and low but to no avail, there just does not seem to be a way... unless you are ready to pay hundreds for a corporate solution (there are a few out there). Pretty much the last dependency between Redmond and I.]]></description>
		<content:encoded><![CDATA[<p>Some interesting information on this page. I am STILL trying to find a way to create searchable PDFs under Linux. Like the author, I scan all my paperwork and rely on that scan being searchable. I have looked high and low but to no avail, there just does not seem to be a way&#8230; unless you are ready to pay hundreds for a corporate solution (there are a few out there). Pretty much the last dependency between Redmond and I.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Felix Lechner</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-390</link>
		<dc:creator><![CDATA[Felix Lechner]]></dc:creator>
		<pubDate>Sat, 02 Dec 2006 21:26:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-390</guid>
		<description><![CDATA[Hi there, just thought i would post my script here, too.  It works well for me.  I keep no paper records anymore at home or work.  Have a second script &quot;scanpdf-a4&quot; with the scan area adjusted for A4.  Make sure to change your SANE device in the script.  Sincerely, Felix

#!/bin/sh

device=&quot;genesys:libusb:004:005&quot;
tmpdir=&quot;/tmp&quot;
basename=&quot;scanpdf&quot;
counter=0
color=0
scanarea=&quot;-l 0 -t 0 -x 216 -y 279&quot;     # US letter
#scanarea=&quot;-l 0 -t 0 -x 210 -y 297&quot;     # A4

if [ ! -d &quot;$tmpdir&quot; ]
then
        echo &quot;Need existing tempdir at $tmpdir.&quot;
        exit
fi

until [ $counter -eq -1 ]
do
        echo -n &quot;Black &amp; White/Color/Done [B/c/d]? &quot;
        read answer
        if [ -z &quot;$answer&quot; -o &quot;$answer&quot; = &quot;b&quot; -o &quot;$answer&quot; = &quot;B&quot; ]
        then
                color=0
        elif [ &quot;$answer&quot; = &quot;c&quot; -o &quot;$answer&quot; = &quot;C&quot; ]
        then
                color=1
        elif [ &quot;$answer&quot; = &quot;d&quot; -o &quot;$answer&quot; = &quot;D&quot; ]
        then
                break
        else
                continue
        fi
        counter=`expr $counter + 1`
        echo &quot;Scanning page  $counter&quot;
        filename=&quot;$basename`echo $counter &#124; awk &#039;{ printf &quot;-%03d&quot;, $1 }&#039;`&quot;
        if [ $color -eq 1 ]
        then
                mode=&quot;--mode Color&quot;
                resolution=&quot;--resolution 150&quot;
        else
                mode=&quot;--mode Gray&quot;
                resolution=&quot;--resolution 150&quot;
        fi
        scanimage $mode $resolution $scanarea &gt; &quot;$tmpdir&quot;/&quot;$filename&quot;.pnb
        convert -quality 50 &quot;$tmpdir&quot;/&quot;$filename&quot;.pnb &quot;$tmpdir&quot;/&quot;$filename&quot;.jpg
        rm &quot;$tmpdir&quot;/&quot;$filename&quot;.pnb
done

if [ $counter -eq 0 ]
then
        echo &quot;Nothing scanned.&quot;
        exit
fi

savein=&quot;&quot;
while [ -z &quot;$savein&quot; ]
do
        echo -n &quot;Please name the PDF file to save in:  &quot;
        read savein
done

convert &quot;$tmpdir&quot;/$basename*.jpg &quot;$savein&quot;.pdf
rm &quot;$tmpdir&quot;/$basename*.jpg
acroread &quot;$savein&quot;.pdf &gt;&amp; /dev/null &amp;
echo Successfully scanned $counter pages and created &quot;$savein&quot;.pdf.]]></description>
		<content:encoded><![CDATA[<p>Hi there, just thought i would post my script here, too.  It works well for me.  I keep no paper records anymore at home or work.  Have a second script &#8220;scanpdf-a4&#8243; with the scan area adjusted for A4.  Make sure to change your SANE device in the script.  Sincerely, Felix</p>
<p>#!/bin/sh</p>
<p>device=&#8221;genesys:libusb:004:005&#8243;<br />
tmpdir=&#8221;/tmp&#8221;<br />
basename=&#8221;scanpdf&#8221;<br />
counter=0<br />
color=0<br />
scanarea=&#8221;-l 0 -t 0 -x 216 -y 279&#8243;     # US letter<br />
#scanarea=&#8221;-l 0 -t 0 -x 210 -y 297&#8243;     # A4</p>
<p>if [ ! -d "$tmpdir" ]<br />
then<br />
        echo &#8220;Need existing tempdir at $tmpdir.&#8221;<br />
        exit<br />
fi</p>
<p>until [ $counter -eq -1 ]<br />
do<br />
        echo -n &#8220;Black &amp; White/Color/Done [B/c/d]? &#8221;<br />
        read answer<br />
        if [ -z "$answer" -o "$answer" = "b" -o "$answer" = "B" ]<br />
        then<br />
                color=0<br />
        elif [ "$answer" = "c" -o "$answer" = "C" ]<br />
        then<br />
                color=1<br />
        elif [ "$answer" = "d" -o "$answer" = "D" ]<br />
        then<br />
                break<br />
        else<br />
                continue<br />
        fi<br />
        counter=`expr $counter + 1`<br />
        echo &#8220;Scanning page  $counter&#8221;<br />
        filename=&#8221;$basename`echo $counter | awk &#8216;{ printf &#8220;-%03d&#8221;, $1 }&#8217;`&#8221;<br />
        if [ $color -eq 1 ]<br />
        then<br />
                mode=&#8221;&#8211;mode Color&#8221;<br />
                resolution=&#8221;&#8211;resolution 150&#8243;<br />
        else<br />
                mode=&#8221;&#8211;mode Gray&#8221;<br />
                resolution=&#8221;&#8211;resolution 150&#8243;<br />
        fi<br />
        scanimage $mode $resolution $scanarea &gt; &#8220;$tmpdir&#8221;/&#8221;$filename&#8221;.pnb<br />
        convert -quality 50 &#8220;$tmpdir&#8221;/&#8221;$filename&#8221;.pnb &#8220;$tmpdir&#8221;/&#8221;$filename&#8221;.jpg<br />
        rm &#8220;$tmpdir&#8221;/&#8221;$filename&#8221;.pnb<br />
done</p>
<p>if [ $counter -eq 0 ]<br />
then<br />
        echo &#8220;Nothing scanned.&#8221;<br />
        exit<br />
fi</p>
<p>savein=&#8221;"<br />
while [ -z "$savein" ]<br />
do<br />
        echo -n &#8220;Please name the PDF file to save in:  &#8221;<br />
        read savein<br />
done</p>
<p>convert &#8220;$tmpdir&#8221;/$basename*.jpg &#8220;$savein&#8221;.pdf<br />
rm &#8220;$tmpdir&#8221;/$basename*.jpg<br />
acroread &#8220;$savein&#8221;.pdf &gt;&amp; /dev/null &amp;<br />
echo Successfully scanned $counter pages and created &#8220;$savein&#8221;.pdf.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gabriel Richards</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-389</link>
		<dc:creator><![CDATA[Gabriel Richards]]></dc:creator>
		<pubDate>Sat, 28 Oct 2006 04:30:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-389</guid>
		<description><![CDATA[Hey y&#039;all, googling &quot;linux scanimage to pdf&quot; here.

Problem, # scanimage --format tiff &gt; scan.tiff // is yielding

scanimage: unrecognized option `--format&#039;

Even though it&#039;s documented in the man page.

Running RHEL v.4.

scanimage -format tiff &gt; scan.tiff yields a file named scan.tiff with &quot;ormat&quot; in it.

Any ideas?

TY
G]]></description>
		<content:encoded><![CDATA[<p>Hey y&#8217;all, googling &#8220;linux scanimage to pdf&#8221; here.</p>
<p>Problem, # scanimage &#8211;format tiff &gt; scan.tiff // is yielding</p>
<p>scanimage: unrecognized option `&#8211;format&#8217;</p>
<p>Even though it&#8217;s documented in the man page.</p>
<p>Running RHEL v.4.</p>
<p>scanimage -format tiff &gt; scan.tiff yields a file named scan.tiff with &#8220;ormat&#8221; in it.</p>
<p>Any ideas?</p>
<p>TY<br />
G</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeffrey Ratcliffe</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-388</link>
		<dc:creator><![CDATA[Jeffrey Ratcliffe]]></dc:creator>
		<pubDate>Sun, 03 Sep 2006 19:10:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-388</guid>
		<description><![CDATA[Try sourceforge.net/projects/gscan2pdf]]></description>
		<content:encoded><![CDATA[<p>Try sourceforge.net/projects/gscan2pdf</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gaurav Goyal</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-387</link>
		<dc:creator><![CDATA[Gaurav Goyal]]></dc:creator>
		<pubDate>Sat, 02 Sep 2006 14:25:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-387</guid>
		<description><![CDATA[You can look at PDF Studio (http://www.qoppa.com/psindex.html). Its a commercial Java based solution.]]></description>
		<content:encoded><![CDATA[<p>You can look at PDF Studio (<a href="http://www.qoppa.com/psindex.html" rel="nofollow">http://www.qoppa.com/psindex.html</a>). Its a commercial Java based solution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: gordon</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-386</link>
		<dc:creator><![CDATA[gordon]]></dc:creator>
		<pubDate>Fri, 01 Sep 2006 07:01:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-386</guid>
		<description><![CDATA[While I am grateful to the contributors for this assistance, I am also struck by the absurd nature of the process involved.  One can do this in a single step for any large document (without having to fiddle with ADF settings) using a number of programs on either Windows or OS X.  The obvious route is to pay for a copy of Acrobat, which is a lot cheaper (for a business) than going through the experimentation etc necessary to automate a procedure of this nature.  There are other cheaper programs available that will do the same thing without requiring all of the activation nonsense that Paperport and other programs based on the Zeon pdf printer driver require.

Further, I have found that editing pdf files in anything more than the most primitive way is also difficult on Linux system. On top of that, there is the problem of getting programs to print directly to pdf files outside the kdeprinter system, which is trivial in Windows or OS X.  It can be done but the messing around with cups-pdf required is obscure and unfriendly for any but experienced users (and the resulting installation gets messed up on every upgrade).

It is worth bearing in mind that pdf files are becoming the standard document format for lots of business use because they don&#039;t contain the kind of compromising information that gets retained with Word and other files.  Sadly, the very primitive support for scanning, creating and manipulating pdf files is a very large barrier to the adoption of Linux as a desktop system for serious business use - and I have tried and use 4-5 different distributions including SuSE which is the best for this purpose in my experience.]]></description>
		<content:encoded><![CDATA[<p>While I am grateful to the contributors for this assistance, I am also struck by the absurd nature of the process involved.  One can do this in a single step for any large document (without having to fiddle with ADF settings) using a number of programs on either Windows or OS X.  The obvious route is to pay for a copy of Acrobat, which is a lot cheaper (for a business) than going through the experimentation etc necessary to automate a procedure of this nature.  There are other cheaper programs available that will do the same thing without requiring all of the activation nonsense that Paperport and other programs based on the Zeon pdf printer driver require.</p>
<p>Further, I have found that editing pdf files in anything more than the most primitive way is also difficult on Linux system. On top of that, there is the problem of getting programs to print directly to pdf files outside the kdeprinter system, which is trivial in Windows or OS X.  It can be done but the messing around with cups-pdf required is obscure and unfriendly for any but experienced users (and the resulting installation gets messed up on every upgrade).</p>
<p>It is worth bearing in mind that pdf files are becoming the standard document format for lots of business use because they don&#8217;t contain the kind of compromising information that gets retained with Word and other files.  Sadly, the very primitive support for scanning, creating and manipulating pdf files is a very large barrier to the adoption of Linux as a desktop system for serious business use &#8211; and I have tried and use 4-5 different distributions including SuSE which is the best for this purpose in my experience.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: matt avila</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-385</link>
		<dc:creator><![CDATA[matt avila]]></dc:creator>
		<pubDate>Sat, 26 Aug 2006 23:49:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-385</guid>
		<description><![CDATA[Thanks to all for the comments and suggestions. I&#039;m running SANE 0.991 on SUSE and need to scan, convert, and save &quot;small as practical&quot; pdf&#039;s. While SANE has the capability to scan multiple page docs into multiple page PDF&#039;s - the filesizes are really huge!

Running through convert (after scanning muilti pages into *.pnm files) and then running that output through ps2pdf yields some relatively small PDF files that are VERY usable. I dumped Redmonds bloatware about a month ago and am attempting to replace the functionality of PaperPort. I typically scan, OCR, and save PDF&#039;s of all the paperwork that comes through the mail on a weekly basis - this can grow quite large (in paper) and doing this allows me to quickly search, retrieve, and also have a continuity plan should the house burn to the ground (make a second backup of the files). Worst case is I loose a month or so of documentation - the most recent is always easiest to replace.

So from what my testing shows is scan with SANE, use the project files to run through convert, then run that output through ps2pdf. This cuts the filesize by about 80% or better... very handy when your working with 50+ page documents.]]></description>
		<content:encoded><![CDATA[<p>Thanks to all for the comments and suggestions. I&#8217;m running SANE 0.991 on SUSE and need to scan, convert, and save &#8220;small as practical&#8221; pdf&#8217;s. While SANE has the capability to scan multiple page docs into multiple page PDF&#8217;s &#8211; the filesizes are really huge!</p>
<p>Running through convert (after scanning muilti pages into *.pnm files) and then running that output through ps2pdf yields some relatively small PDF files that are VERY usable. I dumped Redmonds bloatware about a month ago and am attempting to replace the functionality of PaperPort. I typically scan, OCR, and save PDF&#8217;s of all the paperwork that comes through the mail on a weekly basis &#8211; this can grow quite large (in paper) and doing this allows me to quickly search, retrieve, and also have a continuity plan should the house burn to the ground (make a second backup of the files). Worst case is I loose a month or so of documentation &#8211; the most recent is always easiest to replace.</p>
<p>So from what my testing shows is scan with SANE, use the project files to run through convert, then run that output through ps2pdf. This cuts the filesize by about 80% or better&#8230; very handy when your working with 50+ page documents.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mac Jones</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-384</link>
		<dc:creator><![CDATA[Mac Jones]]></dc:creator>
		<pubDate>Mon, 29 May 2006 11:01:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-384</guid>
		<description><![CDATA[Here&#039;s my graphical script that&#039;s grown over time, the best thing about it is that tools like beagle can find the pdf&#039;s from the meta data you enter ! Needs pdftk and zenity apps.

Cheers
Mac
New Zealand


#!/bin/sh

#scan a batch
#mode Color or Gray on end of next line if needed
#echo &quot;Starting !&quot;

colour=`zenity  --list  --radiolist  --column &quot;-&quot; --column &quot;Scan&quot; TRUE Gray FALS E Color`

LIMIT=10
a=0
cont=1

until [ $cont -eq &quot;0&quot; ]
do
  echo -n &quot;$a &quot;
  let &quot;a+=1&quot;
 if zenity --question --text &quot;OK to scan a page, Cancel to finish, Page=$a&quot;
 then
        cont=1
        scanimage --format pnm --resolution 150 --mode $colour &gt; &quot;$a.pnm&quot;
else
          cont=0
fi

done           # No surprises, so far.


echo &quot;.&quot;
#convert the raw file to postscript
convert -density 150 *.pnm out.ps &#124; zenity --progress --auto-close

#convert the postscript to pdf
ps2pdf out.ps out.pdf &#124; zenity --progress --auto-close

#remove raw scan files
rm *.pnm

#remove old ps files
rm out.ps

#add the metadata
echo -e &quot;\a&quot;
#echo &quot;Please enter a name for the PDF file (** no .pdf on end)&quot;
nm=`zenity --entry --text &quot;Enter file name, (no .pdf on the end)&quot;`
#echo &quot;Please enter Metadata for searching&quot;
meta=`zenity --entry --text &quot;Meta data for searching&quot; --entry-text=$nm`

echo &quot;InfoKey: Producer&quot; &gt; tmp
echo &quot;InfoValue: $meta&quot; &gt;&gt; tmp
echo &quot;InfoKey: Keywords&quot; &gt;&gt; tmp
echo &quot;InfoValue: $meta&quot; &gt;&gt; tmp
echo &quot;InfoKey: Title&quot; &gt;&gt; tmp
echo &quot;InfoValue: $nm&quot; &gt;&gt; tmp

#update the metadata
pdftk out.pdf update_info tmp output &quot;$nm.pdf&quot;

#rm metadata file and pdf
rm tmp
rm out.pdf

zenity --info --text=&quot;All done, $nm.pdf is ready!&quot;]]></description>
		<content:encoded><![CDATA[<p>Here&#8217;s my graphical script that&#8217;s grown over time, the best thing about it is that tools like beagle can find the pdf&#8217;s from the meta data you enter ! Needs pdftk and zenity apps.</p>
<p>Cheers<br />
Mac<br />
New Zealand</p>
<p>#!/bin/sh</p>
<p>#scan a batch<br />
#mode Color or Gray on end of next line if needed<br />
#echo &#8220;Starting !&#8221;</p>
<p>colour=`zenity  &#8211;list  &#8211;radiolist  &#8211;column &#8220;-&#8221; &#8211;column &#8220;Scan&#8221; TRUE Gray FALS E Color`</p>
<p>LIMIT=10<br />
a=0<br />
cont=1</p>
<p>until [ $cont -eq "0" ]<br />
do<br />
  echo -n &#8220;$a &#8221;<br />
  let &#8220;a+=1&#8243;<br />
 if zenity &#8211;question &#8211;text &#8220;OK to scan a page, Cancel to finish, Page=$a&#8221;<br />
 then<br />
        cont=1<br />
        scanimage &#8211;format pnm &#8211;resolution 150 &#8211;mode $colour &gt; &#8220;$a.pnm&#8221;<br />
else<br />
          cont=0<br />
fi</p>
<p>done           # No surprises, so far.</p>
<p>echo &#8220;.&#8221;<br />
#convert the raw file to postscript<br />
convert -density 150 *.pnm out.ps | zenity &#8211;progress &#8211;auto-close</p>
<p>#convert the postscript to pdf<br />
ps2pdf out.ps out.pdf | zenity &#8211;progress &#8211;auto-close</p>
<p>#remove raw scan files<br />
rm *.pnm</p>
<p>#remove old ps files<br />
rm out.ps</p>
<p>#add the metadata<br />
echo -e &#8220;\a&#8221;<br />
#echo &#8220;Please enter a name for the PDF file (** no .pdf on end)&#8221;<br />
nm=`zenity &#8211;entry &#8211;text &#8220;Enter file name, (no .pdf on the end)&#8221;`<br />
#echo &#8220;Please enter Metadata for searching&#8221;<br />
meta=`zenity &#8211;entry &#8211;text &#8220;Meta data for searching&#8221; &#8211;entry-text=$nm`</p>
<p>echo &#8220;InfoKey: Producer&#8221; &gt; tmp<br />
echo &#8220;InfoValue: $meta&#8221; &gt;&gt; tmp<br />
echo &#8220;InfoKey: Keywords&#8221; &gt;&gt; tmp<br />
echo &#8220;InfoValue: $meta&#8221; &gt;&gt; tmp<br />
echo &#8220;InfoKey: Title&#8221; &gt;&gt; tmp<br />
echo &#8220;InfoValue: $nm&#8221; &gt;&gt; tmp</p>
<p>#update the metadata<br />
pdftk out.pdf update_info tmp output &#8220;$nm.pdf&#8221;</p>
<p>#rm metadata file and pdf<br />
rm tmp<br />
rm out.pdf</p>
<p>zenity &#8211;info &#8211;text=&#8221;All done, $nm.pdf is ready!&#8221;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christoph Lange</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-383</link>
		<dc:creator><![CDATA[Christoph Lange]]></dc:creator>
		<pubDate>Thu, 11 May 2006 09:02:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-383</guid>
		<description><![CDATA[Here&#039;s a nice German tutorial on how to automate the whole thing (with scanimage, as mentioned above): http://www.pro-linux.de/t_office/dokumente-in-pdf-scannen.html -- The script is in English :-)

I got, however, the best results (much work but small files) by scanning the pages (in my case: from a magazine) to Gimp in a high resolution (e.g. 360 dpi), then scaling them down so that the text was still readable, then converting them to an indexed format (instead of RGB) and saving them as PNG (best format for images with a small number of different colors). Of course, you have to experiment with convert -density after scaling the images...]]></description>
		<content:encoded><![CDATA[<p>Here&#8217;s a nice German tutorial on how to automate the whole thing (with scanimage, as mentioned above): <a href="http://www.pro-linux.de/t_office/dokumente-in-pdf-scannen.html" rel="nofollow">http://www.pro-linux.de/t_office/dokumente-in-pdf-scannen.html</a> &#8212; The script is in English <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>I got, however, the best results (much work but small files) by scanning the pages (in my case: from a magazine) to Gimp in a high resolution (e.g. 360 dpi), then scaling them down so that the text was still readable, then converting them to an indexed format (instead of RGB) and saving them as PNG (best format for images with a small number of different colors). Of course, you have to experiment with convert -density after scaling the images&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Naranek</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-382</link>
		<dc:creator><![CDATA[Naranek]]></dc:creator>
		<pubDate>Tue, 11 Apr 2006 17:23:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-382</guid>
		<description><![CDATA[Thanks for a great article. This is just what I needed. I changed --batch-prompt to --batch-count=1 so it automatically scans only one page at a time. Had some problems with the ps2pdf cropping my A4 scan, but that was solved with -sPAPERSIZE=a4 switch.]]></description>
		<content:encoded><![CDATA[<p>Thanks for a great article. This is just what I needed. I changed &#8211;batch-prompt to &#8211;batch-count=1 so it automatically scans only one page at a time. Had some problems with the ps2pdf cropping my A4 scan, but that was solved with -sPAPERSIZE=a4 switch.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mac jones</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-381</link>
		<dc:creator><![CDATA[mac jones]]></dc:creator>
		<pubDate>Fri, 24 Mar 2006 03:51:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-381</guid>
		<description><![CDATA[This script worked better for me.....
note the &quot;--&quot; or double hyphen for the scanimage options

#!/bin/sh

#scan a batch
scanimage --format pnm --batch --batch-prompt --resolution 150 --mode Gray

#convert the raw file to postscript
convert -density 150 *.pnm out.ps

#convert the postscript to pdf
ps2pdf out.ps out.pdf

#remove raw scan files
rm *.pnm

#remove old ps files
rm out.ps]]></description>
		<content:encoded><![CDATA[<p>This script worked better for me&#8230;..<br />
note the &#8220;&#8211;&#8221; or double hyphen for the scanimage options</p>
<p>#!/bin/sh</p>
<p>#scan a batch<br />
scanimage &#8211;format pnm &#8211;batch &#8211;batch-prompt &#8211;resolution 150 &#8211;mode Gray</p>
<p>#convert the raw file to postscript<br />
convert -density 150 *.pnm out.ps</p>
<p>#convert the postscript to pdf<br />
ps2pdf out.ps out.pdf</p>
<p>#remove raw scan files<br />
rm *.pnm</p>
<p>#remove old ps files<br />
rm out.ps</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Harout Hedeshian</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-380</link>
		<dc:creator><![CDATA[Harout Hedeshian]]></dc:creator>
		<pubDate>Tue, 21 Mar 2006 03:15:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-380</guid>
		<description><![CDATA[I find that using the imagescan command for the first step works the best. It automatically outputs into PDF format.
A typical scan would look like this:

scanimage --format tiff --batch --batch-double --resolution 200 --source ADF --mode Grayscale --batch-start=3
convert -density 200 *.tif output.ps
ps2pdf output.ps output.pdf

One could potentially write a script that would automate the whole thing]]></description>
		<content:encoded><![CDATA[<p>I find that using the imagescan command for the first step works the best. It automatically outputs into PDF format.<br />
A typical scan would look like this:</p>
<p>scanimage &#8211;format tiff &#8211;batch &#8211;batch-double &#8211;resolution 200 &#8211;source ADF &#8211;mode Grayscale &#8211;batch-start=3<br />
convert -density 200 *.tif output.ps<br />
ps2pdf output.ps output.pdf</p>
<p>One could potentially write a script that would automate the whole thing</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nils Halvard Lunde</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-379</link>
		<dc:creator><![CDATA[Nils Halvard Lunde]]></dc:creator>
		<pubDate>Wed, 15 Mar 2006 20:33:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-379</guid>
		<description><![CDATA[thankx, folks. googling for &quot;linux scanning pdf&quot; (without the quotes, of course)
brought up this article, and it was just what i needed!
i used xsane under fedora5 (just typing xsane at the command line)
with an old hp scsi scanner, and it worked with almost no problems.

and because i scanned line arts (b/w text only pages), it also scanned blazingly fast.

actually, i worked around one minor problem:
after scanning 6 pages, i pressed view image in the project dialog for one of the images,
and xsane stopped abruptpy. however, i just restarted xsane, selected multipage agan, wrote the
exact same project name in the project name input dialog, and the image file list showed up again.
i could not create the pdf file immediately, but then i scanned one page,
then deleted it, and now i could save all pages as a pdf file.
nice to know if xsane freaks out after you have scanned several pages.

you made my day, people!

-- nils halvard smallsoft com]]></description>
		<content:encoded><![CDATA[<p>thankx, folks. googling for &#8220;linux scanning pdf&#8221; (without the quotes, of course)<br />
brought up this article, and it was just what i needed!<br />
i used xsane under fedora5 (just typing xsane at the command line)<br />
with an old hp scsi scanner, and it worked with almost no problems.</p>
<p>and because i scanned line arts (b/w text only pages), it also scanned blazingly fast.</p>
<p>actually, i worked around one minor problem:<br />
after scanning 6 pages, i pressed view image in the project dialog for one of the images,<br />
and xsane stopped abruptpy. however, i just restarted xsane, selected multipage agan, wrote the<br />
exact same project name in the project name input dialog, and the image file list showed up again.<br />
i could not create the pdf file immediately, but then i scanned one page,<br />
then deleted it, and now i could save all pages as a pdf file.<br />
nice to know if xsane freaks out after you have scanned several pages.</p>
<p>you made my day, people!</p>
<p>&#8211; nils halvard smallsoft com</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: david</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-378</link>
		<dc:creator><![CDATA[david]]></dc:creator>
		<pubDate>Tue, 14 Mar 2006 02:50:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-378</guid>
		<description><![CDATA[Thanks, Benjamin -- I&#039;m going to try that.]]></description>
		<content:encoded><![CDATA[<p>Thanks, Benjamin &#8212; I&#8217;m going to try that.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Benjamin Kay</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-377</link>
		<dc:creator><![CDATA[Benjamin Kay]]></dc:creator>
		<pubDate>Tue, 14 Mar 2006 01:21:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-377</guid>
		<description><![CDATA[I&#039;m using XSane 0.991 on Gentoo Linux. In the main XSane dialog there is a &quot;target&quot; icon that looks like a crosshairs, and there is a drop-down menu just to the right of that. Click on the drop down menu and chose &quot;Multipage&quot;.

A new dialog called &quot;xsane multipage project&quot; will appear. You may chose any name for your project; xsane will create a folder with that name to store temporary files in and eventually output a pdf file with that name.

Chose &quot;PDF&quot; under the &quot;Multipage document filetype&quot; drop-down menu. Then click on the &quot;Create project button&quot;. Now you can use the main XSane dialog to scan pages into the project one at a time by clicking scan (optionally, chose the paper size in the preview window). The multipage project dialog has buttons to edit and remove images from the project and to change what order they go in.

When you are satisfied with your project, click the &quot;Save multipage file&quot; button. This will produce a multipage PDF with the project&#039;s name. You can then click &quot;Delete project&quot; to get rid of the project within XSane and remove the temporary directory created by XSane. This will NOT delete the multipage PDF file. You can click the &quot;Create project&quot; button again to start over.

This essentially does the same thing as the process David outlined, only it does it entirely within the XSane GUI and probably doesn&#039;t need as many external applications.]]></description>
		<content:encoded><![CDATA[<p>I&#8217;m using XSane 0.991 on Gentoo Linux. In the main XSane dialog there is a &#8220;target&#8221; icon that looks like a crosshairs, and there is a drop-down menu just to the right of that. Click on the drop down menu and chose &#8220;Multipage&#8221;.</p>
<p>A new dialog called &#8220;xsane multipage project&#8221; will appear. You may chose any name for your project; xsane will create a folder with that name to store temporary files in and eventually output a pdf file with that name.</p>
<p>Chose &#8220;PDF&#8221; under the &#8220;Multipage document filetype&#8221; drop-down menu. Then click on the &#8220;Create project button&#8221;. Now you can use the main XSane dialog to scan pages into the project one at a time by clicking scan (optionally, chose the paper size in the preview window). The multipage project dialog has buttons to edit and remove images from the project and to change what order they go in.</p>
<p>When you are satisfied with your project, click the &#8220;Save multipage file&#8221; button. This will produce a multipage PDF with the project&#8217;s name. You can then click &#8220;Delete project&#8221; to get rid of the project within XSane and remove the temporary directory created by XSane. This will NOT delete the multipage PDF file. You can click the &#8220;Create project&#8221; button again to start over.</p>
<p>This essentially does the same thing as the process David outlined, only it does it entirely within the XSane GUI and probably doesn&#8217;t need as many external applications.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: david</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-376</link>
		<dc:creator><![CDATA[david]]></dc:creator>
		<pubDate>Tue, 28 Feb 2006 00:44:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-376</guid>
		<description><![CDATA[When I skipped the Postscript step, I ended up with very large PDF files; going to PostScript, then using ps2pdf, created much smaller output (better compression?).]]></description>
		<content:encoded><![CDATA[<p>When I skipped the Postscript step, I ended up with very large PDF files; going to PostScript, then using ps2pdf, created much smaller output (better compression?).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dave</title>
		<link>http://quoderat.megginson.com/2006/01/26/scanning-to-pdf-in-linux/#comment-375</link>
		<dc:creator><![CDATA[dave]]></dc:creator>
		<pubDate>Mon, 27 Feb 2006 23:24:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.megginson.com/blogs/quoderat/archives/2006/01/26/scanning-to-pdf-in-linux/#comment-375</guid>
		<description><![CDATA[You don&#039;t actually need to perform the postscript step - ImageMagick&#039;s convert will convert directly to pdf and can allow choice of paper size, for example (using international paper sizes not the strange US ones):
convert -page A4 *.jpg out.pdf

One gotcha with convert though is its memory allocation - the defaults for memory expect that it has a whole Gig of pure memory to chew up. If it fails to allocate memory it will just die on you with no error message. To get around this, use the -limit flags:
convert -page A4 -limit memory 256 -limit map 512]]></description>
		<content:encoded><![CDATA[<p>You don&#8217;t actually need to perform the postscript step &#8211; ImageMagick&#8217;s convert will convert directly to pdf and can allow choice of paper size, for example (using international paper sizes not the strange US ones):<br />
convert -page A4 *.jpg out.pdf</p>
<p>One gotcha with convert though is its memory allocation &#8211; the defaults for memory expect that it has a whole Gig of pure memory to chew up. If it fails to allocate memory it will just die on you with no error message. To get around this, use the -limit flags:<br />
convert -page A4 -limit memory 256 -limit map 512</p>
]]></content:encoded>
	</item>
</channel>
</rss>

