[Web4lib] Scanner Recommendations?

Matt Russell matt at farrwood.com
Mon May 17 16:50:19 EDT 2010


One trick with using a digital camera for 'scanning' documents is 
getting the lighting consistent over the page.  This isn't hard with 
multiple lights (at least 2) positioned at least a few feet away and 
using an umbrella diffuser.  2 of these on tripods is less than $200  
(http://www.bhphotovideo.com/c/product/308806-REG/Impact_401495_Three_Floodlight_Kit_.html)

The internet archive uses prosumer-grade Canon DSLRs to digitize books 
in their 'scribe' system.  They shoot very fast.  Page-turning time is 
the limiting step in the process. 

Matt




Vernell Ward wrote:
> Matt,
>  
> Thanks for the explanation and clarification. I have made images of 
> individual pages for personal use a few years ago, but never thought 
> of using them for document transfer until now. We are looking at 
> options for doc delivery on our campus and your idea is interesting.
>  
> Vernell
>  
>  
> Vernell Ward
> Serials/Gov.Doc. Librarian
> OBU Box 61309
> 500 W. University
> Shawnee, OK 74804
> 405-878-2284
> vernell.ward at okbu.edu <mailto:vernell.ward at okbu.edu>
> "To ask why we need libraries at all, when there is so much 
> information available elsewhere,
> is about as sensible as asking if roadmaps are necessary now that 
> there are so very many roads."
>  
> ---Jon Bing, 2009. /American Libraries /
>
>
> >>> On Monday, May 17, 2010 at 3:32 PM, in message 
> <4BF1A7D6.90304 at farrwood.com>, Matt Russell <matt at farrwood.com> wrote:
> Hi Vernell,
>
> Whether you use a scanner or a digital camera the result is a bitmap 
> image of the page.  If you want to group them into a single multi-page 
> PDF then you just use a pdf creator (like adobe or  and tell it which 
> jpg (or tif) files you want to combine and in which order.  Scanner 
> software which automatically creates PDFs is just doing this combining 
> step for you.
>
> Of course, this results in a fat file, but only a little bit fatter 
> than the original image files.  And, the result isn't a searchable PDF 
> since the file only contains bitmaps.    In order to turn the scan or 
> digital camera image into a smaller, searchable PDF you'd have to 
> perform OCR on the image to convert it to text, then (usually) spend 
> time cleaning it up and restoring formatting information.  This is 
> normally a very tedious process, but it's the same whether you scan or 
> shoot with a digital camera. 
>
> Cheers,
> Matt
>
>
> Vernell Ward wrote:
>> Great idea, but I am puzzled by the details. How do you group the 
>> pages together for an article in a magazine or chapter in a book? Do 
>> you use Adobe or ...?
>>  
>>  
>> Vernell Ward
>> Serials/Gov.Doc. Librarian
>> OBU Box 61309
>> 500 W. University
>> Shawnee, OK 74804
>> 405-878-2284
>> vernell.ward at okbu.edu <mailto:vernell.ward at okbu.edu>
>> "To ask why we need libraries at all, when there is so much 
>> information available elsewhere,
>> is about as sensible as asking if roadmaps are necessary now that 
>> there are so very many roads."
>>  
>> ---Jon Bing, 2009. /American Libraries /
>>
>>
>> >>> On Friday, May 14, 2010 at 10:37 AM, in message 
>> <4BED6E2E.3080902 at farrwood.com>, Matt Russell <matt at farrwood.com> wrote:
>> We have found that, rather than paying the high price for an 11x17
>> scanner, we use a good digital camera for images more than 8.5x14. 
>> There are several advantages to this:
>>
>> * Cost - a reasonably good digital camera (like a Canon T2i for $900 
>> http://www.bhphotovideo.com/c/product/675618-REG/Canon_4462B003_EOS_Rebel_T2i_Digital.html) 
>>
>> has 18MP, which is over 5000 pixels in the long direction.  That
>> resolves a 17" image into over 300 dpi, which is just fine for all but
>> the most demanding archival work.  The eye cannot resolve 300dpi.    For
>> $300-$500 you could get a 12MP camera.  If you pay more than $500 you
>> will be able to capture in RAW format, which avoids compression and
>> color balance artifacts.  For really good quality and a flat field (eg.
>> no barrel distortion at the edges) you could pay $250 more for a good
>> 50mm macro lens.
>>
>> * Depth of field - A DSLR (digital single-lens-reflex) will allows a
>> much wider depth of field than a scanner, so that bound books can be
>> shot with the text in focus as it curves into the spine.  Also, textured
>> objects can be better captured.
>>
>> * Speed - Scanning 11x17 takes a LONG time.  Shooting it is a snap. 
>> Shooting a book is literally click, click, click.  Shooting artwork,
>> maps, etc. is so much quicker than scanning.
>>
>> * Versatility - Having a nice camera around can be handing for lots more
>> than just capturing large images.  The T2 also lets you shoot high def
>> video.
>>
>> For a scanner, we use the Epson V700, which is exceptionally good, works
>> well with Linux and XSANE and costs only $500.
>>
>>
>> /Matt Russell/
>> Lawrence History Center
>> www.lawrencehistory.org
>>
>>
>> Bob Rasmussen wrote:
>> > Apparently you haven't received a flood of responses, so I'll jump in.
>> > Here's what I know of the scanner market:
>> >
>> > 1) Almost all lower-cost scanners are limited to 8.5" width; 
>> usually 14"
>> > height. If you want 11x17" you're in a much more rarified market.
>> >
>> > 2) For 8.5x14", there are some low-cost auto-feed units, even with 
>> duplex
>> > (2-sided scanning). I like the HP Scanjet 5590.
>> >
>> > 3) If you want 11x17" and auto-feed, good luck!
>> >
>> > 4) Most if not all scanners will turn the light off.
>> >
>> > 5) Whether it scans to PDF depends entirely on the software that's
>> > included. Note that some software produces only image PDFs, which 
>> are not
>> > searchable. To search a PDF, something has to have performed Optical
>> > Character Recognition on it.
>> >
>> > I hope that helps.
>> >
>> > On Tue, 11 May 2010, Pruntel,Alison wrote:
>> >
>> >  
>> >> Hi - Apologies in advance to my posting to several listservs.
>> >>
>> >> Our public information coordinator is looking for a new flatbed 
>> scanner. My familiarity with scanners is basically zilch. She 
>> currently uses an HP Scanjet 8250 that is on its last legs. She scans 
>> images, etc., as well as written correspondence/documents (i.e., 
>> letters received by our director who wants an ecopy, etc.) . In an 
>> ideal world, she'd like a scanner that:
>> >>
>> >>
>> >> *         Scans black and white as well as color;
>> >>
>> >> *         Can scan to PDF;
>> >>
>> >> *         Creates clear/crisp photo scans;
>> >>
>> >> *         Accomodates documents up to 11 X 17;
>> >>
>> >> *         Autoloads documents (her current scanner is manual);
>> >>
>> >> *         Relatively fast and quiet;
>> >>
>> >> *         Comes with software, but don't necessarily need photo 
>> editing software;
>> >>
>> >> *         Doesn't keep lamp on all the time/is energy efficient;
>> >>
>> >> And did I mention that she also wants something cheap? I believe 
>> her budget is under $2K, but we'd look at the best option for us, 
>> would consider various price ranges.
>> >>
>> >> Thanks for any/all responses,
>> >>
>> >>
>> >> Alison Pruntel
>> >> Electronic Resources Librarian
>> >> Fauquier County Public Library
>> >> 11 Winchester Street
>> >> Warrenton, VA 20186
>> >> 540-349-2770 (voice)
>> >> 540-349-3278 (fax)
>> >> FCPL Web Site: http://library.fauquiercounty.gov
>> >> My Blog: http://fcpleresources.blogspot.com
>> >> New Stuff at FCPL: http://www.wowbrary.org/nu.aspx?p=2786
>> >>
>> >> _______________________________________________
>> >> Web4lib mailing list
>> >> Web4lib at webjunction.org
>> >> http://lists.webjunction.org/web4lib/
>> >>
>> >>
>> >>
>> >>    
>> >
>> > Regards,
>> > ....Bob Rasmussen,   President,   Rasmussen Software, Inc.
>> >
>> > personal e-mail: ras at anzio.com
>> >  company e-mail: rsi at anzio.com
>> >           voice: (US) 503-624-0360 (9:00-6:00 Pacific Time)
>> >             fax: (US) 503-624-0760
>> >             web: http://www.anzio.com
>> >  street address: Rasmussen Software, Inc.
>> >                  10240 SW Nimbus, Suite L9
>> >                  Portland, OR  97223  USA
>> >
>> >
>> > _______________________________________________
>> > Web4lib mailing list
>> > Web4lib at webjunction.org
>> > http://lists.webjunction.org/web4lib/
>> >
>> >  
>> _______________________________________________
>> Web4lib mailing list
>> Web4lib at webjunction.org
>> http://lists.webjunction.org/web4lib/
>>


More information about the Web4lib mailing list