Originally published Saturday, April 26, 2008 at 12:00 AM
Scanning world's every book means turning many, many pages
In a dimly lit backroom on the second level of the University of Michigan library's book-shelving department, Courtney Mitchel helped a...
The Associated Press
ANN ARBOR, Mich. — In a dimly lit backroom on the second level of the University of Michigan library's book-shelving department, Courtney Mitchel helped a giant desktop machine digest a rare, centuries-old Bible.
Mitchel is among hundreds of librarians from Minnesota to England making digital versions of the most fragile of the books to be included in Google's Book Search, a portal that will eventually lead users to all the estimated 50 million to 100 million books in the world.
The manual scanning — at up to 600 pages a day — is much slower than Google's regular process.
"It's monotonous," the 24-year-old said.
Then she knit her career hopes into the work.
"But it's still something that I'm learning about — how to interact with really old materials and working with digital imaging, which is relevant to art history."
The unusually tight binding on the early 16th-century polyglot Bible made it hard to expose the portions toward the book's middle as Mitchel spread each pair of pages for the scanner.
Google, the Internet's leader in search and advertising, says the process for scanning the majority of the books in Book Search is proprietary. Employees will not discuss it except to say it is much faster than what Mitchel is doing and it's not destructive.
"It took us quite a while to develop it so we do keep that confidential," said a library manager for Book Search, Ben Bunnell, who declined even to say where Google does the scanning.
Cutting costs
Funding from Google allows the 28 libraries it's working with to cut their digitizing costs because they don't have to pay for scanning the books Google wants to include in Book Search.
Through Book Search, users can track down a book on any topic and read a small portion. If the book's not protected by copyright, users can download the whole thing. If it is, or if they just want to read an original, they can use Book Search to find copies to buy or borrow.
![]()
More than 1 million rare or fragile books have been digitized through the Google-Michigan partnership since it began in 2004, with an estimated 6 million to go.
Book Search has the support of many publishers, authors and librarians. But some publishers and authors have sued, claiming the service violates their copyrights. Google says Book Search is aboveboard because Web surfers can retrieve only snippets of copyrighted material through the service.
Brewster Kahle, founder and digital librarian of the Internet Archive at the Open Content Alliance, said Google may be trying to "lock up the public domain" by making proprietary copies of works whose copyrights have expired — which includes the vast majority of the world's books.
Kahle said there's a core value in the project, but he questioned whether Google will share the works it digitizes with other search engines.
"We believe there should be many libraries, many publishers, many search engines, many types of users from different points of view," Kahle said.
John Price Wilkin, Michigan's associate university librarian, called Kahle's stance "theoretical."
"Our volumes are entirely open in the sense that people can find them, read them, use them, do all the things that they would do in scholarship or pleasure," Wilkin said.
In the room where Mitchel and colleague Chava Israel, an artist, work, the temperature is always in the 60s.
Each technician has a slightly angled table with a flexible middle that cradles books and holds them still while two overhead cameras photograph the pages. Sometimes the women play music or listen to news online, but they often work in silence, save the clicks of their computers and scanners.
Scanning software
Mitchel glides in a rolling chair forth and back between scanner and computer, computer and scanner, turning page upon page and clicking her mouse to shoot each pair. Once the images reach the computer, the women use the book-scanning software Omniscan from Germany's Zeutschel GmbH to clean them up.
A final click of the mouse sends each digitized book to Google for optical character-recognition processing, which makes the text searchable. Google then returns a copy of the images and data to the library and posts another to the Web.
Israel, 44, who has been scanning books for three years, takes a philosophical view of the project.
"My favorite part is working with older books and being able to preserve a lot of the knowledge and help bring more people access," Israel said. "I turn pages. It's kind of meditative."
Copyright © 2008 The Seattle Times Company
UPDATE - 09:46 AM
Exxon Mobil wins ruling in Alaska oil spill case
UPDATE - 09:32 AM
Bank stocks push indexes higher; oil prices dip
UPDATE - 08:04 AM
Ford CEO Mulally gets $56.5M in stock award
UPDATE - 07:54 AM
Underwater mortgages rise as home prices fall
NEW - 09:43 AM
Warner Bros. to offer movie rentals on Facebook

general classifieds
Garage & estate salesFurniture & home furnishings
Electronics
just listed
Adorable Bull Terrier puppies for good home...
AKC Great Dane Puppies Ready
AKC PAL/ILP Registered Labs
More listings
POST A FREE LISTING
- Lakewood cop accused of embezzling $150K meant for slain officers' families
- 3 big health insurers stockpile $2.4 billion as rates keep rising
- Council members get briefing on arena proposal, minus details
- Social worker recounts minutes before Powell fire
- Agency set to investigate handling of 911 call about Josh Powell
- Quick decisions: How Washington hired its new football staff
- Historic day for gay marriage as another fight looms
- Justin Wilcox's versatile defensive style is the right fit for Huskies | Jerry Brewer
- Washington men walloped by Oregon, 82-57
- It's Terrence Time: Enigmatic Ross leads Huskies
- Gay-marriage bill passes House, awaits Gregoire's signature
507 - Wanted in Seattle classrooms: more teachers of color
407 - AP Source: Obama to change birth control rule
386 - Council members get briefing on arena proposal, minus details
368 - Oregon live game thread
155 - Worker: Josh Powell told son he had 'surprise'
115 - Rough road again
108 - A few late-night notes
96 - USA Today further spells out how Mariners, handful of clubs next in line for huge cash windfall
76 - Marijuana legalization initiative set to go on Nov. ballot
74
- Wanted in Seattle classrooms: more teachers of color
- State Medicaid program to stop paying for unneeded ER visits
- 3 big health insurers stockpile $2.4 billion as rates keep rising
- Economy, blogs give survivalists new reason to look to Northwest
- Bellevue College adds a third bachelor's degree program
- State's share of mortgage settlement: $648 million
- Darren Berg gets 18-year sentence for Ponzi scheme
- One man's audacious pursuit of sailing history
- $25B settlement reached over foreclosure abuses
- 'Gauguin and Polynesia': dazzling mix-and-match | Art review




