Home | About | RSS Feed | Contact and Publicity Guidelines | Comment Policy the Law, the Universe, and Everything 

advertise-here4


Slip Opinions


Whatever happened to Henry Simons? (fp)

Wow -- that's some very scary poll results (kw)

The scarlet ankle bracelet. (fp)

Every good article should have one idea. (fp)

Family values in market turnover culture. (fp)

Banks really create value: probably $58 billion in overdraft fees & credit card penalties in 2009. (fp)

A Citizens United dream: Exxon could have deployed 10% of its 2008 profits to outspend every presidential and senatorial candidate that year. (fp)

Eternal Earth-Bound Pets promises to adopt your pet if you are raptured. (fp)

Habermas doesn't tweet, but does interview well. (fp)

Lessig on Google, copyright, orphans, and the future of access to information. (kw)

Our Podcast

Subscribe to Law Talk

law-rev-contents2.jpg


  • Posts by Author

  • Categories

  • Archives


  • Recent Comments

    • Kristina on Spring 2010: Is the Window Open?

    • PrometheeFeu on The Advantages and Disadvantages of Rewards

    • PoNyman on Very scary poll results

    • Civ Pro King on Privacy Rights in Death Photos: Catsuouras Case Decided

    • ParatrooperJJ on Privacy Rights in Death Photos: Catsuouras Case Decided

    • Lotta on The Take Away About Take Home Exams

    • Alan on Constitutional Rorschach Test (or Zen Koan)

    • Colin Crowe on The Take Away About Take Home Exams

    • Glomarization on Links and short thoughts on Amazonfail

    • Vinca on Book Review: Divergent Opinions: Why Community Matters — A Review of Sunstein’s Going to Extremes

    • A.J. Sutter on My Letter to the Economist on Climate Change

    • Keri Brooks on Spring 2010: Is the Window Open?

    • Illinois on Spring 2010: Is the Window Open?

    • Ken Rhodes on Constitutional Rorschach Test (or Zen Koan)

    • Ken Rhodes on My Letter to the Economist on Climate Change

  •  

    Site Meter

Conditions for the Digital Library of Alexandria

posted by Frank Pasquale

librarywall.jpgI have been in the middle of a major rethink of search engines’ efforts to digitize books. As it started I enthusiastically celebrated their potential to tame information overload. But major research librarians are now questioning search engines’ practices here:

Several major research libraries have rebuffed offers from Google and Microsoft to scan their books into computer databases, saying they are put off by restrictions these companies want to place on the new digital collections. The research libraries, including a large consortium in the Boston area, are instead signing on with the Open Content Alliance [OCA], a nonprofit effort aimed at making their materials broadly available.

As the article notes, “many in the academic and nonprofit world are intent on pursuing a vision of the Web as a global repository of knowledge that is free of business interests or restrictions.”

As noble as I think this project is, I doubt it can ultimately compete with the monetary brawn of a Google. And why should delicate old books get scanned 3 or 4 times by duplicative efforts of Google, Microsoft, the OCA, and who knows what other private competitor? I also worry that a fragmented archiving system might create a library of Babel. So what is to be done?


My new position is: leverage current copyright challenges to Google’s book search program to guarantee that it serves the public interest. Here’s how that might work:

Google’s plans to scan and index hundreds of thousands of copyrighted books have provoked extraordinary public controversy and private litigation. This project aims to archive and provide text-based indexing for an enormous number of books. Google’s scanning of copyrighted books is prima facie infringement, but Google is presently asserting a fair use defense. The debate has largely centered on the rival property rights of Google and the owners of the copyrights of the books it would scan and edit.

Given Google’s alliance with some of the leading libraries in the world, journalistic narratives have largely portrayed the Google Book Search project as an untrammeled advance in public access to knowledge. However, other libraries are beginning to question the restrictive terms of the contracts that Google strikes when it agrees to scan and create a digital database of a library’s books. While each library is guaranteed access to the books it agrees to have scanned, it is not guaranteed access to the entire index of scanned works.

Those restrictive terms foreshadow potential future restrictions on and tiering of their book search services. Well-funded libraries may pay a premium to gain access to all sources; lesser institutions may be left to scrounge among digital scraps. If permitted to become prevalent, such tiered access to information would threaten to rigidify and reinforce existing inequalities in access to knowledge, and life chances. Such tiering divides society into two groups–those who can afford to access the information, and those who cannot. To the extent that the latter group’s relative poverty is not its own fault, information tiering inequitably subjects it to yet another disadvantage, whereby others’ wealth can be leveraged into status, educational, or occupational advantage.

Given the diciness of the fair use case for projects like Google Book Search, courts should condition the legality of such archiving of copyrighted content on universal access to the contents of the resulting database. Landmark cases like Sony v. Universal have set a precedent for taking such broad public interests into account in the course of copyright litigation. Given the importance of “commerciality” in the first of the four fair use factors, suspicion of tiered access could also be figured into that prong of the test. A more ambitious (if less likely) solution would require Congress to set such terms in a legislative settlement of the issue.

However the matter is ultimately settled, any outcome in favor of dominant categorizers should be conditioned on their maintaining open access to search results. Such a condition would help assure that the type of “tiered access” common for legal resources would not further pervade the networked world. If Google’s proposed extension of the fair use defense succeeds, such a holding should be limited to current versions of the services that conduce to a common informational infrastructure. To the extent it or other search engines limit access to parts of their index, their public-spirited defenses of their archiving and indexing projects are suspect.

PS: For more thoughts on the future of digital archiving, see Diane Leenheer Zimmerman’s Can Our Culture Be Saved?

PPS: This post is part of a series, which starts here.

Photo Credit: ekornblut, Wall of Library of Alexandria.


 November 24, 2007 at 8:11 pm   Posted in: Cyberlaw, Economic Analysis of Law, Google & Search Engines, Intellectual Property, Media Law   Print This Post Print This Post

Responses (3)

  1. James Grimmelmann - November 25, 2007 at 12:51 am

    Your position is exactly backwards. Far and away the biggest threat to the universally accessible library is hold-up from copyright holders. They can kill the library. A dominant search engine can only delay or hamper it.

    Duplication of scanning efforts is not a problem. If we spend 10x as much as we should making 5 scanned copies of everything, so what? It’s still money well spent, and good search engines will still iron out any inconsistencies. And overspending on the digitization project is a far far easier way to prevent a tiered access future than trying to scan once and set exactly the right conditions on it. We should all be pushing as hard as we can for scanning and searching to be an unambiguous fair use, so that lots of institutions get into the scan+search business. (Keep in mind, too, that the costs of scanning are continually falling, whereas the stock of things that need to be scanned is not growing at anywhere near the same rate.)

    In general, you’re too eager to see search markets as natural monopolies. They aren’t. Universal search does have high costs and thus high barriers to entry, but smaller pieces of the search market are still cheap to play in. Especially given mobility of users, Google’s dominance today isn’t a function of unique market factors forcing concentration; it’s a result of a Schumpeterian breakthrough in search technology that Google spearheaded and is still milking. Another paradigm shift in how things are done could dethrone it in the space of a few years — and it’s quite possible that that shift could be to something not under the control of any one company.

    There are serious issues that large search engines present. We should face and address those issues. But for a lot of the deeper problems you worry about in search, there are more pressing present dangers from other powerful entities. That’s the case with neutrality (the incumbent broadband ISPs) and it’s the case wit book scanning (the publishers).

  2. Frank - November 25, 2007 at 4:29 pm

    James,

    Some responses:

    1. You say: “[T]here are more pressing present dangers from other powerful entities. That’s the case with neutrality (the incumbent broadband ISPs) and it’s the case with book scanning (the publishers).”

    Agreed. Google book search without the conditions I’ve advanced above would be better than a Google book search contingent on a million licensing deals.

    The real question is whether conditions like mine would scuttle the project. And I don’t think they are that burdensome. They can also be pared down; for example, there is a much greater societal need to have scholarship indices being free and open access than, say, indices of Danielle Steele books.

    2. You say “smaller pieces of the search market are still cheap to play in.” I agree, but I think a) any one of those people is still pretty reliant on Google to route it customers and b) Google does not need to monopolize that space to still be a dominant force that deserves scrutiny.

    Do you really think someone else is poised to make a “Schumpeterian breakthrough” on general-purpose search?

    3. The falling costs of scanning are a good argument for multiple scanning enterprises. And yes, the risk of one inaccurate or incomplete scan should be factored in against the risk of harming a book via multiple scans. But I still think that this project is such a small part of the overall Google business plan that even if the conditions I’ve proposed were applied, they would not significantly deter the project.

    Nobody is saying “Google can’t advertise on the index”–that’s their core business plan. I’m just saying, don’t try to make money off tiered access–just as Google says to the carriers when it lobbies for net neutrality.

  3. James Grimmelmann - November 26, 2007 at 7:43 am

    And my meta-responses:

    The real question is whether conditions like mine would scuttle the project. And I don’t think they are that burdensome.

    Perhaps not as an end-result, but your means of getting there — through current copyright challenges to doing the project at all — is playing with fire.

    [A]ny one of these people is still pretty reliant on Google to route it customers

    Maybe, maybe not. Dopplr mostly isn’t, Abebooks mostly isn’t, Altlaw mostly isn’t. The principal Google searches they care about are navigational queries on their own names.

    Do you really think someone else is poised to make a “Schumpeterian breakthrough” on general-purpose search?

    Yes. The key will be to redefine the problem; my best guess is that whatever comes next will be significantly user-generated.

Leave a Reply

*
To prove you're a person (not a spam script), type the security word shown in the picture. Click on the picture to hear an audio file of the word.
Click to hear an audio file of the anti-spam word


  • « Previous post
  • Next post »

Authors

Daniel J. Solove
Kaimipono Wenger
Dave Hoffman
Nate Oman
Frank Pasquale
Deven Desai
Danielle Citron
Lawrence Cunningham
Sarah Waldeck
Jaya Ramji-Nogales
Solangel Maldonado
Gerard Magliocca

Guests

Adam Benforado
Mark Edwards
Michelle Harner
Kristin Johnson
Jeffrey Kahn
Alex Kreit
Viva Moffat
Adam Steinman










Previous Guests

Michael Abramowicz
Michelle Adams
Robert Ahdieh
Michelle Anderson
Laura Appleman
Ann Bartow
Francesca Bignami
Jeremy Blumenthal
Kathleen Boozang
Bruce Boyden
Donald Braman
Al Brophy
Neil H. Buchanan
Bill Burke-White
Scott Burris
Paul Butler
Naomi Cahn
Anupam Chander
Miriam Cherry
Jack Chin
Jennifer Collins
Thomas Crocker
Allison Danner
Brannon Denning
Deven Desai
Mike Dimino
Mark Edwards
David Fagundes
Christine Haight Farley
Kim Ferzan
Dan Filler
Michael Froomkin
Amanda Frost
Timothy Glynn
Rachel Godsil
Eric Goldman
David Gray
Craig Green
Tristin Green
Jeffrey Harrison
Erica Hashimoto
Carissa Hessick
Laura Heymann
Robert Hillman
Christine Hurt
Darian Ibrahim
John Ip
Kevin Johnson
Dan Kahan
Brian Kalt
Sam Kamin
Michael Kang
Chimène Keitner
Orin Kerr
Nancy Kim
Heidi Kitrosser
Adam Kolber
Russell Korobkin
Anita S. Krishnakumar
Susan Kuo
Greg Lastowka
Sarah Lawsky
Erik Lillquist
Jeff Lipshaw
Jonathan Lipson
Jacqueline Lipton
Joseph Liu
Michael Madison
Solangel Maldonado
Jason Mazzone
Linda McClain
William McGeveran
Salil Mehra
Carrie Menkel-Meadow
Max Minzner
Scott Moss
Eric Muller
Jaya Ramji-Nogales
Helen Norton
Elizabeth Nowicki
Paul Ohm
Michael O'Shea
David Opderback
Kristen Osenga
Rafael Pardo
Marcy Peek
Eduardo Peñalver
Robert Percival
David Post
Shruti Rana
Geoffrey Rapp
Neil Richards
Lori Ringhand
Alice Ristroph
Susan Scafidi
Paul Secunda
Jonathan Siegel
Jessica Silbey
Peter Smith
Charles Sullivan
Rick Swedloff
Steph Tai
Andrew Taslitz
Robert Tsai
Jenia Turner
Steve Vladeck
Spencer Weber Waller
Howard Wasserman
Melissa Waters
Frank Wu
Alfred Yen
Corey Yung
David Zaring
Timothy Zick
Michael Zimmer
Jonathan Zittrain

Ownership

Concurring Opinions is a
general-interest legal blog
operated by Concurring
Opinions LLC, a Pennsylvania
Limited Liability Corporation.

Blogroll

Above the Law
ACS Blog
Althouse
Balkinization
Becker-Posner Blog
BlackProf
BoingBoing
Chicago Law Faculty Blog
Conglomerate
CrimLaw
Crime & Federalism
CrimProf Blog
Crooked Timber
Discourse.net
Dorf on Law
Election Law
Emergent Chaos
The Faculty Lounge
Feminist Law Profs
43(B)log
Freakonomics Blog
Freedom to Tinker
Google Blogoscoped
How Appealing
Ideoblog
Info/Law
Instapundit.com
Juris Novus
Jurisdynamics
Law and Humanities Blog
Law and Letters
Law Librarian Blog
Legal Profession Blog
Legal Theory Blog
Legal Times Blog
Leiter Reports
Brian Leiter's Law School Reports
Lessig Blog
Madisonian Theory
Media Law Blog
Mirror of Justice
The Moderate Voice
National Security Advisors
Opinio Juris
Point of Law
PrawfsBlawg
ProfessorBainbridge.com
Property Prof Blog
Red Tape Chronicles
The Right Coast
Schneier on Security
SCOTUSBlog
Security Dilemmas
Sentencing Law and Policy
Simple Justice
Sivacracy.net
The Situationist
Susan Crawford
TalkLeft
Talking Points Memo
TaxProf Blog
Tech & Marketing Law
Truth on the Market
Volokh Conspiracy
WorkPlace Prof Blog
WSJ Law Blog
Wonkette
The Yin Blog


© Concurring Opinions

Powered by WordPress