the Law, the Universe, and Everything 

Search

Concurring Opinions is a
general-interest legal blog
operated by Concurring
Opinions LLC, a Pennsylvania
Limited Liability Corporation.

Yale University Press

ad-logo5.jpg

Our Podcast

Subscribe to Law Talk

Law-Rev-Forum-2.jpg

law-rev-contents2.jpg

Law-Prof-Blog-Census.jpg

Categories

Administrative Announcements
Administrative Law
Admiralty
Advertising
Agricultural Law
Anonymity
Antitrust
Architecture
Articles and Books
Bankruptcy
Behavioral Law and Economics
Bioethics
Blogging
Book Reviews
Capital Punishment
Civil Procedure
Civil Rights
Conferences
Constitutional Law
Consumer Protection Law
Contract Law & Beyond
Corporate Law
Criminal Law
Criminal Procedure
Culture
Current Events
Cyberlaw
DRM
Economic Analysis of Law
Education
Empirical Analysis of Law
Employment Law
Environmental Law
Family Law
Feminism and Gender
First Amendment
Food
Google & Search Engines
Health Law
History of Law
Humor
Immigration
Insurance Law
Intellectual Property
International & Comparative Law
Interviews
Jurisprudence
Law and Humanities
Law and Inequality
Law and Psychology
Law Practice
Law Professor Blogger Census
Law Rev (Boston College)
Law Rev (Boston University)
Law Rev (California)
Law Rev (Chicago)
Law Rev (Columbia)
Law Rev (Cornell)
Law Rev (Duke)
Law Rev (Emory)
Law Rev (Fordham)
Law Rev (Georgetown)
Law Rev (GW)
Law Rev (Harvard)
Law Rev (Illinois)
Law Rev (Indiana)
Law Rev (Michigan)
Law Rev (Minnesota)
Law Rev (Northwestern)
Law Rev (Notre Dame)
Law Rev (NYU)
Law Rev (Penn)
Law Rev (S Cal)
Law Rev (Stanford)
Law Rev (Texas)
Law Rev (UCLA)
Law Rev (Vanderbilt)
Law Rev (Virginia)
Law Rev (Yale)
Law Rev Contents
Law Rev Forum
Law School
Law School (Hiring & Laterals)
Law School (Law Reviews)
Law School (Rankings)
Law School (Scholarship)
Law School (Teaching)
Law Student Discussions
Law Talk
Legal Ethics
Legal Theory
Media Law
Movies & Television
Philosophy of Social Science
Politics
Privacy
Privacy (Consumer Privacy)
Privacy (Electronic Surveillance)
Privacy (Gossip & Shaming)
Privacy (ID Theft)
Privacy (Law Enforcement)
Privacy (Medical)
Privacy (National Security)
Property Law
Race
Religion
Reparations
Science Fiction
Securities
Social Network Websites
Sociology of Law
Supreme Court
Tax
Teaching
Technology
Tort Law
Web 2.0
Weird
Wiki
Wills, Trusts, and Estates

Recent Comments

oldfox on Google's Empire, Privacy, and Government Access to Personal Data

E on Google's Empire, Privacy, and Government Access to Personal Data

Dave! on Google's Empire, Privacy, and Government Access to Personal Data

Mike on Google's Empire, Privacy, and Government Access to Personal Data

Bruce on Google's Empire, Privacy, and Government Access to Personal Data

E on Google's Empire, Privacy, and Government Access to Personal Data

Archives

April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006
January 2006
December 2005
November 2005
October 2005
August 2005
July 2005
June 2005

 

« The End of Shame | Main | On Becoming a Supreme Court Clerk »

November 29, 2005

Google's Empire, Privacy, and Government Access to Personal Data

posted by Daniel J. Solove

google-priv.jpgA New York Times editorial observes:

At a North Carolina strangulation-murder trial this month, prosecutors announced an unusual piece of evidence: Google searches allegedly done by the defendant that included the words "neck" and "snap." The data were taken from the defendant's computer, prosecutors say. But it might have come directly from Google, which - unbeknownst to many users - keeps records of every search on its site, in ways that can be traced back to individuals.

This is an interesting fact -- Google keeps records of every search in a way that can be traceable to individuals. The op-ed goes on to say:

Google has been aggressive about collecting information about its users' activities online. It stores their search data, possibly forever, and puts "cookies" on their computers that make it possible to track those searches in a personally identifiable way - cookies that do not expire until 2038. Its e-mail system, Gmail, scans the content of e-mail messages so relevant ads can be posted. Google's written privacy policy reserves the right to pool what it learns about users from their searches with what it learns from their e-mail messages, though Google says it won't do so. . . .

The government can gain access to Google's data storehouse simply by presenting a valid warrant or subpoena. . . .

This is an important point. No matter what Google's privacy policy says, the fact that it maintains information about people's search activity enables the government to gather that data, often with a mere subpoena, which provides virtually no protection to privacy -- and sometimes without even a subpoena. In my book, The Digital Person, and in an earlier paper, Digital Dossiers and the Dissipation of Fourth Amendment Privacy, 75 S. Cal. L. Rev. 1083 (2002), I argued that today an increasing amount of detailed personal data is being maintained by various companies, merchants, and organizations. The Supreme Court has held that the Fourth Amendment does not protect against the government accessing records maintained by third parties. In United States v. Miller, 425 U.S. 435 (1976), for example, the Supreme Court held that people lack a reasonable expectation of privacy in their bank records because "[a]ll of the documents obtained, including financial statements and deposit slips, contain only information voluntarily conveyed to banks and exposed to their employees in the ordinary course of business."

The New York Times op-ed goes on to criticize Google for not being a leader in protecting privacy:

It is hard to believe most Google users know they have a cookie that expires in 2038, or have thought much about the government's ability to read their search history and stored e-mail messages without them knowing it. . . .

Google should develop an overarching privacy theory that is as bold as its mission to make the world's information accessible - one that can become a model for the online world. Google is not necessarily worse than other Internet companies when it comes to privacy. But it should be doing better.

I agree with the op-ed, but I also think that businesses should use their power to push for greater legislative protections of personal information from government access. It is here were Google's interests and the privacy interests of its users coincide. Right now, the government is inadequately regulated when it comes to accessing personal data maintained by third parties. If the businesses maintaining the data lobbied Congress for greater protections, this would help to address one of the major privacy threats that their maintaining the information poses. It wouldn't solve all of the problems, but it would address a big one.

Related Posts:
1. Solove, When Google Is King
2. Solove, Searching the Internet: It’s the Hip Thing to Do

Thanks to Chris Hoofnagle and Steve Charnovitz for pointing me to this op-ed.

Posted by Daniel J. Solove at November 29, 2005 12:01 AM

Trackback Pings

TrackBack URL for this entry:
http://www.concurringopinions.com/movabletype/mt-tb.cgi/262.

Listed below are links to weblogs that reference Google's Empire, Privacy, and Government Access to Personal Data:

» Google and Privacy from Schneier on Security
Daniel Solove on Google and privacy: A New York Times editorial observes: At a North Carolina strangulation-murder trial this month, prosecutors announced an unusual piece of evidence: Google searches allegedly done by the defendant that included the w... [Read More]

Tracked on November 30, 2005 04:26 PM

» Google Privacy, and Government Access to Personal Data from Dark Side Programming
In a strangulation-murder trial this month, prosecutors announced an unusual piece of evidence: Google searches allegedly done by the defendant that included the words "neck" and "snap." The data were taken from the defendant's computer, prosecutors sa... [Read More]

Tracked on November 30, 2005 08:17 PM

» Google and Privacy from SearchBuzz.com
<!--adsense--> Google is quickly missing the opportunity to become the leader in privacy handling on the Internet. Concurring Opinions currently has a nice writeup of the current Google happenings. The pressure is mounting for an official sta... [Read More]

Tracked on November 30, 2005 10:01 PM

» Leven in de 21e eeuw from Thijs' weblog
In de 21e eeuw volgen nieuwe technologische ontwikkelingen elkaar in hoog tempo op. De termen vliegen je om de oren: podcasting, wiki, rss, iPod, Skype, voip, etc., etc. Er is veel te ontdekken, maar is het ook allemaal te bewonderen? De on-line avant... [Read More]

Tracked on December 1, 2005 10:31 AM

» Back at it from Alt Text
She will becme more powerful than you can possibly imagine. Oprah is finding an audience among young Sauid Arabians. I... [Read More]

Tracked on December 1, 2005 11:27 AM

» Back at it from Alt Text
She will becme more powerful than you can possibly imagine. Oprah is finding an audience among young Sauid Arabians. I... [Read More]

Tracked on December 1, 2005 11:29 AM

» Google and Privacy from Schneier on Security
Daniel Solove on Google and privacy: A New York Times editorial observes: At a North Carolina strangulation-murder trial this month, prosecutors announced an unusual piece of evidence: Google searches allegedly done by the defendant that included the w... [Read More]

Tracked on December 3, 2005 09:23 AM

» Clearing my browser from JD on MX
Clearing my browser: I've got a lot of tabbed windows open, with interesting material for which I lack significant personal additions, but can write a short summary for each... you may find something of interest in the following webpages this... [Read More]

Tracked on December 3, 2005 11:19 PM

» Googles Datenhunger from Smoking Gun
Daniel Solove und Bruce Schneier weisen auf einen interessanten Kommentar der New York Times hin: What Google Should Roll Out Next: A Privacy Upgrade. Zwei Zitate: At a North Carolina strangulation-murder trial this month, prosecutors announced an un... [Read More]

Tracked on December 5, 2005 12:55 PM

» Government vs. Google from Concurring Opinions
According to the AP: Google Inc. is rebuffing the Bush administration's demand for a peek at what millions of people have been looking up on the Internet's leading search engine — a request that underscores the potential for online databases... [Read More]

Tracked on January 20, 2006 12:38 PM

» Do No Evil and Perhaps Do Some Good: Google, Privacy, and Business Records from Concurring Opinions
I just blogged about the case where the goverment is seeking search query records from Google. I am very pleased that Google is opposing the goverment's suboena. According to the AP artice: Google — whose motto when it went public... [Read More]

Tracked on January 20, 2006 12:40 PM

Comments

Actually, the scope of the information that Google collects goes much further than just one's searches through Google and using other Google services such as GMail. The now ubiquitous Google Ads and the new Google Analytics service in use by a number of websites allow Google to track most of one's visits, queries, and other activities throughout the Web. Additionally (though I am not aware of any such current activities), aggressive use of web browser technologies such as Javascript further enable fairly advanced analysis of what you are looking at, clicking, etc. to be relayed back home - all performed by your own web browser without your knowledge.

Given the scope of this activity, it is pretty much impossible to "opt out" of Google's data collection practices, or simply to surf anonymously. Instead, one has to take fairly aggressive actions not to participate. One might attempt various rudimentary actions, such as cookie blocking, to avoid such tracking, but it is fairly simple to correlate one's IP address with one's identity. Furthermore, more aggressive techniques such as anonymous proxies, URL blocking, etc. raise issues for websites dependant upon advertising. If I somehow manage to block Google Ads, is it fair to a website that counts on Google Ad revenue in order to operate? Additionally, one still has to be fairly vigilant to remain an anonymous surfer.

This is not to say that Google is acting with any malicious purpose, or even is not making good use of the data it is collecting to provide better services. But the corpus of personal data amassed by Google, coupled with information found in various private databases, raises huge concerns. What one looks for and looks at on the web, activity which is only increasing over time, seems to already have become available for access for law enforcement and national security purposes. It seems that discovery in civil suits is not far behind. If I have a well-placed source working at Google, how much of this information might I be able to offer up for sale? Private investigators have done this for some time through the DMV, etc. What if someone manages to buy Google? This information is a significant asset for all kinds of purposes. What if Google decides to offer new services for employers to screen employees? Coupled with all kinds of other databases they could give you a complete picture of all aspects of a prospective employee's life. There a lot of "what ifs" about these huge databases that are not currently being adddressed, or even fully understood what it means when this data persists indefinitely.

Posted by: E at November 29, 2005 12:01 PM


A couple of things:

I think it's important not to get carried away here and figure out what Google is actually doing, and how that differs from what anyone else is doing.

Google's ability to identify your searches has nothing to do with cookies and is simply a function of their properly employing two very useful technologies: using GET for their search results and logging IP addresses. Using GET means that you can link directly to Google searches, which is very useful. Logging IP addresses of visitors, and what pages they view, is common practice on the web, for both security and web development reasons. Since Google pages contain the search terms right in the URL of the page, that means that if you know someone's IP address, and you have access to Google's server logs for the time period in question, then you can figure out what they searched for. So far, Google isn't really any different from any other business (including your ISP) that has personal information about you.

The cookies are a different matter, but I think it's important to keep in mind that cookies are not inherently nefarious and are used for all sorts of legitimate purposes. I checked my own cookies folder and there are 2 cookies written by Google on there -- one is labelled PREF, which sounds like it might be used to track my personal preferences, e.g. for Google News. The other seems to have something to do with Blogspot -- my guess would be it allows automatic logins for comments on blogs. Of course either cookie ID could be matched to my IP address and page searches, in which case Google would be able to match my preference selections to my IP address and page views -- but so what? I don't think that cookie can be matched to my name and address unless I tell them who I am at the same time they can read the cookie, and I haven't done that to my knowledge. And the 2038 expiration just means that the cookie will remain until deleted or you buy a new computer, which for most people happens every few years or so. If it was shorter -- say, one month -- that means I would have to reset my Google News preferences once a month, which would be a pain. It doesn't mean that Google will retain their server logs for that long -- I seriously doubt they have the money for or the interest in that amount of storage space.

Posted by: Bruce at November 29, 2005 02:41 PM


Dan, is there anything to prevent the government from issuing a subpoena to Google requiring them to provide information on every person who, say, searches for [child pr0n]? I ask because after the Candyman case, it seems like the government would have a strong argument supporting probable cause to search the computers of everyone Googling such terms. So, essentially, the government would be saying the following: "Google has in its possession information regarding people whose computers are being used to commit a crime. Therefore, Google has to turn this information over to us."

Thoughts?

Posted by: Mike at November 29, 2005 05:04 PM


Additionally, you can clear your cookies... and a cookie that doesn't expire until 2038 is really just for convenience... what are the odds you'll be using the same browser or the same computer until 2038??

The logging of searches by IPs is also dubious. For example, 1) I don't have a static IP at home, so there's no way to prove 100% that any search linked to an IP was/is my search; 2) IPs can be shared/proxied so they are not 100% conclusive to link to a single person (or even entity, necessarily).

Although I agree, the idea of the government being able to subpoena Google's records to determine what you've been searching for is frightening.

It leaves me with a neophyte (read: law student) question though... given that an IP != person, wouldn't it be really difficult to meet the requirements for authentication to enter this kind of evidence? (Supposing the evidence resided on Google's servers, not on your own personal computer.)

Posted by: Dave! at November 29, 2005 05:31 PM


The big problem with the Google cookie is that it attaches a universal ID to anything done by your browser with any Google property - whether you choose to or not. Whatever the cookie may have been named is at best an indication of what it might have done at some point, and says nothing as to what additional uses the information it represents may currently serve or be put to later. Also, note that the cookie is for the entire google.com domain, which, as I stated before, includes Google services that people do not choose to engage in, are very pervasive across the web, and end up recording many of the things that they do online. The only thing one receives in exchange is (possibly) better targeted ads.

As far as Google's data retention policies, they have already demonstrated their intention to retain the data they collect. One of their key technologies is aggregating inexpensive hardware to produce massive storage systems. As I understand it, they are currently putting about 3.5 petabytes of storage in a 40-foot container. At an estimate of 1 billion logged transactions a day (which may be low given 250M searches/day in early 2003), that represents about 2.5 years of logs in one trailer. Also, Google was the first to offer 1+ GB email, not simply out of hopes that people would not use the space, but because they simply could do it. In my experience, Google has taken the view that storage is cheap, getting cheaper by the week, and they might as well retain everything - probably an easy mindset to adopt once you're doing things like caching significant amounts of the Internet's content. We have already seen snapshots of what Google has retained thus far through last April's introduction of their Personal History Feature. A number of people were surprised that suddenly they had access to a significant chunk of their past search history - they didn't really consider that Google could & might choose to keep all of this information around. One article noted that the "service is designed to store years of each individual's search activity."

As far as linking this cookie value to your actual identity, I would assert that in many cases doing so is not only possible, but simple. For those who have GMail accounts, the exercise is trivial. Accessing a GMail account from your laptop, desktop, and office connects them all. I am not suggesting that Google is doing so today with any nefarious purpose - what do they care about the details of Joe Schmoe #5334's life? I am simply pointing out that it can be done by anyone with access to the pool of information assembled by Google.

You point out that some or most of these practices are common in the industry - many websites perform IP logging and use cookies. If nothing else, I would suggest that the sheer scale of Google's activities invites closer scrutiny. This is beyond the idea that one theoretically could correlate a super shopper card database with banking records and credit card records to figure out pretty much everything you've purchased. Google's retained information pretty much points to everywhere one has gone online. Recently there was concern over states tracking cell phones for the purpose of monitoring traffic congestion, and promises were made that the collected data would be aggregated and anonymized for privacy purposes. Why should people be any less concerned about the much more thorough tracking of their movements in cyberspace performed by Google? Shouldn't we figure out just what will be done when everything is not only logged, but also sits around indefinitely? What if Google is already at that point - or will be in the very near future? Should a party to civil litigation be able to obtain this information through discovery? It would be very helpful to have this mountain of information to impugn someone's character for places visited online.

Such possibilities remind me of the old email mantra: never send anything by email that you wouldn't want to see on the front page of the newspaper. Should it be the same for what we do online? As it stands now, Google sees a lot of what people do online - it's just a matter of whether they choose to remember it.

Posted by: E at November 29, 2005 06:33 PM


Does the NYTimes recommend that Google be categorized as a news outlet like itself so it can then raise the veil of "newsman privilege" before cooperating with law enforcement agencies?


The NYTimes has no such privilege (nor any obligation to restrict it's own information collection activities) and why should Google be any different?


The Times has lost it's power of reason in a muddle of partisanship. They really have.

Posted by: oldfox at December 7, 2005 10:48 AM


Authors

Daniel J. Solove

Website
The Future of Reputation

Kaimipono Wenger

Website
SSRN Page

Dave Hoffman

Website
SSRN Page

Nate Oman

Website
SSRN Page

Frank Pasquale

Website
SSRN Page

Melissa Waters

Website
SSRN Page

Deven Desai

Website
SSRN Page


Guests

Elaine Chiu
Dan Kahan
Sam Kamin
Michael O'Shea
Alice Ristroph






ad-logo3.jpg

blawg100_winner2.jpg

Previous Guests

Michael Abramowicz
Michelle Adams
Robert Ahdieh
Michelle Anderson
Laura Appleman
Francesca Bignami
Jeremy Blumenthal
Bruce Boyden
Donald Braman
Al Brophy
Bill Burke-White
Scott Burris
Anupam Chander
Miriam Cherry
Jack Chin
Jennifer Collins
Allison Danner
Brannon Denning
Deven Desai
Mike Dimino
Christine Haight Farley
Kim Ferzan
Dan Filler
Amanda Frost
Timothy Glynn
Rachel Godsil
Eric Goldman
Craig Green
Jeffrey Harrison
Erica Hashimoto
Laura Heymann
Christine Hurt
Heidi Kitrosser
Adam Kolber
Russell Korobkin
Anita S. Krishnakumar
Greg Lastowka
Joseph Liu
Solangel Maldonado
Jason Mazzone
William McGeveran
Salil Mehra
Carrie Menkel-Meadow
Scott Moss
Eric Muller
Jaya Ramji-Nogales
Elizabeth Nowicki
Paul Ohm
Michael O'Shea
Rafael Pardo
Marcy Peek
Eduardo Peñalver
Neil RIchards
Lori Ringhand
Alice Ristroph
Paul Secunda
Peter Smith
Charles Sullivan
Rick Swedloff
Steph Tai
Robert Tsai
Steve Vladeck
Sarah Waldeck
Alfred Yen
David Zaring
Timothy Zick
Jonathan Zittrain

Blogroll

Above the Law
ACS Blog
Althouse
Balkinization
Becker-Posner Blog
Beltway Blogroll
BlackProf
BoingBoing
Chicago Law Faculty Blog
Conglomerate
CrimLaw
Crime & Federalism
CrimProf Blog
Crooked Timber
Discourse.net
Dorf on Law
Election Law
Emergent Chaos
Feminist Law Profs
43(B)log
Freakonomics Blog
Freedom to Tinker
Google Blogoscoped
How Appealing
Ideoblog
Info/Law
Instapundit.com
JD2B.com
Juris Novus
Jurisdynamics
Law and Letters
Legal Profession Blog
Legal Theory Blog
Legal Times Blog
Leiter Reports
Brian Leiter's Law School Reports
Lessig Blog
Madisonian
Mirror of Justice
National Security Advisors
Opinio Juris
Point of Law
Political Theory Daily Review
PrawfsBlawg
ProfessorBainbridge.com
Property Prof
Red Tape Chronicles
The Right Coast
Schneier on Security
SCOTUSBlog
Security Dilemmas
Sentencing Law and Policy
Simple Justice
Sivacracy.net
The Situationist
Susan Crawford
TalkLeft
Talking Points Memo
TaxProf Blog
Tech & Marketing Law
Truth on the Market
Volokh Conspiracy
WorkPlace Prof Blog
WSJ Law Blog
Wonkette
The Yin Blog

Pajamas Media BlogRoll Member