Computers

Computers

by Carlo Longino




Researchers Feeling Conflicted Over AOL Data

from the in-two-minds dept

The leak of a ton of search data at AOL has been nothing short of a mess, culminating Monday in the termination of some employees at the company. The privacy concerns overshadowed how interesting the data was, and AOL's mistake in not stripping out personally identifiable information undid their original good intention: to give researchers a look at a large amount of search data, something that's often difficult for them to get their hands on. Though AOL pulled the data, it was downloaded by plenty of people before it got yanked, and many search researchers have been examining it. However, feeling some ethical pangs, some can't bring themselves to look at it. It's nice to see these people have some ethical concerns, but as long as they're using the information responsibly, it doesn't seem like they have much to be worried about. However, as some researchers point out, the ongoing effect of the AOL gaffe will be to make search companies think twice about releasing any kind of data, even if they have anonymized it. That's really not an ideal solution, as it limits the ability of people outside search companies to research and refine search technology. The answer is to release the data responsibly, taking users' privacy into consideration. In the meantime, these researchers should probably just carry on with the data, since it's last they're likely to see for a while -- just try to avoid fingering individual users with their search habits.

9 Comments | Leave a Comment..

 
 

Reader Comments (rss)

(Flattened / Threaded)

  1. Aug 23rd, 2006 @ 10:32am
    by Anonymous Coward

    I dont believe its unethical to use that data for non-unethical purposes. It was only unethical for it to be released, as not only will it fall into the hands of those using it for unethical purposes, it breaches the trust of the users. There is no such implicit trust relationship between researchers and AOL users, although there is a more primitive ethical imperative for them to do no evil with it. Keep right on using that data, the damage is already done and now let us gain as much from it as we can.

    (reply to this comment) (link to this comment)

  2. Aug 23rd, 2006 @ 10:45am

    I am so glad...

    ...that I don't use AOL!

    (reply to this comment) (link to this comment)

  3. Aug 23rd, 2006 @ 11:04am

    Re: I am so glad...

    by Chris

    AOL is gay. The End.

    (reply to this comment) (link to this comment)

  4. Aug 23rd, 2006 @ 11:06am

    Spam King

    by AOL Grrrrr

    The big question is- did any AOL user search for "How to bury Gold in your grandfather's Garden"

    (reply to this comment) (link to this comment)

  5. Aug 23rd, 2006 @ 11:53am
    by Grandfather Time

    it takes something like this, for thousands upon thousands to realize that the AOL service and security they have been paying for, for years, is all just a waste of time and money. Faster, Securer, More reliable.....more like, Slower, More Expensive, and Easier to Hack......

    (reply to this comment) (link to this comment)

  6. Aug 23rd, 2006 @ 12:00pm

    I don't need their help

    by techdirtReader

    I'm not a big fan of companies sharing my data - even anonymously - with other companies to "improve my user experience". Somehow, every time a company wants to "improve my user experience", the company ends up with more revenues and I end up with a big brother-esque experience (aka Amazon suggestions based on past history)

    Hence, I'm all for limiting the ability of people outside search companies to research and refine search technology.

    (reply to this comment) (link to this comment)

  7. Aug 23rd, 2006 @ 1:32pm

    Track me not

    by aReader

    This has made the extension http://mrl.nyu.edu/~dhowe/TrackMeNot/ very popular. I don't really like the extension as by creating ghost traffic it could backfire. The search engine servers will have to deal with ghost queries even if they do not intend to store and publish the search data.

    (reply to this comment) (link to this comment)

  8. Aug 23rd, 2006 @ 2:31pm

    Other means of getting data for research

    Why don't researchers make use of proxies more? So they have subjects specify a proxy in their browser and just keep searchings. That way they can control for the types of people using the data, and get to see more than just what they did on the AOL site. (Although this data is quite rich showing which sites they went to after searching).

    (reply to this comment) (link to this comment)

  9. Aug 23rd, 2006 @ 9:20pm

    Using the AOL data is not ethical

    I think the ethical problems with doing research on the AOL dataset are significant. See this post for longer discussion: The Ethics of the AOL Search Data Disclosure

    (reply to this comment) (link to this comment)

Add Your Comment

Have a Techdirt Account? Sign in now. Want one? Register here
Get Techdirt’s Daily Email
Plain Text HTML Save me a cookie
  • Plain Text: A CRLF will be replaced by break <br> tag, all other allowable HTML is intact
  • HTML: No formatting of any kind is done without explicitly being written in
  • Allowed HTML Tags: <b> <i> <p> <a> <em> <br> <strong> <blockquote> <hr> <tt>
Close
Have a Techdirt Account? Sign in now. Want one? Register here
Get Techdirt’s Daily Email
Plain Text HTML Save me a cookie

Search Techdirt
And now, a word from our Sponsors..



Subscribe to Techdirt's Daily Email Newsletter

Techdirt's Daily Email Newsletter

Related Stories
Close
E-mail It