Thursday, December 18, 2008

Are the big search engines data mining?

Yahoo makes the first move amid the potential that the government might someday soon come calling:



WASHINGTON - Yahoo Inc. said Wednesday that it will shorten the amount of time that it retains data about its users' online behavior — including Internet search records — to three months from 13 months and expand the range of data that it "anonymizes" after that period.

The company's new privacy policy comes amid mounting concerns among regulators and lawmakers from Washington to Europe about how much data big Internet companies are collecting on their users and how that information is being used. Yahoo's announcement also ratchets up the pressure on rivals Google Inc. and Microsoft Corp. to follow its lead.

In September, Google said it would "anonymize," or mask, the numeric Internet Protocol (IP) addresses on its server logs after nine months, down from a previous period of 18 months. And Microsoft, which keeps user data for 18 months, said last week it would support an industry standard of six months.

Under Yahoo's new policy, the company will strip out portions of users' IP addresses, alter small tracking files known as "cookies" and delete other potential personally identifiable information after 90 days in most cases. In cases involving fraud and data security, the company will anonymize the data after six months.

Sunnyvale, Calif.-based Yahoo also said it will expand the scope of data that it anonymizes to encompass not only search engine logs, but also page views, page clicks, ad views and ad clicks. That information is used to personalize online content and advertising.

Yahoo will begin implementing the new policy next month and says it will be effective across all the company's services by mid-2010.

Anne Toth, vice president of policy and head of privacy for Yahoo, said the company is adopting the new policy to build trust with users and differentiate it from its competitors. Yahoo also hopes to take the issue of data retention "off the table" by showing that Internet companies can regulate themselves, Toth said.

European Union regulators have pressured Yahoo, Google and Microsoft over the past year to shorten the amount of time that they hold onto user data. And Congress has begun asking questions about the extent to which Internet and telecommunications companies track where their users go online and use that information to target personalized advertising.

Edward Markey, D-Mass., chairman of the House Energy and Commerce Subcommittee on Telecommunications and the Internet, praised Yahoo for setting a new standard on privacy protection and said Google, Microsoft and other companies will now be compared against that standard.

Ari Schwartz, vice president of the Center for Democracy & Technology, a civil liberties group, agreed that Yahoo's new policy is "step in the right direction." He added, however, that he would like to see more clarity — and more standardization — from the industry about what it does with Internet users' data. He noted, for instance, that while some companies delete full IP addresses, other delete only parts of IP addresses or simply encrypt them.

Indeed, Redmond, Wash.-based Microsoft said in a statement that the method of anonymization is more important than how long records are logged. It called on the entire industry to adopt a "high standard."

For its part, Mountain View, Calif.-based Google said it takes privacy seriously and strives to strike "the appropriate balance between protecting our users' privacy and offering them benefits of data retention, such as better security measures and new innovations." Google did not address, however, whether it would be open to further reductions in the time it maintains user logs.

No comments: