Subscribe / Unsubscribe Enewsletters | Login | Register

Pencil Banner

Exploring Google Search's tenuous relationship with pirated content

Evan Dashevsky | Sept. 12, 2013
Mountain View says it filters copyright-infringing content from its autocomplete function, but not all search filters are created equal.

According to a recently released paper "How Google Fights Piracy" (available here as a Google Doc), the search giant claims that fans intent on pirating media use pirate sites rather than search. Despite Google's insistence that its search function is not the preferred doorway to copyrighted content, Mountain View says it "has taken steps to prevent terms closely associated with piracy from appearing in Autocomplete and Related Search."

The exact whats and hows of Google's search algorithm has long been shrouded in mystery, but I found it to be particularly foggy when it comes to playing around to searching specific terms related to the copyright-free lifestyle.

On a pirate hunt
Last week the British Music Industry trade group, BPI requested for Google to block a number of URLs from its search function, citing copyright infringement. Included in the list of rouge sites were several urls from the torrent-search site The Pirate Bay.

Of the 2056 URLs that BPI asked to have removed—including 171 from The Pirate Bay—only one had no action taken against it: The Pirate Bay's current homepage, www.piratebay.sx.

Why? Google doesn't say specifically, but it's probably because although the TPB homepage allows users to search for torrents that no doubt contain pirated property, The Pirate Bay's front page itself does not actually list or contain any copyright-protected information.

According to Google's compliance FAQ page:

It is our policy to respond to clear and specific notices of alleged copyright infringement. Upon review, we may discover that one or more URLs specified in a copyright removal request clearly did not infringe copyrights. In those cases we will decline to remove those URLs from Search.

This isn't so newsworthy, as BPI is one of the most active senders of DMCA requests, and The Pirate Bay is one of the most targeted DMCA takedown recipients. (According to Google, TPB has had nearly 460 thousand URLs requested to be removed.) However, despite the fact that Google has determined that The Pirate Bay's front page should remain indexable and available, Mountain View has constructed a strange maze of semantic filters regarding "The Pirate Bay" and other pirate sites within its autocomplete functions.

Autocomplete anarchy
In 2009, Google completely removed The Pirate Bay from its search index, only to eventually return TPB's homepage to searchability. The Pirate Bay is currently available through Google, but remains somewhat hidden via the site's autocomplete function. For example, on the US Google site, searching the term "The Pirate Bay" will not bring up the site's front page in Google's autocomplete results.

Meanwhile, the autocomplete for the conjoined url phrase "thepiratebay" will bring up some sketchy sites (that you should NOT click on) rather than TPB's official site. However, if you add in a period to your search ("thepiratebay."), the Pirate Bay's homepage will magically appear in the main search field below the bar.

 

1  2  Next Page 

Sign up for CIO Asia eNewsletters.