Scunthorpe is a small town in the north of England. Back in 1996, many of the town’s residents were having trouble signing up for AOL’s Internet service. It seems that the company’s profanity filters were rejecting the name of the city since it contains a slang term for female genitalia. Although residents of Penistone, South Yorkshire and Lightwater, Surrey experienced the same problem, the issue has become known as the “Scunthorpe Problem”.
There are two basic ways to filter content on the Internet. One is to have humans do it, and the other is to have machines do it.
The problem with having humans do it is obvious – there are billions of web pages out there, and it would be a Herculean task to have an actual human visit each page and judge the content therein. Plus, web site operators change web hosts, move their content around, and rename pages all the time; every web site – even a tiny site such as this one – would need to be visited on a very regular basis to make sure that any filters were up to date.
On the other hand, machines work 24 hours a day without a salary. Even a modest computer – such as a home PC from five years ago – could filter tens of thousands of pages every day. But the problem is, machines have no sense of nuance. A computer only looks for a string of letters organized in a certain way. It sees web sites like romansinsussex.co.uk (an educational site about English history) and arkansasextermination.com (a site for an Arkansas-based pest extermination company) and blocks them because of “sex” in their addresses – although those sites have nothing to do with sex!