Cookie jar

I remember as a small child that, when you had taken some cookies from the cookie jar you were not supposed to take, your mother somehow always knew. I suppose this must feel the same for the people that got caught by the Wikiscanner.
This is an application that relates the IP adresses from the people who did edits on the Wiki pages. Surpringly (or maybe not so surprising really) a lot of edits come from organisations that are involved in the wiki-article and their intentions are not always without self interest. The page of George W. Bush was edited from the home security office (removal of the fact that he had a drinking problem) and the best of all about the edit from the queen’s home address where information from her daughter-in-law Mabel was edited to make her look a little better.
This all is of course not surprising. When information can be edited there is no reason to think companies and people will not do so in order to make them look better. The interesting part is that it turns out that many of the un-truthful edits had been discovered very quickly and have been repaired withing hours and sometimes even minutes.
Also, the wikiscanner is a new step that “the crowd” can and will use to make the information more trustworthy. More transparency will lead to better information. Not perfect but still a little bit better and better to judge.
These are for example the edits from the Telematica Instituut (looks like we have a lot of mathematicians in the company or are these the edits of one person?). Interesting…

Smart and Stupid

LemmingsThe crowd can be smart  and the crowd can be stupid. Part of the discussion on Brittanica deals with what the advantage is of social software and iCrowds. And when results are stupid or irrelevant. IS there for example a thing like a citizen scientist?
Open source seems to work great: Linux, Samba, Apache are all examples where the software has reached tremendous stability because it is open. Everybody in the world can test it, can comment on features and can look at the code to try to make it better. I think most will agree that in these cases no group of “professional specialists” would have done any better.
On the other hand we have Wikipedia that sort of combines the best and worst of information. And it can be hard to distinguish between good and bad. Parroting each other through blogs and fora’s creates information that looks like the truth because it is amplified a thousand times (you can Google it and a hundred links say the same “truth” but in reality it may be just one person who is the source and who might be wrong. A thousand consistent lies remains a lie. One of the reasons we think information that is found many-fold is true is because in the old days much of this information was from different sources. Due to the Internet and the free copy and distribution, information is replicated with the speed of lightning.  This speed of replication leads to a loss in diversity of sources since it is much more convenient to copy based on what Google delivers. This process is of course self propelling. 
One of the reasons for this difference is the ease how we can distinguish right or wrong. When using the Apache server I know when it crashes or is vulnerable for attacks. And I know when it is fixed (when it does not crash or does like advertised). also, each person looking at the application or code does this on it’s own. With information in Wikipedia this is not so easy to determine without thorough research based on “peer reviewed” sources.
As for the citizen scientist: there may certainly be a place in science for the pro-am model. A much mentioned example is the use of amateur astronomists that see a lot a things that the professionals missed because you can’t look at the whole sky. But collection of data is only a part of science. Building theories and designing experiments is something altogether different. On a photograph it is easy to see if it was a supernova. Building and proving a theorie is much less clear and needs the input and critical review of many experts.
Combining diversity is the key to iCrowds, not replication of the same.