Wednesday, March 27, 2013

German "Leistungsschutzrecht"

German parliament just recently passed a law called the Leistungsschutzrecht that is, for all practical purposes, an attempt by the government to make Google pay German publishers for repeating a snippet of the text on their search result pages. Of course, Google drives a lot of traffic their way, but that is not enough for the publishers, they want more.

Enter the Leistungsschutzrecht. It is, of course, being celebrated as a wonderful way to "regulate" that nasty Internet, but the problem is, that it is also making life difficult for bloggers. We are putting ourselves at danger of steep legal fees for take-down orders if we use part of an article we refer to. And since the links are often the first sentence of what we are writing about, there is some concern that the links themselves might be problematic. The law is quite vague on the point of what exactly is allowed and what is not.

(c) netzpolitik.org
Oh, the publishers say, if you're not making money, you don't have to worry. I'm not so sure about that. And so I will refrain from referring to any of the publishers who are behind this law - and there are lots of them: http://opalkatze.wordpress.com/2013/03/01/diese-seiten-unterstutzen-das-leistungsschutzrecht/

Hope to be back to regular blogging - sans links to German press - after Easter.

Saturday, March 23, 2013

Lawyers instead of arguments

One of the Alpha-Bloggers in Germany, Johnny Haeusler from Spreeblick, wrote the other day about the mounting legal costs of operating a blog that reports on and comments the news (Geld ausgeben mit Blogs). He's apparently already spent 10,000 € on legal fees fighting take-down notices from lawyers, although at least he runs ads and has income. People and companies don't like it when you say anything negative about them. They try and root out all of the reports about problems with their products or the deeds that they did and try to silence people.

I am quite allergic to people who give themselves out to be scientists (for example because they wrote a doctorate or publish scientific papers that turn out to be plagiarisms or to have cooked their data or some such) and who react to criticism or statement of fact by sending out the lawyers. I much prefer to see arguments. Don't threaten me, convince me!

I myself just had a tussle about an article that I wrote on this blog in October 2011 about a conference on good scientific practice in medicine at the Charité.  I noted that the journalist Hubert Rehm had spoken about a few cases, amongst them B.S.  (for more information, see Laborjournal (link removed), they have reported in depth on the case). A few weeks ago I received a letter from a Swiss law firm with so many names on the masthead, it took almost half of the first page. Take down the reference to B.S. immediately or else.

I consulted a lawyer, who noted that I am in a very special circumstance: I do research on good scientific practice, and in Germany a researcher is permitted to publish as they please, as long as it's not illegal. And it is never illegal to state the truth. All the same, the lawyer suggested that I tone down my sentence. So I did, and had the lawyers write an (expensive) letter to that effect.

What I don't understand is what these lawyers and their customers think that they are going to achieve by trying to silence everyone. Have they not heard of Barbara Streisand?

Update: On 2013-04-18 I removed the link to the Laborjournal article.  The lawyers insist that even linking to the publication that they consider to be illegal is not okay. By linking I am not assuming the position of the items I link to - I am reporting on what what was said and giving my readers the opportunity to decide for themselves. This is covered by the laws governing freedom of speech and the freedom of the press. I am not removing information because I made an error - I am making the changes clear while removing the identifying information because I do not wish to fight a legal battle on this particular issue.

Friday, March 15, 2013

A Journalistic Quicky on Plagiarism

Time and again I get exasperated with journalists who are only out for a quick sensation instead of actually researching a question. Last Friday a journalist wrote me an email at 3 in the morning asking me to help him answer a question from a student. They are starting a new "service", answering real questions. The question was: How can I find out if I have plagiarized in my term paper? He wanted a very short answer, 4-5 lines, and it could be as "perky" (flott) as possible.

I was traveling, giving a workshop on plagiarism detection for professors and leaving for a week of vacation in the morning. Since this is not a question that can be answered in 4-5 lines, I just didn't reply.

So what was the result? I was contacted today by a friend with a question about a software system that I had "recommended" in some article. Checking my alert service I found the article and was aghast -- the software in question was one that came in last in my 2010 test of so-called plagiarism detection software. The entire article turns the results of my tests upside down.

I suppose it is because I am a computer scientist and I test software that journalists expect that I "recommend" the top-rated software, tell people what to buy and where the magic button is for quickly and painlessly finding plagiarism. In fact, there is no recommended software, the category at the top is "partially useful". It says at the top of the page:
Our recommendation: Only use these systems when suspicions of plagiarism arise that cannot be found with 3-5 words in a search machine. The focus should be on teaching students about plagiarism and how to avoid it instead of investing time in using software. Most of the work involved later is in preparing a plagiarism case and dealing with the plagiator, and for this good documentation is needed. Very few systems provide good documentation.
The journalist banged together three statements, the first and the last attributed to me and the second one not directly including my name, but people have read it as being from me. He writes (numbering by me):
(1) Die Berliner Medieninformatikerin Debora Weber-Wulff hat vor Kurzem verschiedene Softwares getestet. Sieger war das kostenpflichtige Programm turnitin, das mittlerweile an über 80 deutschen Universitäten und Forschungseinrichtungen genutzt wird. Sie können es möglicherweise über Ihre Hochschule kostenlos nutzen.
(2) Eine brauchbare kostenfreie Alternative ist Viper.
(3) Kurze Texte können Sie außerdem gratis bei PlagAware kontrollieren lassen, das in Weber-Wulffs Analyse ebenfalls solide abgeschnitten hat.
 ad (1): "vor Kurzem" means recently. Um, my last test was 2010, published in 2011. We are currently retesting the software, hoping to publish the results in September. These tests take a lot of time to conduct. "Sieger" means the winner. There were no winners. There were 5 systems that were given the grade of C-, as they did not even manage to reach 70 % of the points in the test. None of these can be considered winners, but they can be used as tools if one has a suspicion of plagiarism. They have far too many false negatives and false positives for general use, and all five are on about the same level. And the idea of letting students use them (for free even!) to run their papers through until they come out clean gives me shudders down the spine. The students will resort to swapping word orders and using unfitting synonyms until the numbers (which will differ for the same text from system to system and even in some cases from test to test) reach an acceptable level. There must be someone with experience in reading and interpreting the reports to go over this together with the students if it is to be a meaningful learning experience.

ad (2): Oh dear. I suppose the journalist was looking for a free system, as opposed to the expensive ones. I have no idea what terms he googled to find this system, but he didn't read this on my results page. Technically, he doesn't say that I find the system useful ("brauchbar"), this is his own judgement. But sandwiched in between two references to my work, people misread it as me recommending it. Nothing could be further from the truth. The street address is the same for the company offering the plagiarism detection service and for a paper mill / essay writing service. Their respective telephone numbers differ only in the last digit. And actually reading the "Terms and conditions" should scare the daylights out of anyone: If you submit a paper to Viper, you give the company "All Answers Limited" the irrevocable right to keep a copy of the paper forever and to use it for marketing purposes, either for Viper or any "associated website".  The email account I used for the test (we use a different email address for every test to see what kind of spam shows up later) now gets fun letters with subject lines such as "Enjoy the sun and get your dissertation done on time". Oh, and Viper has the distinction of being the worst system for detecting plagiarism in the test, coming in last with just 24% of the effectiveness points on all of our test cases in 2010.

ad (3): Another system is mentioned as being "solide", respectable, in my analysis. Again, "partially useful" is not even close to respectable.

I wonder why the journalist is in effect advertising the services of three companies, one of which is extremely questionable, instead of really trying to answer the question. At least the comments are good: Most of the comments note that one does not actually plagiarize by mistake. If you are taking careful notes and making the difference clear between what is your own words and ideas and what is from someone else, you are fine.

Now, how do I get journalists who just breeze through writing something like this without actually reading and understanding what hat been written on the topic to start doing real research themselves instead of just blowing off an email to me? I suppose I need to answer all journalists, as soon as possible, with short and snappy sayings. I do wish, however, that they would read at least the summaries before they ask.



Sunday, March 3, 2013

The Dark Alleys of Turkish Academia

I published a short note in September 2012 about the work of a group of academics in Turkey. A. Murat Eren has now organized a translation of their work into English so that a wider group of scientists can take a peek into the very dark alleys of Turkish academia.

http://subjektif.org/landscapes-from-turkish-academy/

Take some time for a long read, there are many pictures documenting the plagiarism. There are ten dissertations, followed by a discussion of the problems involved with dissertations not being published in Turkey. We are really very lucky in Germany that all theses have to be published, as it makes research about them so much easier. There is a long list of excuses given by the libraries for not being able to obtain theses. Istanbul University is my favorite one - you can obtain them, if you fill out all these forms and send money and the moon phase is correct ... [strike that last item].

There is an overview of how many theses are not available at the different libraries -- 40 % of the theses not available at the best library, 66 % at the worst one!

And then there is the list of academics in Turkey with the most retractions to their name -- and their current occupation. Let me quote these here, because it is so shocking:
Only one of the authors with multiple retracted papers is not affiliated with academia. Anyone who knows how difficult it is to get a paper retracted will understand the depth of concern here. How can these people teach at university and mentor doctoral students when they themselves have multiple retractions to their names?

The same chapter also reports on the Sezen case, one that I blogged about in June 2012.

Eren's conclusions:
Turkey’s bad academia is self-perpetuating.

People who have committed ethical violations in their dissertations and publications are allowed to become thesis supervisors. Students who are misguided by these create dissertations that equally violate ethics, publish insignificant or duplicated papers, and some of them become the new academic generation, in turn completing the cycle.

One of the major problems that perpetuates this cycle is the difficulty of access to dissertations. University libraries limit access with arbitrary reasons, and improvements in YÖK Thesis Archive are far from solving the problem in practice.

Even when a dissertation is accessed and plagiarism is seen, penalties are far from being deterrent, due to legal and executive roadblocks.

While advanced societies take science theft very seriously, actors of science theft in Turkey silently go on with their duties, thus deleteriously undermining the credibility of the field.

Even though today’s scientists in Turkey are not proactive, and they are mostly mute unless they have to defend themselves, I believe that self-criticism will become a way to reveal and eventually eradicate academical problems in Turkey in the future.
 I am indebted to the Turkish scientists who have worked on this. I have corresponded with them and did some proofreading on the English version. I hope that this will shine a bright light down the dark alleys. 

Saturday, March 2, 2013

Plagiarism definition from an accused plagiarist

For many years I have used this definition of plagiarism by Paul English in my German talks:
urn:nbn:de:s2w-5415
“Plagiat ist also die aus freier 
Entschließung eines Autors oder 
Künstlers betätigte Entnahme eines 
nicht unbeträchtlichen Gedanken
inhalts eines anderen für sein Werk in 
der Absicht, solche Zwangsanleihe nach ihrer Herkunft durch entsprechende Umgestaltung zu verwischen und den Anschein eigenen Schaffens damit beim Leser oder Beschauer zu erwecken.”  

[S. 81-82]
I didn't like the intentionality that he had in his definition, but it was good that he included the arts here, and I really liked the phrase "Anschein eigenen Schaffens" (the appearance of own work).

As it turns out, Paul English was himself accused of plagiarism. In 1908 a Viennese author, Dr. Stern-Szana, prepared a handwritten manuscript for a book. It was purchased in 1920 by a publisher in Berlin, Socher.  In 1922 Socher gave the manuscript to a typist to have it prepared for publication. The typist was Paul English. The book, however, was never published by Socher. But in 1928 Paul English published a book with the exact same title and extremely similar structure, as well as occasional word-for-word copies.

Socher took English to court in 1929 and there is a wonderful book (that has now been digitized) that Socher put together for the court case, a side-by-side of English and Stern-Szana's manuscript. English republished his book with a new title, the contents unchanged. The court case seems to have gone on to a higher court in 1932, the trail is lost there. I will be researching this during my sabbatical in the hopes that the result of the case has not been lost in the destruction wreaked by the Second World War in Germany. If anyone happens on a clue, please contact me.

A booklet with the definition by English was first published as "Plagiat!! Plagiat!!" in 1930, and then in an extended version as "Meister des Plagiats - oder die Kunst der Abschriftstellerei" in 1933. With the knowledge of the history of English and the context from page 81 of the latter publication, I now read his "definition" as an attempt to wash his hands of the taint of plagiarism. I have now removed the definition from my plagiarism portal and from my slides. I now use Teddi Fishman's definition. I apologize for any inconvenience.