May 2009 ArchiveTuesday, 26 May
Motes, beams &c.
A while back, Philip Davis over at The Scholarly Kitchen posted about a small but useful research project of his: All I did was ask five librarians at institutions administrating Open Access publication charges two simple questions: This is (to me) clearly information that such programs should be collating and reporting, and after two weeks Davis' results were not exactly stellar: Two weeks after asking my simple questions, I received just two short responses. No list, no numbers, but at least a few details: There was some confusion on the part of faculty of what an OA article publication charge really was. Some faculty requests were actually for page charges in conventional subscription journals; one faculty submitted a request for reprint charges; others submitted invoices to the library when they should have been directed to the external granting agency (like the HHMI). To date, no bonafide requests have been denied. That's useful information, as far as it goes, but it doesn't go very far. Davis plays the conspiracy theory card way too hard for my taste, with "dark secrets" in the post title and an opening paragraph that reeks of melodrama: You would have thought I was requesting a field manual for interrogating prisoners of war or a list of members on Dick Cheney's Energy Taskforce. At least in those instances, I would have received a response that answering my questions violated national security or "executive privilege." Whoa, cowboy, back up a minute. As commenter Amanda R pointed out, we don't know much about how Davis went about gathering the information: As a point of clarification, were you directly refused data, or did libraries simply not respond? Did you contact them back and ask why there was no response, or if there was a reason they weren't providing the full data you wanted? In a Friendfeed discussion, librarian Christina Pikas made a related point: the worst part of this is figuring out who you would send a request like that to. It takes me 10 e-mails and 3 phone calls to find the right person at my mothership main library. Almost seems that he's taking confusion for malicious intent as did commenter JQ Johnson: when I in March queried the same institutions that Davis did, I got lots of cooperation. For example, UNC pointed me to a public letter (2/20/2009) to their vice chancellor that summarized in some detail the 12 requests they had funded to date. I'm puzzled why Davis got the response he did. Did he ask the wrong people? Davis replied to both Amanda R and JQJ, but he gave non-answers containing no information about his methodology and insisted that what he had shown was a lack of transparency: Whether the lack of response was caused by human error, technological barriers or internal policy, the result is a lack of transparency in how these author-support programs are performing. I found this very frustrating and left a comment1 aimed at clarifying why that was so: JQJ's comments and questions do not seem to me to skirt the issue at all, but rather to speak directly to alternative explanations for the lack of response. Methodological concerns are not trivial here. Now, it's been almost two weeks since I left that comment, and it hasn't appeared or been answered. What dark secrets is Philip Davis hiding? What dim, Crotty-esque ambitions of being the famous naysayer, the Nicholas Carr of Open Access, are forming even now in the troubled subconscious of this --- Or, you know, I just got stuck in the spam queue. It happens. :-) Davis finishes up by saying something relatively unexceptionable if taken out of the context of his insistence on ignoring both Occam's and Hanlon's razors: Library Open Access policies cannot exist with secret budgets, ambiguous guidelines, and a practice of stonewalling requests for information. Exactly so -- everyone else, including bloggers who wish to hold librarian feet to the accountability fire. Thursday, 14 May
Death and pigs and penmanship.
Go read Digger. Thank me later.
Better than nothing? A bookmarklet for The Open Lab
A while back, I mentioned that the Open Lab could really use a bookmarklet to make submission easier and faster. Since for once the LazyWeb did not provide, I've had a crack at it. I've got a simple version working (though I haven't tested it anywhere but FireFox3); all it does is pop up a conveniently-sized window showing the submission form: OpenLab If you drag that to your toolbar, you can at least hit the bookmarklet while you're on the page you want to submit, and simply move the popup around in order to copy over the information. I find it a lot more convenient than having to open the submission form in a separate window and go back and forth. What would really make this useful is if it would auto-fill in the submitter's name, address and website and pull in the title and url of the post being submitted. I'm trying to add that functionality, but I'm a complete javascript n00b and so far cannot get it to work, no matter what existing bookmarklets I try to use as a model. So I hope it's useful to someone as-is -- and if you know your way around javascript feel free to upgrade it! -- but I wouldn't hold my breath waiting for the improved version. Tuesday, 12 May
Oprah and anti-vaccine propaganda
Unless you've been living under a rock, you'll be aware of the latest round of anti-vaccine inanity, but unless you've taken some time to look into it you might not be aware of quite how stupid, and how dangerous, the anti-vaccine "crusade" really is. In either case, do yourself and everyone else a favor and go read Shirley's open letter to Oprah. @Oprah, don't watch show but nice Duke speech. take own advice and make difficult decision to pull support from mccarthy, save lives. kthxbi It's a quick read, but contains all the facts you'd want at your fingertips and pointers to plenty more. It's also warm and calm and human, and strikes me as being far more likely to actually get Oprah to reconsider her stance than some of the angrier and/or more science-focused commentary out there. So go read it, blog about it, retweet it. See if we can get Oprah's attention. Monday, 11 May
The Semantic Web: a long and somewhat convoluted definition.
This1 is an attempt to define and explain the semantic web for a lay audience, though it should be remembered that I am a member of that audience myself... It is a commonplace that we are drowning in information, and nowhere is this "information overload" more apparent than in scientific research. The National Library of Medicine's literature database, PubMed, is searched more than 60 million times a month and contains almost 19 million records from more than 5300 journals -- still only a fraction of the approximately 15,000 active, refereed, scientific journals listed in Ulrich's Periodicals Directory2. GenBank, the world's foremost repository of nucleic acid sequence information, contains roughly 100 billion bases in 100 million sequence records, and is growing at an exponentially increasing rate that is currently in excess of 50,000 records per day. Unlike PubMed and GenBank, which are cross-disciplinary databases, the Nucleic Acids Research Molecular Biology Database Collection is a carefully curated list of high-value specialist resources; it currently lists 1170 distinct, largely non-overlapping databases. I could go on, but you get the point3. As things stand, researchers talk to researchers and use computers to facilitate that conversation; what we need is for computers to be able to talk to computers. To cope with (literally) inhuman volumes of data, we need that data to start making sense to machines, so that they can do something no human brain can do: process all of it. We need to make it possible for machines to transfer richly interconnected data among themselves, mix and remix it, generate new connections, filter it, process it, transform it, and output the results to formats and interfaces that make sense to human brains -- substrates on which we can carry out the sorts of synthetic, creative thinking that computers cannot do. We need a man-machine partnership in which both partners can do what they do best, and that means we need the semantic web. The semantic web is the outcome of processes and frameworks with which computers can manipulate data in a way that makes it accessible by human brains. It is built on the standards and metadata -- information about data -- that are required for automated data exchange and processing, which in turn is required to create machine-generated, human-scale summaries, skeletons, outlines and other representations of, and interfaces with, the entire knowledge corpus. Here's an example. Human brains have no trouble processing the following data: Another reason for opening access to research. Wilbanks J. BMJ. 333:1306-8 (2006). To you, that's a reference; but to a computer, it's just a string of text. What a computer needs is information (metatada) about each substring: Title: Another reason for opening access to research. Now the computer "knows" which letters identify John, which constitute the title of the article, and so on. If you set the standards up properly, it even "knows" that Wilbanks is the surname and J the first initial, and so on into ever finer grained properties. Now imagine you had, oh, say, about 19 million such records. A human brain cannot do anything useful with such a database, but a computer can -- which is exactly why we can ask PubMed human-scale questions like "how many papers did J Wilbanks publish between 2000 and 2009?", or "show me all the papers with "access to research" in the title". Now multiply that -- the ability to ask human-scale questions of a mass of information far too large for any human brain to absorb or process -- by thousands of different types of information (text, gene sequences, chemical formulae, microarray results, etc etc), millions of individual records within each data type, recorded in thousands of journals and databases, produced by hundreds of thousands of laboratories, libraries and garage hackers. Imagine what we could learn if we could query all of that information on a human scale. There: that's a glimpse of the potential power of the semantic web. ------------- 2 tickboxes = active, refereed, scholarly/academic; search = LC Classification Number for [Q* OR R* OR S* OR T* OR U* OR V*] 3In fact, I'm always on the lookout for more good examples of the "data deluge" and the rapid progress of science and tech; post 'em (in comments) if you got 'em. Saturday, 09 May
More on the "Australasian Journal of..." series.
On the basis of the evidence below, I believe the entire "Australasian journal of..." series from Excerpta Medica to be either nonexistent or fake, in the same sense of "fake" that Elsevier has already admitted applies to the following six titles from that series:
WorldCat lists a further thirteen titles in the apparent series:
I believe these all to be either nonexistent or fake because:
Australasian journal of bone & joint medicine.I've written to the library to ask for a copy or photograph of either journal.
Mellsop GW, Menkes DB, El-Badri S. Releasing Psychiatry from the Constraints of Categorical Diagnosis. Australasian Journal of Psychiatry. 2007;15:3-5. doi: 10.1080/10398560601083134That DOI resolves to an article of the same name and with the same page numbers in Australasian Psychiatry, which is published by Informa Healthcare for The Royal Australian and New Zealand College of Psychiatrists. I've written to the communicating author, Dr Mellsop, to ask for a reprint. Of the remaining three hits, two are citations to other articles in the Australasian Journal of Psychiatry and one I cannot decipher without paying a fee to see the references of an obscure paper. Of the two I can decipher, one resolves to a paper in Australasian Psychiatry from 2003; the same article is available from Informaworld. The other is to an "in press" citation from 2007 (which also appears in 4b below).
Excerpta Medica in action
The Elsevier fake journal scandal is expanding in two directions. First, it's now "fake journals", plural. Elsevier has admitted to publishing six of these things:
Only one, Bone & Joint Medicine, is on the list I posted yesterday of Excerpta Medica "Australasian journal of..." titles from WorldCat. That leaves thirteen titles in the same series, none of which are listed in PubMed, Science Direct, Ulrich's or (thanks to Peter Murray, see comments on that post) Scopus. Jonathan Rochkind has pointed out how to find the rest of their titles in WorldCat; there are around 50 all told. That's the tip; I await the rest of the iceberg. The second direction in which the scandal is expanding is towards ghostwriting: I think probably Laika was the first person to make this connection clear. This is a separate but related issue, and Excerpta Medica appears to be up to their armpits in this sleazy practice as well. There's quite a large literature on ghostwriting, so here are just a few quotes (mentioning Excerpta Medica) to whet your appetite (if indeed one could be said to have an 'appetite' for something so nauseating): Anna Wilde Mathews, At medical journals, paid writers play big role When articles are ghostwritten by someone paid by a company, the big question is whether the article gets slanted. That's what one former free-lance medical writer alleges she was told to do by a company hired by Johnson & Johnson. Carl Elliott, Pharma goes to the laundry: public relations and the business of medical education One of the most ingenious pieces of the Fen-Phen public relations strategy was its ghostwriting scheme. In 1996 Wyeth hired Excerpta Medica Inc, a New Jersey-based medical communications firm, to write ten articles for medical journals promoting obesity treatment. Wyeth paid Excerpta Medica $20,000 per article. In turn, Excerpta Medica paid prominent university researchers $1,000 to $1,500 to edit drafts of their articles and put their names on the published product. Wyeth kept each article under tight control, scrubbing drafts of any material that could damage sales. One draft article included sentences that read: "Individual case reports also suggest a link between dexfenfluramine and primary pulmonary hypertension." Wyeth had Excerpta delete it. (21) Sergio Sismondo, Ghost Management: How Much of the Medical Literature Is Shaped Behind the Scenes by the Pharmaceutical Industry? Several of the publication planning firms identified are owned by major publishing houses. For example, Excerpta Medica is "an Elsevier business" and writes that its "relationship with Elsevier allows... access to editors and editorial boards who provide professional advice and deep opinion leader networks" [40]. Wolters Kluwer Health draws attention to its publisher Lippincott Williams & Wilkins, with "nearly 275 periodicals and 1,500 books in more than 100 disciplines," and to Ovid and its other medical information providers, emphasizing the links it can make between its different arms [41]. Vertical integration is attractive in the industry as a whole: at least three of the world's largest advertising agencies own not only MECCs, but also CROs [contract research organizations] [13]. Wednesday, 06 May
No bottom to worse at Elsevier?
Like Dorothea, I haven't said anything about the slimy Merck/Elsevier fake publication deal, because I thought the blogosphere had plenty of coverage. Anyone who reads me would know all about the scandal. The latest development, though, strikes me as something that should be shouted from every available rooftop: Elsevier simply must answer the questions raised. Via Dorothea: Jonathan Rochkind has done a little "forensic librarianship" and raised astonishing questions about the entire imprint, Excerpta Medica, which published the fake journal that started all of this. Go read Jonathan, but the bottom line is this: Excerpta Medica does not provide a straightforward list of its own publications or make clear which are, ahem, "industry-sponsored". Jonathan says "WorldCat lists 50 publications by Excerpta Medica Communications"; I just tried a simple author search for that phrase and got only 21 results, including the recently-exposed-as-fake Australasian journal of bone & joint medicine; how many others are fake? How about the other
Why, for one thing, are none of them indexed by Science Direct? The PubMed journal limit field contains only Australasian journals of dermatology, pharmacy and optometry; the latter two seem to be defunct and the first is published by Wiley. Futher obvious questions arising:
Monday, 04 May
Alternative Connotea bookmarklets for OATP
Peter Suber launched the Open Access Tracking Project on April 16, and you can read a full description of it in this month's SPARC OA Newsletter. I encourage anyone interested in contributing to the OATP to read the full description so as to make your contributions maximally useful. Here are the basics:
If you are pressed for time, and we all are, then it may help to have a Connotea bookmarklet with the oa.new tag (or oa.unclassified, if the item is older than six months) already filled in. That way you can just hit the bookmarklet, hit "add to my library" and be done. It's better if you have time to put in further classifying tags and a description, but at least this way the page will be recorded. I guess the easiest way to do this would be to have three bookmarklets, the regular one and the "two click" bookmarklets I describe here. If you're using FireFox, here are the two-click versions; you can install them the same way as the regular one (drag to the toolbar) and, if you like, rename them using the "Organize Bookmarks" dialog box: This would obviously be better as a one-click than a two-click bookmarklet, but I failed dismally in my attempt to make it so because I don't actually know anything about javascript. I've previously suggested to the lazyweb that someone make a bookmarklet for another project, and nothing came of it; I'm hoping both that this little hack will be useful, and that it will inspire an actual programmer to improve it. Saturday, 02 May
Congratulations to Harvard.
Harvard has been fortunate enough to secure the services of Peter Suber, who has been appointed a Berkman Fellow. I cannot say it better so I will simply quote Stevan Harnad's comments accompanying the announcement: A brilliant choice, and eminently well-deserved. Peter -- whose historic contributions to the growth of OA have been spectacularly successful -- will continue his invaluable OA work, but this Fellowship will also make it possible for him to begin writing the books on OA and related matters that are welling up in him, and that the world scholarly and scientific research community (as well as the historians of knowledge) are eagerly waiting to read, digest and learn from for years to come. Friday, 01 May
Open Access, copyright transfer and NC licensing: caveat emptor!
When I was rummaging around in J Vis a while back, I noticed something that I've been meaning to blog about: why is an Open Access journal still requiring complete surrender of author copyright1? I happen to know one answer to that question, though I don't know whether this is the case at J Vis. The deal is this: Big Publishing sells paper reprints, and not just of their own articles -- they pay fees where necessary in order to provide a one-stop shop (e.g. through Excerpta Medica or Ovid), mainly to the pharmaceutical industry. In order to blanket existing and potential customers with research favorable to their causes, pharm companies spend a great deal of money on these reprints -- some of which trickles down to small publishers, some of whom depend on that revenue. Such publishers therefore cannot afford to give up such rights as force the reprint traders to pay for their wares. J Vis has a copyright notice which says, in part: Users may view, reproduce or store copies of articles comprising the journal provided that the articles are used only for their personal, non-commercial use. [...] Any uses and or copies of Journal of Vision articles, either in whole or in part, must include the customary bibliographic citation, including author attribution, date, article title, journal name, DOI and/or URL, and copyright notice. A closely related strategy is to use open(ish) licensing that contains a noncommercial (NC) clause. For instance, Springer Open Choice leaves copyright with authors, but uses their own license that is compatible with CC-BY-NC. That, like J Vis' copyright notice, puts their publications out of reach of the reprint traders, except for the little clause that says: No term or provision of this License shall be deemed waived and no breach consented to unless such waiver or consent shall be in writing and signed by the party to be charged with such waiver or consent. which allows the small publishers to waive the NC part for certain uses, in return for what amounts to royalties2. Why do I care about this? Because it's another instance of the old "Free is not Open" argument, and the problems discussed here and here. Since digital repositories -- as far as I know, all existing digital repositories -- carry no blanket license, but leave intact the licensing of each individual digital object they contain, the effect is that there are no OA repositories that remove both price and permission barriers (that is, provide "strong" or "libre" OA to their contents). The end result is the same problem that copyleft causes3: Reuse, Rework and Redistribute may not be powerfully affected, but Remix is killed outright. Consider, for instance, PubMed Central, all the papers in which are free to read. What else can you do with them? Textmining, datamining? As far as I can tell, the answer is no, you can't do any of that -- because whatever you want to do, some papers will be licensed to allow it and some won't. Barring some way to reach agreements with dozens or perhaps hundreds of publishers and pre-sort millions of papers on the basis of licensing, the entire PMC barrel is spoiled by the copyrighted, NC and similar apples -- though there is a much smaller uncontaminated barrel available4. Which brings me, at long last, to my title. Why "caveat emptor"? Well, if you're buying Open Access -- that is, publishing with a journal that charges author-side fees (remember, most don't), make sure you're getting value for your money! If the journal demands your copyright, or slaps a NC license on your work before distributing it, you should know that many possible downstream uses for your work are being pre-emptively eliminated. Are you sure that's what you want? ------------- ![]() 2 There's even a clause in the canonical definitions of OA that deals with this issue -- or at least I suspect that's what it's doing there. Budapest, which came first, says this: The only constraint on reproduction and distribution, and the only role for copyright in this domain, should be to give authors control over the integrity of their work and the right to be properly acknowledged and cited. But Bethesda and Berlin, both of which were written about two years later, include this in the definition of Open Access (emphasis mine): The author(s) and right holder(s) of [OA works] grant(s) to all users a free, irrevocable, worldwide, right of access to, and a license to copy, use, distribute, transmit and display the work publicly and to make and distribute derivative works, in any digital medium for any responsible purpose, subject to proper attribution of authorship (community standards, will continue to provide the mechanism for enforcement of proper attribution and responsible use of the published work, as they do now), as well as the right to make small numbers of printed copies for their personal use. I suspect, though of course I'm really just guessing, that the "small numbers" clause was inserted at least in part as a reaction to the gleeful scarfing-up of OA works for resale by reprint distributors, or to the threat of same. It needs the force of law to be any use for that purpose, though, which is where licensing comes in -- using a noncommercial clause like the one in CC NC licenses is a bit like swatting a gnat with a bulldozer, but I know of no licenses which deal specifically with the volume reprint trade but allow other commercial uses. Frankly, even if there were such a license, I wonder whether publishers who insist on NC now would switch to it. Springer's Open Choice, for example, charges $3000 per article. I would say they've already been paid and shouldn't much care if someone else, without restricting access to their content, makes further profit from it. The barrier (to such a view) seems to be a mindset that says "why shouldn't I get my cut?" -- and if any other downstream use should arise that starts to make serious money, they would want their cut of that too. To make sure they get it, just in case it ever comes into being, I expect that many publishers would be willing pre-emptively to kill off any smaller commercial innovations that might otherwise arise. (Someone will no doubt argue that these fledgelings could always negotiate via the waiver clause, as above. The main problem there is that such negotiations themselves cost money, and since much of the promise of OA is in remix across a wide range of sources, that means negotiating with every publisher. Let me know how that works out for ya.) 3 In fact, although NC clauses don't require a particular license for derivative or collective works, they do exert a kind of de facto copyleft, because they are only downstream-compatible with other NC licenses -- see footnote 1 here, or play this game for a while. 4 Two things of note here: firstly, the NIH apparently agrees with me that OA by definition removes both price and permission barriers, since they refer to the uncontaminated barrel as Open Access and explicitly say that the rest of their content is free, not OA. Secondly, following on from Egon's and Antony's questions, I wonder: by permitting the spoilage, can databases violate the licensing terms of the CC-BY papers they also contain? The question hinges on this wording: You may not offer or impose any terms on the Work that restrict the terms of this License or the ability of the recipient of the Work to exercise the rights granted to that recipient under the terms of the License. [...] When You Distribute or Publicly Perform the Work, You may not impose any effective technological measures on the Work that restrict the ability of a recipient of the Work from You to exercise the rights granted to that recipient under the terms of the License. Egon and Antony are asking more directly technological questions, but I do think it could be argued that if they do not do as PMC has done and make available a libre OA subset, databases can be seen to be imposing terms that restrict, etc. |
RSS Feed
Links: |