|
Fooling around with numbers, part 3; or, why would anyone pay for these journals?
Following on from part 2, I thought I'd ask a couple more questions about price-per-use, based on the online usage stats in the UCOSC dataset. I started on this because I noticed that in Fig 2 of part 2, I'd missed a point: there is an even-further-out outlier above the Elsevier set I pointed out: It's another Elsevier journal, Nuclear Physics B. In 2003, only 1001 online uses were reported to UC by the publisher, but the 2004 list price was $15,360. The companion journal Nuc Phys A is not much better, $10,121 for 1198 uses. Compare that with Nature, 286125 uses at just $1,280! It gets worse, too, because I'm led to believe that anything that appears in a physics journal these days is available ahead of time from the arXiv. I tried to confirm that for Nuc Phys B, but either I'm missing something or the arXiv search function is totally for shit, so I couldn't do it systematically. I did go through the latest table of contents (Vol 813 issue 3) on the Science Direct page, and was easily able to find every paper in the arXiv -- mostly just by searching on author names, though in a couple of cases I had to put titles into Google Scholar. Still, they were all there, which leads me to wonder why any library would buy Nuc Phys B (or Nuc Phys A, assuming it's also covered by the arXiv). Prices haven't improved in the intervening 5 years, either: [I had a table here but Movable Type keeps munging it. Piece of shit. Here's a jpg until I sort it.]
The curve fits are for the whole of each dataset, even though it's a zoomed view; the Nature set excludes British Journal of Pharmacology, the only NPG title that recorded 0 uses, and Nature itself. Colour coding by publisher is the same for each figure in this post. As in part 2, the correlation between price and use is weak at best and doesn't change much from publisher to publisher. Also, each publisher subset shows a stronger correlation than the entire pooled set -- score another one for Bob O'Hara's suggestion that finer-grained analyses of this kind of data are likely to produce more robust results. Since cutoffs improved the apparent correlation for the pooled set, I tried that with the publisher subsets:
Next, I broke the data out into intervals (for clarity the labels say 0-1, 1-2 etc, but the actual intervals used were 0-0.99, 1-1.99 etc):
So, are these reasonable prices -- $1 per use, $6 per use? I'm not sure I can, but I'll try to say something about that question, using the UCOSC dataset, in Part 4. Comments Problems with analysis: UCOSC most likely pays more than list price. Reasonable cost per article as compared to a subscription is more like $15-$18 from a librarian�s point of view. However, the real reasonable cost is what a user would pay out of their own pocket for an article, which is not static � depends on how important the article is to their work at that time, how close they are to a deadline, how flush they are that month, etc. Libraries paying for pay-per-view access are similar to agreeing to a la carte pricing to provide an all-you-can-eat buffet. Unimpeded access leads to indiscriminate use. When a user needs to take any action in addition to simply clicking on a link, they make a better determination of their actual need. Most likely the ILL office of UCOSC already uses pay-per-view to get some articles for their users. Requiring mediation by librarians to get pay-per-view articles is one way to control indiscriminate use. Post a comment |
RSS Feed
Links: |
Interesting, thanks for sharing this, Bill. Please note - without open access, usage-based pricing is a harm to scholarship. One of the reasons is that usage-based pricing inevitably discourages use.
However, with open access, this could be a catalyst for change.
This has inspired me to write a blogpost of my own, which can be found here.