[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Does the arXiv lead to higher citations and reduced publisher downloads?
Kristin Antelman wrote:
Data I collected for philosophy, political science, engineering
and mathematics do not support this hypothesis that OA causes
more citations for better articles only (given that one uses
overall citations as a rough measure of quality).
The data that Kristin illustrates do not show causation, only
association. Articles available freely from repositories are
associated with a higher frequency of citations. I have no
contention with this claim, and everyone since Steve Lawrence
(Nature, 2001) has backed this observation. What I am arguing,
however, is that the likely (and primary) cause of the citation
advantage is not increased access, but some quality differential,
leading to better articles being deposited in the arXiv. In our
manuscript, we argue that if OA-as-cause is present, its scope is
severely limited to highly-cited articles. How can we say this?
If increased access was the cause of increased citations in our
data, we should see a significant and positive correlation
between fulltext article downloads from the arXiv and the number
of citations an article receives. The rationale is that article
repositories increases readership, some of which leads to
increased citations (this is the argument that SPARC, Harnad,
Suber and others use to justify the use of archives). Now please
take a look at Figure 3 in our paper
(http://arxiv.org/pdf/cs.DL/0603056). Notice that this positive
association only applies to highly-cited articles (note: the
inverse log of 2.5 is about 316 downloads).
In order to argue for causation one must be able to describe and
measure the mechanism by which the cause takes place. Antelman
(and others) demonstrate only the association between open access
and citations, and infer that open access must be the cause. In
our paper, we test the Open Access postulate, the Early View
postulate, and a Quality Differential postulate. Of these three
we feel that the Quality Differential is the strongest
explanation for the data. We do not rule out Open Access
completely, but the data do suggest that if access is responsible
for increased citations, this effect may only take place for
already highly-cited articles.
--Phil Davis