Transcript Document
Bibliometric research methods Faculty Brown Bag IUPUI Cassidy R. Sugimoto
Overview Vocabularly Citation analysis Citation indices Bibliometric laws Impact factor Applications
Vocabulary
Scholarly Communications Formal and information Scientometrics Scientific communication Infometrics Thinking beyond scholarly “texts” Webometrics web Bibliometrics Application of statistical and mathematical methods (formal channels)
Citation analysis Citing document A A references B Cited document B B is cited by A Why do people cite? Why are some articles not cited?
What does a citation mean?
Who’s on first?
Embedded citation index from
` En mishpat
: Babylonian Talmud (1546) (Weinberg, 1997) Shepard’s Citation Index (1873) Shapiro (1992)
Institute for Scientific Information (ISI)
Scopus
GoogleScholar
Comparison
Scopus
n=7,333 (86%)
Scopus 29% (2,441) Overlap 57% (4,892) Web of Science 14% (1,216)
Web of Science
n=6,108 (71%)
Distribution of unique and overlapping citations in Scopus and Web of Science (n=8,549)
Are you a citation index?
Bibliometric research OR “Why I love good indexes”
Citation analysis Citing document A A references B Cited document B B is cited by A
Citation analysis: methods Not just articles…
Variable:PRODUCERS
Variable:PRODUCERS
Variable:ARTIFACTS
Variable:CONCEPTS
Hybrid approaches Chaomei Chen: http://www.pages.drexel.edu/~cc345/citespace/figures/terrorism1990-2003-300dpi.png
h
-index Hirsch (2005)
A scientist has index h if h of [his/her] N p papers have at least h citations each, and the other (N p − h) papers have at most h citations each.
Bibliometric laws Lotka’s Law (1926) the number (of authors) making
n
contributions is about 1/
n
² of those making one; and the proportion of all contributors, that make a single contribution, is about 60 percent (60,15,7…6>10) Not statistically exact May be changing with the current model of scholarship
Bibliometric laws Bradford’s law (1934) Journals in a field can be divided into three parts: 1) Core: relatively few # of journals producing 1/3 of all articles 2) Zone 2: same # of articles, but > # of journals 3) Zone 3: same # of articles, but > # of journals The mathematical relationship of the number of journals in the core to the first zone is a constant
n
and to the second zone the relationship is
n
². 1:
n
:
n
² Not statistically exact General power law distribution (akin to Pareto’s law in economics)
Bibliometric laws Zipf’s Law (1935) relationship is:
r
frequency, and
k
x
f
=
k
where
r
is the rank of the word, is the constant
f
is the equals a constant that is approximately 26,500 Not statistically exact General power law probability distribution
Bibliometric laws Other power law probability distributions Pareto’s law (economics) 80-20 rule Law of the vital few Principle of factor sparsity PageRank (google) The Long Tail (markets)
Journal impact factors
As a research method… Reliability?
Validity?
Limitations?
Applications?
Finding and use Collection development Reference services Collection evaluation Use studies Information retrieval algorithms Diffusion of ideas Domain areas and interdisciplinarity Mapping science
Writing your paper…