Bibliometric Analysis with Sci2 - Methods in Information Science and

Download Report

Transcript Bibliometric Analysis with Sci2 - Methods in Information Science and

Bibliometric Analysis with Sci2:
Choose Your Own Adventure
Laura Ridenour
School of Library and Information
Science, Indiana University
Overview
•
•
•
•
•
•
•
Paper citation network using Sci2
Gathering data
Examining and understanding file format
Adjusting program parameters for speed
Using the Network Analysis Toolkit
Visualizing a network
Exporting data for use in other programs
Gathering Data
• ISI file format
– Text file with special formatting
– Downloadable from Web of Science
• 500 records at a time from WoS
– Other sources include the Scholarly Database
(http://sdb.cns.iu.edu/search/), but data from this
source works slightly differently than shown in this
tutorial
• Can also use any data with nodes and edges, as
long as it’s a clean data set
Required tools
• Sci2 https://sci2.cns.iu.edu/user/welcome.php
– Must create an account in order to download
• A text editor
• Helpful:
– Oxygen, Gephi
Allocating Additional Memory
• In the Sci2 directory, open the .ini file for Sci2
and change the settings to the following:
– -vmargs
– -Xms30m
– -Xmx1028m
• Save and close the file, re-open Sci2
• Note: you may need to adjust the memory
parameters slightly depending on system
capabilities and settings
http://www.libraries.iub.edu
Web of Knowledge
Download Settings
What an ISI File Looks Like
Using Multiple ISI Files
• Strip out all headers but the beginning and
end tag – can do this manually, or with a script
Sci2
• File > Load > (select your file and click ‘Open’)
> choose ‘ISI flat format’ > click ‘Select’
In the Data Manager
• Right-click the loaded file, select “view” to check and/or modify the file
• If you modify the file, be sure to save the file to a permanent location and
reload it into the data manager
Extracting the Network
Data Manager
Using the Network Analysis Toolkit
(NAT)
• Choose “Network Analysis Toolkit”
NAT Analysis is Displayed in the
Console
Examining the Output
• Checking the NAT results can save time. If the
network is particularly densely connected,
using a faster system may save time
Visualization
• Select the extracted network to visualize from
the Data Manager
Visualization (cont’d)
• Visualization > Networks > GUESS
Visualization (GUESS)
Choosing a layout (GUESS)
• Layout > GEM (or whichever layout you would
like to use)
Initial GEM Layout (GUESS)
Graph Modifier Dashboard in GUESS
Play!!
• Take some time to play with different
properties in the network. Feel free to use the
interpreter, it uses jython (examples on next
slide, from CNS wiki)
Jython Examples
Exporting Data
• From the Data Manager, right click the
appropriate graph file and select “Save”
Exporting Data (cont’d)
• Choose GraphML for importing a graph into
Gephi
– After saving, change the file extension from .XML
to .graphml to make the file usable by Gephi
• Choose .net for Pajek
Additional Resources
• http://wiki.cns.iu.edu/
• http://gephi.org/