Transcript Document

Analysing Public Science Debates through Blogs and Online News Sources

Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK

Contents

Background    Blogs Oline news sources RSS Tracking public science debates Detecting public science debates

Background

Blogs, public opinion, online news, RSS

Background

There are millions of bloggers Bloggers are almost normal human beings Automatically tracking bloggers’ postings may give insights into public opinion

Blog tracking companies

IBM  WebFountain Intelliseek   BlogPulse “Monitor, measure and leverage consumer generated media” Others growing…

RSS Format

Rich Site Syndication/Really Simple Syndication  XML technology  Used for frequently updated information sources (blogs, news, academic journals) RSS Readers   Users subscribe to the RSS feeds of favourite blogs/sites/journals/searches Notified when updates available  User-controlled ‘push’ technology

Tracking Public Science Debates

Blog keyword searches

Technorati “Searches weblogs by keyword and for links”  Stem cell research Blogdigger  stem cell research IceRocket  Allows Advanced search es  Allows genuine date range search (Google only allows “last updated” date range searches)

Track evolution over time

What is changing about interest in Stem cell research/GM food?

Are experts good at identifying changes in public interest?

How can experts be sure/can they be supported with quantitative information?

Can blogs be used to generate time series reflecting changes in “public interest”?

Free science debate graphs

Solves the trend identification problem?

Blogpulse Offers free automatic blog searches and keyword-generated click search graphs    Stem cell research GM food Mobile phone radiation

Research graphs

Time-consuming to collect data Give control over the data source

Detecting Public Science Debates

How to detect a new debate?

Heuristic methods  E.g. Read papers, scan relevant blogs Automatic methods  E.g. look for sudden increase in usage of science-related words in blogs?

Free hot topic searches

Blog keyword search (sort by date)  Technorati “Searches weblogs by keyword and for links”  Stem cell research  Blogdigger blog search Hot topic searches  Blogdex – top contagious information  Bloglines – today’s hot topics (most popular links) Searches find the really big science debates?

Specialist research tools

Commercial software  Intelliseek/IBM Mozdeh RSS monitor     Generates sub-collections Generates word time series Allows keyword searches Identifies hot topics

Mozdeh Science Concern Corpus A collection of blog postings containing a fear word AND a science word Trend detection used to identify hot “science fear” topics Data cleaning to remove spam Need manual scanning of list of words experiencing biggest usage increase

Science concern hot topics (7%) Random Temporal Descriptor Duplicate Other Threat Prediction Progress Information Fear of Science 0 20 40 60

Hot science fear words

80

Unexpected results?

Social science research  Sudden burst of discussion over fears of the economic theories of Karl Rove, an influential advisor to George Bush Computer security  Concern over spyware features in a software vendor’s products  Research showing that consumers’ pin numbers could be revealed by poor printing

Conclusions

Many free tools support exploration of Consumer Generated Media Also room for specialist research tools

References

http://www.blogpulse.com/ http://www.blogpulse.com/www2006 workshop/  http://www.creen.org/ Thelwall, M., Prabowo, R. & Fairclough, R. (2006, to appear). Are raw RSS feeds suitable for broad issue scanning ? A science concern case study. Journal of the American Society for Information Science and Technology.