Integrated Collaborative Information Systems Ahmet E. Topcu [email protected] Advisor: Prof Dr. Geoffrey Fox Outline Introduction  Motivation  Research Issues  Architecture  Measurements and Analysis  Conclusions   Contributions.

Download Report

Transcript Integrated Collaborative Information Systems Ahmet E. Topcu [email protected] Advisor: Prof Dr. Geoffrey Fox Outline Introduction  Motivation  Research Issues  Architecture  Measurements and Analysis  Conclusions   Contributions.

Integrated Collaborative Information Systems

Ahmet E. Topcu [email protected]

Advisor: Prof Dr. Geoffrey Fox

1

Outline

      Introduction Motivation Research Issues Architecture Measurements and Analysis Conclusions  Contributions 2

Introduction

 Efforts for collaboration and sharing between users and communities.

  Grid  Virtual Organizations Sakai   Collaboration and Learning Environment for Education Web 2.0

      Represents new web-based services.

Provides rich and lightweight online tools Provides reusable services and data Updates software and data often very rapidly Provides interactive user interfaces Provides an architecture for easy user contribution 3

Web 2.0 Examples

      Blogs (blogger.com, GoogleBlog) Wikis(Wikipedia, WikiWikiWeb) Social Networking Tools(MySpace ,LinkedIn) Social Bookmarking Tools(del.icio.us ,YouTube) Domain of scientific research (CiteULike , Connotea , and Bibsonomy) Domain specific academic search tools(CiteSeer, Google Scholar, Windows Live Academic) 4

Motivation

 Numerous annotation and search tools. Each of them has different capability and not completed defined metadata  Need for exploiting large set of data sources from various tools  Integration of major annotation and search tools in order to use them having additional functionalities for scientific research  Utilize the best capabilities of the tools 5

Motivation II

     Necessities for integration Need for common data format No easy way to find all publications  Example: A search in Google Scholar for the publications of our research lab (Community Grids Lab) will return only about 20% of the total CGL publications.

Wealth of information contained in numerous field remains largely outside the scope of tools What happens if tool you choose is not adopted or worse just disappears 6

Research Issues

Integration

 Building a model to  integrate community tools and adding value to existing systems    natural collection of related documents easily support more metadata support tagging 

Scalability

 Investigate system behavior for increased message rate per second 

Flexibility and Extensibility

  Easy to add and remove service mechanism Easy of integrating annotation and search tools 7

Architecture Principles

 Community-centric platform of services  Integration of dynamic publication, search tools into Cyberinfrastructure based scholarly research.

 Integration such scientific research defining metadata and using various URL, and map them.

 Services that aggregate information from a variety of sources (i.e., “mash-up” tools) and provide added value to communities of researchers  Do not build a new tagging and search systems. Reuse the tools and adding value to them  Easier to link together all relating information common Digital Entity (DE) 8

Integrated Collaborative Information Systems (ICIS) Architecture

HTTP/ SOAP Integration Manager

Gateway 1 Gateway 2 Gateway 3 Gateway N

Tool Gateway

Pull Service

HTTP/ SOAP Client

Push Service ……..

Information Service Client Gateway

Tool-1 Google Scholar Tool-2 Tool-N ……..

Windows CiteSeer Live Acad.

……..

Tool-3 Del.icio.us

Filter Handler Token checker Filter Processor Token builder Controller

Tools

Controller

Data Manager

Extracter Service Inserter Service

Filter Permission Handler Database Storage

9

Integrated Collaborative Information Systems (ICIS) Architecture Components       Tools external web tools to provide services to clients Integration Manager have information service and provide communication between tools, client, and responsible for integration operation in the system Filter operates two-way data filtering Permission Handler checks existing Digital Entity (DE)s permission or build a new permission token for new DEs Data Manager provides a mechanism to extract data from a repository and insert data into a repository Storage maintains user data and permissions in the database.

10

Web Tools Tool Gateway

HTTP

Service Point

WSDL

Pull Service Request Handler Information Handler Metadata Builder Presentation Service Information Service Integration Manager Client Gateway Service Point

HTTP WSDL

Client 11

DE Digital Entity DE DE DE DE DE DE

Filter

DE DE DE Filter Handler Filter Processor DE DE DE DE DE DE DE DE DE 12

Summary: Architecture

        Build integration architecture We do not reinvent existing tools Use existing features of tools Supports tagging services Provides common metadata Allows to use consistent data Provides common resolution of filters Supports authorization of users 13

Use Case:

Collection of Metadata from web pages     

Collect

 Digital Entities in web pages using HTTP methods.

Analyze

 Using heuristic methodology to extract metadata fields of the Digital Entities for publications

Build

  RSS objects using collected Digital Entities.

New tags using collected Digital Entities.

Compare

 Collected Digital Entities from web pages with the existing Digital Entities in ICIS repository.

 If they are: 

different:

same:

Store new Digital Entities in ICIS repository.

Option to update tags and other fields.

Share

 New Digital Entities with other tools using ICIS repository.

14

Security Model

   Security in web 2.0 can be limited.

We implemented a simple but more powerful security model around local tools that wrap Web 2.0 systems.

We used an access-control matrix model to provide security for our information system     Supports multiple groups and multiple users for each Digital Entity (DE).

Similar to UNIX file system  The Unix RWX bits corresponds to Read, Write, and Execute operation for each file and directory. In our system, DE correspond to the file element and folder corresponds to the directory element.

For each DE and folder, there are three types of access rights defined in the systems: Read, Write, and Delete. 15

Security Model II

 We have a security model that supports 

Level of Authorization

      Roles are defined as Super Administrator (SA) and Group Administrator (GA), User (U) The system allows having more than one SA. An existing SA can add other SAs to the system.

SA can assign any U to become GA, and remove GA from group. Each group should at least one GA. GA add/remove U from group U can allow other U and groups to share their resources 

User profile

 Share user profile between sites.

16

Benchmarks and Environments

 Message rate scalability investigation   Search operation  Using Database Access  Using Memory Utilization Test environments     Apache Axis version 1.2

Apache Tomcat Server version 5.0.28

Java 2 Runtime Environment, Standard Edition (build 1.5.0_12-b04) The maximum heap size of Java Virtual Machine(JVM) is 1024 MB 17

Integrated Collaboration Information System(ICIS) Framework

 Search local repository using database access with increasing Message rate # of Clients Thread W S D L W S D L Integrated Collaboration Information System(ICIS) Framework Thread W S D L Database 18

Message rate scalability result (Search using Database)

19

Integrated Collaboration Information System(ICIS) Framework II

 Search local repository using memory with increasing Message rate # of Clients Thread W S D L W S D L Integrated Collaboration Information System(ICIS) Framework Thread W S D L Memory 20

Message rate scalability result (Search using Memory Utilization)

21

Contribution

 

System Research

       Providing architecture and model for integration of collaborative systems Integration and interoperability of annotation, search tools, and web search tools Collaboration of the users sharing resources.

Increasing performance and scalability using memory utilization Providing flexibility allowing integration of different tools having common metadata.

Easy to add and extend service mechanism Supports authorization and event based mechanism

System Software

 An ICIS Infrastructure of Internet Documentation and Integration of Metadata (IDIOM) systems 22

Thanks!

Questions?

23