Model of Taxonomy Development

Download Report

Transcript Model of Taxonomy Development

Enterprise Information Architecture
A Platform for Integrating Your
Organization’s Information and Knowledge Activities
Tom Reamy
Chief Knowledge Architect
KAPS Group
Knowledge Architecture Professional Services
http://www.kapsgroup.com
Agenda
 Enterprise Information Architecture
–
Need for Integrated Semantic Solution
– Content, Technology, People, Activities
 Benefits of an Integrated Semantic Solution
–
Cost Savings, Business Value
 Implementation of Enterprise Information
Architecture
–
–
Enterprise Strategy – Integration
Where Search / Convera fits in
2
KAPS Group: General
 Knowledge Architecture Professional Services
 Started Three Years Ago
 Virtual Company: Network of consultants – 12-15
–
Top Taxonomy People, Partner with other consultants
– Partners – Convera, (and others), etc.
– First Convera Certified Taxonomy Developers
 Articles – KMWorld, EContent, Information Today, etc.
 Presentations – KMWorld, Information Today, Pharmaceutical,
Learning, Information Architecture
 Topics: Knowledge Architecture, Taxonomy Boot Camp,
Enterprise Search, Complexity Theory, Intranets
3
KAPS Group: Services




Consulting, Strategy recommendations
Knowledge architecture audit
Taxonomies: Enterprise, Marketing, Insurance, etc.
Services:
–
–
–
–
–
Taxonomy development, consulting, customization
Technology Consulting – Search, CMS, Portals, etc.
Metadata standards and implementation
Knowledge Management: Collaboration, Expertise, e-learning
Information Architecture, Web Development
4
Enterprise Information Architecture
Need for Integrated Semantic Solutions
 Integrated: “Effective IM starts at top.
Most organization’s IM
starts with grassroots approaches that only add to the problem of
information silos” (Forester)
 Semantics:
Taxonomy, metadata, controlled vocabularies,
Personas, Topic Maps, Natural Categories
– Taxonomies in top 10 technologies for 2006 (Gartner)
– “Through 2006, more than 70% of firms that invest in
unstructured information-management initiatives won’t
achieve their targeted return on investment, due to
underinvestment in taxonomy building (.7 probability)”
5
Enterprise Information Architecture
Need for Integrated Semantic Solutions
 Integrated Semantic Solutions:
–
–
–
Combination of technology and semantics
CMS/LMS – content creation is the right time to add metadata
– cheaper and better metadata
Portals and Search – contextual information and feedback
 Technological Integration – very expensive, no one solution to CMS,
LMS, Search, Portal
 Semantic Infrastructure - allows the meaningful integration of content
with a minimal technological element (XML)
– Cheaper, faster, less resources
– Deeper integration – knowledge, not just data
6
Enterprise Information Architecture
Four Essential Contexts
 Content and Content Structure
–
–
Data and Unstructured Information
Standards and Procedures
 People – Company Structure
–
Communities, Users, Central Team
 Activities – Business processes and procedures
–
Central team - establish standards, facilitate
 Technology
–
–
CMS, Search, portals, taxonomy tools
Applications – BI, CI, Text Mining
7
Enterprise Information Architecture
Content & Content Structures
 Content
–
–
Huge variety of types, sources, and uses
Structured data, unstructured documents, web pages, email
 Semantic Infrastructure – Foundation
–
–
Essential content structures – taxonomies, metadata,
vocabularies, synonyms, ontologies, best bets
Standards, publishing policies and procedures
• Metadata standards, common taxonomies
• Integration of metadata into publishing process
8
Enterprise Information Architecture
Taxonomies
 Taxonomies are an Infrastructure Resource
–
Indexing for search:
• Meaningful relevance ranking
• Categorization & related content
–
Browse
• Better user experience, buy more
–
Text mining, Alerts, Competitor Intelligence
 Metadata - Keywords – most difficult
–
–
–
Need to do it right and completely to get real value
Need Taxonomy, Controlled Vocabulary
Value from all fields – purpose, title, description, audience
9
Enterprise Information Architecture
People Structures
 Individual People
–
–
Tacit knowledge, information behaviors
Advanced personalization – category priority
• Sales – forms ---- New Account Form
• Accountant ---- New Accounts ---- Forms
 Communities
–
–
–
–
Variety of types – map of formal and informal
Variety of subject matter – customer experience,
demographics research, scuba
Variety of communication channels and information behaviors
Community-specific vocabularies, need for inter-community
communication
10
Enterprise Information Architecture
Infrastructure Team
 Semantic Infrastructure requires both an infrastructure
team and distributed expertise.
–
–
Software and SME’s is not the answer – keywords
Need local expert input, integration not rigid standardization
 Infrastructure Team
–
–
–
Variety of roles and skills, plus part time, partners
Facilitating author metadata, Research metadata theory
Creating, acquiring, evaluating taxonomies, metadata
standards, vocabularies
11
Enterprise Information Architecture
Technology & Processes
 Technology: Infrastructure and Applications
–
Enterprise Platforms: unstructured data management, CM with
categorization, Portals, Collaboration, Text Mining
 Organizational and Technology Context
–
–
When, who, how, and how much structure to add
Pre-creation, creation, retrieval, application
 Creation – Content Management, Innovation, CoP’s
–
Metadata, categorization
– Workflow with Meaning
– Central Team and distributed SME’s and authors
 Expertise locators – balance of structure and serendipity
12
Content Creation, Customer Services
Agency Activities
Text Mining, Alerts, Personalization
SEARCH / PORTAL / EAI /
Content Management
Data
Documents
Databases
Drives
Email
External
Internet
Subscriptions
Activity
Services
Technology
People
Tacit
Knowledge
Content
Layer
13
Content Creation, Customer Services
Agency Activities
Text Mining, Alerts, Personalization
SEARCH / PORTAL / EAI /
Content Management
Documents
Databases
Drives
Email
External
Internet
Subscriptions
Services
Technology
Data base schemas, Metadata,
Taxonomies, Vocabularies, Personas
Data
Activity
People
Tacit
Knowledge
Content
Structure
People
Policies
Tools
Content
Layer
14
Integrated Solutions - Business Case:
IDC White Paper
 Information Tasks
–
–
–
–
–
Email – 14.5 hours a week
Create documents – 13.3 hours a week
Search – 9.5 hours a week
Gather information for documents – 8.3 hours a week
Find and organize documents – 6.8 hours a week
 Gartner: “Business spend an estimated $750 Billion annually
seeking information necessary to do their job. 30-40% of a
knowledge worker’s time is spent managing documents.”
15
Integrated Solutions - Business Case:
IDC White Paper
 Time Wasted
–
–
–
Reformat information - $57 million per 10,000 per year
Not finding information - $53 million per 10,000
Recreating content - $45 Million per 10,000
 $150 million per 10,000 -- Small Percent Gain = large
savings
–
–
–
1% - $1.5 million per year per 10,000
5% - $7.5 million
10% - $15 million
16
Integrated Solutions - Business Case:
General ROI Issues
 Justification
–
–
–
–
Search Engine - $500K-$2Mil
Content Management - $500K-$2Mil
Portal - $500-$2Mil
Plus maintenance and employee costs
 Taxonomy
–
–
–
–
Small comparative cost – 1%
Needed to get full value from all the above
Search & Portal – deliver higher value
CMS – get value from investment
17
Integrated Solutions - Business Case:
Business Benefits
 Reduce development costs, cycle times
–
–
Increase employee efficiency
Less time looking, more time doing
 Enhance communication
–
–
Capture and reuse knowledge
Innovate better & faster
 Cost of not finding right information
–
–
Business – lost money, opportunities
Security – lost lives
18
Integrated Solutions - Business Case:
General ROI Issues
 Creates a platform for future projects
–
–
Support new types of analysis
Text mining, alerts, CI, BI, etc.
 ROI – the wrong question
–
What is ROI for organizing your agency?
 You wouldn’t run a government agency without organizing
your employees and computers, why think you can create
information access without organizing your information?
19
Enterprise Strategy
General Approach
 Think Big, Start Small, Scale Fast
 Think Big
–
First Step: knowledge architecture audit, K-Map
• Understand what you have, what you are, what you want
• Contextual interviews, content analysis, surveys, focus groups,
ethnographic studies, information behaviors
–
Natural level categories mapped to communities, activities
Category Modeling - “Intertwingledness”
Living, breathing, evolving foundation is the goal
–
–
• Turn over maintenance to enterprise architecture team
–
One outcome – map which areas to do more research
20
Enterprise Strategy
Sequence & integration
 Overall Sequences
–
–
–
Vision / Audit / Enterprise Team & Tools
Content structure / CMS / Search
Portal / New Applications / Integrate Applications
 Coordinate with IT and functional units
–
–
–
Maximize the impact of everyone
Allow for cheaper, smoother implementation
Avoid having to redo parts of either – or worse, buy new
technologies to support
21
Enterprise Strategy
Content Foundation
 Content Management
–
Create a metadata standard with implementation rules
– Controlled vocabulary
– Data and content integration
 Develop/Buy/Customize Enterprise Taxonomy
– Deep taxonomy – platform
 Metadata Repository
–
–
Develop Metadata Standards – Dublin Core+, Implementation
Common resource for search and CMS and?
22
Enterprise Strategy
Infrastructure & Application Technologies
 Unstructured Data Management: Entity and Fact Extraction
 Enterprise Search - Federated
–
support for taxonomies, browse, facets & variety of metadata
 Portal
–
Support for community personalization
 Advanced Applications
–
–
Text Mining, Alerts, Competitor Intelligence
Business Intelligence, internal activities
23
Enterprise Strategy
Search / Convera
 Dynamic Categorized search and browse is best
–
–
Can’t predict all the ways people think
Can’t predict all the questions and activities
• Advanced Cognitive Differences
• Panda, Monkey, Banana
 Complex Topics: intersection of facets, facets and subject
matter – Post coordination
–
–
What users are looking for and what documents are often
about – China and Biotech, Pharma and Farms
Power of fuzzy relationships
24
Enterprise Strategy
Search as Infrastructure
 Ontologies – modeling the world
–
–
–
Excalibur
From information to knowledge
From text mining to Fact Mining
 Knowledge Management
–
–
–
–
Expertise Location – people finding the right people
Communities of Practice – people working with people
Social Network analysis – understanding how people work
Smart applications – learn and adapt to users behaviors
25
Conclusions
 Importance of Integrated Semantic Solutions
–
Semantic Infrastructure
 Need to locate IM in 4 contexts
–
Deep Structure – models and team
 Business Case
–
–
Embarrassment of Riches – getting “realer”
Metrics and Real Stories
 Think Big, Start Small, Scale Fast
 Convera – An infrastructure Platform
26
Questions?
Tom Reamy
[email protected]
KAPS Group
Knowledge Architecture Professional Services
http://www.kapsgroup.com