FAST Search for sharepoint 2010 - Home

Download Report

Transcript FAST Search for sharepoint 2010 - Home

Presented by Jacob Wilson
http://sharepoint.jacobtwilson.com
1
• Jacob Wilson
• Principal Consultant @ Neudesic
• MCTS SharePoint, MCITP SharePoint, MCTS PerformancePoint
• Have worked with SharePoint for the past 4 years with field related work in
Architecture, Installation, Configuration, Administration, Custom Development, Branding,
and User Experience (UX).
• Have also been doing Planning and Architecture work with FAST Search for SharePoint
2010.
3
•
•
•
•
•
•
•
About FAST Search for SharePoint
FAST Search for SharePoint Services
Planning and Architecture (Demo)
Visual Search and User Context (Demo)
SharePoint Search Comparisons
Extending Search
Lesson Learned
4
5
• April 2008
• Microsoft acquires Norwegian Enterprise Search Solution Fast Search &
Transfer
• Overall Vision
• To provide a market leading enterprise search platform that integrates
with your existing infrastructure
• SharePoint Vision
• Best-in-class search, with the simplicity and low TCO of SharePoint
• Build upon a social computing platform for a more robust search
experience
6
• Processes each item by extracting the searchable text, detecting
the written language, and extracting the known properties, such
as company names, people names, locations, and dates.
• The system indexes the extracted text in an inverted index so
that it can handle search requests later.
• Uses the search index to return items that match a user query.
The items are returned in a query hit list that is sorted by the
relevancy to the specified query.
7
• The system interacts with the Active Directory and claims
infrastructure to resolve permissions and group memberships. It
then only returns items the current user is allowed to see,
according to the settings of the content source.
8
•
•
•
•
•
•
SharePoint Sites
• Indexing Connector: FAST Search Connector
• Configuration: Central Administration
File Shares
• Indexing Connector: FAST Search Connector
• Configuration: Central Administration
Web Sites
• Indexing Connector: FAST Search Connector
• Configuration: Central Administration
Line of Business Data
• Indexing Connector: FAST Search Connector
• Configuration: Central Administration
Database
• Indexing Connector: Indexing Connector for JDBC
• Configuration: XML File
Lotus Notes
• Indexing Connector: Indexing Connector for Lotus Notes
• Configuration: XML Files
9
10
• FAST Search Web Crawler
• The FAST Search Web crawler is an indexing connector that can be used
for complex Web crawl scenarios involving a mix of Internet and Intranet
sites.
• Fast Search Connector
• Retrieves content for indexing from SharePoint farms, Exchange folders,
internal/external Web sites, line of business data and file shares. This is
an indexing connector that crawls the content repositories and retrieves
content that can be indexed by FAST Search Server 2010 for SharePoint.
11
• Web Link Analysis (Web Analyzer, Scalable)
• It analyzes search click through logs and hyperlink structures.
• Items that show many clicks in the search click through log are popular and
therefore receive better rank scores than less-viewed items.
• Items that are linked to from many other items are also perceived to be
more relevant for the user and therefore receive better rank scores.
12
• Item Processing (Scalable)
• Receives items to be indexed from indexing connectors.
• Extracts content from source documents in various formats
• Discovers and sets managed properties
• Performs linguistic processing on the content
• Sends the processed items to the indexing service
• Key Features
• Mapping from crawled properties to managed properties
• Parsing of document formats such as Word, Excel and PDF
• Extracting properties from crawled content
• Linguistic processing of items before indexing (Stemming and word
breaking)
13
• Query Matching (Scalable)
• The query matching service uses the inverted indexes created by the
indexing service to retrieve the items that match a query and then return
these items as a query hit list
• The number of columns in the query matching service always equals the
number of columns in the indexer service. The reason is that the index
columns represent a partitioning of the index, and each query matching
node can handle only one such partition of the index.
14
• FAST Search Authorization
• The FAST Search Authorization (FSA) manager is a part of the
administration service that manages user authorization for indexed content
• Ensures that only items that a user is entitled to read appear in the query
results
• The FSA manager communicates with Active Directory services or other
LDAP based directory services to manage the authorization process
15
16
• CPU
• Quad 2.0 GHz or faster
• Memory
• 4GB or more is recommended for all configurations. For multi-core servers,
at least 2GB per core is recommended.
• Swap Space
• Should be twice the machine’s physical memory
• Disk Space
• 50GB initial available disk space
• RAID 5 recommended
• Plan for 1TB of disk space per 5 million indexed documents
17
Feature
Item
Processing
Indexin
g
Query
Matching
Query
Processing
RAM –
Query
Matching
Disk Access
Disk Space
Deep Refiners
-
L
L
L
H
-
-
Shallow Refiners
-
-
-
H
-
L
-
Property Extraction
M
-
-
-
-
-
L
Field Collapsing (Dups)
-
-
M
-
-
-
-
Full-text Sorting
-
-
L
L
H
-
-
Hit highlighted summary
-
-
M
-
-
-
L
Complex Queries
-
-
M
L
-
H
-
Substring Search
-
L
L
-
-
L
L
Stemming
L
L
-
L
-
-
L
Spell Check
-
-
-
L
-
-
-
Synonyms
-
-
-
L
-
-
-
High stop-word threshold
-
-
H
H
L
H
-
18
• Admin Server
• Administration Services
• Non-Admin Server
• Query Matching (Search) Service
• Indexing Service
• Document/Item Processing Service
19
• Admin and Non-Admin Services on single server
20
• Admin Server and 1+ Non-Admin Server(s)
21
• PowerPoint Service Application
• Viewing/Editing PPT documents in the browser
• Generating PPT thumbnails in FAST Search Results
• Word Viewing Service
• Viewing/Editing Word documents in the browser
• Generating Word thumbnails in FAST Search Results
22
• FAST Search Connector (SSA, App Server)
• Is a Search Service Application that retrieves content for indexing from
SharePoint farms and other content repositories.
• FAST Query (SSA, WFE)
• Is a Search Service Application that provides query results from the
content that is crawled by the Content SSA.
23
• Oh yeah, let’s not forget about the partitioned index.
• Spread the load for queries across multiple query servers
• Propagate index subsets to different query servers
• Hash of document ID’s to determine which partition the index entries for a
specific document should be stored
• Query results can come from different servers
24
25
26
• Refiners with Detailed Counts
• Counts are specific to FAST Search
• Thumbnails and Scrolling PPT Previews
• Specific to FAST Search
• Provided by Office Web Apps 2010
• Similarity Search
• Specific to FAST Search
• Sorting on any Property
• Visual Best Bets
• Specific to FAST Search
27
Sorting on any
property
Visual Best Bets
Refinement with
counts
on any property
Scrolling PowerPoint
Previews
Thumbnails
Similarity
Search
28
• Keyword
•
•
•
•
Best Bets
Visual Best Bets
Document Promotion
Document Demotion
• Site Promotion
• Site Demotion
29
30
31
End User perspective
Great OOB search experience
Most visual and dynamic
user experience
Excellent OOB relevance
Federated results
PC, phone, or browser access
Same capabilities, plus:
Broader, better language support
Deeper query syntax
Metadata based refiners
Related searches
View in Browser
Same capabilities, plus:
Deep refiners with counts
Similarity search
Sorting on any property
Phonetic & nickname matching
Relevance and refiners
Integrated presence
Same capabilities
Click-through drives relevance
Colleague & Expertise suggestions
Query suggestions
Same capabilities
Thumbnails & Scrolling Previews
Visual Best Bets
User context from user profile
Multiple relevance profiles
32
Feature
Basic search
Best Bets
SharePoint
Search Server
Foundation 2010 2010 Express
Y
Y
Y
Search
Server 2010
Y
SharePoint
Server 2010
Y
FAST Search Server
2010 for SharePoint
Y
Y
Y
Y
Visual Best Bets
Y
Similar Results
Y
Duplicate
Results
Y
Search Scopes
Y
Y
Y
Search
Enhancement
based on user
context
Y
Y
Crawled and
Managed
Properties
Y
Y
Y
Y*
Query
Federation
Y
Y
Y
Y
33
Feature
Query Suggestions
Sort Results on
Managed Properties or
Rank Profiles
SharePoint Search Server
Search
SharePoint
Foundation 2010 Express Server 2010 Server 2010
2010
Y
Y
Y
FAST Search Server 2010
for SharePoint
Y
Y
Relevancy Tuning by
Document or Site
Promotions
Y
Y
Y
Y*
Shallow Results
Refinement
Y
Y
Y
Y
Deep Results
Refinement
Y
Document Preview
Windows 7 Federation
People Search
Social Search
Taxonomy Integration
Multi-Tenant Hosting
Rich Web Indexing
Support
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
34
35
36
37
• Extend OOB Web Parts – or Create New Ones
• Location Awareness
• Geo-Search using SortFormula()
FQL (Fast Query Language)
with
• Query Language Expressiveness
• Soft boost (relevance control) using Xrank operator in FQL
• EX: Boost all documents that contain a selected persons name or phrase
38
39
•
FAST Query Service Location Port
• Use 13287 instead of the suggested 13285 for HTTP communication. Use 13286 for HTTPS.
•
FAST Search User Context Permissions
• Need to have User Profile Service Application Permissions
•
FAST Search Server Local Admin Group
• FAST Service Accounts need to be manually added to the FAST Server local group FASTSearchAdministrators
•
Service Account Access to FAST Search SQL DB’s
• Need to manually grant service accounts access to SQL DB’s
•
Office Web Applications Install and Post Configuration
• Install on the SharePoint server before installing SharePoint 2010
• Run PowerShell scripts for post-configuration
•
Make sure to activate the publishing infrastructure features on the site where the FAST Search Center
will live. You won’t be able to successfully add a FAST Search Center site to the Site Collection until this
is done.
40
•
FAST Search Server for SharePoint Deployment & Configuration Guide
• http://www.microsoft.com/downloads/details.aspx?FamilyID=a65aef6f-bc99-4b29-81bb37fce8ad6ea7&displaylang=en
•
FAST Search Server for SharePoint Known Issues
• http://www.microsoft.com/downloads/details.aspx?FamilyID=8101DA55-1AF2-41A1-89EF15536A8000F2&displaylang=en
•
FAST Search Server for SharePoint Cmdlets Overview
• http://www.microsoft.com/downloads/details.aspx?familyid=0A91AE47-8C6F-447E-80842EF7208837AE&displaylang=en
•
Deploy Office Web Apps (SharePoint 2010)
• http://technet.microsoft.com/en-us/library/ee695758(office.14).aspx#bkmk_activatingPS
•
Search TechCenter
• http://technet.microsoft.com/en-us/enterprisesearch/ee441234.aspx
41
42