SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
CDH


      Transform Enterprise Search with
CDH       FAST Search for SharePoint
CDH                    Quick Facts



About Us           Approach          Partnerships
• 22nd Year        • Vendor          •   Microsoft Gold
• Grand Rapids &     Independent     •   VMware Enterprise
  Royal Oak        • Non-reseller    •   Citrix Silver
• 30 Staff         • Professional    •   Novell Gold
                     Services Only   •   Cisco Premier
CDH   Expertise
CDH   CDH   Talks Tech
CDH               About me


 David Tappan
 Consultant
 IOAp, MCITP, MCTS: SharePoint
 davidt@cdh.com
CDH


CDH   FAST Search: Better Insight
CDH                Agenda: Insight

      • How FAST increases insight

      • Insight into how FAST is used to solve
        specific business problems

      • Insight into what FAST Search high
        availability really requires
CDH          A question



      What is Search, really?
CDH                   One answer



      “Search is the ability to find text strings in
      documents”
The Problem:
CDH        Hidden meaning in the searcher’s intent




       ”What should I know           ”What should I know
       about selling ERP?”        about implementing ERP?”
      - Alan Brewer, Sales Lead     - Renee Lo, Consultant
CDH               Another answer



      “Search is the ability to query any document
      property”
CDH
CDH        Recommended reading



 • http://www.well.com/~doctorow/metacrap.htm
CDH             A better answer


  Search is a service that matches what you
  mean with what documents mean.
CDH


CDH                Cool FAST solutions

  How FAST Search for SharePoint
  enables better meaning extraction
CDH   F4SP Architecture Basics
In the box:
CDH      Dynamic rank algorithms at query time


                       • Query terms in title vs. body
       Context

      Query term       • «Bill Gates» vs. «Bill saw the gates»
       proximity
 «Anchors» match       • «...a page about Bill Gates...»
   query terms
                       • Others clicked a hit for «Bill Gates»
 Click history match
Customizable Query Processing
CDH    What is someone thinking about when they
                  perform a query?
CDH                  Search and the activity feed

Looking for a knowledge management solution?!?!?
                                                                   Knowledge Management
I love SharePoint

It’s the best Knowledge Management Solution in the market
                                                                     Web Content Management
Have you ever built an e-commerce solution on it?

Our focus is knowledge management, and it just works!                          E-Commerce

We use it as a web content management system, and we’re so happy with it

Great for WCM, Great for KM!

Just deployed for KM… so good, so far… will get back once the pilot is over!
CDH             For the geeks…
fql = xrank(string(“fast search”),


      or(department:or(string(“services”),

          string(“engineering”)),

         keywords:string(“knowledge 
management”)), 

          boost=10,000)
In the box: Static rank algorithms
CDH          at content processing time

       Landing     • Prefer shallow
        pages        URLs

                   • Links from other
       Authority     pages

                   • Boost
      High quality   sites/documents
CDH             Customizable content processing
                            How to Index Content by Location?

•    Address, intersection, zip code, names, etc.
       – One Microsoft Way, Redmond, WA
•    Geodetic coordinates (latitude & longitude)
       – 47.639767, -122.129755
       – Degrees, minutes, seconds
            • 47° 38’ 23.16” N, 122° 7’ 47.1” W
•    Universal Transverse Mercator (UTM)
       – 10N 565367 5276630
•    Military Grid Reference System (MGRS)
       – 10T ET 65367 76630



    Index Schema ( Managed Properties)
CDH               Geographic entity extraction
•   Requirement                                    { name: 'Microsoft', 
     – Parse elements from text                     address: 'One Microsoft Way, Redmond, 
                                                   WA 98052',
     – Tag documents with the individual values     phone: '1‐800‐Microsoft (642‐7676)',
                                                    path: 'http://www.microsoft.com', 
                                                    latitude: '47.639767',
•   Solution                                        longitude: '‐122.129755'  }
     – Custom regular expression extraction
     – Call Bing Maps API
     – Return latitude and longitude and store as crawled property
CDH                          How they did it

                                         Geo-coding with Bing Maps API


                                                     …



                                                          …
                                                     Mapper
                                                                        Lemmatization
            OpenSearch                                   Entity Extraction
                                                                              Format Conversion
              Source                                                   Language Detection



End Users
                                                                                        Data Sources
                Federation




                               Query                  Content
                                         Indexer
                             Processor               Processor        Feeder
                                                                        Feeder



                                          Index

   Search Center                         Partition
CDH          Geographic queries
(
    YOUR_TERM(s)_HERE,
    maxlatitude:range(LOW_LAT,max),
    minlatitude:range(min,HIGH_LAT),
    maxlongitude:range(LOW_LON,max),
    minlongitude:range(min,HIGH_LON)
)
e.g.
and(football,maxlatitude:range(12,max),minlatitude:range
(min,34), maxlongitude(56,max),minlongitude(min,78))
CDH               Takeaways

  • Search ain’t beanbag
  • http://www.well.com/~doctorow/metacrap.htm
  • FAST Search for SharePoint provides tools
    to extract MEANING from content and
    queries
CDH


      Scaling FAST Search:
CDH            What it takes
CDH           FAST Search for SharePoint scaleout

                    Query
                                                                      Scale-out multiple
                   Volume    Search and Indexing                      “dimensions”
                                                                          Query Volume
                                                                          Content Volume
                                                                          Indexing freshness

                                                                      Redundancy options
Query and Result                                   Content
                                                                         Search
  Processing                                       Volume                Indexing

                                                                      Performance targets*
                                                                          15M Docs/node
                                                                          25 QPS/node
                                                                          50 docs/sec
No theoretical              Crawling and Content
                                Processing
upper bounds!
                                                             *Depends on content and hardware specifics
CDH                  Don’t forget SharePoint!
  Request crawl
                                                                           Web
         FAST
      Content SSA
                                                                          crawls
       Admin DB
      Poll request                                                                         Database
                       Admin
                     component
                                                              Content Web
                       Log request      FAST                    Service
                                     Content SSA
                                      Crawl DB
                                         Poll request
                                                          Master Crawl
                                                            comp.

                         Crawl data                     Distribute work
                         Crawl history                                       Crawl comp.
                                                                            Crawl comp.
                         Crawl queue                                       Crawl comp.
                         additions
                                                                          Document
                                                                            batches
                                                                               FAST Search
CDH   SharePoint Search components



      Admin              Query


                        Index P1
      Crawl

                                       Admin   Crawl   Props




      SharePoint Server              Database Server
      All Components on one server   All Databases on one Instance
Search deployment:
CDH               Query layer build out
               Query              Query

                             P2
                       Index P1



      Admin                  Query

                            Index P1
       Crawl


                                             Admin   Crawl   Props




      SharePoint Server                      Database Server
      Query Components on Multiple Servers   All Databases on one Instance
      Index Re-Partitioned
                                                                         Props
Search deployment:
CDH                        Crawl layer build out
                        Query              Query   Query              Query

                                Index P1                   Index P2



              Admin


               Crawl




          SharePoint Server
          Query Components on Multiple Servers
          Index Re-Partitioned
 SharePoint Server         Crawl
 Crawl Components on
                                                                                           Crawl   Props
 Multiple Servers                                                              Admin
                                                                                                           Props


                                                                                       Crawl



                                                                              Database Server
                                                                              All Databases on one Instance
CDH                         Thank You

 Royal Oak                                  Grand Rapids
 306 S. Washington Ave.                     15 Ionia SW
 Suite 212                                  Suite 270
 Royal Oak, MI 48067                        Grand Rapids, MI 49503
 p: (248) 546-1800                          p: (616) 776-1600




                 www.cdh.com
                   (c) C/D/H 2007. All rights reserved

Weitere ähnliche Inhalte

Was ist angesagt?

Search for Overview for SC Upstate SP users
Search for Overview for SC Upstate SP usersSearch for Overview for SC Upstate SP users
Search for Overview for SC Upstate SP usersMike Brannon
 
Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...
Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...
Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...Krishnan Parasuraman
 
Crowdsourcing-enabled Linked Data management architecture
Crowdsourcing-enabled Linked Data management architectureCrowdsourcing-enabled Linked Data management architecture
Crowdsourcing-enabled Linked Data management architectureElena Simperl
 
Adding structure to unstructured content for enhanced findability hakan tylen
Adding structure to unstructured content for enhanced findability hakan tylenAdding structure to unstructured content for enhanced findability hakan tylen
Adding structure to unstructured content for enhanced findability hakan tylenDynamic People B.V.
 
Yahoo! TAO Case Study Excerpt
Yahoo! TAO Case Study ExcerptYahoo! TAO Case Study Excerpt
Yahoo! TAO Case Study ExcerptDenny Lee
 
Building Enterprise Apps for Big Data with Cascading
Building Enterprise Apps for Big Data with CascadingBuilding Enterprise Apps for Big Data with Cascading
Building Enterprise Apps for Big Data with CascadingPaco Nathan
 
Intro to Cascading (SpringOne2GX)
Intro to Cascading (SpringOne2GX)Intro to Cascading (SpringOne2GX)
Intro to Cascading (SpringOne2GX)Paco Nathan
 
Vodafone xone fev142013v3 ext
Vodafone xone fev142013v3 extVodafone xone fev142013v3 ext
Vodafone xone fev142013v3 extInfiniteGraph
 
Cascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional ProgrammingCascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional ProgrammingPaco Nathan
 
Enterprise Data Workflows with Cascading and Windows Azure HDInsight
Enterprise Data Workflows with Cascading and Windows Azure HDInsightEnterprise Data Workflows with Cascading and Windows Azure HDInsight
Enterprise Data Workflows with Cascading and Windows Azure HDInsightPaco Nathan
 

Was ist angesagt? (10)

Search for Overview for SC Upstate SP users
Search for Overview for SC Upstate SP usersSearch for Overview for SC Upstate SP users
Search for Overview for SC Upstate SP users
 
Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...
Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...
Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...
 
Crowdsourcing-enabled Linked Data management architecture
Crowdsourcing-enabled Linked Data management architectureCrowdsourcing-enabled Linked Data management architecture
Crowdsourcing-enabled Linked Data management architecture
 
Adding structure to unstructured content for enhanced findability hakan tylen
Adding structure to unstructured content for enhanced findability hakan tylenAdding structure to unstructured content for enhanced findability hakan tylen
Adding structure to unstructured content for enhanced findability hakan tylen
 
Yahoo! TAO Case Study Excerpt
Yahoo! TAO Case Study ExcerptYahoo! TAO Case Study Excerpt
Yahoo! TAO Case Study Excerpt
 
Building Enterprise Apps for Big Data with Cascading
Building Enterprise Apps for Big Data with CascadingBuilding Enterprise Apps for Big Data with Cascading
Building Enterprise Apps for Big Data with Cascading
 
Intro to Cascading (SpringOne2GX)
Intro to Cascading (SpringOne2GX)Intro to Cascading (SpringOne2GX)
Intro to Cascading (SpringOne2GX)
 
Vodafone xone fev142013v3 ext
Vodafone xone fev142013v3 extVodafone xone fev142013v3 ext
Vodafone xone fev142013v3 ext
 
Cascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional ProgrammingCascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional Programming
 
Enterprise Data Workflows with Cascading and Windows Azure HDInsight
Enterprise Data Workflows with Cascading and Windows Azure HDInsightEnterprise Data Workflows with Cascading and Windows Azure HDInsight
Enterprise Data Workflows with Cascading and Windows Azure HDInsight
 

Ähnlich wie FAST Search for SharePoint

Engineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsEngineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsHisham Arafat
 
Large-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, SolrLarge-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, SolrDataWorks Summit
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 SearchSPC Adriatics
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchAgnes Molnar
 
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchEnterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchSearch Technologies
 
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...MongoDB
 
How to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceHow to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceBrightEdge
 
Tutorial: Best Practices for Building a Records-Management Deployment in Shar...
Tutorial: Best Practices for Building a Records-Management Deployment in Shar...Tutorial: Best Practices for Building a Records-Management Deployment in Shar...
Tutorial: Best Practices for Building a Records-Management Deployment in Shar...SPTechCon
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?Agnes Molnar
 
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...Flink Forward
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Concept Searching Portal Solutions Search Engine Face Off
Concept Searching Portal Solutions Search Engine Face OffConcept Searching Portal Solutions Search Engine Face Off
Concept Searching Portal Solutions Search Engine Face Offmartingarland
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebNuxeo
 
Slide 1
Slide 1Slide 1
Slide 1butest
 

Ähnlich wie FAST Search for SharePoint (20)

Highlights from SharePoint Conference 2011
Highlights from SharePoint Conference 2011Highlights from SharePoint Conference 2011
Highlights from SharePoint Conference 2011
 
Engineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsEngineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platforms
 
Large-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, SolrLarge-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchEnterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for Search
 
FAST for SharePoint 2010: How and Why?
FAST for SharePoint 2010: How and Why?FAST for SharePoint 2010: How and Why?
FAST for SharePoint 2010: How and Why?
 
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...
 
How to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceHow to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User Experience
 
Tutorial: Best Practices for Building a Records-Management Deployment in Shar...
Tutorial: Best Practices for Building a Records-Management Deployment in Shar...Tutorial: Best Practices for Building a Records-Management Deployment in Shar...
Tutorial: Best Practices for Building a Records-Management Deployment in Shar...
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?
 
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
Concept Searching Portal Solutions Search Engine Face Off
Concept Searching Portal Solutions Search Engine Face OffConcept Searching Portal Solutions Search Engine Face Off
Concept Searching Portal Solutions Search Engine Face Off
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Slide 1
Slide 1Slide 1
Slide 1
 

Mehr von C/D/H Technology Consultants

Mehr von C/D/H Technology Consultants (20)

6 Ways to Get More From Your Azure
6 Ways to Get More From Your Azure6 Ways to Get More From Your Azure
6 Ways to Get More From Your Azure
 
No Data Left BehindA A SharePoint 2013 Migration
No Data Left BehindA A SharePoint 2013 MigrationNo Data Left BehindA A SharePoint 2013 Migration
No Data Left BehindA A SharePoint 2013 Migration
 
6 Ways to Get More From Your Azure
6 Ways to Get More From Your Azure6 Ways to Get More From Your Azure
6 Ways to Get More From Your Azure
 
Google for education vs office 365
Google for education vs office 365Google for education vs office 365
Google for education vs office 365
 
C/D/H 1990-2015
C/D/H 1990-2015C/D/H 1990-2015
C/D/H 1990-2015
 
SharePoint for Government
SharePoint for GovernmentSharePoint for Government
SharePoint for Government
 
Lean Mobile Solutions with Microsoft Azure
Lean Mobile Solutions with Microsoft AzureLean Mobile Solutions with Microsoft Azure
Lean Mobile Solutions with Microsoft Azure
 
SharePoint site admins leverage search
SharePoint site admins leverage searchSharePoint site admins leverage search
SharePoint site admins leverage search
 
Real world records management in SharePoint 2013
Real world records management in SharePoint 2013Real world records management in SharePoint 2013
Real world records management in SharePoint 2013
 
Real world records management in share point 2013
Real world records management in share point 2013Real world records management in share point 2013
Real world records management in share point 2013
 
System center seminar presentation
System center seminar presentationSystem center seminar presentation
System center seminar presentation
 
Microsoft SharePoint Cloud presentation
Microsoft SharePoint Cloud presentationMicrosoft SharePoint Cloud presentation
Microsoft SharePoint Cloud presentation
 
Office 365 branding webinar
Office 365 branding webinarOffice 365 branding webinar
Office 365 branding webinar
 
Have It Your Way! SharePoint Customizations
Have It Your Way! SharePoint CustomizationsHave It Your Way! SharePoint Customizations
Have It Your Way! SharePoint Customizations
 
C/D/H: Excel to Project in 5 Easy Steps
C/D/H: Excel to Project in 5 Easy StepsC/D/H: Excel to Project in 5 Easy Steps
C/D/H: Excel to Project in 5 Easy Steps
 
Real world rm in share point 2013
Real world rm in share point 2013Real world rm in share point 2013
Real world rm in share point 2013
 
Lync seminar preso all content
Lync seminar preso all contentLync seminar preso all content
Lync seminar preso all content
 
Office 365: How to Get a Foothold in the Cloud
Office 365: How to Get a Foothold in the CloudOffice 365: How to Get a Foothold in the Cloud
Office 365: How to Get a Foothold in the Cloud
 
Epm seminar presentation
Epm seminar presentationEpm seminar presentation
Epm seminar presentation
 
Manage your enterprise with System Center
Manage your enterprise with System CenterManage your enterprise with System Center
Manage your enterprise with System Center
 

Kürzlich hochgeladen

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 

Kürzlich hochgeladen (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

FAST Search for SharePoint

  • 1. CDH Transform Enterprise Search with CDH FAST Search for SharePoint
  • 2. CDH Quick Facts About Us Approach Partnerships • 22nd Year • Vendor • Microsoft Gold • Grand Rapids & Independent • VMware Enterprise Royal Oak • Non-reseller • Citrix Silver • 30 Staff • Professional • Novell Gold Services Only • Cisco Premier
  • 3. CDH Expertise
  • 4. CDH CDH Talks Tech
  • 5. CDH About me David Tappan Consultant IOAp, MCITP, MCTS: SharePoint davidt@cdh.com
  • 6. CDH CDH FAST Search: Better Insight
  • 7. CDH Agenda: Insight • How FAST increases insight • Insight into how FAST is used to solve specific business problems • Insight into what FAST Search high availability really requires
  • 8. CDH A question What is Search, really?
  • 9. CDH One answer “Search is the ability to find text strings in documents”
  • 10. The Problem: CDH Hidden meaning in the searcher’s intent ”What should I know ”What should I know about selling ERP?” about implementing ERP?” - Alan Brewer, Sales Lead - Renee Lo, Consultant
  • 11. CDH Another answer “Search is the ability to query any document property”
  • 12. CDH
  • 13. CDH Recommended reading • http://www.well.com/~doctorow/metacrap.htm
  • 14. CDH A better answer Search is a service that matches what you mean with what documents mean.
  • 15. CDH CDH Cool FAST solutions How FAST Search for SharePoint enables better meaning extraction
  • 16. CDH F4SP Architecture Basics
  • 17. In the box: CDH Dynamic rank algorithms at query time • Query terms in title vs. body Context Query term • «Bill Gates» vs. «Bill saw the gates» proximity «Anchors» match • «...a page about Bill Gates...» query terms • Others clicked a hit for «Bill Gates» Click history match
  • 18. Customizable Query Processing CDH What is someone thinking about when they perform a query?
  • 19. CDH Search and the activity feed Looking for a knowledge management solution?!?!? Knowledge Management I love SharePoint It’s the best Knowledge Management Solution in the market Web Content Management Have you ever built an e-commerce solution on it? Our focus is knowledge management, and it just works! E-Commerce We use it as a web content management system, and we’re so happy with it Great for WCM, Great for KM! Just deployed for KM… so good, so far… will get back once the pilot is over!
  • 20. CDH For the geeks… fql = xrank(string(“fast search”), or(department:or(string(“services”), string(“engineering”)), keywords:string(“knowledge  management”)),  boost=10,000)
  • 21. In the box: Static rank algorithms CDH at content processing time Landing • Prefer shallow pages URLs • Links from other Authority pages • Boost High quality sites/documents
  • 22. CDH Customizable content processing How to Index Content by Location? • Address, intersection, zip code, names, etc. – One Microsoft Way, Redmond, WA • Geodetic coordinates (latitude & longitude) – 47.639767, -122.129755 – Degrees, minutes, seconds • 47° 38’ 23.16” N, 122° 7’ 47.1” W • Universal Transverse Mercator (UTM) – 10N 565367 5276630 • Military Grid Reference System (MGRS) – 10T ET 65367 76630 Index Schema ( Managed Properties)
  • 23. CDH Geographic entity extraction • Requirement { name: 'Microsoft',  – Parse elements from text address: 'One Microsoft Way, Redmond,  WA 98052', – Tag documents with the individual values phone: '1‐800‐Microsoft (642‐7676)', path: 'http://www.microsoft.com',  latitude: '47.639767', • Solution longitude: '‐122.129755'  } – Custom regular expression extraction – Call Bing Maps API – Return latitude and longitude and store as crawled property
  • 24. CDH How they did it Geo-coding with Bing Maps API … … Mapper Lemmatization OpenSearch Entity Extraction Format Conversion Source Language Detection End Users Data Sources Federation Query Content Indexer Processor Processor Feeder Feeder Index Search Center Partition
  • 25. CDH Geographic queries ( YOUR_TERM(s)_HERE, maxlatitude:range(LOW_LAT,max), minlatitude:range(min,HIGH_LAT), maxlongitude:range(LOW_LON,max), minlongitude:range(min,HIGH_LON) ) e.g. and(football,maxlatitude:range(12,max),minlatitude:range (min,34), maxlongitude(56,max),minlongitude(min,78))
  • 26. CDH Takeaways • Search ain’t beanbag • http://www.well.com/~doctorow/metacrap.htm • FAST Search for SharePoint provides tools to extract MEANING from content and queries
  • 27. CDH Scaling FAST Search: CDH What it takes
  • 28. CDH FAST Search for SharePoint scaleout Query Scale-out multiple Volume Search and Indexing “dimensions” Query Volume Content Volume Indexing freshness Redundancy options Query and Result Content Search Processing Volume Indexing Performance targets* 15M Docs/node 25 QPS/node 50 docs/sec No theoretical Crawling and Content Processing upper bounds! *Depends on content and hardware specifics
  • 29. CDH Don’t forget SharePoint! Request crawl Web FAST Content SSA crawls Admin DB Poll request Database Admin component Content Web Log request FAST Service Content SSA Crawl DB Poll request Master Crawl comp. Crawl data Distribute work Crawl history Crawl comp. Crawl comp. Crawl queue Crawl comp. additions Document batches FAST Search
  • 30. CDH SharePoint Search components Admin Query Index P1 Crawl Admin Crawl Props SharePoint Server Database Server All Components on one server All Databases on one Instance
  • 31. Search deployment: CDH Query layer build out Query Query P2 Index P1 Admin Query Index P1 Crawl Admin Crawl Props SharePoint Server Database Server Query Components on Multiple Servers All Databases on one Instance Index Re-Partitioned Props
  • 32. Search deployment: CDH Crawl layer build out Query Query Query Query Index P1 Index P2 Admin Crawl SharePoint Server Query Components on Multiple Servers Index Re-Partitioned SharePoint Server Crawl Crawl Components on Crawl Props Multiple Servers Admin Props Crawl Database Server All Databases on one Instance
  • 33. CDH Thank You Royal Oak Grand Rapids 306 S. Washington Ave. 15 Ionia SW Suite 212 Suite 270 Royal Oak, MI 48067 Grand Rapids, MI 49503 p: (248) 546-1800 p: (616) 776-1600 www.cdh.com (c) C/D/H 2007. All rights reserved