Suche senden
Hochladen
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
•
Als PPTX, PDF herunterladen
•
0 gefällt mir
•
2,875 views
Lucidworks (Archived)
Folgen
"Box + Solr = Content Search for Business" - Wei Zhao, Box
Weniger lesen
Mehr lesen
Technologie
Melden
Teilen
Melden
Teilen
1 von 39
Jetzt herunterladen
Empfohlen
Content Search for Business Using Solr: Presented by Wei Zhao, Box
Content Search for Business Using Solr: Presented by Wei Zhao, Box
Lucidworks
Solr search engine with multiple table relation
Solr search engine with multiple table relation
Jay Bharat
Summit v4 dave wolcott
Summit v4 dave wolcott
Data Con LA
FOCA 2.5.5 Training
FOCA 2.5.5 Training
Chema Alonso
Updated: Getting Ready for Due-Diligence
Updated: Getting Ready for Due-Diligence
Marty Kaszubowski
Simbad marinela
Simbad marinela
guest986e5ae
ICT Tool Sharing
ICT Tool Sharing
Republic Polytechnic
The Gaiety Hotel
The Gaiety Hotel
dummypackages
Empfohlen
Content Search for Business Using Solr: Presented by Wei Zhao, Box
Content Search for Business Using Solr: Presented by Wei Zhao, Box
Lucidworks
Solr search engine with multiple table relation
Solr search engine with multiple table relation
Jay Bharat
Summit v4 dave wolcott
Summit v4 dave wolcott
Data Con LA
FOCA 2.5.5 Training
FOCA 2.5.5 Training
Chema Alonso
Updated: Getting Ready for Due-Diligence
Updated: Getting Ready for Due-Diligence
Marty Kaszubowski
Simbad marinela
Simbad marinela
guest986e5ae
ICT Tool Sharing
ICT Tool Sharing
Republic Polytechnic
The Gaiety Hotel
The Gaiety Hotel
dummypackages
Creep
Creep
tanica
Tennis
Tennis
aritz
Bob dylan
Bob dylan
tanica
Davis mark advanced search analytics in 20 minutes
Davis mark advanced search analytics in 20 minutes
Lucidworks (Archived)
Windows 8 で魅力的なWeb サイトを作る
Windows 8 で魅力的なWeb サイトを作る
彰 村地
Web Design Course Overview
Web Design Course Overview
CMD Training Institute
презентация по книге дуг де карло "экстримальное управление проектами"
презентация по книге дуг де карло "экстримальное управление проектами"
tarodnova
Portades
Portades
guest6bfe1581
Cmd Training Institute - New Premises
Cmd Training Institute - New Premises
CMD Training Institute
Searching The United States Code with Solr/Lucene
Searching The United States Code with Solr/Lucene
Lucidworks (Archived)
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Marty Kaszubowski
Column Stride Fields aka. DocValues
Column Stride Fields aka. DocValues
Lucidworks (Archived)
Building SaaS Solutions for Online Media Using Apache Solr
Building SaaS Solutions for Online Media Using Apache Solr
Lucidworks (Archived)
Building specialized industry applications using Solr, and migration from FAS...
Building specialized industry applications using Solr, and migration from FAS...
Lucidworks (Archived)
Gaiety Hotel - full version
Gaiety Hotel - full version
dummypackages
The scene- I love you like a love song Selena Gomez
The scene- I love you like a love song Selena Gomez
tanica
Highly Relevant Search Result Ranking for Law Enforcement
Highly Relevant Search Result Ranking for Law Enforcement
Lucidworks (Archived)
Impact of open source search on the intelligence community
Impact of open source search on the intelligence community
Lucidworks (Archived)
Solr 3.1 and beyond
Solr 3.1 and beyond
Lucidworks (Archived)
Joan Miro
Joan Miro
guest986e5ae
Box + Solr = Content Search for Business
Box + Solr = Content Search for Business
Lucidworks
Big Search 4 Big Data War Stories
Big Search 4 Big Data War Stories
OpenSource Connections
Weitere ähnliche Inhalte
Andere mochten auch
Creep
Creep
tanica
Tennis
Tennis
aritz
Bob dylan
Bob dylan
tanica
Davis mark advanced search analytics in 20 minutes
Davis mark advanced search analytics in 20 minutes
Lucidworks (Archived)
Windows 8 で魅力的なWeb サイトを作る
Windows 8 で魅力的なWeb サイトを作る
彰 村地
Web Design Course Overview
Web Design Course Overview
CMD Training Institute
презентация по книге дуг де карло "экстримальное управление проектами"
презентация по книге дуг де карло "экстримальное управление проектами"
tarodnova
Portades
Portades
guest6bfe1581
Cmd Training Institute - New Premises
Cmd Training Institute - New Premises
CMD Training Institute
Searching The United States Code with Solr/Lucene
Searching The United States Code with Solr/Lucene
Lucidworks (Archived)
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Marty Kaszubowski
Column Stride Fields aka. DocValues
Column Stride Fields aka. DocValues
Lucidworks (Archived)
Building SaaS Solutions for Online Media Using Apache Solr
Building SaaS Solutions for Online Media Using Apache Solr
Lucidworks (Archived)
Building specialized industry applications using Solr, and migration from FAS...
Building specialized industry applications using Solr, and migration from FAS...
Lucidworks (Archived)
Gaiety Hotel - full version
Gaiety Hotel - full version
dummypackages
The scene- I love you like a love song Selena Gomez
The scene- I love you like a love song Selena Gomez
tanica
Highly Relevant Search Result Ranking for Law Enforcement
Highly Relevant Search Result Ranking for Law Enforcement
Lucidworks (Archived)
Impact of open source search on the intelligence community
Impact of open source search on the intelligence community
Lucidworks (Archived)
Solr 3.1 and beyond
Solr 3.1 and beyond
Lucidworks (Archived)
Joan Miro
Joan Miro
guest986e5ae
Andere mochten auch
(20)
Creep
Creep
Tennis
Tennis
Bob dylan
Bob dylan
Davis mark advanced search analytics in 20 minutes
Davis mark advanced search analytics in 20 minutes
Windows 8 で魅力的なWeb サイトを作る
Windows 8 で魅力的なWeb サイトを作る
Web Design Course Overview
Web Design Course Overview
презентация по книге дуг де карло "экстримальное управление проектами"
презентация по книге дуг де карло "экстримальное управление проектами"
Portades
Portades
Cmd Training Institute - New Premises
Cmd Training Institute - New Premises
Searching The United States Code with Solr/Lucene
Searching The United States Code with Solr/Lucene
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Column Stride Fields aka. DocValues
Column Stride Fields aka. DocValues
Building SaaS Solutions for Online Media Using Apache Solr
Building SaaS Solutions for Online Media Using Apache Solr
Building specialized industry applications using Solr, and migration from FAS...
Building specialized industry applications using Solr, and migration from FAS...
Gaiety Hotel - full version
Gaiety Hotel - full version
The scene- I love you like a love song Selena Gomez
The scene- I love you like a love song Selena Gomez
Highly Relevant Search Result Ranking for Law Enforcement
Highly Relevant Search Result Ranking for Law Enforcement
Impact of open source search on the intelligence community
Impact of open source search on the intelligence community
Solr 3.1 and beyond
Solr 3.1 and beyond
Joan Miro
Joan Miro
Ähnlich wie SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
Box + Solr = Content Search for Business
Box + Solr = Content Search for Business
Lucidworks
Big Search 4 Big Data War Stories
Big Search 4 Big Data War Stories
OpenSource Connections
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
DuraSpace
Extracts from AS/400 Concepts & Tools workshop
Extracts from AS/400 Concepts & Tools workshop
Ramesh Joshi
How to build your own google
How to build your own google
Data Science Warsaw
Realtimestream and realtime fastcatsearch
Realtimestream and realtime fastcatsearch
상욱 송
Integrate ManifoldCF with Solr
Integrate ManifoldCF with Solr
francelabs
A Practical Introduction to Apache Solr
A Practical Introduction to Apache Solr
Angel Borroy López
[IC Manage] Workspace Acceleration & Network Storage Reduction
[IC Manage] Workspace Acceleration & Network Storage Reduction
Perforce
Temporal and semantic analysis of richly typed social networks from user-gene...
Temporal and semantic analysis of richly typed social networks from user-gene...
Zide Meng
Fedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN Infrastructure
Menzo Windhouwer
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career Conference
Carly Strasser
Data Infrastructure at LinkedIn
Data Infrastructure at LinkedIn
Amy W. Tang
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Creating Knowledge out of Interlinked Data
Amundsen: From discovering to security data
Amundsen: From discovering to security data
markgrover
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and Usecases
Rahul Jain
Dancing faster in the datasphere
Dancing faster in the datasphere
J T "Tom" Johnson
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Databricks
Second Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for Data
University of California Curation Center
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
Ben Blaiszik
Ähnlich wie SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
(20)
Box + Solr = Content Search for Business
Box + Solr = Content Search for Business
Big Search 4 Big Data War Stories
Big Search 4 Big Data War Stories
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
Extracts from AS/400 Concepts & Tools workshop
Extracts from AS/400 Concepts & Tools workshop
How to build your own google
How to build your own google
Realtimestream and realtime fastcatsearch
Realtimestream and realtime fastcatsearch
Integrate ManifoldCF with Solr
Integrate ManifoldCF with Solr
A Practical Introduction to Apache Solr
A Practical Introduction to Apache Solr
[IC Manage] Workspace Acceleration & Network Storage Reduction
[IC Manage] Workspace Acceleration & Network Storage Reduction
Temporal and semantic analysis of richly typed social networks from user-gene...
Temporal and semantic analysis of richly typed social networks from user-gene...
Fedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN Infrastructure
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career Conference
Data Infrastructure at LinkedIn
Data Infrastructure at LinkedIn
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
Amundsen: From discovering to security data
Amundsen: From discovering to security data
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and Usecases
Dancing faster in the datasphere
Dancing faster in the datasphere
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Second Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for Data
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
Mehr von Lucidworks (Archived)
Integrating Hadoop & Solr
Integrating Hadoop & Solr
Lucidworks (Archived)
The Data-Driven Paradigm
The Data-Driven Paradigm
Lucidworks (Archived)
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Lucidworks (Archived)
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
Lucidworks (Archived)
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
Lucidworks (Archived)
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Lucidworks (Archived)
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Lucidworks (Archived)
What's new in solr june 2014
What's new in solr june 2014
Lucidworks (Archived)
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Lucidworks (Archived)
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Lucidworks (Archived)
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Lucidworks (Archived)
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Lucidworks (Archived)
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Lucidworks (Archived)
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Lucidworks (Archived)
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
Lucidworks (Archived)
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Lucidworks (Archived)
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Lucidworks (Archived)
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Lucidworks (Archived)
Building a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLK
Lucidworks (Archived)
Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinar
Lucidworks (Archived)
Mehr von Lucidworks (Archived)
(20)
Integrating Hadoop & Solr
Integrating Hadoop & Solr
The Data-Driven Paradigm
The Data-Driven Paradigm
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
What's new in solr june 2014
What's new in solr june 2014
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Building a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLK
Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinar
Kürzlich hochgeladen
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
Alex Barbosa Coqueiro
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
Mattias Andersson
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
null - The Open Security Community
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
Lonnie McRorey
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
Fwdays
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
Enterprise Knowledge
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Fwdays
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
Commit University
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
comworks
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
2toLead Limited
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
Scott Keck-Warren
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Stephanie Beckett
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Precisely
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
Florian Wilhelm
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
Alan Dix
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Dilum Bandara
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
Alfredo García Lavilla
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
DianaGray10
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Zilliz
How to write a Business Continuity Plan
How to write a Business Continuity Plan
Databarracks
Kürzlich hochgeladen
(20)
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
How to write a Business Continuity Plan
How to write a Business Continuity Plan
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
1.
1 June 2014 Box +
Solr = Content Search for Business
2.
2 Wei Zhao Box backend
engineer wzhao@box.com
3.
3 to make organizations
more productive, competitive and collaborative by connecting people and their most important information Box mission
4.
4 25MM+ Users 225K+ Businesses 99% Fortune 500
5.
5 Box search mission
is to make user content easy to discover.
6.
6 10Billion+ Documents 10TB+ Index size 100M+ Daily requests Box
uses Solr for search
7.
7 Quick Search
8.
8 Quick Search
9.
9 Full Search
10.
10 Sharding – splitting
the index Agenda Highly available search A few more things 1 2 3 4 5 Q&A Currently working on
11.
11 We shard things
12.
12 Shard ID =
File ID % Total Shards
13.
13 Multi-tenant – One
big logical index for all users Solr index Shard1 Shard2 Shard3 ShardN
14.
14 Search scope
15.
15 File ID: 12345 OwnerID:
user1 Parent Folders IDs: folder1, folder2 File Name: Solr.ppt File Content: blah ...... A typical Solr Document
16.
16 Owner: User1 Parent: Folder1 Owner:
User2 Parent: Folder3 Owner: User2 Parent: Folder2 Owner: User1 Parent: Folder1 Folder4 File 1 File 2 File 3 File 4
17.
17 User1 with no
share folder Owner: User1 Parent: Folder1 Owner: User2 Parent: Folder3 Owner: User2 Parent: Folder2 Owner: User1 Parent: Folder1 Folder4 File 1 File 2 File 3 File 4
18.
18 User2 shares Folder2
with User1 Owner: User1 Parent: Folder1 Owner: User2 Parent: Folder3 Owner: User2 Parent: Folder2 Owner: User1 Parent: Folder1 Folder4 File 1 File 2 File 3 File 4
19.
19 User2 shares Folder2
with User1 Owner: User1 Parent: Folder1 Owner: User2 Parent: Folder3 Owner: User2 Parent: Folder2 Owner: User1 Parent: Folder1 Folder4 File 1 File 2 File 3 File 4
20.
20 User2 shares Folder2
with User1 Owner: User1 Parent: Folder1 Owner: User2 Parent: Folder3 Owner: User2 Parent: Folder5 Owner: User1 Parent: Folder1 Folder4 File 1 File 2 File 3 File 4 Removed out of Folder2
21.
21 User2 shares Folder2
with User1 Owner: User1 Parent: Folder1 Owner: User2 Parent: Folder3 Owner: User2 Parent: Folder5 Owner: User1 Parent: Folder1 Folder4 File 1 File 2 File 3 File 4 Removed out of Folder2
22.
22 Highly Available Search
23.
23 • Index is
highly available • Search functionality is highly available
24.
24 Index workflow
25.
25 Box Front End Upload Index Queue Queue 1 Queue
2 Queue 3 Indexer 1 Indexer 3 Indexer 2 MySQL Index1 Index2 Index2
26.
26 Search workflow
27.
27 Box Front End query HA Proxy Head node HA Proxy 1
2 3 N Box Front End query HA Proxy Head node HA Proxy 1 2 3 N Data center boundary
28.
28 A few more
things
29.
29 File Content Search
30.
30 Box Front End Upload MySQL Box File Storage Indexer Solr Index Text
Extraction Extracted Text
31.
31 Multi-language support
32.
32 Raw file content Language detector English tokenizer Spanish
tokenizer Japanese tokenizer German tokenizer file_content_en File_content_es {hola} file_content_ja . . . . File_content_de
33.
33 To Dos • Scale
language support • Support document with mixed languages
34.
34 Search Warm-up
35.
35 • Front end
informs backend to warm up on keyboard focus • Backend prepares the search filter and caches it in a search session • Backend sends a warm-up query to Solr
36.
36 What we are
working on
37.
37 • Search suggestions •
Search operators • Use machine learning to influence ranking • Logical sharding Things we are working on
38.
38 Question?
39.
39 Contact: wzhao@box.com We are
hiring!
Hinweis der Redaktion
This has changed many of the access patterns to data in the enterprise.
This has changed many of the access patterns to data in the enterprise.
This has changed many of the access patterns to data in the enterprise.
Jetzt herunterladen