Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Search and Analytics (using Elasticsearch)
1. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Search and Analytics
(using Elasticsearch)
Costin Leau
2. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Why search?
3. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Search – what’s the big deal?
Basic/Metadata retrieval
“Find banks with more then (x) accounts”
4. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Search – what’s the big deal?
Basic/Metadata retrieval
“Find banks near my location”
5. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Search – What we’re all about
6. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Search categories
Basic/Metadata retrieval
Full-text search
Highlighting
Geolocation
Fuzzy search (“did-you-mean”)
Natural Language
7. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Search categories
Basic/Metadata retrieval
Full-text search
Highlighting
Geolocation
Fuzzy search (“did-you-mean”)
Natural Language
data stores
search engines
8. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
‘Players’ in the search market
Search engines
- Google/Bing/Yahoo!/Ask.com/Yandex/Baidu
Open-Source
- Sphinx
- Apache Lucene
- Elasticsearch
- Solr
- Sensei
Enterprise Search
- Oracle Endeca / MDEX
- HP Autonomy
- Exalead
- IBM Enterprise Search
9. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Elasticsearch
10. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Elasticsearch
Open-Source Search & Analytics engine
- Structured & Unstructured Data
- Real Time
- Analytics capabilities (facets)
- REST based
Distributed
- Designed for the Cloud
- Designed for Big Data
11. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Elasticsearch
Open-Source Search & Analytics engine
- Structured & Unstructured Data
- Real Time
- Analytics capabilities (facets)
- REST based
Distributed
- Designed for the Cloud
- Designed for Big Data
Lightweight
12. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Elasticsearch
Open-Source Search & Analytics engine
- Structured & Unstructured Data
- Real Time
- Analytics capabilities (facets)
- REST based
Distributed
- Designed for the Cloud
- Designed for Big Data
Lightweight
Popular: >200K downloads/month
13. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Users
14. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Users
15. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Platform Adoption
http://www.thoughtworks.com/radar#platforms 2013
16. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Platform Adoption
http://www.thoughtworks.com/radar#platforms 2013
17. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Use Case - Text Search
https://github.com/blog/1381-a-whole-new-code-search
18. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Searches 50,000,000 venues every day using
Elasticsearch
Use Case - Geolocation
19. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Use Case – Support/Reporting
20. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Use Case - Centralized Logging
21. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Use Case - Pure Analytics
22. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Search and Big Data
23. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
A Holistic View of a Big Data System
ETL
Real
Time
Streams
Unstructured Data (HDFS)
RT Semi
structured
Database
(hBase,
Cassandra,
Mongo)
Big SQL
(Greenplum,
AsterData,
Etc…)
Batch
ProcessingReal-Time
Processing
(s4, storm)
Analytics
24. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
A Holistic View of a Big Data System
ETL
Real
Time
Streams
Unstructured Data (HDFS)
RT Semi
structured
Database
(hBase,
Cassandra,
Mongo)
Big SQL
(Greenplum,
AsterData,
Etc…)
Batch
Processing
Analytics
Real-Time
Processing
(s4, storm)
25. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Hadoop eco-system
Hadoop Distributed File System (HDFS)
Map Reduce Framework (MapRed)
26. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Hadoop eco-system
Hadoop Distributed File System (HDFS)
Map Reduce Framework (MapRed)
27. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Elasticsearch + Hadoop
0
10
20
30
40
50
60
M/R Pig Hive
Raw w/ ES
0
10
20
30
40
50
60
M/R Pig Hive
Raw w/ ES
Writing Reading / Querying
28. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Explore data through
(Elastic)Search
29. Copyright Elasticsearch 2013. Copying, publishing and/or distributing without written permission is strictly prohibited
Thank you!
@costinl
http://www.elasticsearch.org/