5. Lucidworks Fusion Is Search-Driven Everything
â˘Drive next generation relevance
via Content, Collaboration and
Context
â˘Harness best in class Open
Source: Apache Solr + Spark
â˘Simplify application
development and reduce
ongoing maintenance
CATALOG
DYNAMIC NAVIGATION
AND LANDING PAGES
INSTANT INSIGHTS AND
ANALYTICS
PERSONALIZED
SHOPPING EXPERIENCE
PROMOTIONS USER HISTORY
Data Acquisition
Indexing & Streaming
Smart Access API
Recommendations &â¨
Alerts
Analytics & InsightsExtreme Relevancy
Access data from
anywhere to build
intelligent, data-
driven applications.
9. ⢠System:
⢠Improved Javascript Stage performance
⢠Updated Versions for: Solr (5.4.1), Tika (1.12), Spark (1.6.1)
⢠Security:
⢠SAML-based security support
⢠API password-redaction capabilities
⢠Connectors:
⢠Box now supports JWT authentication, for easier setup
⢠Azure now supports incremental crawling
⢠HDFS and Windows Shares now support Kerberos authentication
⢠Additional controls for Github crawling
General Improvements
10. ⢠Sample your data source and preview documents
without indexing
⢠Build and test custom pipelines without affecting the
original deďŹnitions
⢠Copy, save, merge pipelines upon completion
Enhanced Data Modeling via Index Pipeline Previews
11. ⢠Greatly simplify the care and feeding of
time-based indexes
⢠Point and click creation of time series
shards
⢠Total control over number of shards and
replication
⢠Easily deďŹned retention and archiving
policies (e.g. 30 day retention)
⢠Intelligent query parsing optimizes shard
access
⢠Ideal for log data and signals
Time Series Done Right
12. ⢠User Interface designed for quickly getting
started with Fusion and easy customization
⢠Popular features are pre-conďŹgured
⢠Built on AngularJS and Apache-licensed open
source
⢠Built in templates for viewing a variety of data
sources
⢠Learn more: https://lucidworks.com/products/
view/
⢠Fork on Github: https://github.com/lucidworks/
lucidworks-view
Lucidworks View
14. ⢠Improved Spark streaming and data locality
integration resulting in signiďŹcant performance
improvements
⢠$FUSION_HOME/bin/spark-shell available for rapid
prototyping and testing of Spark in the Fusion
environment using the command line
⢠Check out: http://github.com/lucidworks/spark-solr
Spark FTW
15. ⢠Support for new Spark Job types:
⢠Aggregations, Script, Item Similarity, Quality
⢠Spark Job API now available at â/spark/jobsâ
⢠Create and run your own Spark jobs
⢠Leverage best in class libraries like MLLib, Mahout
and DL4J
Fusion: Creating Jobs for Engineers Since 2015
16. ⢠Spark has very basic text handling capabilities built-in
(whitespace tokenization and a few others)
⢠Lucene has a fast, capable text analysis system built-
in, hence:
⢠Weâve made Lucene Analyzers work nicely in Spark!
⢠Learn more at:
⢠https://lucidworks.com/blog/2016/04/13/spark-
solr-lucenetextanalyzer/
⢠https://github.com/lucidworks/spark-solr/blob/
master/src/main/scala/com/lucidworks/spark/
analysis/LuceneTextAnalyzer.scala
Lucene + Spark: Getting Past the Whitespace
17. ⢠Fusion can now capture and calculate
common search metrics like:
⢠Mean Reciprocal Rank
⢠Precision/Recall
⢠NDCG (Normalized Discounted
Cumulative Gain)
⢠Uses the same framework as signals and
aggregations, meaning you can easily track
and report across time
Speaking of QualityâŚ
18. Demo
Spark Shell, run k-Means, index clusters:
https://github.com/lucidworks/fusion-examples/tree/master/fusion-2.3-webinar/src/main/spark-shell
19. ⢠Next Release will be 3.0 (June/July timeframe)
⢠Java 8 and above
⢠Solr 6.x
⢠Query Pipeline Builder
⢠Enhanced Machine Learning capabilities
⢠Preview in 2.3, but marked experimental
⢠Full featured Experiment Management framework with
support for multi-arm bandit optimization
⢠Easy import/export for moving from Dev -> QA -> Staging
-> Production
Looking Ahead
20. ⢠Fusion 2.3 will be available week of April 25th
⢠Learn more about Fusion at: http://www.lucidworks.com/products/fusion
⢠Learn more about Lucidworks View: https://lucidworks.com/products/view/
⢠Fusion docs available at http://docs.lucidworks.com
Questions?