SlideShare a Scribd company logo
1 of 29
Download to read offline
Datafari - Building an Open Source
Enterprise Search Solution from
Popular Building Blocks
CEDRIC ULMER
FRANCE LABS
II-SDV
25/04/17
Datafari
So what is Datafari?
• « Packaged solution » to analyse and search for documents and data
• Can index heterogeneous data formats from multiple sources
• Federated search interface
• Apache v2 licence
Why Datafari ?
Choice of the Apache Solr and Elasticsearch technologies (more about this later...)
Three possibilities to answer a customer requirements :
• Use a packaged solution available on the market from a 3rd party
• Starting from Apache Solr or Elasticsearch (or others)
• Develop, gather necessary components for each customer needs
• Ensure « production-ready » material: docs, processes, tests.
• Create our own packaged solution (yeah!)
Why Datafari ?
Problems with 3rd party proprietary solutions:
• Black box
• Roadmap not clear
• Resilience (bankrupt, acquisition…)
Problems with 3rd party open source solutions:
• Lack of technical documentation
• Difficulty to setup an understandable debug environment
• Delay in the embedded components updates: In particular Solr or ES
• License issues (mostly viral ones)
• Lack of resilience from the makers
=> Required us to develop our own solution to better address our customer needs
Why Datafari
Idea:
• Gather the best of both worlds :
• The “packaged” aspect of existing solutions
• Many functionalities
• All in one
• The flexibility of a solution based on Solr and ES
• All of that with an Apache v2 licence ☺
• Focus on Enterprise Search:
• Admin for search experts
• Admin for search admin
• Eased AD/LDAP management
• Search and data analytics
Based on 4 building blocks:
• Apache Solr
• The heart of the search engine
• Apache Manifold CF
• Crawling documents
• Ajax FranceLabs
• UI
• Elasticsearch
• Data analytics
Ajax
FranceLabs
Datafari 3.1
Apache Tomcat 7
Data Sources
Datafari Search / Admin
Apache ManifoldCF
CMS
DB
Fileshares
Web
Security
(AD, LDAP)
PostgreSQL
Apache Solr 5.5
Document Index
Statistics Index
Apache ManifoldCF 2.5 Crawler Service
Autorization Service
ELK
Cassandra (User
Management)
Apache Solr
Lucene based Full text search engine
Apache Top Level project
Large communauty (users/devs)
Efficient/Reliable
Scalable
• High availability
• Queries
• Index volume
Apache Solr
Webapp Java
REST APIs XML/HTTP
• Indexing
• Querying
Caching
Web admin interface
Configuration through XML config files or APIs
Apache Lucene/Solr – Some refs
Apache Solr for Datafari
Search core of Datafari
Preconfigured index for rich documents
• Language detection
• Standard facets
• Autocomplete
• Spellchecker
Indexing user queries
• Enables analytics on search users behavior
Datafari 3.1
Apache Tomcat 7
Data Sources
Datafari Search / Admin
Apache ManifoldCF
CMS
DB
Fileshares
Web
Security
(AD, LDAP)
PostgreSQL
Apache Solr 5.5
Document Index
Statistics Index
Apache ManifoldCF 2.5 Crawler Service
Autorization Service
ELK
Cassandra (User
Management)
Apache Manifold CF
Framework for data crawling
Management of incremental crawling
Authorization management
Programmable crawls (time windows, loads, regex…)
Apache Manifold CF
Many off the shelf connectors:
• FileShare (Samba)
• JDBC
• Website
• Alfresco
• CMIS
• Sharepoint
• Mail
• Dropbox
• LDAP/AD
Apache Manifold CF for Datafari
Manages data crawling
Manages authentication
Preconfigured integration with our Solr
Datafari 3.1
Apache Tomcat 7
Data Sources
Datafari Search / Admin
Apache ManifoldCF
CMS
DB
Fileshares
Web
Security
(AD, LDAP)
PostgreSQL
Apache Solr 5.5
Document Index
Statistics Index
Apache ManifoldCF 2.5 Crawler Service
Autorization Service
ELK
Cassandra (User
Management)
Datafari Search
Front-End
User UI
• AjaxFrance Labs
Authentication
Interactions with Solr (SolrJ)
Indexing users queries
Admin UI
• Solr
• ManifoldCF
• Statistics
AjaxFranceLabs
Inspired by AjaxSolr
Javascript/Ajax client
Provides several components:
• Manager: backend connection
• Widgets
• Graphical/Logical components
• (Advanced) Search
• Facet
• Geolocalisation (Based on OpenStreetMap)
Browser
Datafari Server
Datafari Search
Manager
SearchBarWidget
ResultWidget
FacetWidget
Datafari Search Servlet
Ajax
Use case 1 – Oil and Gas
Sources:
• Sharepoint
• Documentum
• Fileshare
• DB
Volume: 28 TB
Users: Geoscientists
Use case 2 – Nuclear
Sources:
• Fileshare
• Oracle
• DB
Volume: 15 M docs
Users: Maintenance operators
Démo!!!
Technical Roadmap (1/2)
New advanced search
Solr 6
Graphical SolrCloud management
Always more documentation
Annotator
Technical roadmap (2/2)
New languages
Consolidation
Unit test framework
More dashboards in ELK
Learning-to-Rank
Where can I find Datafari
Main hub: http://www.datafari.com/en
Source code available on Github:
• https://code.google.com/p/datafari/
Install packages for Debian 7 and Windows available on:
• www.datafari.com
Forum:
• https://groups.google.com/forum/#!forum/datafari
Documentation on Confluence
• Technical and functional
Tickets and releases on Jira
Want to follow Datafari ?
@francelabs
#datafari
francelabs
francelabs
Become a Datafarian ! ☺
We are always open to suggestions
• “Reorganise your docs…”
Contribution
• What about a German version ?!
• UI widgets ?
Most important: your use cases and usage feedback
CONTACT
Don’t hesitate to reach out to us for any info
Our corporate website: www.francelabs.com
Email: contact@francelabs.com
Tél: 09 72 43 72 85
Fax: 09 72 29 28 14

More Related Content

What's hot

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...Dr. Haxel Consult
 
II-PIC 2017: Porduct presentation minesoft
II-PIC 2017: Porduct presentation minesoftII-PIC 2017: Porduct presentation minesoft
II-PIC 2017: Porduct presentation minesoftDr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
II-PIC 2017: Product Presentation BizInt
II-PIC 2017: Product Presentation BizIntII-PIC 2017: Product Presentation BizInt
II-PIC 2017: Product Presentation BizIntDr. Haxel Consult
 
ICIC 2017: New product presentationsLighthouse IP
ICIC 2017: New product presentationsLighthouse IPICIC 2017: New product presentationsLighthouse IP
ICIC 2017: New product presentationsLighthouse IPDr. Haxel Consult
 
IC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePointIC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePointDr. Haxel Consult
 
II-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPII-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPDr. Haxel Consult
 
II-PIC 2017: Optimizing R&D strategy through organized patent database
II-PIC 2017: Optimizing R&D strategy through organized patent databaseII-PIC 2017: Optimizing R&D strategy through organized patent database
II-PIC 2017: Optimizing R&D strategy through organized patent databaseDr. Haxel Consult
 
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
ICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscDr. Haxel Consult
 
ICIC 2014 New Product Introduction ProQuest
ICIC 2014 New Product Introduction ProQuestICIC 2014 New Product Introduction ProQuest
ICIC 2014 New Product Introduction ProQuestDr. Haxel Consult
 
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web Search
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web SearchII-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web Search
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web SearchDr. Haxel Consult
 
ViewPorter® Louis™ Machine Learning
ViewPorter® Louis™ Machine LearningViewPorter® Louis™ Machine Learning
ViewPorter® Louis™ Machine LearningOrange Digit
 
American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015Craig Jordan
 

What's hot (20)

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
 
II-PIC 2017: Porduct presentation minesoft
II-PIC 2017: Porduct presentation minesoftII-PIC 2017: Porduct presentation minesoft
II-PIC 2017: Porduct presentation minesoft
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-PIC 2017: Product Presentation BizInt
II-PIC 2017: Product Presentation BizIntII-PIC 2017: Product Presentation BizInt
II-PIC 2017: Product Presentation BizInt
 
ICIC 2017: New product presentationsLighthouse IP
ICIC 2017: New product presentationsLighthouse IPICIC 2017: New product presentationsLighthouse IP
ICIC 2017: New product presentationsLighthouse IP
 
IC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePointIC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePoint
 
II-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPII-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IP
 
II-PIC 2017: Optimizing R&D strategy through organized patent database
II-PIC 2017: Optimizing R&D strategy through organized patent databaseII-PIC 2017: Optimizing R&D strategy through organized patent database
II-PIC 2017: Optimizing R&D strategy through organized patent database
 
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
 
AI-SDV 2021: Dolcera
AI-SDV 2021: DolceraAI-SDV 2021: Dolcera
AI-SDV 2021: Dolcera
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
ICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction Gridlogisc
 
ICIC 2014 New Product Introduction ProQuest
ICIC 2014 New Product Introduction ProQuestICIC 2014 New Product Introduction ProQuest
ICIC 2014 New Product Introduction ProQuest
 
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web Search
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web SearchII-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web Search
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web Search
 
ViewPorter® Louis™ Machine Learning
ViewPorter® Louis™ Machine LearningViewPorter® Louis™ Machine Learning
ViewPorter® Louis™ Machine Learning
 
American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015
 
SciDB
SciDBSciDB
SciDB
 
The BI Sandbox
The BI SandboxThe BI Sandbox
The BI Sandbox
 

Viewers also liked

II-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space AnalysisII-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space AnalysisDr. Haxel Consult
 
II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You? II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You? Dr. Haxel Consult
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...Dr. Haxel Consult
 
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPPII-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPPDr. Haxel Consult
 
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...Dr. Haxel Consult
 
II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?Dr. Haxel Consult
 
II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" Dr. Haxel Consult
 

Viewers also liked (7)

II-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space AnalysisII-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
 
II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You? II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You?
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPPII-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
 
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
 
II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?
 
II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network"
 

Similar to II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution from Popular Building Blocks

Solr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for HadoopSolr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for Hadoopgregchanan
 
Data Science with the Help of Metadata
Data Science with the Help of MetadataData Science with the Help of Metadata
Data Science with the Help of MetadataJim Dowling
 
Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Cask Data
 
Denodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me AnythingDenodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me AnythingDenodo
 
Alfresco Day Stockholm 2015 - Alfresco One
Alfresco Day Stockholm 2015 - Alfresco OneAlfresco Day Stockholm 2015 - Alfresco One
Alfresco Day Stockholm 2015 - Alfresco OneNicole Szigeti
 
Solr Recipes Workshop
Solr Recipes WorkshopSolr Recipes Workshop
Solr Recipes WorkshopErik Hatcher
 
Bquery Reporting & Analytics Architecture
Bquery Reporting & Analytics ArchitectureBquery Reporting & Analytics Architecture
Bquery Reporting & Analytics ArchitectureCarst Vaartjes
 
End-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks Delta
End-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks DeltaEnd-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks Delta
End-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks DeltaDatabricks
 
An introduction to QuerySurge webinar
An introduction to QuerySurge webinarAn introduction to QuerySurge webinar
An introduction to QuerySurge webinarRTTS
 
Presto: Fast SQL on Everything
Presto: Fast SQL on EverythingPresto: Fast SQL on Everything
Presto: Fast SQL on EverythingDavid Phillips
 
Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6
Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6
Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6Kim Hammar
 
10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About 10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About Jesus Rodriguez
 
(ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service (ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service BIOVIA
 
Introduction to Laravel Framework (5.2)
Introduction to Laravel Framework (5.2)Introduction to Laravel Framework (5.2)
Introduction to Laravel Framework (5.2)Viral Solani
 
Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016Sumo Logic
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiUnmesh Baile
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiUnmesh Baile
 

Similar to II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution from Popular Building Blocks (20)

Solr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for HadoopSolr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for Hadoop
 
Data Science with the Help of Metadata
Data Science with the Help of MetadataData Science with the Help of Metadata
Data Science with the Help of Metadata
 
Solr 101
Solr 101Solr 101
Solr 101
 
Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?
 
Solr Recipes
Solr RecipesSolr Recipes
Solr Recipes
 
Denodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me AnythingDenodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me Anything
 
Alfresco Day Stockholm 2015 - Alfresco One
Alfresco Day Stockholm 2015 - Alfresco OneAlfresco Day Stockholm 2015 - Alfresco One
Alfresco Day Stockholm 2015 - Alfresco One
 
Solr Recipes Workshop
Solr Recipes WorkshopSolr Recipes Workshop
Solr Recipes Workshop
 
Bquery Reporting & Analytics Architecture
Bquery Reporting & Analytics ArchitectureBquery Reporting & Analytics Architecture
Bquery Reporting & Analytics Architecture
 
Informatica slides
Informatica slidesInformatica slides
Informatica slides
 
End-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks Delta
End-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks DeltaEnd-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks Delta
End-to-End Spark/TensorFlow/PyTorch Pipelines with Databricks Delta
 
An introduction to QuerySurge webinar
An introduction to QuerySurge webinarAn introduction to QuerySurge webinar
An introduction to QuerySurge webinar
 
Presto: Fast SQL on Everything
Presto: Fast SQL on EverythingPresto: Fast SQL on Everything
Presto: Fast SQL on Everything
 
Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6
Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6
Spark ai summit_oct_17_2019_kimhammar_jimdowling_v6
 
10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About 10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About
 
(ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service (ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service
 
Introduction to Laravel Framework (5.2)
Introduction to Laravel Framework (5.2)Introduction to Laravel Framework (5.2)
Introduction to Laravel Framework (5.2)
 
Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbai
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbai
 

More from Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
 

More from Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Recently uploaded

Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Dana Luther
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
Intellectual property rightsand its types.pptx
Intellectual property rightsand its types.pptxIntellectual property rightsand its types.pptx
Intellectual property rightsand its types.pptxBipin Adhikari
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)Christopher H Felton
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Excelmac1
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Paul Calvano
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationLinaWolf1
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一Fs
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Sonam Pathan
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Sonam Pathan
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleanscorenetworkseo
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012rehmti665
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhimiss dipika
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMartaLoveguard
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书zdzoqco
 

Recently uploaded (20)

Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
Intellectual property rightsand its types.pptx
Intellectual property rightsand its types.pptxIntellectual property rightsand its types.pptx
Intellectual property rightsand its types.pptx
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 Documentation
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleans
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
 
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhi
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptx
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
 
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 

II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution from Popular Building Blocks

  • 1. Datafari - Building an Open Source Enterprise Search Solution from Popular Building Blocks CEDRIC ULMER FRANCE LABS II-SDV 25/04/17
  • 2. Datafari So what is Datafari? • « Packaged solution » to analyse and search for documents and data • Can index heterogeneous data formats from multiple sources • Federated search interface • Apache v2 licence
  • 3.
  • 4. Why Datafari ? Choice of the Apache Solr and Elasticsearch technologies (more about this later...) Three possibilities to answer a customer requirements : • Use a packaged solution available on the market from a 3rd party • Starting from Apache Solr or Elasticsearch (or others) • Develop, gather necessary components for each customer needs • Ensure « production-ready » material: docs, processes, tests. • Create our own packaged solution (yeah!)
  • 5. Why Datafari ? Problems with 3rd party proprietary solutions: • Black box • Roadmap not clear • Resilience (bankrupt, acquisition…) Problems with 3rd party open source solutions: • Lack of technical documentation • Difficulty to setup an understandable debug environment • Delay in the embedded components updates: In particular Solr or ES • License issues (mostly viral ones) • Lack of resilience from the makers => Required us to develop our own solution to better address our customer needs
  • 6. Why Datafari Idea: • Gather the best of both worlds : • The “packaged” aspect of existing solutions • Many functionalities • All in one • The flexibility of a solution based on Solr and ES • All of that with an Apache v2 licence ☺ • Focus on Enterprise Search: • Admin for search experts • Admin for search admin • Eased AD/LDAP management • Search and data analytics
  • 7. Based on 4 building blocks: • Apache Solr • The heart of the search engine • Apache Manifold CF • Crawling documents • Ajax FranceLabs • UI • Elasticsearch • Data analytics Ajax FranceLabs
  • 8. Datafari 3.1 Apache Tomcat 7 Data Sources Datafari Search / Admin Apache ManifoldCF CMS DB Fileshares Web Security (AD, LDAP) PostgreSQL Apache Solr 5.5 Document Index Statistics Index Apache ManifoldCF 2.5 Crawler Service Autorization Service ELK Cassandra (User Management)
  • 9. Apache Solr Lucene based Full text search engine Apache Top Level project Large communauty (users/devs) Efficient/Reliable Scalable • High availability • Queries • Index volume
  • 10. Apache Solr Webapp Java REST APIs XML/HTTP • Indexing • Querying Caching Web admin interface Configuration through XML config files or APIs
  • 12. Apache Solr for Datafari Search core of Datafari Preconfigured index for rich documents • Language detection • Standard facets • Autocomplete • Spellchecker Indexing user queries • Enables analytics on search users behavior
  • 13. Datafari 3.1 Apache Tomcat 7 Data Sources Datafari Search / Admin Apache ManifoldCF CMS DB Fileshares Web Security (AD, LDAP) PostgreSQL Apache Solr 5.5 Document Index Statistics Index Apache ManifoldCF 2.5 Crawler Service Autorization Service ELK Cassandra (User Management)
  • 14. Apache Manifold CF Framework for data crawling Management of incremental crawling Authorization management Programmable crawls (time windows, loads, regex…)
  • 15. Apache Manifold CF Many off the shelf connectors: • FileShare (Samba) • JDBC • Website • Alfresco • CMIS • Sharepoint • Mail • Dropbox • LDAP/AD
  • 16. Apache Manifold CF for Datafari Manages data crawling Manages authentication Preconfigured integration with our Solr
  • 17. Datafari 3.1 Apache Tomcat 7 Data Sources Datafari Search / Admin Apache ManifoldCF CMS DB Fileshares Web Security (AD, LDAP) PostgreSQL Apache Solr 5.5 Document Index Statistics Index Apache ManifoldCF 2.5 Crawler Service Autorization Service ELK Cassandra (User Management)
  • 18. Datafari Search Front-End User UI • AjaxFrance Labs Authentication Interactions with Solr (SolrJ) Indexing users queries Admin UI • Solr • ManifoldCF • Statistics
  • 19. AjaxFranceLabs Inspired by AjaxSolr Javascript/Ajax client Provides several components: • Manager: backend connection • Widgets • Graphical/Logical components • (Advanced) Search • Facet • Geolocalisation (Based on OpenStreetMap)
  • 21. Use case 1 – Oil and Gas Sources: • Sharepoint • Documentum • Fileshare • DB Volume: 28 TB Users: Geoscientists
  • 22. Use case 2 – Nuclear Sources: • Fileshare • Oracle • DB Volume: 15 M docs Users: Maintenance operators
  • 24. Technical Roadmap (1/2) New advanced search Solr 6 Graphical SolrCloud management Always more documentation Annotator
  • 25. Technical roadmap (2/2) New languages Consolidation Unit test framework More dashboards in ELK Learning-to-Rank
  • 26. Where can I find Datafari Main hub: http://www.datafari.com/en Source code available on Github: • https://code.google.com/p/datafari/ Install packages for Debian 7 and Windows available on: • www.datafari.com Forum: • https://groups.google.com/forum/#!forum/datafari Documentation on Confluence • Technical and functional Tickets and releases on Jira
  • 27. Want to follow Datafari ? @francelabs #datafari francelabs francelabs
  • 28. Become a Datafarian ! ☺ We are always open to suggestions • “Reorganise your docs…” Contribution • What about a German version ?! • UI widgets ? Most important: your use cases and usage feedback
  • 29. CONTACT Don’t hesitate to reach out to us for any info Our corporate website: www.francelabs.com Email: contact@francelabs.com Tél: 09 72 43 72 85 Fax: 09 72 29 28 14