1. کارگاه پردازش داده توزیع شده
پردیس- شهیدبهشتی
دانشکده علوم و مهندسی کامپیوتر
درس: پایگاه داده توزیع شده
استاد: دکتر هادی طباطبایی
ارائه: ابوالفضل صدیقی
آذر ۱۳۹۳
4. 4
Topics
● Assumptions
● First Node
– Java Installation
– Downloading and Extracting Elasticsearch
– Configuration
● Cloning
● Starting ES Cluster
● ES REST API
● ES General Concepts
– Index, Shard, Segment
– Plugins
● River
● CSV
● JDBC
● Feeder
● ES Commands
● ES GUIs
– Cluster Monitoring
– Analytical Search and BI
5. 5
Assumptions
● You already know about Linux.
– http://www.slideshare.net/AmirSedighi/distrinuted-data-
processing-workshop-sbu
7. 7
Downloading and Extracting
● http://hadoop.apache.org/releases.html
● $ tar -zxvf elasticsearch1.3.2.gz
8. 8
Elasticsearch Configuration
● You would need to modify elasticsearch.yml
and append the following as a minimum
configuration
cluster.name: hexican
name.name: "node1"
node.master: true
node.data: false
11. 11
Cloning
● Clone the first machine and extend your cluster.
– Find the instruction here:
● http://www.slideshare.net/AmirSedighi/distrinuted-data-processing-
workshop-sbu
15. 15
Starting Elasticsearch Cluster
● You can run nodes one by one
– $ elasticsearch-1.3.4/bin/elasticsearch
● You can run all nodes at once using DSH
– $ dsh -M -a – 'elasticsearch-1.3.4/bin/elasticsearch'