Infochimps Hadoop Summit 2013

•Als PPTX, PDF herunterladen•

0 gefällt mir•1,561 views

Jim Kaskade

Technologie Business

0101010101010101010101010101010
010101010101010101010101010101
01010101010101010101010101010
0101010101010101010101010101
01010101010101010101010101
0101010101010101010101010
010101010101010101010101
01010101010101010101010
0101010101010101010101
01010101010101010101
010101010101010101
01010101010101010
0101010101010101
010101010101010
01010101010101
0101010101010
010101010101
01010101010
1010101010
010101010
10101010
0101010
101010
0101
101
Enterprise
Big Data
Turning Data
Into
Revenue

8/17/2013 2
Which Do You Prefer?
24 Months 30 Days
Over Budget 10% of Budget
Failed Big
Data Project
Creating
Huge Value

Real-Time
Ad-hoc
Batch
Applications
Cloud Infrastructure
Analytics
Public Virtual
Private
Private

Batch Analytics
Ad hoc Analytics
Real-time Analytics

Infochimps Big Data Platform
HBase
Elastic-
search
Hadoop
Command
Center
Platform
API
Zabbix
Zookeepers Chef MySQLNFS
Backup
Scheduler
Listener Queue
Storm
HTTP(S)
Syslog
Archive
Storage
You only worry about a tiny
part of the overall platform.

8/17/2013 7
Hybrid Big Data Cloud
Public Virtual Private Private

Variety, Velocity, & Volume
LOGTXT
CSV
XML
HTTP
JSON
Input Data
Cloud::Streams
Your Application
Command Center
A complete managed service for
custom analytics in the
public, private, or hybrid cloud.
Cloud::Queries
Cloud::Hadoop

Cloud::Streams
LOGTXT
CSV
XML
HTTP
JSON
Universal
Listeners
Data
Queueing
JSON
Archiving
Downstream
Data Loading
Cloud::Hadoop
Tuples
Direct
Data Loading
Cloud::Queries
Tuples
Streaming Analytics
happen in real time
Applications
Your Application

HBaseor
Elasticsearch
Cloud::Queries
Cloud::Streams
Tuple
Cloud::Hadoop Archiving
Ad Hoc and Interactive
Analytics on aggregates.
Your Application

Cloud::Hadoop
Archiving
HDFS
HDFS
HDFS
Data ScienceCluster
File
File
Cloud::QueriesCloud::Streams
Tuple
Run batch analytics against
all of your historical data.
Applications
Your Application

Infochimps Cloud Pillars
Fast
• Completely Integrated &
Unified Architecture
• Deployed in hours
• Expanded in minutes
8/17/2013 Infochimps Confidential 13
Simple
• We focus on
Infrastructure Managed
Services
• Customers focus on data
& applications
Flexible
• Cloud Agnostic
• Modular
• Portable
• Open Standards Based
Scalable
• Elastic Cloud
Infrastructure
• Linearly Scalable Across
All Big Data Functions
• Enterprise Class

Empfohlen

External data perspectives: Key ExhibitsGalytix Limited

Clueda - Baader Investment Conference 2014Clueda AG

Ocient PresoIanBertram5

Woodside Glens Neighborhood Plan - Amended 1999Jim Kaskade

Big analytics best practices @ PARCJim Kaskade

Infochimps Cloudcon 2012Jim Kaskade

Infochimps TieCon 2013Jim Kaskade

Jim Kaskade BiographyJim Kaskade

Empfohlen

External data perspectives: Key ExhibitsGalytix Limited

Clueda - Baader Investment Conference 2014Clueda AG

Ocient PresoIanBertram5

Woodside Glens Neighborhood Plan - Amended 1999Jim Kaskade

Big analytics best practices @ PARCJim Kaskade

Infochimps Cloudcon 2012Jim Kaskade

Infochimps TieCon 2013Jim Kaskade

Jim Kaskade BiographyJim Kaskade

Enterprise IT Uncertainty Around Big Data Initiatives in 2015SnapLogic

Charles Verdon - Samedi SQL - Futur de l'intelligence d'affaire MSDEVMTL

Azure-cloud-presentation-Security-Privacy-EDM.pptxcernatdragos1

Webinar: The Death of Traditional Data IntegrationSnapLogic

Riding the wave of change in manufacturingAndreas Schwarzenbrunner

Money All Time Aditi Shrivastava

1825_6_JochenFrancois_TacklingTheVelocityOfBigDataJochen François

EN_ADP_VUCA_Report_Animated_V1Lee Saunders

How big data is helping business?Rachale Adam

Wall Street Tech Conference_2015_Pooneh Mohazzabipooneh mohazzabi

The Risks and Rewards of AISplunk

Big Data.compressedOctavian Donnelly

Automated FP EvolutionDoug Rudolph

Getting started with the Internet of ThingsW. David Stephenson

Yu info 2015 final jgJozek Gruskovnjak

Data & Analytic Innovations: 5 lessons from our customersNick Smith

2018 McRock Capital IIoT Symposium: The Road to IIoT and the Algorithm Econom...MTechHub

Corporate flyertechnolabs

From Big Data to Big ValueDatacratic

CGIAR Platform for Big Data in AgricultureCIAT

Jim kaskade biography (updated)Jim Kaskade

Woodside Residential Design GuidelinesJim Kaskade

Weitere ähnliche Inhalte

Ähnlich wie Infochimps Hadoop Summit 2013

Enterprise IT Uncertainty Around Big Data Initiatives in 2015SnapLogic

Charles Verdon - Samedi SQL - Futur de l'intelligence d'affaire MSDEVMTL

Azure-cloud-presentation-Security-Privacy-EDM.pptxcernatdragos1

Webinar: The Death of Traditional Data IntegrationSnapLogic

Riding the wave of change in manufacturingAndreas Schwarzenbrunner

Money All Time Aditi Shrivastava

1825_6_JochenFrancois_TacklingTheVelocityOfBigDataJochen François

EN_ADP_VUCA_Report_Animated_V1Lee Saunders

How big data is helping business?Rachale Adam

Wall Street Tech Conference_2015_Pooneh Mohazzabipooneh mohazzabi

The Risks and Rewards of AISplunk

Big Data.compressedOctavian Donnelly

Automated FP EvolutionDoug Rudolph

Getting started with the Internet of ThingsW. David Stephenson

Yu info 2015 final jgJozek Gruskovnjak

Data & Analytic Innovations: 5 lessons from our customersNick Smith

2018 McRock Capital IIoT Symposium: The Road to IIoT and the Algorithm Econom...MTechHub

Corporate flyertechnolabs

From Big Data to Big ValueDatacratic

CGIAR Platform for Big Data in AgricultureCIAT

Ähnlich wie Infochimps Hadoop Summit 2013 (20)

Enterprise IT Uncertainty Around Big Data Initiatives in 2015

Charles Verdon - Samedi SQL - Futur de l'intelligence d'affaire

Azure-cloud-presentation-Security-Privacy-EDM.pptx

Webinar: The Death of Traditional Data Integration

Riding the wave of change in manufacturing

Money All Time

1825_6_JochenFrancois_TacklingTheVelocityOfBigData

EN_ADP_VUCA_Report_Animated_V1

How big data is helping business?

Wall Street Tech Conference_2015_Pooneh Mohazzabi

The Risks and Rewards of AI

Big Data.compressed

Automated FP Evolution

Getting started with the Internet of Things

Yu info 2015 final jg

Data & Analytic Innovations: 5 lessons from our customers

2018 McRock Capital IIoT Symposium: The Road to IIoT and the Algorithm Econom...

Corporate flyer

From Big Data to Big Value

CGIAR Platform for Big Data in Agriculture

Mehr von Jim Kaskade

Jim kaskade biography (updated)Jim Kaskade

Woodside Residential Design GuidelinesJim Kaskade

Vmware Serengeti - Based on Infochimps IronfanJim Kaskade

Infochimps CxO Seminar @ PARCJim Kaskade

Big Data & Cloud - Infinite Monkey TheoremJim Kaskade

Marketing & SalesJim Kaskade

Outsourcing ClassJim Kaskade

Online Video and Next-gen StorageJim Kaskade

Rapid Social Game Development & DeploymentJim Kaskade

Application Model for Cloud DeploymentJim Kaskade

Next-Gen Security (using Cloud)Jim Kaskade

CISCO Visual Networking Index Forecast and Methodology, 2009-14Jim Kaskade

$CISCO\'s Take On Internet Video$ $CISCO\'s Take On Internet Video$

CISCO\'s Take On Internet VideoJim Kaskade

Private Cloud Platform as a ServiceJim Kaskade

Advertising Exchange WhitepaperJim Kaskade

Broadband Video Ad ExchangeJim Kaskade

Mobile VideoJim Kaskade

Broadband Video ReviewJim Kaskade

Video SaaS OverviewJim Kaskade

Mehr von Jim Kaskade (19)

Jim kaskade biography (updated)

Woodside Residential Design Guidelines

Vmware Serengeti - Based on Infochimps Ironfan

Infochimps CxO Seminar @ PARC

Big Data & Cloud - Infinite Monkey Theorem

Marketing & Sales

Outsourcing Class

Online Video and Next-gen Storage

Rapid Social Game Development & Deployment

Application Model for Cloud Deployment

Next-Gen Security (using Cloud)

CISCO Visual Networking Index Forecast and Methodology, 2009-14

$CISCO\'s Take On Internet Video$ $CISCO\'s Take On Internet Video$

CISCO\'s Take On Internet Video

Private Cloud Platform as a Service

Advertising Exchange Whitepaper

Broadband Video Ad Exchange

Mobile Video

Broadband Video Review

Video SaaS Overview

Kürzlich hochgeladen

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

Exploring Multimodal Embeddings with MilvusZilliz

FWD Group - Insurer Innovation Award 2024The Digital Insurer

CNIC Information System with Pakdata Cf In Pakistandanishmna97

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

Why Teams call analytics are critical to your entire businesspanagenda

MINDCTI Revenue Release Quarter One 2024MIND CTI

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

[BuildWithAI] Introduction to Gemini.pdfSandro Moreira

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays

DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Kürzlich hochgeladen (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Exploring Multimodal Embeddings with Milvus

FWD Group - Insurer Innovation Award 2024

CNIC Information System with Pakdata Cf In Pakistan

presentation ICT roal in 21st century education

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

Why Teams call analytics are critical to your entire business

MINDCTI Revenue Release Quarter One 2024

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Boost Fertility New Invention Ups Success Rates.pdf

[BuildWithAI] Introduction to Gemini.pdf

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model

MS Copilot expands with MS Graph connectors

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Infochimps Hadoop Summit 2013

1. 0101010101010101010101010101010 010101010101010101010101010101 01010101010101010101010101010 0101010101010101010101010101 01010101010101010101010101 0101010101010101010101010 010101010101010101010101 01010101010101010101010 0101010101010101010101 01010101010101010101 010101010101010101 01010101010101010 0101010101010101 010101010101010 01010101010101 0101010101010 010101010101 01010101010 1010101010 010101010 10101010 0101010 101010 0101 101 Enterprise Big Data Turning Data Into Revenue

2. 8/17/2013 2 Which Do You Prefer? 24 Months 30 Days Over Budget 10% of Budget Failed Big Data Project Creating Huge Value

3. Real-Time Ad-hoc Batch Applications Cloud Infrastructure Analytics Public Virtual Private Private

4. Batch Analytics Ad hoc Analytics Real-time Analytics

5. Infochimps Big Data Platform HBase Elastic- search Hadoop Command Center Platform API Zabbix Zookeepers Chef MySQLNFS Backup Scheduler Listener Queue Storm HTTP(S) Syslog Archive Storage You only worry about a tiny part of the overall platform.

6. 8/17/2013 Infochimps Confidential 6

7. 8/17/2013 7 Hybrid Big Data Cloud Public Virtual Private Private

8. #1 Big Data Cloud

9. Variety, Velocity, & Volume LOGTXT CSV XML HTTP JSON Input Data Cloud::Streams Your Application Command Center A complete managed service for custom analytics in the public, private, or hybrid cloud. Cloud::Queries Cloud::Hadoop

10. Cloud::Streams LOGTXT CSV XML HTTP JSON Universal Listeners Data Queueing JSON Archiving Downstream Data Loading Cloud::Hadoop Tuples Direct Data Loading Cloud::Queries Tuples Streaming Analytics happen in real time Applications Your Application

11. HBaseor Elasticsearch Cloud::Queries Cloud::Streams Tuple Cloud::Hadoop Archiving Ad Hoc and Interactive Analytics on aggregates. Your Application

12. Cloud::Hadoop Archiving HDFS HDFS HDFS Data ScienceCluster File File Cloud::QueriesCloud::Streams Tuple Run batch analytics against all of your historical data. Applications Your Application

13. Infochimps Cloud Pillars Fast • Completely Integrated & Unified Architecture • Deployed in hours • Expanded in minutes 8/17/2013 Infochimps Confidential 13 Simple • We focus on Infrastructure Managed Services • Customers focus on data & applications Flexible • Cloud Agnostic • Modular • Portable • Open Standards Based Scalable • Elastic Cloud Infrastructure • Linearly Scalable Across All Big Data Functions • Enterprise Class

Hinweis der Redaktion

The only part you have to worry about is in the yellow circle. This is the same deploy pack that runs on your local machine for development.
Key MessagesWe help you leverage the people and resources you already have.Infochimps Cloud eliminates all the implementation headaches caused by Big Data enabling your Big Data applications to be completed quickly and fully achieve their objectives.Working with Big Data shouldn’t require you to hire rocket scientists or send your team to 12 weeks of Hadoop boot camp. Infochimps and our partners empower your existing teams to implement any data-driven application your Big Data vision requires. How it WorksYour largest, fastest data sources are streamed in to the Infochimps cloud, where real-time transformation, aggregation, decoration, and matching can be done.Data is then saved to a database for querying (typically Elasticsearch, Hbase, or MySQL).Simultaneously, data is saved to Hadoop for things like historical processing.Technical PointsInfochimps is a managed cloud service provider, and handles everything except your application. Also, all your application has to worry about is pointing to the database for querying data.The ETA for records is far less than 5 seconds, and we have customers who have SLA’s of under 1 second.
Key MessagesFrom: http://www.infochimps.com/infochimps-cloud/cloud-services/cloud-streams/Streaming data and real-time analytics -- Easily handle millions of events per second with in-stream ETL and analyticsIt’s not enough anymore to simply perform historical analysis and batch reports. In situations where you need to make well-informed decisions in real-time, the data and insights must also be timely and immediately actionable. Cloud::Streams lets you process data as it flows into your application, powering real-time dashboards and on-the-fly analytics and delivering data seamlessly to Hadoop clusters and NoSQL databases.Single-purpose ETL solutions are rapidly being replaced with multi-node, multi-purpose data integration platforms — the universal glue that connects systems together and makes Big Data analytics feasible. Cloud::Streams is a linearly scalable, fault-tolerant distributed routing framework for data integration, collection, and streaming data processing. Ready-to-go integration connectors allow you to tap into virtually any internal or external data source that your application needs.BenefitsEasily integrate with virtually any data source, both live/in-motion as well as bulk/at restProcess data as it flows, at scale – not only generating real-time insights, but also delivering data to databases and Hadoop clusters that has already been cleaned, transformed, and augmented/enhancedSolve any business use case with the ability to handle any complexity business logic and parallel stream computingWrite your analytics once when leveraging Wukong – then run in both real-time with Cloud::Streams and in batch with Cloud::Hadoop
Key MessagesAd hoc and interactive analytics -- power your Big Data applications with data you can queryCloud::Queries, a cloud service delivered by Infochimps Cloud, enables advanced distributed text search, any-format document storage and database tables with more than 1B rows — structured and un-structured. Databases and data storage are provided as a cloud service, including worry-free database maintenance, updates and support. Depending on your application requirements, multiple storage technologies may be appropriate including NoSQL and New SQL databases such as HBase, Cassandra, Elasticsearch, MongoDB or even MySQL. Whatever your needs, with Cloud::Queries you’ll have the most powerful cloud database for the job, scaling to the needs of your business and providing APIs that will support your most demanding ad hoc and interactive queries and applications.BenefitsEliminate frustrations of large-scale database administration and data managementTight integration with Big Data processing workflows and delivery paths results for a truly comprehensive Big Data stackLinearly scalable, distributed systems support of the most demanding applications and analytics queries
From http://www.infochimps.com/infochimps-cloud/cloud-services/cloud-hadoop/Key MessagesElastic Hadoop and large-scale batch analytics -- The easiest way to configure and manage Hadoop clusters in the cloudYour team recognizes the power that massively parallel data analysis can provide, and Hadoop is the standard to handle massively scalable data. Cloud::Hadoop, a cloud service delivered by Infochimps™ Cloud, is the ideal Hadoop solution. Turn clusters on at a moment’s notice with advanced elastic spin-up/spin-down capabilities, scale and customize on the fly and leverage tools such as Pig, Hive and Wukong that make Hadoop easier to use and much more useful for enterprises.BenefitsFocus on building applications and answering business questions, not on keeping an extremely complex Hadoop cluster happy and performantScale up to meet any data processing demand through superior elasticityBe more efficient with resources, while still having quick access to HDFS data, with instantly elastic and high performing clustersWrite your analytics once when leveraging Wukong, then run both in batch with Cloud::Hadoop and in real-time streaming with Cloud::Streams