4. BigQuery: 100% serverless data warehouse
Google
BigQuery
Fully Managed and Serverless
Google Cloud’s Enterprise Data
Warehouse for Analytics
Petabyte-Scale and Fast
Convenience of SQL
Encrypted, Durable and Highly
Available
5. BigQuery is a great choice because:
Near-real
time analysis
of massive
datasets
No-ops;
Pay for use
Durable
(replicated),
inexpensive
storage
Immutable
audit logs
Mashing up
different
datasets to
derive insights
6. 10 B rows
Sample query - Processes over 10
billion rows in less than 10
seconds
SELECT
language, SUM(views) as views
FROM
wikipedia_benchmark.Wiki10B
WHERE
regexp_match(title,"G.*o.*o.*g")
GROUP by language
ORDER by views DESC
7. BigQuery = Massively Parallel Processing query
with the petabit network and thousands of servers
SQL QueryPetabit Network
BigQuery
Storage Compute
Streaming Ingest
Fast Batch Load
DataFlow
DataProc
8. Load data using bq tool, web UI, or API
Create, append or
overwrite table
CSV, JSON or
AVRO format
10. For business analysts
Beautiful reports
Drive-based collaboration experience
No technical expertise required
Connects to many sources: BigQuery, Adwords, Google
Analytics, Google Sheets, YouTube Analytics, etc.
11. Integrating with Google Data Studio
1 Navigate to DataStudio
to create a new
dashboard
2 Create a new Data
Source
3 Select the type of Data
Source to use
4 Authorize
15. Why Machine Learning?
★ Allows to solve problems we don’t have exact solution
for.
○ E.g. recommendations, predictions, clustering.
★ Given y = F(X), where we observe y, we can estimate F.
★ Becomes better with more data
○ when hard coded solution usually becomes worse with more code :)
46. Google Home — voice-activated speaker powered
The Google Assistant — A conversation between you and
Google that helps you get more done in your world.
Actions on Google — How developers can extend the
assistant (via Conversation Actions)
51. “Ok Google, talk to personal chef”
Conversation API, Actions SDK
Invoke “personal chef” action
“Sure, here’s personal chef.
Hi, I’m your personal chef, what
are you in the mood for?”
Speech to Text
“What protein would you
like to use?”
“Well, it’s kind of cold outside, so I’d like
something to …”
Text to Speech
“Sure, here’s your personal chef”
Speech to Text, NLP,
Knowledge Graph,
ML Ranking, User
Profile, Text to
Speech
53. Confidential & ProprietaryGoogle Cloud Platform 53
So…. Why APIs?
{ Google Cloud Platform }
1. We want to offer businesses the tools to differentiate by offering a powerful set of APIs
that enable apps to see, hear and understand the world
2. Reduce your Time to Market (TMM) when launching your next-generation app
3. Provide you easy access to machine learning technology to give any developer the
freedom to work in the language and tools they want
4. Provide virtually limitless scalability to your application without needing to manage
back-end servers running deep learning
54. Pre-Trained Machine Learning Models
Fully trained ML models from Google Cloud that allow a general developer to
take advantage of rich machine learning capabilities with simple REST based
services.
56. Confidential & ProprietaryGoogle Cloud Platform 56
Features
Extract sentence, identify parts of
speech and create dependency parse
trees for each sentence
Identify entities and label by types such
as person, organization, location, events,
products and media
Understand the overall sentiment of a
block of text
Access via REST API. Text can be
uploaded in the request or integrated
with Google Cloud Storage
Syntax Analysis Entity Recognition
Sentiment Analysis Integrated REST API
58. Confidential & ProprietaryGoogle Cloud Platform 58
Faces: Faces, facial landmarks,
emotions
OCR: Read and extract text, with
support for > 10 languages
Photo credit Getty Images
Label: Detect entities from furniture to
transportation
Logos: Identify product logos
Landmarks & Image Properties
Detect landmarks & dominant
color of image
Safe Search: Detect explicit content -
adult, violent, medical and spoof
Cloud Vision API
Call API from anywhere, with support for embeddable images, and Google Cloud Storage
69. Main shellplus_contacts = get_plus_contacts()
print "Processing %d contacts" % len(plus_contacts)
for plus_id in plus_contacts:
plus_profile = get_plus_profile(plus_id)
image_uri = plus_profile['image']['url'].replace("?sz=50","?sz=250")
image_data = analyze_img(image_uri)
if image_data is not None:
print(image_uri)
if 'labelAnnotations' in image_data['responses'][0]:
for label in image_data['responses'][0]['labelAnnotations']:
print label['description']; label['score']; image_uri
70. get_plus_contacts: oAuth
storage = Storage('/home/almo/dev/keys/ex1/oAuth_credentials.dat')
credentials = storage.get()
if credentials is None or credentials.invalid:
PEOPLE_API='https://www.googleapis.com/auth/contacts.readonly'
flow = flow_from_clientsecrets('/home/almo/dev/keys/ex1/oAuth_key.json',
scope=[PEOPLE_API])
credentials = run_flow(flow, storage)
http = credentials.authorize(httplib2.Http())
service = build('people','v1',http=http)
request = service.people().connections().list(resourceName='people/me',
pageSize=500)