SlideShare ist ein Scribd-Unternehmen logo
1 von 6
Downloaden Sie, um offline zu lesen
BIG DATA
FIRST INTERNSHIP
Sep 2013 – Jan 2014
Organized by Contemi Vietnam
Outcomes
• Contemi
• Big Data presence
• Big Data know-how
• Big Data experience
• Interns
• Linux
• R / Python language
• Machine Learning practice
• Process
• Scrum
• Cross Industry Standard Process
for Data Mining (CRISM-DM)
• Kaggle profile
• Hadoop
Preparation
• Platform: Ubuntu 12.04 LTS
• Process:
• Scrum
• Cross Industry Standard Process for Data Mining (CRISP DM)
• Weekly blog
• http://contanalytics.wordpress.com/
Headstart for Dung
• 16/09 – 30/09
• Learn R / Python
• Try Digit Recognizer competition on Kaggle.com
• Join in Introduction to Recommender System and Web Intelligence and Big
Data on Coursera.com
3 month plan
• 1/10 – 31/10
• Go through all typical Machine Learning algorithms, implement, demo and present to Contemi
• 1/11 – 15/11
• Compete for AMS 2013-2014 Solar Energy Prediction Contest
• URL: http://www.kaggle.com/c/ams-2014-solar-energy-prediction-contest
• 16/11 – 22/11
• Compete for Accelerometer Biometric Competition
• URL: http://www.kaggle.com/c/accelerometer-biometric-competition
• 23/11 – 31/12 (end of internship)
• Deploy Hadoop
• Learn Java
• Run Word counting and Sorting experiments with large data (> 1GB)
• Compete for Facebook Recruting III – Keyword Extraction (personally)
• Re-optimize built model basing on Hadoop
Next plan for next internships
• App using Singapore open datasets
• Stock prediction app for Vietnam market
• Visualization
• GitHub
• R-Bloggers

Weitere ähnliche Inhalte

Was ist angesagt?

Free Your On-Premises Data
Free Your On-Premises DataFree Your On-Premises Data
Free Your On-Premises DataSam Basu
 
Sharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerSharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerKNIMESlides
 
Sap open connectors #sitcph
Sap open connectors #sitcphSap open connectors #sitcph
Sap open connectors #sitcphDaniel Graversen
 
AWS Finland Meetup 2019 April
AWS Finland Meetup 2019 AprilAWS Finland Meetup 2019 April
AWS Finland Meetup 2019 AprilRolf Koski
 
All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18Esri UK
 
Cloud Developer Days - BigQuery
Cloud Developer Days - BigQueryCloud Developer Days - BigQuery
Cloud Developer Days - BigQueryWlodek Bielski
 
DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)Jorge Millán Cabrera
 
Intro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GISIntro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GISJoanne Cook
 
An Introduction to Cloud Computing
An Introduction to Cloud ComputingAn Introduction to Cloud Computing
An Introduction to Cloud ComputingMike Frampton
 
Cloud computing
Cloud computingCloud computing
Cloud computingYash Patel
 
API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)Jorge Millán Cabrera
 
Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019Timo Pertilä
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareKNIMESlides
 
Scaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geographyScaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geographyJason Smith
 
AWS Finland meetup 2017 August
AWS Finland meetup 2017 AugustAWS Finland meetup 2017 August
AWS Finland meetup 2017 AugustRolf Koski
 
dotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get FartherdotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get FartherJason Smith
 

Was ist angesagt? (20)

Free Your On-Premises Data
Free Your On-Premises DataFree Your On-Premises Data
Free Your On-Premises Data
 
Sharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerSharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME Server
 
Sap open connectors #sitcph
Sap open connectors #sitcphSap open connectors #sitcph
Sap open connectors #sitcph
 
AWS Finland Meetup 2019 April
AWS Finland Meetup 2019 AprilAWS Finland Meetup 2019 April
AWS Finland Meetup 2019 April
 
All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18
 
DevOpsDays Amsterdam 2016 workshop
DevOpsDays Amsterdam 2016 workshopDevOpsDays Amsterdam 2016 workshop
DevOpsDays Amsterdam 2016 workshop
 
Cloud Developer Days - BigQuery
Cloud Developer Days - BigQueryCloud Developer Days - BigQuery
Cloud Developer Days - BigQuery
 
DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)
 
Intro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GISIntro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GIS
 
An Introduction to Cloud Computing
An Introduction to Cloud ComputingAn Introduction to Cloud Computing
An Introduction to Cloud Computing
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)
 
Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME Software
 
Scaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geographyScaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geography
 
Sitech
SitechSitech
Sitech
 
AWS Dev Day 2018
AWS Dev Day 2018AWS Dev Day 2018
AWS Dev Day 2018
 
AWS Finland meetup 2017 August
AWS Finland meetup 2017 AugustAWS Finland meetup 2017 August
AWS Finland meetup 2017 August
 
dotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get FartherdotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get Farther
 
Real time serverless data pipelines on AWS
Real time serverless data pipelines on AWSReal time serverless data pipelines on AWS
Real time serverless data pipelines on AWS
 

Ähnlich wie Big data internship plan at Contemi Vietnam

SIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptxSIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptxShaista Ansari
 
Day 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big dataDay 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big datassusere5ddd6
 
Choosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-HybridChoosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-HybridRobert Starmer
 
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deploymentsSAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deploymentsChris Kernaghan
 
Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213Chris Kernaghan
 
Getting started with GCP ( Google Cloud Platform)
Getting started with GCP ( Google  Cloud Platform)Getting started with GCP ( Google  Cloud Platform)
Getting started with GCP ( Google Cloud Platform)bigdata trunk
 
Connecting DMPs & Repositories
Connecting DMPs & RepositoriesConnecting DMPs & Repositories
Connecting DMPs & RepositoriesSarah Jones
 
Train the Trainers: Cloud Computing & Big Data Workshop
Train the Trainers:  Cloud Computing & Big Data WorkshopTrain the Trainers:  Cloud Computing & Big Data Workshop
Train the Trainers: Cloud Computing & Big Data WorkshopIMC Institute
 
With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?Dan Sullivan, Ph.D.
 
Digital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for LibrariansDigital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for LibrariansLaura Fu
 
Google Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsGoogle Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsDaniel Zivkovic
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Productioniguazio
 
Google Cloud Machine Learning
 Google Cloud Machine Learning  Google Cloud Machine Learning
Google Cloud Machine Learning India Quotient
 
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OKServerless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OKKriangkrai Chaonithi
 
20140116 Tim Willoughby and James Fogarty FOSS in Local Government
20140116   Tim Willoughby and James Fogarty FOSS in Local Government20140116   Tim Willoughby and James Fogarty FOSS in Local Government
20140116 Tim Willoughby and James Fogarty FOSS in Local GovernmentTim Willoughby
 
Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service - Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service - David Peterson
 
22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentation22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentationDefence and Security Accelerator
 

Ähnlich wie Big data internship plan at Contemi Vietnam (20)

SIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptxSIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptx
 
Day 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big dataDay 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big data
 
Choosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-HybridChoosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-Hybrid
 
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deploymentsSAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
 
Data Science in the Cloud
Data Science in the CloudData Science in the Cloud
Data Science in the Cloud
 
Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213
 
Getting started with GCP ( Google Cloud Platform)
Getting started with GCP ( Google  Cloud Platform)Getting started with GCP ( Google  Cloud Platform)
Getting started with GCP ( Google Cloud Platform)
 
Connecting DMPs & Repositories
Connecting DMPs & RepositoriesConnecting DMPs & Repositories
Connecting DMPs & Repositories
 
Train the Trainers: Cloud Computing & Big Data Workshop
Train the Trainers:  Cloud Computing & Big Data WorkshopTrain the Trainers:  Cloud Computing & Big Data Workshop
Train the Trainers: Cloud Computing & Big Data Workshop
 
With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?
 
Digital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for LibrariansDigital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for Librarians
 
Google Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsGoogle Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification Tips
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
 
QuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA RapidsQuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA Rapids
 
Google Cloud Machine Learning
 Google Cloud Machine Learning  Google Cloud Machine Learning
Google Cloud Machine Learning
 
Top 7 mistakes
Top 7 mistakesTop 7 mistakes
Top 7 mistakes
 
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OKServerless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
 
20140116 Tim Willoughby and James Fogarty FOSS in Local Government
20140116   Tim Willoughby and James Fogarty FOSS in Local Government20140116   Tim Willoughby and James Fogarty FOSS in Local Government
20140116 Tim Willoughby and James Fogarty FOSS in Local Government
 
Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service - Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service -
 
22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentation22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentation
 

Mehr von Quang Nguyen

Big Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science UniversityBig Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science UniversityQuang Nguyen
 
Hồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FCHồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FCQuang Nguyen
 
Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013Quang Nguyen
 
Gioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vnGioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vnQuang Nguyen
 
[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slides[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slidesQuang Nguyen
 
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deckQuang Nguyen
 
[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitch[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitchQuang Nguyen
 
giaibongda.com pitch deck
giaibongda.com pitch deckgiaibongda.com pitch deck
giaibongda.com pitch deckQuang Nguyen
 
Agile development @open consultant offline
Agile development @open consultant offlineAgile development @open consultant offline
Agile development @open consultant offlineQuang Nguyen
 
[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tập[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tậpQuang Nguyen
 
Lean startup overview @ipl offline
Lean startup overview @ipl offlineLean startup overview @ipl offline
Lean startup overview @ipl offlineQuang Nguyen
 
Keep the eyes beyond competition
Keep the eyes beyond competitionKeep the eyes beyond competition
Keep the eyes beyond competitionQuang Nguyen
 

Mehr von Quang Nguyen (15)

Intro to Hadoop
Intro to HadoopIntro to Hadoop
Intro to Hadoop
 
Big Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science UniversityBig Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science University
 
Intro to Big Data
Intro to Big DataIntro to Big Data
Intro to Big Data
 
Hồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FCHồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FC
 
Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013
 
Gioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vnGioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vn
 
[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slides[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slides
 
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
 
[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitch[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitch
 
giaibongda.com pitch deck
giaibongda.com pitch deckgiaibongda.com pitch deck
giaibongda.com pitch deck
 
Agile development @open consultant offline
Agile development @open consultant offlineAgile development @open consultant offline
Agile development @open consultant offline
 
[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tập[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tập
 
Lean startup overview @ipl offline
Lean startup overview @ipl offlineLean startup overview @ipl offline
Lean startup overview @ipl offline
 
Lean startup
Lean startupLean startup
Lean startup
 
Keep the eyes beyond competition
Keep the eyes beyond competitionKeep the eyes beyond competition
Keep the eyes beyond competition
 

Kürzlich hochgeladen

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 

Kürzlich hochgeladen (20)

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 

Big data internship plan at Contemi Vietnam

  • 1. BIG DATA FIRST INTERNSHIP Sep 2013 – Jan 2014 Organized by Contemi Vietnam
  • 2. Outcomes • Contemi • Big Data presence • Big Data know-how • Big Data experience • Interns • Linux • R / Python language • Machine Learning practice • Process • Scrum • Cross Industry Standard Process for Data Mining (CRISM-DM) • Kaggle profile • Hadoop
  • 3. Preparation • Platform: Ubuntu 12.04 LTS • Process: • Scrum • Cross Industry Standard Process for Data Mining (CRISP DM) • Weekly blog • http://contanalytics.wordpress.com/
  • 4. Headstart for Dung • 16/09 – 30/09 • Learn R / Python • Try Digit Recognizer competition on Kaggle.com • Join in Introduction to Recommender System and Web Intelligence and Big Data on Coursera.com
  • 5. 3 month plan • 1/10 – 31/10 • Go through all typical Machine Learning algorithms, implement, demo and present to Contemi • 1/11 – 15/11 • Compete for AMS 2013-2014 Solar Energy Prediction Contest • URL: http://www.kaggle.com/c/ams-2014-solar-energy-prediction-contest • 16/11 – 22/11 • Compete for Accelerometer Biometric Competition • URL: http://www.kaggle.com/c/accelerometer-biometric-competition • 23/11 – 31/12 (end of internship) • Deploy Hadoop • Learn Java • Run Word counting and Sorting experiments with large data (> 1GB) • Compete for Facebook Recruting III – Keyword Extraction (personally) • Re-optimize built model basing on Hadoop
  • 6. Next plan for next internships • App using Singapore open datasets • Stock prediction app for Vietnam market • Visualization • GitHub • R-Bloggers