Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area United States
Branche
Electronics / Computer Hardware
Info
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Tags
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
Mehr anzeigen
Präsentationen
(8)Gefällt mir
(8)Frustration-Reduced PySpark: Data engineering with DataFrames
Ilya Ganelin
•
Vor 8 Jahren
sparklyr - Jeff Allen
Sri Ambati
•
Vor 7 Jahren
A lightweight browser start page - 3x3 Links
Federico Elles
•
Vor 15 Jahren
The Secret Sauce of Successful Teams
Sven Peters
•
Vor 7 Jahren
Web Services Testing
Vladimir Soghoyan
•
Vor 10 Jahren
Network Intrusion Detection Analysis using Random Forest Algorithm on Apache Mahout
Cisco
•
Vor 9 Jahren
Clustering and Association Rule
Cisco
•
Vor 9 Jahren
Time Series Forecasting for Google Inc. and Break-even analysis for Google glass.
Cisco
•
Vor 9 Jahren
Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area United States
Branche
Electronics / Computer Hardware
Info
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Tags
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
Mehr anzeigen