Palestra ministrada pelo Dr. Rommel Novaes Carvalho, Coordenador-Geral do Observatório da Despesa Pública e Professor do Mestrado Profissional em Computação Aplicada da UnB.
Evento: Brasil 100% Digital: Integração e transparência a serviço da sociedade
Website: http://www.brasildigital.gov.br/
Data: 10/11/2016
Vídeo: https://www.youtube.com/watch?v=3WYQlPR-RLw&feature=youtu.be&t=2h4m44s
Como transformar servidores em cientistas de dados e diminuir a distância entre a academia e o governo
1. B I G D ATA N O G O V E R N O
C O M O T R A N S F O R M A R S E R V I D O R E S E M C I E N T I S TA S D E D A D O S E
D I M I N U I R A D I S TÂ N C I A E N T R E A A C A D E M I A E O G O V E R N O
b y D r. R o m m e l N o v a e s C a r v a l h o
Instituto de Ciências Exatas
Departamento de Ciência da Computação
Mestrado em Computação Aplicada
2. – G A R T N E R
“Big data is high-volume, high-velocity and high-variety information
assets that demand cost-effective, innovative forms of information
processing for enhanced insight and decision making.”
b y D r. R o m m e l N o v a e s C a r v a l h o
3. b y D r. R o m m e l N o v a e s C a r v a l h o
4. b y D r. R o m m e l N o v a e s C a r v a l h o
5. – M C K I N S E Y G L O B A L I N S T I T U T E
“There will be a shortage of talent necessary for organizations to take advantage of big
data. By 2018, the United States alone could face a shortage of 140,000 to 190,000
people with deep analytical skills as well as 1.5 million managers and analysts with the
know-how to use the analysis of big data to make effective decisions.”
b y D r. R o m m e l N o v a e s C a r v a l h o
6. b y D r. R o m m e l N o v a e s C a r v a l h o
7. b y D r. R o m m e l N o v a e s C a r v a l h o
8. b y D r. R o m m e l N o v a e s C a r v a l h o
Inscrição no PPCA
R$ 600,00
Publicar e apresentar
artigo internacional
R$ 15.000,00
Mudar o Brasil e ter
seu trabalho divulgado
na mídia nacional
NÃO TEM PREÇO!!!
9. – U D A C I T Y
“To situate yourself for success as data analyst, you should be
familiar with five core competencies: programming, statistics,
machine learning, data munging, and data visualization.”
- P ro g r a m a ç ã o B á s i c a e m
R
- E s t a t í s t i c a D e s c r i t i v a e
I n f e rê n c i a
- P ré - p ro c e s s a m e n t o d e
D a d o s
- Vi s u a l i z a ç ã o d e D a d o s
B á s i c a
A N Á L I S E E S TAT Í S T I C A D E
D A D O S E I N F O R M A Ç Õ E S
- P ro g r a m a ç ã o
I n t e r m e d i á r i a e m R
- E s t a t í s t i c a Av a n ç a d a
- A p re n d i z a g e m d e
M á q u i n a C l á s s i c a
- Vi s u a l i z a ç ã o d e D a d o s
I n t e r m e d i á r i a
M I N E R A Ç Ã O D E D A D O S E
T E X T O S
- P ro g r a m a ç ã o P a r a l e l a
e m R
- M a p R e d u c e e H a d o o p
- A p re n d i z a g e m d e
M á q u i n a Av a n ç a d a
- Vi s u a l i z a ç ã o d e D a d o s
Av a n ç a d a
M I N E R A Ç Ã O D E D A D O S
M A S S I V O S
b y D r. R o m m e l N o v a e s C a r v a l h o
10. – D H L
Success Factors for Implementing Big Data Analytics
“But there needs to be more than a positive assessment of business
value. The following five success factors must also be in place.”
B U S I N E S S A N D I T
A L I G N M E N T
D ATA T R A N S PA R E N C Y A N D
G O V E R N A N C E
D ATA P R I VA C Y
b y D r. R o m m e l N o v a e s C a r v a l h o
D ATA S C I E N C E S K I L L S
A P P R O P R I AT E T E C H N O L O G Y
U S A G E
11. B I G D ATA N O B R A S I L
C O M O E S TÁ A O N D A D E B I G D ATA N O B R A S I L ?
b y D r. R o m m e l N o v a e s C a r v a l h o
12. b y D r. R o m m e l N o v a e s C a r v a l h o
https://youtu.be/OQtxja08oro
13. B I G D ATA A N A LY T I C S A T R AV É S D E
P A R C E R I A G O V E R N O / A C A D E M I A
b y D r. R o m m e l N o v a e s C a r v a l h o
Análise de variáveis e construção de modelos
preditivos para melhoria na seleção de processos
de compensação de crédito tributário
Aplicação de Mineração de Dados para
prevenção de demissões no Exército Brasileiro
Analyzing Suspicious Medical Visit Claims from Individual
Healthcare Service Providers using
K-means Clustering
Using Political Party Affiliation Data to
Measure Civil Servants’ Risk of Corruption
0%#
100%#
SERVIDOR#!
Application of text mining techniques for classification of
documents: a study of automation of complaints screening
in a Brazilian Federal Agency
“Who is their mother?”: A classification work to
get answers over registration people databases
14. B I G D ATA A N A LY T I C S A T R AV É S D E
P A R C E R I A G O V E R N O / A C A D E M I A
b y D r. R o m m e l N o v a e s C a r v a l h o
Mineração de dados: classificação de bens de Tecnologia
da Informação no catálogo de materiais do sistema de
Compras Governamentais
Classificação de clientes bancários
com baixa renda
Técnicas de mineração aplicadas em um
ambiente supervisionado de Service Desk
Aplicação de Técnicas de Mineração de Dados Para
Definição de ANS em Processos de Gestão de Incidentes e
Requisições de Serviços de TI
Estimativa de crescimento de uma base de
documentos eletrônicos utilizando modelos de previsão
de séries temporais
Proposta de novo mecanismo de buscas para base
de conhecimento MediaWiki auxiliado por
Rede Neural
Aplicação de técnicas de mineração de textos para
classificação de códigos fonte Cobol: um estudo para
otimizar o processo de manutenção de software
R U L E
15. P A R C E R I A G O V E R N O / A C A D E M I A
P R O J E T O F I N A L M D T 2 0 1 6
https://youtu.be/hnY3pJ2vVT8?
list=PLPHt3Ge65Mt903IsAnQWGE7-g73a1Es0D
b y D r. R o m m e l N o v a e s C a r v a l h o
16. D I S S E M I N A Ç Ã O D A A N Á L I S E D E D A D O S
N O G O V E R N O F E D E R A L
http://www.brasildigital.gov.br/brasil-
digital/eventos-anteriores/brasil-100-
digital-2/o-evento/
http://www.brasildigital.gov.br/brasil-
digital/eventos-anteriores/brasil-100-
digital/o-evento/
17. R E F E R Ê N C I A S
• Gartner IT Glossary - Big Data @ http://www.gartner.com/it-glossary/big-data/
• 2014: The Year Big Data Adoption Goes Mainstream In The Enterprise @ http://www.forbes.com/sites/louiscolumbus/
2014/01/12/2014-the-year-big-data-adoption-goes-mainstream-in-the-enterprise/
• Research Report on Big Data @ http://www.idgenterprise.com/report/big-data-2
• Power of 1% Improvement – ROI & Use Cases for Industrial Big Data @ http://practicalanalytics.wordpress.com/2013/06/25/power-
of-1-the-business-case-for-industrial-big-data/
• Extract Big Returns from Investments in Big Data and Predictive Analytics in the Energy Industry Infographic @ http://
www.slideshare.net/SAPanalytics/extract-big-returns-from-investments-in-big-data-and-predictive-analytics-in-the-energy-industry-
infographic
• Big Data Can Help Marketers Unlock Up To $200 Billion Read @ http://www.businessinsider.com/big-data-can-boost-marketing-
roi-2013-11#ixzz3LYA7DBSM
• How Retailers Can Tap Actionable Big Data Competitively (Infographic) @ http://upstreamcommerce.com/blog/2013/03/13/
retailers-tap-actionable-big-data-competitively-infographic
b y D r. R o m m e l N o v a e s C a r v a l h o
18. R E F E R Ê N C I A S
• Big data: The next frontier for innovation, competition, and productivity @ http://www.mckinsey.com/insights/
business_technology/big_data_the_next_frontier_for_innovation
• PayScale - Data Scientist, IT Salary (United States) @ http://www.payscale.com/research/US/Job=Data_Scientist%2c_IT/Salary
• PayScale - Senior Data Scientist, IT Salary (United States) @ http://www.payscale.com/research/US/
Job=Senior_Data_Scientist%2c_IT/Salary
• Data Analysts: What You’ll Make and Where You’ll Make It @ http://blog.udacity.com/2014/11/data-analysts-what-youll-make.html
• Using Bayesian Networks to Identify and Prevent Split Purchases in Brazil
• Presentation @ https://www.youtube.com/watch?v=UVOsztdSQ3A
• Slides @ http://www.slideshare.net/rommelnc/bmaw-2014-using-bayesian-networks-to-identify-and-prevent-split-purchases-
in-brazil
• Paper @ http://ceur-ws.org/Vol-1218/bmaw2014_paper_7.pdf
b y D r. R o m m e l N o v a e s C a r v a l h o
19. R E F E R Ê N C I A S
• Methodology for Creating the Brazilian Government Reference Price Database @ http://www.lbd.dcc.ufmg.br/bdbcomp/servlet/Trabalho?id=20394
• Análise das Eleições nas Redes Sociais @ http://eleicoesnasredessociais.blogspot.com.br/
• Probabilistic Ontology Representation and Modeling Methodology
• Presentation @ https://www.youtube.com/watch?v=Zl5rmag6BqY
• Slides @ http://pt.slideshare.net/rommelnc/probabilistic-ontology-representation-and-modeling-methodology-8647132
• Dissertation @ http://digilib.gmu.edu:8080/handle/1920/6616
• Mestrado Profissional em Computação Aplicada - CIC/UnB @ http://ppca.unb.br/
• “Who is their mother?”: A classification work to get answers over registration people databases @ http://www.lbd.dcc.ufmg.br/bdbcomp/servlet/
Trabalho?id=21572
• Application of text mining techniques for classification of documents: a study of automation of complaints screening in a Brazilian Federal Agency @
http://www.lbd.dcc.ufmg.br/bdbcomp/servlet/Trabalho?id=21567
• Using Political Party Affiliation Data to Measure Civil Servants’ Risk of Corruption @ http://www.computer.org/csdl/proceedings/bracis/
2014/5618/00/5618a166.pdf
b y D r. R o m m e l N o v a e s C a r v a l h o
20. R E F E R Ê N C I A S
• Receita espera recuperar R$ 16 bilhões com operações de investigação @ http://www1.folha.uol.com.br/mercado/
2016/10/1819387-receita-federal-espera-recuperar-mais-de-r-16-bilhoes-com-tres-operacoes.shtml
• Como a inteligência artificial está ajudando no combate à corrupção no Brasil @ http://m.gizmodo.uol.com.br/
corrupcao-inteligencia-artificial/
• 26th DEXA Conferences and Workshops @ http://www.dexa.org/previous/dexa2015/
• Analyzing Suspicious Medical Visit Claims from Individual Healthcare Service Providers Using K-Means
Clustering @ http://www.springer.com/us/book/9783319223889
• Predictive Models on Tax Refund Claims - Essays of Data Mining in Brazilian Tax Administration @ http://
www.springer.com/us/book/9783319223889
• 27th DEXA Conferences and Workshops @ http://dexa.org/
• Identifying the Main Problems in IT Auditing: A Comparison Between Unsupervised and Supervised Learning
b y D r. R o m m e l N o v a e s C a r v a l h o
21. R E F E R Ê N C I A S
• 13th Annual Bayesian Modeling Applications Workshop @ http://c4i.gmu.edu/bmaw/2016/
• Measuring the Risk of Public Contracts Using Bayesian Classifiers @ http://ceur-ws.org/Vol-1663/bmaw2016_paper_2.pdf
• Bayesian Networks on Income Tax Audit Selection - A Case Study of Brazilian Tax Administration @ http://ceur-ws.org/Vol-1663/
bmaw2016_paper_3.pdf
• Bayesian Models to Assess Risk of Corruption of Federal Management Units @ http://ceur-ws.org/Vol-1663/bmaw2016_paper_5.pdf
• 15th IEEE International Conference on Machine Learning and Applications (IEEE ICMLA’16) @ http://www.icmla-conference.org/
icmla16/
• Identifying IT purchases anomalies in the Brazilian Government Procurement System using Deep Learning @ http://www.icmla-
conference.org/icmla16/papers.pdf
• Predicting Recovery of Credit Operations on a Brazilian Bank @ http://www.icmla-conference.org/icmla16/papers.pdf
• Deep Learning Anomaly Detection as Support Fraud Investigation in Brazilian Exports and Anti-Money Laundering @ http://
www.icmla-conference.org/icmla16/papers.pdf
b y D r. R o m m e l N o v a e s C a r v a l h o