SlideShare a Scribd company logo
1 of 41
1
Weiyi Shang
Supervisor: Dr. Ahmed E. Hassan
Log Engineering: Towards Systematic Log
Mining to Support the Development of
Ultra-largeScale Systems
Automated profiling & instrumentation are
widely used in software engineering
Overhead No domain knowledgeLarge scale
2
Logs are a valuable source of information about
system execution
3
Field informationDeveloper experience
foo() {
…
Log_statement(“operation
started”);
…
}
Overview of log mining
4
Software
System
Log collection Log analysis
Log
transformation
Goal
Finding 1. Little research focuses on logging
statements that reside in the source code.
Finding 2. Little research focuses on logs
generated during the development of system.
5
Software
System
Log collection Log analysis
Log
transformation
Goal
• Types of Logs:
• Platform logs: Hadoop logs [Tan et al.]
• Application logs: Dell DVD store logs [Jiang et al.]
• Sources of Logs:
• Logs from the field: [Kavulia et al.]
• Logs during development: [Jiang et al.]
Finding 3. Prior research primarily uses ad hoc
log transformation techniques.
6
Software
System
Log collection Log analysis
Log
transformation
Goal
• Abstracted logs: Log events [Jiang et al.]
• Vectors or sets: Pairs [Jiang et al.], Sequence [Jiang et al.], Suffix arrays
[Nagappan et al.], Time series [Bitincka et al.]
• Graphs: State machines [Tan et al.], Directed Graph[Nagappan et al.]
• Matrixes: [Lou et al.]
Finding 4. Prior log mining research does not
address the scalability challenges.
7
Software
System
Log collection Log analysis
Log
transformation
Goal
• Simple calculation: filtering [Salfner et
al. ]
• Directed Graph-based algorithms:
[Nagappan et al.]
• Static analysis: [Yuan et al.]
• Model checking: [Beschastnikh et al.]
• Visualization: [De Pauw et al.]
• Statistical methods: PCA [Xu et al.]
• Data mining techniques: Co-occurrence
analysis [Lou et al.]
• Machine learning techniques: Prediction
[Salfner et al.]
• Other analysis techniques: Compression
[Hassan et al.]
Finding 5. There exists limited software log
mining research to support software
development activities
8
Software
System
Log collection Log analysis
Log
transformation
Goal
• Log mining platforms: [Bitincka et al.]
• Log improvements: [Yuan et al.]
• Log mining for system administration
• Anomaly detection [Xu et al.]
• System monitoring [Rabkin et al.]
• Work load recovery and capacity
planning [Kavulia et al.]
• Log mining for software engineering
• Program comprehension:
[Beschastnikh et al.]
• Software testing: [Jiang et al.]
• Empirical studies: [Yuan et al.]
Thesis statement
Logs are a valuable yet rarely explored source of knowledge
about a software system and its operation. There is little
research regarding the understanding and evolution of logs.
Systematic and scalable log mining approaches are needed
to support various software development activities (e.g.,
code quality improvement, large scale testing and
deployment of ultra-large scale applications).
9
10
Part 1: Study the challenges associated with
understanding and evolving logging statements
Part 2: Log engineering approaches to support
software development activities
What are the challenges in understanding logging
statements? [Submitted to ICSM 2014]
How do logging statements evolve?
[WCRE 2011 , JESP]
Prioritizing code review and testing efforts using logs
and their churn. [EMSE]
Verifying deployment of Big Data Analytics applications
using logs. [ICSE 2013 ]
11
Part 1: Study the challenges associated with
understanding and evolving logging statements
Part 2: Log engineering approaches to support
software development activities
What are the challenges in understanding
logging statements?
How do logging statements evolve?
Prioritizing code review and testing efforts
using logs and their churn.
Verifying deployment of Big Data Analytics
applications using logs.
Motivation: Log understanding is challenging
12
User mailing lists
Hadoop
Cassandra
Zookeeper
14 inquiries asked
about 5 types of
information
2
11
1
6
1
0
5
10
Meaning Cause Context Solution Impact
# inquires
[ICSM 2014 in submission]
Approach: Attaching development knowledge
to logs
13
Code
commit
Issue reports
Source code
/*
…
*/
Call graph
Code comments
[ICSM 2014 in submission]
Development knowledge can resolve real-life
inquiries
Development knowledge can provide help in resolving 9 out of 14
real-life inquiries from the user mailing list
0
2
4
6
8
10
12
Meaning Cause Context Solution Impact
# not answered inquires
# answered inquires
14
[ICSM 2014 in submission]
15
Part 1: Study the challenges associated with
understanding and evolving logging statements
Part 2: Log engineering approaches to support
software development activities
What are the challenges in understanding
logging statements?
How do logging statements evolve?
Prioritizing code review and testing efforts
using logs and their churn.
Verifying deployment of Big Data Analytics
applications using logs.
Motivation:
How to keep Log Processing Apps in sync with
logs?
Release 1 Release 2 Release 3
16
[WCRE 2011 best paper, JSEP]
Approach:
Studying log evolution at the execution level
Data
Collection
Log
Abstraction
System
Deployment
time=1, Trying to launch, TaskID=01A
time=$t, Trying to launch, TaskID=$id
Enterprise Application (EA)
17
Log
Events
[WCRE 2011 best paper, JSEP]
Generating
Abstract
Syntax Tree
Identifying
logging
statements
Source code
Log.info (“time=%d, Trying to launch,
TaskID=%s”, time, taskid);
time=$t, Trying to launch, TaskID=$id
18
Logging
statements
Approach:
Studying log evolution at the code level
[WCRE 2011 best paper, JSEP]
How do log evolve
over time?
19
Growing &
changing
Document &
track
What types of
modifications
happen to logs?
What information is
conveyed by the
short-lived logs?
Quantity Type Content
8 types
Are mostly
avoidable
Implementation-
level details
Fragile
Maintenance
effort
Results
[WCRE 2011 best paper, JSEP]
20
Part 1: Study the challenges associated with
understanding and evolving logging statements
Part 2: Log engineering approaches to support
software development activities
What are the challenges in understanding
logging statements?
How do logging statements evolve?
Prioritizing code review and testing efforts
using logs and their churn.
Verifying deployment of Big Data Analytics
applications using logs.
Approach: Building statistical models for post-
release defects
21
Logistic
Regression
Model
Traditional metrics
Traditional metrics Log-related metrics
Logistic
Regression
Model
• Are log-related metrics significant in the models?
• How much explanatory power improvement can
log-related metrics provide over traditional
metrics?[EMSE]
22
Log density
Average logging level
Log add density
Log delete density
Co-change of log and bug fix
Product Process
Approach:
Defining log-related metrics
Lines of code
Pre-release defects
Total prior commits
log-related metrics
Traditional metrics
Product Process
[EMSE]
23
There is relationship between logging characteristics and software quality.
Results
• In 7 out of 8 studied releases, at least one log-related
metric is statistically significant in enhancing the model
with only traditional metrics.
• The log-related metrics provide up to 40%
improvement over the explanatory power of the
traditional metrics.
0.16.0 to 0.19.0 3.0 to 4.0
[EMSE]
24
Part 1: Study the challenges associated with
understanding and evolving logging statements
Part 2: Log engineering approaches to support
software development activities
What are the challenges in understanding
logging statements?
How do logging statements evolve?
Prioritizing code review and testing efforts
using logs and their churn.
Verifying deployment of Big Data Analytics
applications using logs.
How to verify the deployment of Big Data
Analytics Apps?
25
Small sample data and pseudo
cloud
Big data and real-life cloud
How to verify
[ICSE 2013 distinguished paper]
Traditional approach for verifying BDA apps
26
Keyword scan
Many false positives!!
Large results, too much
effort to manually
examine
[ICSE 2013 distinguished paper]
Overview of our approach
27
Small sample data and pseudo
cloud
Big data and real-life cloud
Underlying platform Underlying platform
Execution
sequences
Execution
sequences
Execution
sequence
delta
[ICSE 2013 distinguished paper]
Comparing small and large runs
28
Logs from
testing run
with small
data
Logs from
run with
large data
Execution sequence
E1, E2, E3, E5, E6
Execution sequence
E1, E2, E3, E5, E6
E1, E2, E3, E7, E5, E6
Execution sequence delta
E1, E2, E3, E7, E5, E6
[ICSE 2013 distinguished paper]
How precise is our
approach?
Precision
29
Effort Reduction
How much effort
reduction does our
approach provide?
Reduce logs for
manual inspection
by over 86%
Less false positive
[ICSE 2013 distinguished paper]
Thesis contribution
• We demonstrate the challenges of understanding
logs.
• We show that logging statements continually
evolve.
• We show that there is a relationship between
logging characteristics and software defects.
• We propose approaches that leverage logs to
verify the deployment of Big Data Analytics
applications.
30
31
32
Where else can we find the requested
information?
33
Code
commit
Issue reports
Source code
/*
…
*/
Code
comments
Call graph
fetch failure
From method
checkAndInformJobTracker
of file ShuffleScheduler.java
Where else can we find the requested
information?
34
Code
commit
Issue reports
Source code
/*
…
*/
Code
comments
Call graph
fetch failure
Notify the JobTracker after every read error, if
`reportReadErrorImmediately' is true or after
every `maxFetchFailuresBeforeReporting' failures
Where else can we find the requested
information?
35
Code
commit
Issue reports
Source code
/*
…
*/
Code
comments
Call graph
fetch failure
Called by method
copyFailed in class ShuffleScheduler
Where else can we find the requested
information?
36
Code
commit
Issue reports
Source code
/*
…
*/
Code
comments
Call graph
fetch failure
Allow shuffle retries and read-error
reporting to be configurable. Contributed
by Amareshwari Sriramadasu.
Where else can we find the requested
information?
37
Code
commit
Issue reports
Source code
/*
…
*/
Code
comments
Call graph
fetch failure
MAPREDUCE-1171.
… This is caused by a behavioral change in
hadoop 0.20.1. …
…One solution I could see is "Provide a config
option... ”…
Where else can we find the requested
information?
38
Code
commit
Issue reports
Source code
/*
…
*/
Code
comments
Call graph
fetch failure
Meaning: There is a data reading error.
Cause: One of the possible reasons is a configuration.
Context: The event happens during the shuffle period, while
copying data.
Impact: The event impacts the jobtracker.
Solution: Changing a configuration option would solve the issue.
Amareshwari Sriramadasu is the expert to go to.
Step 1: Log Abstraction
reduces the size of logs
39
Log
abstraction
Log Linking
Simplifying
sequences
Example of log lines
Execution events
Jiang et al. JSME 2008
Step 2: Log linking
provides context for logs
40
Log
abstraction
Log Linking
Simplifying
sequences
Example of log lines
Execution events
Step 3: Sequence simplification
deals with repeated logs
41
Log
abstraction
Log Linking
Simplifying
sequences
Repeated logs:
task t1 read file A.
task t1 read file A.
task t1 read file A.
Remove repetition
and order of events

More Related Content

What's hot

Animated Visualization of Software History Using Software Evolution Storyboards
Animated Visualization of Software History Using Software Evolution StoryboardsAnimated Visualization of Software History Using Software Evolution Storyboards
Animated Visualization of Software History Using Software Evolution StoryboardsSAIL_QU
 
Using Control Charts for Detecting and Understanding Performance Regressions ...
Using Control Charts for Detecting and Understanding Performance Regressions ...Using Control Charts for Detecting and Understanding Performance Regressions ...
Using Control Charts for Detecting and Understanding Performance Regressions ...SAIL_QU
 
Human factors in software reliability engineering - Research Paper
Human factors in software reliability engineering - Research PaperHuman factors in software reliability engineering - Research Paper
Human factors in software reliability engineering - Research PaperMuhammad Ahmad Zia
 
An Industrial Case Study of Automatically Identifying Performance Regression-...
An Industrial Case Study of Automatically Identifying Performance Regression-...An Industrial Case Study of Automatically Identifying Performance Regression-...
An Industrial Case Study of Automatically Identifying Performance Regression-...SAIL_QU
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cycleAlfa Rizki Harahap
 
Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...
Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...
Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...Eero Laukkanen
 
naveed-kamran-software-architecture-agile
naveed-kamran-software-architecture-agilenaveed-kamran-software-architecture-agile
naveed-kamran-software-architecture-agileNaveed Kamran
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cycleSelvy Ariska
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cycleadeafsa
 
Icse 2011 ds_1
Icse 2011 ds_1Icse 2011 ds_1
Icse 2011 ds_1SAIL_QU
 
Key Findings from the 2019 State of DevOps Report
Key Findings from the 2019 State of DevOps ReportKey Findings from the 2019 State of DevOps Report
Key Findings from the 2019 State of DevOps ReportPuppet
 
Software Engineering (Requirements Engineering & Software Maintenance)
Software Engineering (Requirements Engineering  & Software Maintenance)Software Engineering (Requirements Engineering  & Software Maintenance)
Software Engineering (Requirements Engineering & Software Maintenance)ShudipPal
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cyclefajarayuningrum
 
Software devlopment security
Software devlopment securitySoftware devlopment security
Software devlopment securitySuraj Singh
 
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan BakkerTEST Huddle
 
Software Testing Basics
Software Testing BasicsSoftware Testing Basics
Software Testing BasicsBelal Raslan
 
ISTQB - Software development life cycle
ISTQB - Software development life cycleISTQB - Software development life cycle
ISTQB - Software development life cycleHoangThiHien1
 

What's hot (20)

Animated Visualization of Software History Using Software Evolution Storyboards
Animated Visualization of Software History Using Software Evolution StoryboardsAnimated Visualization of Software History Using Software Evolution Storyboards
Animated Visualization of Software History Using Software Evolution Storyboards
 
Using Control Charts for Detecting and Understanding Performance Regressions ...
Using Control Charts for Detecting and Understanding Performance Regressions ...Using Control Charts for Detecting and Understanding Performance Regressions ...
Using Control Charts for Detecting and Understanding Performance Regressions ...
 
Human factors in software reliability engineering - Research Paper
Human factors in software reliability engineering - Research PaperHuman factors in software reliability engineering - Research Paper
Human factors in software reliability engineering - Research Paper
 
Software metrics
Software metricsSoftware metrics
Software metrics
 
Sop test planning
Sop test planningSop test planning
Sop test planning
 
An Industrial Case Study of Automatically Identifying Performance Regression-...
An Industrial Case Study of Automatically Identifying Performance Regression-...An Industrial Case Study of Automatically Identifying Performance Regression-...
An Industrial Case Study of Automatically Identifying Performance Regression-...
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cycle
 
Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...
Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...
Bottom-up Adoption of Continuous Delivery in a Stage-gate Managed Software Or...
 
naveed-kamran-software-architecture-agile
naveed-kamran-software-architecture-agilenaveed-kamran-software-architecture-agile
naveed-kamran-software-architecture-agile
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cycle
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cycle
 
Icse 2011 ds_1
Icse 2011 ds_1Icse 2011 ds_1
Icse 2011 ds_1
 
Key Findings from the 2019 State of DevOps Report
Key Findings from the 2019 State of DevOps ReportKey Findings from the 2019 State of DevOps Report
Key Findings from the 2019 State of DevOps Report
 
Software Engineering (Requirements Engineering & Software Maintenance)
Software Engineering (Requirements Engineering  & Software Maintenance)Software Engineering (Requirements Engineering  & Software Maintenance)
Software Engineering (Requirements Engineering & Software Maintenance)
 
Testing throughout the software life cycle
Testing throughout the software life cycleTesting throughout the software life cycle
Testing throughout the software life cycle
 
Software devlopment security
Software devlopment securitySoftware devlopment security
Software devlopment security
 
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
'Automated Reliability Testing via Hardware Interfaces' by Bryan Bakker
 
Software Testing Basics
Software Testing BasicsSoftware Testing Basics
Software Testing Basics
 
Software engineering
Software  engineeringSoftware  engineering
Software engineering
 
ISTQB - Software development life cycle
ISTQB - Software development life cycleISTQB - Software development life cycle
ISTQB - Software development life cycle
 

Viewers also liked

Medios publicitarios
Medios publicitarios Medios publicitarios
Medios publicitarios Giarline
 
Proyecto de sociologia
Proyecto de sociologiaProyecto de sociologia
Proyecto de sociologiaruth Apellidos
 
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...Senadores PRD
 
Case History Carlo Cairoli
Case History Carlo CairoliCase History Carlo Cairoli
Case History Carlo CairoliCarlo Cairoli
 
Large-Scale Empirical Studies of Mobile Apps
Large-Scale Empirical Studies of Mobile AppsLarge-Scale Empirical Studies of Mobile Apps
Large-Scale Empirical Studies of Mobile AppsSAIL_QU
 
C de marchi_tirocinio
C de marchi_tirocinioC de marchi_tirocinio
C de marchi_tirocinioCladema
 
비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...
비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...
비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...成 金
 
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015Arnaud Bonzom
 
Powering of bangladesh- Vision 2021
Powering of bangladesh- Vision 2021Powering of bangladesh- Vision 2021
Powering of bangladesh- Vision 2021Mukhlasur Rahman
 
Sustainability Day Leeds 2017
Sustainability Day Leeds 2017Sustainability Day Leeds 2017
Sustainability Day Leeds 20174 All of Us
 
Lean Canvas evolved - FTE Canvas
Lean Canvas evolved - FTE CanvasLean Canvas evolved - FTE Canvas
Lean Canvas evolved - FTE CanvasGorilla Ventures Oy
 

Viewers also liked (12)

Medios publicitarios
Medios publicitarios Medios publicitarios
Medios publicitarios
 
Proyecto de sociologia
Proyecto de sociologiaProyecto de sociologia
Proyecto de sociologia
 
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...
 
Case History Carlo Cairoli
Case History Carlo CairoliCase History Carlo Cairoli
Case History Carlo Cairoli
 
Large-Scale Empirical Studies of Mobile Apps
Large-Scale Empirical Studies of Mobile AppsLarge-Scale Empirical Studies of Mobile Apps
Large-Scale Empirical Studies of Mobile Apps
 
C de marchi_tirocinio
C de marchi_tirocinioC de marchi_tirocinio
C de marchi_tirocinio
 
비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...
비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...
비아그라 판매 =<7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...
 
Importance of Eye Contact in Hospitality
Importance of Eye Contact in HospitalityImportance of Eye Contact in Hospitality
Importance of Eye Contact in Hospitality
 
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
 
Powering of bangladesh- Vision 2021
Powering of bangladesh- Vision 2021Powering of bangladesh- Vision 2021
Powering of bangladesh- Vision 2021
 
Sustainability Day Leeds 2017
Sustainability Day Leeds 2017Sustainability Day Leeds 2017
Sustainability Day Leeds 2017
 
Lean Canvas evolved - FTE Canvas
Lean Canvas evolved - FTE CanvasLean Canvas evolved - FTE Canvas
Lean Canvas evolved - FTE Canvas
 

Similar to Log Engineering: Towards Systematic Log Mining to Support the Development of Ultra-large Scale Systems

Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesTao Xie
 
Towards Software Sustainability Guides for Industrial Software Systems
Towards Software Sustainability Guides for Industrial Software SystemsTowards Software Sustainability Guides for Industrial Software Systems
Towards Software Sustainability Guides for Industrial Software SystemsHeiko Koziolek
 
Association Rule Mining Scheme for Software Failure Analysis
Association Rule Mining Scheme for Software Failure AnalysisAssociation Rule Mining Scheme for Software Failure Analysis
Association Rule Mining Scheme for Software Failure AnalysisEditor IJMTER
 
Why Monitoring and Logging are Important in DevOps.pdf
Why Monitoring and Logging are Important in DevOps.pdfWhy Monitoring and Logging are Important in DevOps.pdf
Why Monitoring and Logging are Important in DevOps.pdfDatacademy.ai
 
Agile maintenance
Agile maintenanceAgile maintenance
Agile maintenancearalikatte
 
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifijboughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifijakd3143
 
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcessEvolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcessIJMER
 
Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...SAIL_QU
 
PhD Proposal talk
PhD Proposal talkPhD Proposal talk
PhD Proposal talkRay Buse
 
Integrated Analysis of Traditional Requirements Engineering Process with Agil...
Integrated Analysis of Traditional Requirements Engineering Process with Agil...Integrated Analysis of Traditional Requirements Engineering Process with Agil...
Integrated Analysis of Traditional Requirements Engineering Process with Agil...zillesubhan
 
Improving the Performance of Database-Centric Applications Through Program An...
Improving the Performance of Database-Centric Applications Through Program An...Improving the Performance of Database-Centric Applications Through Program An...
Improving the Performance of Database-Centric Applications Through Program An...Concordia University
 
Process and Project Metrics-1
Process and Project Metrics-1Process and Project Metrics-1
Process and Project Metrics-1Saqib Raza
 
Ooad lab manual(original)
Ooad lab manual(original)Ooad lab manual(original)
Ooad lab manual(original)dipenpatelpatel
 
V1_I2_2012_Paper3.doc
V1_I2_2012_Paper3.docV1_I2_2012_Paper3.doc
V1_I2_2012_Paper3.docpraveena06
 
Improvement of Software Maintenance and Reliability using Data Mining Techniques
Improvement of Software Maintenance and Reliability using Data Mining TechniquesImprovement of Software Maintenance and Reliability using Data Mining Techniques
Improvement of Software Maintenance and Reliability using Data Mining Techniquesijdmtaiir
 

Similar to Log Engineering: Towards Systematic Log Mining to Support the Development of Ultra-large Scale Systems (20)

Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and Challenges
 
Towards Software Sustainability Guides for Industrial Software Systems
Towards Software Sustainability Guides for Industrial Software SystemsTowards Software Sustainability Guides for Industrial Software Systems
Towards Software Sustainability Guides for Industrial Software Systems
 
Msr2021 tutorial-di penta
Msr2021 tutorial-di pentaMsr2021 tutorial-di penta
Msr2021 tutorial-di penta
 
Association Rule Mining Scheme for Software Failure Analysis
Association Rule Mining Scheme for Software Failure AnalysisAssociation Rule Mining Scheme for Software Failure Analysis
Association Rule Mining Scheme for Software Failure Analysis
 
Why Monitoring and Logging are Important in DevOps.pdf
Why Monitoring and Logging are Important in DevOps.pdfWhy Monitoring and Logging are Important in DevOps.pdf
Why Monitoring and Logging are Important in DevOps.pdf
 
Agile maintenance
Agile maintenanceAgile maintenance
Agile maintenance
 
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifijboughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
 
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcessEvolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcess
 
Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...
 
A Regression Analysis Approach for Building a Prediction Model for System Tes...
A Regression Analysis Approach for Building a Prediction Model for System Tes...A Regression Analysis Approach for Building a Prediction Model for System Tes...
A Regression Analysis Approach for Building a Prediction Model for System Tes...
 
PhD Proposal talk
PhD Proposal talkPhD Proposal talk
PhD Proposal talk
 
Integrated Analysis of Traditional Requirements Engineering Process with Agil...
Integrated Analysis of Traditional Requirements Engineering Process with Agil...Integrated Analysis of Traditional Requirements Engineering Process with Agil...
Integrated Analysis of Traditional Requirements Engineering Process with Agil...
 
Datamingse
DatamingseDatamingse
Datamingse
 
Improving the Performance of Database-Centric Applications Through Program An...
Improving the Performance of Database-Centric Applications Through Program An...Improving the Performance of Database-Centric Applications Through Program An...
Improving the Performance of Database-Centric Applications Through Program An...
 
Process and Project Metrics-1
Process and Project Metrics-1Process and Project Metrics-1
Process and Project Metrics-1
 
Software Development Life Cycle
Software Development Life CycleSoftware Development Life Cycle
Software Development Life Cycle
 
Sdlc 4
Sdlc 4Sdlc 4
Sdlc 4
 
Ooad lab manual(original)
Ooad lab manual(original)Ooad lab manual(original)
Ooad lab manual(original)
 
V1_I2_2012_Paper3.doc
V1_I2_2012_Paper3.docV1_I2_2012_Paper3.doc
V1_I2_2012_Paper3.doc
 
Improvement of Software Maintenance and Reliability using Data Mining Techniques
Improvement of Software Maintenance and Reliability using Data Mining TechniquesImprovement of Software Maintenance and Reliability using Data Mining Techniques
Improvement of Software Maintenance and Reliability using Data Mining Techniques
 

More from SAIL_QU

Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...SAIL_QU
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...SAIL_QU
 
Improving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load testsImproving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load testsSAIL_QU
 
Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...SAIL_QU
 
Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...SAIL_QU
 
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...SAIL_QU
 
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...SAIL_QU
 
Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?SAIL_QU
 
Towards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log ChangesTowards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log ChangesSAIL_QU
 
The Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution AnalysesThe Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution AnalysesSAIL_QU
 
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...SAIL_QU
 
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...SAIL_QU
 
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...SAIL_QU
 
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...SAIL_QU
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...SAIL_QU
 
What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?SAIL_QU
 
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...SAIL_QU
 
Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...SAIL_QU
 
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with ProfessionalsMeasuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with ProfessionalsSAIL_QU
 
On the Unreliability of Bug Severity Data
On the Unreliability of Bug Severity DataOn the Unreliability of Bug Severity Data
On the Unreliability of Bug Severity DataSAIL_QU
 

More from SAIL_QU (20)

Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
 
Improving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load testsImproving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load tests
 
Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...
 
Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...
 
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
 
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
 
Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?
 
Towards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log ChangesTowards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log Changes
 
The Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution AnalysesThe Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution Analyses
 
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
 
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
 
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
 
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
 
What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?
 
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
 
Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...
 
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with ProfessionalsMeasuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
 
On the Unreliability of Bug Severity Data
On the Unreliability of Bug Severity DataOn the Unreliability of Bug Severity Data
On the Unreliability of Bug Severity Data
 

Recently uploaded

How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 

Recently uploaded (20)

How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 

Log Engineering: Towards Systematic Log Mining to Support the Development of Ultra-large Scale Systems

  • 1. 1 Weiyi Shang Supervisor: Dr. Ahmed E. Hassan Log Engineering: Towards Systematic Log Mining to Support the Development of Ultra-largeScale Systems
  • 2. Automated profiling & instrumentation are widely used in software engineering Overhead No domain knowledgeLarge scale 2
  • 3. Logs are a valuable source of information about system execution 3 Field informationDeveloper experience foo() { … Log_statement(“operation started”); … }
  • 4. Overview of log mining 4 Software System Log collection Log analysis Log transformation Goal
  • 5. Finding 1. Little research focuses on logging statements that reside in the source code. Finding 2. Little research focuses on logs generated during the development of system. 5 Software System Log collection Log analysis Log transformation Goal • Types of Logs: • Platform logs: Hadoop logs [Tan et al.] • Application logs: Dell DVD store logs [Jiang et al.] • Sources of Logs: • Logs from the field: [Kavulia et al.] • Logs during development: [Jiang et al.]
  • 6. Finding 3. Prior research primarily uses ad hoc log transformation techniques. 6 Software System Log collection Log analysis Log transformation Goal • Abstracted logs: Log events [Jiang et al.] • Vectors or sets: Pairs [Jiang et al.], Sequence [Jiang et al.], Suffix arrays [Nagappan et al.], Time series [Bitincka et al.] • Graphs: State machines [Tan et al.], Directed Graph[Nagappan et al.] • Matrixes: [Lou et al.]
  • 7. Finding 4. Prior log mining research does not address the scalability challenges. 7 Software System Log collection Log analysis Log transformation Goal • Simple calculation: filtering [Salfner et al. ] • Directed Graph-based algorithms: [Nagappan et al.] • Static analysis: [Yuan et al.] • Model checking: [Beschastnikh et al.] • Visualization: [De Pauw et al.] • Statistical methods: PCA [Xu et al.] • Data mining techniques: Co-occurrence analysis [Lou et al.] • Machine learning techniques: Prediction [Salfner et al.] • Other analysis techniques: Compression [Hassan et al.]
  • 8. Finding 5. There exists limited software log mining research to support software development activities 8 Software System Log collection Log analysis Log transformation Goal • Log mining platforms: [Bitincka et al.] • Log improvements: [Yuan et al.] • Log mining for system administration • Anomaly detection [Xu et al.] • System monitoring [Rabkin et al.] • Work load recovery and capacity planning [Kavulia et al.] • Log mining for software engineering • Program comprehension: [Beschastnikh et al.] • Software testing: [Jiang et al.] • Empirical studies: [Yuan et al.]
  • 9. Thesis statement Logs are a valuable yet rarely explored source of knowledge about a software system and its operation. There is little research regarding the understanding and evolution of logs. Systematic and scalable log mining approaches are needed to support various software development activities (e.g., code quality improvement, large scale testing and deployment of ultra-large scale applications). 9
  • 10. 10 Part 1: Study the challenges associated with understanding and evolving logging statements Part 2: Log engineering approaches to support software development activities What are the challenges in understanding logging statements? [Submitted to ICSM 2014] How do logging statements evolve? [WCRE 2011 , JESP] Prioritizing code review and testing efforts using logs and their churn. [EMSE] Verifying deployment of Big Data Analytics applications using logs. [ICSE 2013 ]
  • 11. 11 Part 1: Study the challenges associated with understanding and evolving logging statements Part 2: Log engineering approaches to support software development activities What are the challenges in understanding logging statements? How do logging statements evolve? Prioritizing code review and testing efforts using logs and their churn. Verifying deployment of Big Data Analytics applications using logs.
  • 12. Motivation: Log understanding is challenging 12 User mailing lists Hadoop Cassandra Zookeeper 14 inquiries asked about 5 types of information 2 11 1 6 1 0 5 10 Meaning Cause Context Solution Impact # inquires [ICSM 2014 in submission]
  • 13. Approach: Attaching development knowledge to logs 13 Code commit Issue reports Source code /* … */ Call graph Code comments [ICSM 2014 in submission]
  • 14. Development knowledge can resolve real-life inquiries Development knowledge can provide help in resolving 9 out of 14 real-life inquiries from the user mailing list 0 2 4 6 8 10 12 Meaning Cause Context Solution Impact # not answered inquires # answered inquires 14 [ICSM 2014 in submission]
  • 15. 15 Part 1: Study the challenges associated with understanding and evolving logging statements Part 2: Log engineering approaches to support software development activities What are the challenges in understanding logging statements? How do logging statements evolve? Prioritizing code review and testing efforts using logs and their churn. Verifying deployment of Big Data Analytics applications using logs.
  • 16. Motivation: How to keep Log Processing Apps in sync with logs? Release 1 Release 2 Release 3 16 [WCRE 2011 best paper, JSEP]
  • 17. Approach: Studying log evolution at the execution level Data Collection Log Abstraction System Deployment time=1, Trying to launch, TaskID=01A time=$t, Trying to launch, TaskID=$id Enterprise Application (EA) 17 Log Events [WCRE 2011 best paper, JSEP]
  • 18. Generating Abstract Syntax Tree Identifying logging statements Source code Log.info (“time=%d, Trying to launch, TaskID=%s”, time, taskid); time=$t, Trying to launch, TaskID=$id 18 Logging statements Approach: Studying log evolution at the code level [WCRE 2011 best paper, JSEP]
  • 19. How do log evolve over time? 19 Growing & changing Document & track What types of modifications happen to logs? What information is conveyed by the short-lived logs? Quantity Type Content 8 types Are mostly avoidable Implementation- level details Fragile Maintenance effort Results [WCRE 2011 best paper, JSEP]
  • 20. 20 Part 1: Study the challenges associated with understanding and evolving logging statements Part 2: Log engineering approaches to support software development activities What are the challenges in understanding logging statements? How do logging statements evolve? Prioritizing code review and testing efforts using logs and their churn. Verifying deployment of Big Data Analytics applications using logs.
  • 21. Approach: Building statistical models for post- release defects 21 Logistic Regression Model Traditional metrics Traditional metrics Log-related metrics Logistic Regression Model • Are log-related metrics significant in the models? • How much explanatory power improvement can log-related metrics provide over traditional metrics?[EMSE]
  • 22. 22 Log density Average logging level Log add density Log delete density Co-change of log and bug fix Product Process Approach: Defining log-related metrics Lines of code Pre-release defects Total prior commits log-related metrics Traditional metrics Product Process [EMSE]
  • 23. 23 There is relationship between logging characteristics and software quality. Results • In 7 out of 8 studied releases, at least one log-related metric is statistically significant in enhancing the model with only traditional metrics. • The log-related metrics provide up to 40% improvement over the explanatory power of the traditional metrics. 0.16.0 to 0.19.0 3.0 to 4.0 [EMSE]
  • 24. 24 Part 1: Study the challenges associated with understanding and evolving logging statements Part 2: Log engineering approaches to support software development activities What are the challenges in understanding logging statements? How do logging statements evolve? Prioritizing code review and testing efforts using logs and their churn. Verifying deployment of Big Data Analytics applications using logs.
  • 25. How to verify the deployment of Big Data Analytics Apps? 25 Small sample data and pseudo cloud Big data and real-life cloud How to verify [ICSE 2013 distinguished paper]
  • 26. Traditional approach for verifying BDA apps 26 Keyword scan Many false positives!! Large results, too much effort to manually examine [ICSE 2013 distinguished paper]
  • 27. Overview of our approach 27 Small sample data and pseudo cloud Big data and real-life cloud Underlying platform Underlying platform Execution sequences Execution sequences Execution sequence delta [ICSE 2013 distinguished paper]
  • 28. Comparing small and large runs 28 Logs from testing run with small data Logs from run with large data Execution sequence E1, E2, E3, E5, E6 Execution sequence E1, E2, E3, E5, E6 E1, E2, E3, E7, E5, E6 Execution sequence delta E1, E2, E3, E7, E5, E6 [ICSE 2013 distinguished paper]
  • 29. How precise is our approach? Precision 29 Effort Reduction How much effort reduction does our approach provide? Reduce logs for manual inspection by over 86% Less false positive [ICSE 2013 distinguished paper]
  • 30. Thesis contribution • We demonstrate the challenges of understanding logs. • We show that logging statements continually evolve. • We show that there is a relationship between logging characteristics and software defects. • We propose approaches that leverage logs to verify the deployment of Big Data Analytics applications. 30
  • 31. 31
  • 32. 32
  • 33. Where else can we find the requested information? 33 Code commit Issue reports Source code /* … */ Code comments Call graph fetch failure From method checkAndInformJobTracker of file ShuffleScheduler.java
  • 34. Where else can we find the requested information? 34 Code commit Issue reports Source code /* … */ Code comments Call graph fetch failure Notify the JobTracker after every read error, if `reportReadErrorImmediately' is true or after every `maxFetchFailuresBeforeReporting' failures
  • 35. Where else can we find the requested information? 35 Code commit Issue reports Source code /* … */ Code comments Call graph fetch failure Called by method copyFailed in class ShuffleScheduler
  • 36. Where else can we find the requested information? 36 Code commit Issue reports Source code /* … */ Code comments Call graph fetch failure Allow shuffle retries and read-error reporting to be configurable. Contributed by Amareshwari Sriramadasu.
  • 37. Where else can we find the requested information? 37 Code commit Issue reports Source code /* … */ Code comments Call graph fetch failure MAPREDUCE-1171. … This is caused by a behavioral change in hadoop 0.20.1. … …One solution I could see is "Provide a config option... ”…
  • 38. Where else can we find the requested information? 38 Code commit Issue reports Source code /* … */ Code comments Call graph fetch failure Meaning: There is a data reading error. Cause: One of the possible reasons is a configuration. Context: The event happens during the shuffle period, while copying data. Impact: The event impacts the jobtracker. Solution: Changing a configuration option would solve the issue. Amareshwari Sriramadasu is the expert to go to.
  • 39. Step 1: Log Abstraction reduces the size of logs 39 Log abstraction Log Linking Simplifying sequences Example of log lines Execution events Jiang et al. JSME 2008
  • 40. Step 2: Log linking provides context for logs 40 Log abstraction Log Linking Simplifying sequences Example of log lines Execution events
  • 41. Step 3: Sequence simplification deals with repeated logs 41 Log abstraction Log Linking Simplifying sequences Repeated logs: task t1 read file A. task t1 read file A. task t1 read file A. Remove repetition and order of events

Editor's Notes

  1. Introduce my self and topic Title large
  2. To understand system behavior Example of why we need it No-domain knowledge* Entire system rather than an important part
  3. Logs record important events of system, developers put logs there
  4. Title 5 tto 9
  5. Title 5 tto 9
  6. Priority change figure
  7. Churn=> how logs change
  8. Disconnect between dev and system admin
  9. neon
  10. Emphysize on this slide more for take-home
  11. Priority change figure
  12. Explain LPA
  13. Example of the workload. Hadoop wordcount. Dell DVD store.
  14. By knowing the logging method, I get the logging statements e.g, log4j
  15. Example of the log types: rephrase Short-lived logs: what’s that Fragile=> break the LPA
  16. Priority change figure
  17. What traditional metrics is: well studied in practice and accepted by community
  18. Logging level: what’s that
  19. Priority change figure
  20. BDA
  21. Underlying platform: hadoop
  22. 3 applications, real ones
  23. First part, we study logging in practice, second part, we propose log mining techniques
  24. neon
  25. neon
  26. neon
  27. neon
  28. neon
  29. neon
  30. +animation here