SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Downloaden Sie, um offline zu lesen
STAKEHOLDER-CENTRED IDENTIFICATION OF
DATA QUALITY ISSUES:
KNOWLEDGE THAT CAN SAVE YOUR BUSINESS
The International Conference on Intelligent Data Science Technologies and Applications (IDSTA2021)
November 15-16, 2021. Tartu, Estonia (web-based)
Anastasija Nikiforova, Natalija Kozmina
“Innovative Information Technologies” Laboratory, Programming Department
Faculty of Computing, University of Latvia
AIM & RESEARCH QUESTIONS
(RQ1) What are the main data quality issues to be considered when conducting data quality analysis?
(RQ2) What do users with advanced data quality knowledge think of a list of defined data quality issues and requirements as
a result of the literature analysis, i.e., are all these issues important in their view?
(RQ3) Are the data quality requirements identified while answering previous RQs valid for real-world data?
(RQ4) What is the list of data quality requirements to be included in the data quality analysis and in the
specification of the data quality tool?
The goal of this study is to determine the most common data quality issues (i.e.,
defects) that affect users' experience with data and their reuse, as well as intent for
their use in the future, potentially resulting in financial losses for businesses.
19% of businesses had lost
their customers using inaccurate
or incomplete data in 2019
“Global Marketing Alliance, The cost of bad
data: have you done the math?”, 2020
The 2020 edition of “Magic
Quadrant for Data Quality
Solutions” found that organizations
estimate the average cost of poor
data quality at more than $12
million per year
Gartner Magic Quadrant for Data Quality Solutions,
2020,
RELATED RESEARCHES
‹«  This state of affairs has led to much confusion within the data quality community and is even
more bewildering for those who are new to the discipline and more importantly to business
stakeholders »
(DAMA UK, 2018)
** In different proposals, dimensions of the same name can have different semantics and vice versa.
(Batini, 2016)
General studies on data and information quality - define different
dimensions of quality and their groupings
✘ The key data quality dimensions are not universally agreed upon*;
✘ There is no agreement on their meanings and usability **;
✘ Each dimension can be supplied with one or more metrics that varies from
one solution to another;
✘ The number of different data quality dimensions, their definitions and
grouping are often useful for only particular solution.
Question: How to relate particular dimension (and which one?) to a particular use-case???
RESEARCH DESIGN
Step Ia: results of the literature review Step Ib: results of the brainstorming session, identifying
and removing duplicates (30 DQ-users)
Step II: results of DELPHI
analysis (12 experts)
(Laranjeiro et al., 2015) - 22 studies
(Scannapieco et al., 2002) – 6 studies
(ISO/IEC, 2008)
(Torchiano et al., 2017)
(Rafique et al., 2012)
(Askham et al., 2013)
(Utamachant et al., 2018)
(Wang and Strong, 1996)
1.accuracy/ correctness
2.objectivity
3.reputation/ traceability
4.believability/ credibility
5.timeliness
6.completeness
7.relevancy
8.value-added
9.interpretability
10.access security
11.currentness
12.representational consistency
13.consistency/ concise representation
14.accessibility
15.precision
16.efficiency
17.recoverability
18.portability
19.response time
20.adequacy
21.confidentiality (privacy, security)
22.understandability (ease of understanding, interpretability)
1.accuracy/ correctness
2.traceability
3.believability/ credibility
4.timeliness, currentness
5.completeness
6.consistency
7.accessibility
8.confidentiality/
privacy, security
9.understandability
(ease of understanding, clarity,
interpretability)
DATA QUALITY DIMENSIONS: 2-STEP IDENTIFICATION
Dimension* Level
DT/DS
Data quality issue associated
accuracy/
correctness
DT Incorrect/inaccurate values that do not belong to the domain
Misspelling
Precision
Special characters
Duplicates/uniqueness violations
Incorrect references
Different aggregation levels
traceability DS
DT
untraceable
believability/
credibility
DS non-credible
timeliness,
currentness
DS
DT
Outdated temporal data
completeness DT Missing value
... ... ...
DATA QUALITY DIMENSIONS AND ASSOCIATED DATA
QUALITY ISSUES IDENTIFIED (PART I)
*For definition of each dimension we have used, please, refer to the article
Dimension Level DT/DS Data quality issue associated
... ... ...
consistency DS
DT
Different representations (intra-relational constraint)
Different word orderings between values of one attribute
Use of synonyms / multiple notation for one object in scope of one attribute
Use of synonyms / multiple notation for one object in scope of different datasets
Different encoding formats, Wrong data type
Different aggregation levels
Different units
Special characters
accessibility DS Special characters
Misspelling, Different encoding formats
Different aggregation levels
Different units
Use of synonyms / multiple notation for one object in scope of different datasets
Bulk download
confidentiality/
privacy, security
DS unsecure / non-confidential
understandability
(ease of understanding, clarity,
interpretability)
DS
DT
unclear
DATA QUALITY DIMENSIONS AND ASSOCIATED DATA
QUALITY ISSUES IDENTIFIED (PART II)
Step I: results of the literature review Step II: results of the brainstorming session, identifying and
removing duplicates (30 DQ-users)
Step III: results of DELPHI analysis
(12 experts)
(Laranjeiro et al., 2015) - 22 studies
(Scannapieco et al., 2002) – 6 studies
(ISO/IEC, 2008)
(Torchiano et al., 2017)
(Rafique et al., 2012)
(Askham et al., 2013)
(Utamachant et al., 2018)
(Wang and Strong, 1996)
1.accuracy/ correctness
2.objectivity
3.reputation/ traceability
4.believability/ credibility
5.timeliness
6.completeness
7.relevancy
8.value-added
9.interpretability
10.access security
11.currentness
12.representational consistency
13.consistency/ concise representation
14.accessibility
15.precision
16.efficiency
17.recoverability
18.portability
19.response time
20.adequacy
21.confidentiality (privacy, security)
22.understandability (ease of understanding, interpretability)
1.accuracy/ correctness
2.traceability
3.believability/ credibility
4.timeliness, currentness
5.completeness
6.consistency
7.accessibility
8.confidentiality/
privacy, security
9.understandability
(ease of understanding, clarity,
interpretability)
DATA QUALITY DIMENSIONS: STEP III
Data quality problem in question Frequency of
checks
(datasets)
Frequency of issues in DS
(#defective data sets/#total)
Frequency of issues
(#defective parameters/ #total)
QD1: Incorrect/inaccurate values that does not belong to the
domain
40.00% 16.67% 15.38%
QD1: Misspelling 86.67% 7.69% 3.33%
QD1: Precision 40.00% 0 0
QD1: Special characters 10% 13.33% 25.93%
QD1: Duplicates / uniqueness violations 93.33% 28.57% 18.18%
QD1: Incorrect references 80.00% 16.67% 13.33%
QD1: Different aggregation levels 80.00% 16.67% 13.33%
QD2: Traceability (DT) 66.67% 0 0
QD2: Traceability (DS) 93.33% 14.29% 6.67%
QD3: Believability/ credibility 100% 13.33% 2.27%
QD4: Outdated temporal data (DT) 93.33% 7.14% 10.00%
QD4: Outdated temporal data (DS) 93.33% 64.29% 28.82%
QD5: Completeness 93.33% 64.29% 28.82%
... ... ... ...
RESULTS OF APPLYING DATA QUALITY REQUIREMENTS
TO OPEN GOVERNMENT DATA (part I)
Data quality problem in question Frequency of checks
(datasets)
Frequency of issues in
DS (#defective data
sets/#total)
Frequency of
issues (#defective
parameters/ #total)
QD6: Different representations (Intra-relational constraint) 86.67% 61.54% 61.90%
QD6: Different word orderings between values of one attribute 93.33% 42.86% 25.00%
QD6: Use of synonyms / multiple notation for one object in scope of one
attribute
86.67% 61.54% 61.90%
QD6: Use of synonyms / multiple notation for one object in different datasets 93.33% 50.00% 26.32%
QD6:Different encoding formats 80.00% 0 0
QD6: Wrong data type 86.67% 7.69% 0.80%
QD6:Different aggregation levels 46.67% 57.14% 25.93%
QD6: Different units 53.33% 25.00% 21.74%
QD6: Special characters 46.67% 57.14% 25.93%
QD7: Special characters 86.67% 7.69% 8.57%
QD7: Misspelling 90.00% 6.67% 8.33%
QD7: Different encoding formats 33.33% 0 0
QD7: Different aggregation levels 80.00% 8.33% 10.00%
QD7: Different units 80.00% 16.67% 21.74%
QD7: Use of synonyms / multiple notation for one object in scope of different
datasets
86.67% 30.77% 21.74%
QD7: Bulk download 100.00% 20.00% 20.00%
QD8: Confidentiality/ privacy, security 0 0 0
QD9: Understandability (DT) 100.00% 20.00% 11.76%
QD9: Understandability (DS) 100.00% 66.67% 25.93%
RESULTS
This study has raised and answered 4 research questions:
the list of main data quality issues to be considered when conducting data quality analysis was identified in
course of the literature analysis, which was then filtered out during the brainstorming session.
in terms of the DELPHI analysis with 12 experts the list was reduced to 9 data quality dimensions and 15 data
quality issues mapped to each other, dividing data quality issues into two categories depending on their level,
i.e., data and data set levels.
 the validity of the data quality issues identified was examined by applying the list of data quality
requirements set in RQ1 and RQ2 to 30 real open government data sets from the Latvian open government data
portal.
14 data quality issues to be transformed into requirements for the web-based tool under development
have been identified with 6 more appearing in some cases (<10% of data sets) to be considered for
implementation.
CONCLUSIONS I
The concept and topic of “data quality” attracts researchers for more than three decades, and its popularity
certainly will not change in the future - the data are not only an integral part of our lives and business. With
the popularity of the open government data, their value now is even higher than ever.
The paradigm according to which the data quality control and management is performed in closed systems,
is no longer valid.
This leads to the modification of already existing and the development of new data quality dimensions,
their classification, data quality issues, etc.
CONCLUSIONS I
The results showed that most of the defects are representative for OGD available to each stakeholder.
The OGD have data quality issues which, as demonstrated by OGD-related studies, have a negative impact on users’
readiness and willingness to re-use these data for their purposes such as innovative service and solutions.
Let's keep in mind that the data are worth reusing only if they are usable both in terms of their value and quality, otherwise
bringing businesses losses.
Further studies on the topic include the development of the web-based data quality analysis tool where the knowledge obtained
during this study will serve as a specification of the functionality to be covered by it.
DATA AVAILABILITY
Data are available in Open Access (under CC-BY)  DOI: https://doi.org/10.5281/zenodo.4604656
https://www.eosc-hub.eu/open-science-info
THANK YOU FOR
ATTENTION!
QUESTIONS?
For more information, see ResearchGate
See also anastasijanikiforova.com
For questions or any other queries, contact
me via email - Anastasija.Nikiforova@lu.lv

Weitere Àhnliche Inhalte

Was ist angesagt?

Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748Jyotindra Zaveri
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentAmrapali Zaveri, PhD
 
1.2 steps and functionalities
1.2 steps and functionalities1.2 steps and functionalities
1.2 steps and functionalitiesRajendran
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataeXascale Infolab
 
Linked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyLinked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyAmrapali Zaveri, PhD
 
A SURVEY OF LINK MINING AND ANOMALIES DETECTION
A SURVEY OF LINK MINING AND ANOMALIES DETECTIONA SURVEY OF LINK MINING AND ANOMALIES DETECTION
A SURVEY OF LINK MINING AND ANOMALIES DETECTIONIJDKP
 
Ghhh
GhhhGhhh
Ghhhagammya
 
Metadata Quality Assurance
Metadata Quality AssuranceMetadata Quality Assurance
Metadata Quality AssurancePĂ©ter KirĂĄly
 
PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.Giuseppe Ricci
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Rinke Hoekstra
 
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific Data
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific DataEvaluation Mechanism for Similarity-Based Ranked Search Over Scientific Data
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific DataAM Publications
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Tutorial Knowledge Discovery
Tutorial Knowledge DiscoveryTutorial Knowledge Discovery
Tutorial Knowledge DiscoverySSSW
 

Was ist angesagt? (19)

Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
1.2 steps and functionalities
1.2 steps and functionalities1.2 steps and functionalities
1.2 steps and functionalities
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked Data
 
Linked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyLinked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A Survey
 
A SURVEY OF LINK MINING AND ANOMALIES DETECTION
A SURVEY OF LINK MINING AND ANOMALIES DETECTIONA SURVEY OF LINK MINING AND ANOMALIES DETECTION
A SURVEY OF LINK MINING AND ANOMALIES DETECTION
 
Ghhh
GhhhGhhh
Ghhh
 
Metadata Quality Assurance
Metadata Quality AssuranceMetadata Quality Assurance
Metadata Quality Assurance
 
The Genopolis Microarray database
The Genopolis Microarray databaseThe Genopolis Microarray database
The Genopolis Microarray database
 
PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
 
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific Data
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific DataEvaluation Mechanism for Similarity-Based Ranked Search Over Scientific Data
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific Data
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
PhD defense
PhD defense PhD defense
PhD defense
 
Artificial Intelligence in Data Curation
Artificial Intelligence in Data CurationArtificial Intelligence in Data Curation
Artificial Intelligence in Data Curation
 
Konrad cedem praesi
Konrad cedem praesiKonrad cedem praesi
Konrad cedem praesi
 
Open data quality
Open data qualityOpen data quality
Open data quality
 
Tutorial Knowledge Discovery
Tutorial Knowledge DiscoveryTutorial Knowledge Discovery
Tutorial Knowledge Discovery
 
Amrapali Zaveri Defense
Amrapali Zaveri DefenseAmrapali Zaveri Defense
Amrapali Zaveri Defense
 

Ähnlich wie Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can Save Your Business

A step towards a data quality theory
 A step towards a data quality theory A step towards a data quality theory
A step towards a data quality theoryAnastasija Nikiforova
 
How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?andrea huang
 
Evaluating the effectiveness of data quality framework in software engineering
Evaluating the effectiveness of data quality framework in  software engineeringEvaluating the effectiveness of data quality framework in  software engineering
Evaluating the effectiveness of data quality framework in software engineeringIJECEIAES
 
Big data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyBig data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyShital Kat
 
Privacy Requirements Engineering in Agile Software Development
Privacy Requirements Engineering in Agile Software DevelopmentPrivacy Requirements Engineering in Agile Software Development
Privacy Requirements Engineering in Agile Software DevelopmentRequirementsEngineeringLaboratory
 
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...DATAVERSITY
 
Data Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesData Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesDeepaR42
 
Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...
Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...
Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...Kathmandu Living Labs
 
Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)NikitaRajbhoj
 
TUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaS
TUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaSTUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaS
TUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaSHong-Linh Truong
 
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSINGMETA DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSINGIJCSEIT Journal
 
RES812 U4 Individual Project
RES812  U4 Individual ProjectRES812  U4 Individual Project
RES812 U4 Individual ProjectThienSi Le
 
RES812 U4 Individual Project
RES812  U4 Individual ProjectRES812  U4 Individual Project
RES812 U4 Individual ProjectThienSi Le
 
Standards and Standardization - A Research Project
Standards and Standardization - A Research ProjectStandards and Standardization - A Research Project
Standards and Standardization - A Research ProjectSandeep Purao
 
Profiling Linked Open Data
Profiling Linked Open DataProfiling Linked Open Data
Profiling Linked Open DataBlerina Spahiu
 
Metadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - shortMetadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - shortPĂ©ter KirĂĄly
 
IMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERING
IMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERINGIMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERING
IMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERINGijcsit
 
Importance of Process Mining for Big Data Requirements Engineering
Importance of Process Mining for Big Data Requirements EngineeringImportance of Process Mining for Big Data Requirements Engineering
Importance of Process Mining for Big Data Requirements EngineeringAIRCC Publishing Corporation
 
Modeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender SystemsModeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender Systemskib_83
 

Ähnlich wie Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can Save Your Business (20)

A step towards a data quality theory
 A step towards a data quality theory A step towards a data quality theory
A step towards a data quality theory
 
How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?
 
Evaluating the effectiveness of data quality framework in software engineering
Evaluating the effectiveness of data quality framework in  software engineeringEvaluating the effectiveness of data quality framework in  software engineering
Evaluating the effectiveness of data quality framework in software engineering
 
Big data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyBig data processing using - Hadoop Technology
Big data processing using - Hadoop Technology
 
Privacy Requirements Engineering in Agile Software Development
Privacy Requirements Engineering in Agile Software DevelopmentPrivacy Requirements Engineering in Agile Software Development
Privacy Requirements Engineering in Agile Software Development
 
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
 
Data Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesData Mining : Concepts and Techniques
Data Mining : Concepts and Techniques
 
Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...
Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...
Prof. Melinda Laituri, Colorado State University | Map Data Integrity | SotM ...
 
Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)
 
TUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaS
TUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaSTUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaS
TUW-ASE- Summer 2014: Analyzing and Specifying Concerns for DaaS
 
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSINGMETA DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
 
RES812 U4 Individual Project
RES812  U4 Individual ProjectRES812  U4 Individual Project
RES812 U4 Individual Project
 
RES812 U4 Individual Project
RES812  U4 Individual ProjectRES812  U4 Individual Project
RES812 U4 Individual Project
 
Standards and Standardization - A Research Project
Standards and Standardization - A Research ProjectStandards and Standardization - A Research Project
Standards and Standardization - A Research Project
 
DCW Data Quality 1992
DCW Data Quality 1992DCW Data Quality 1992
DCW Data Quality 1992
 
Profiling Linked Open Data
Profiling Linked Open DataProfiling Linked Open Data
Profiling Linked Open Data
 
Metadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - shortMetadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - short
 
IMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERING
IMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERINGIMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERING
IMPORTANCE OF PROCESS MINING FOR BIG DATA REQUIREMENTS ENGINEERING
 
Importance of Process Mining for Big Data Requirements Engineering
Importance of Process Mining for Big Data Requirements EngineeringImportance of Process Mining for Big Data Requirements Engineering
Importance of Process Mining for Big Data Requirements Engineering
 
Modeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender SystemsModeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender Systems
 

Mehr von Anastasija Nikiforova

Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...
Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...
Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...Anastasija Nikiforova
 
Towards High-Value Datasets determination for data-driven development: a syst...
Towards High-Value Datasets determination for data-driven development: a syst...Towards High-Value Datasets determination for data-driven development: a syst...
Towards High-Value Datasets determination for data-driven development: a syst...Anastasija Nikiforova
 
Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...Anastasija Nikiforova
 
Artificial Intelligence for open data or open data for artificial intelligence?
Artificial Intelligence for open data or open data for artificial intelligence?Artificial Intelligence for open data or open data for artificial intelligence?
Artificial Intelligence for open data or open data for artificial intelligence?Anastasija Nikiforova
 
Overlooked aspects of data governance: workflow framework for enterprise data...
Overlooked aspects of data governance: workflow framework for enterprise data...Overlooked aspects of data governance: workflow framework for enterprise data...
Overlooked aspects of data governance: workflow framework for enterprise data...Anastasija Nikiforova
 
Data Quality as a prerequisite for you business success: when should I start ...
Data Quality as a prerequisite for you business success: when should I start ...Data Quality as a prerequisite for you business success: when should I start ...
Data Quality as a prerequisite for you business success: when should I start ...Anastasija Nikiforova
 
Framework for understanding quantum computing use cases from a multidisciplin...
Framework for understanding quantum computing use cases from a multidisciplin...Framework for understanding quantum computing use cases from a multidisciplin...
Framework for understanding quantum computing use cases from a multidisciplin...Anastasija Nikiforova
 
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...Anastasija Nikiforova
 
Putting FAIR Principles in the Context of Research Information: FAIRness for ...
Putting FAIR Principles in the Context of Research Information: FAIRness for ...Putting FAIR Principles in the Context of Research Information: FAIRness for ...
Putting FAIR Principles in the Context of Research Information: FAIRness for ...Anastasija Nikiforova
 
Open data hackathon as a tool for increased engagement of Generation Z: to h...
Open data hackathon as a tool for increased engagement of Generation Z:  to h...Open data hackathon as a tool for increased engagement of Generation Z:  to h...
Open data hackathon as a tool for increased engagement of Generation Z: to h...Anastasija Nikiforova
 
Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...
Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...
Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...Anastasija Nikiforova
 
Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRIS
Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRISCombining Data Lake and Data Wrangling for Ensuring Data Quality in CRIS
Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRISAnastasija Nikiforova
 
The role of open data in the development of sustainable smart cities and smar...
The role of open data in the development of sustainable smart cities and smar...The role of open data in the development of sustainable smart cities and smar...
The role of open data in the development of sustainable smart cities and smar...Anastasija Nikiforova
 
Data security as a top priority in the digital world: preserve data value by ...
Data security as a top priority in the digital world: preserve data value by ...Data security as a top priority in the digital world: preserve data value by ...
Data security as a top priority in the digital world: preserve data value by ...Anastasija Nikiforova
 
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...Anastasija Nikiforova
 
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...Anastasija Nikiforova
 
ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...
ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...
ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...Anastasija Nikiforova
 
Towards a Concurrence Analysis in Business Processes
Towards a Concurrence Analysis in Business ProcessesTowards a Concurrence Analysis in Business Processes
Towards a Concurrence Analysis in Business ProcessesAnastasija Nikiforova
 
DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...
DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...
DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...Anastasija Nikiforova
 

Mehr von Anastasija Nikiforova (20)

Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...
Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...
Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...
 
Towards High-Value Datasets determination for data-driven development: a syst...
Towards High-Value Datasets determination for data-driven development: a syst...Towards High-Value Datasets determination for data-driven development: a syst...
Towards High-Value Datasets determination for data-driven development: a syst...
 
Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...
 
Artificial Intelligence for open data or open data for artificial intelligence?
Artificial Intelligence for open data or open data for artificial intelligence?Artificial Intelligence for open data or open data for artificial intelligence?
Artificial Intelligence for open data or open data for artificial intelligence?
 
Overlooked aspects of data governance: workflow framework for enterprise data...
Overlooked aspects of data governance: workflow framework for enterprise data...Overlooked aspects of data governance: workflow framework for enterprise data...
Overlooked aspects of data governance: workflow framework for enterprise data...
 
Data Quality as a prerequisite for you business success: when should I start ...
Data Quality as a prerequisite for you business success: when should I start ...Data Quality as a prerequisite for you business success: when should I start ...
Data Quality as a prerequisite for you business success: when should I start ...
 
Framework for understanding quantum computing use cases from a multidisciplin...
Framework for understanding quantum computing use cases from a multidisciplin...Framework for understanding quantum computing use cases from a multidisciplin...
Framework for understanding quantum computing use cases from a multidisciplin...
 
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
 
Putting FAIR Principles in the Context of Research Information: FAIRness for ...
Putting FAIR Principles in the Context of Research Information: FAIRness for ...Putting FAIR Principles in the Context of Research Information: FAIRness for ...
Putting FAIR Principles in the Context of Research Information: FAIRness for ...
 
Open data hackathon as a tool for increased engagement of Generation Z: to h...
Open data hackathon as a tool for increased engagement of Generation Z:  to h...Open data hackathon as a tool for increased engagement of Generation Z:  to h...
Open data hackathon as a tool for increased engagement of Generation Z: to h...
 
Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...
Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...
Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...
 
Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRIS
Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRISCombining Data Lake and Data Wrangling for Ensuring Data Quality in CRIS
Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRIS
 
The role of open data in the development of sustainable smart cities and smar...
The role of open data in the development of sustainable smart cities and smar...The role of open data in the development of sustainable smart cities and smar...
The role of open data in the development of sustainable smart cities and smar...
 
Data security as a top priority in the digital world: preserve data value by ...
Data security as a top priority in the digital world: preserve data value by ...Data security as a top priority in the digital world: preserve data value by ...
Data security as a top priority in the digital world: preserve data value by ...
 
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...
 
Atvērto datu potenciāls
Atvērto datu potenciālsAtvērto datu potenciāls
Atvērto datu potenciāls
 
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...
 
ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...
ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...
ATVĒRTO DATU SAVLAICÄȘGUMS NACIONĀLAJOS ATVĒRTO DATU PORTĀLOS AR PANDĒMIJU SAI...
 
Towards a Concurrence Analysis in Business Processes
Towards a Concurrence Analysis in Business ProcessesTowards a Concurrence Analysis in Business Processes
Towards a Concurrence Analysis in Business Processes
 
DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...
DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...
DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...
 

KĂŒrzlich hochgeladen

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...gajnagarg
 
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Standamitlee9823
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics
 
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Standamitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...gajnagarg
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night Standamitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823
 

KĂŒrzlich hochgeladen (20)

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
 
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 đŸ„” Book Your One night Stand
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
âž„đŸ” 7737669865 đŸ”â–» Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 

Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can Save Your Business

  • 1. STAKEHOLDER-CENTRED IDENTIFICATION OF DATA QUALITY ISSUES: KNOWLEDGE THAT CAN SAVE YOUR BUSINESS The International Conference on Intelligent Data Science Technologies and Applications (IDSTA2021) November 15-16, 2021. Tartu, Estonia (web-based) Anastasija Nikiforova, Natalija Kozmina “Innovative Information Technologies” Laboratory, Programming Department Faculty of Computing, University of Latvia
  • 2. AIM & RESEARCH QUESTIONS (RQ1) What are the main data quality issues to be considered when conducting data quality analysis? (RQ2) What do users with advanced data quality knowledge think of a list of defined data quality issues and requirements as a result of the literature analysis, i.e., are all these issues important in their view? (RQ3) Are the data quality requirements identified while answering previous RQs valid for real-world data? (RQ4) What is the list of data quality requirements to be included in the data quality analysis and in the specification of the data quality tool? The goal of this study is to determine the most common data quality issues (i.e., defects) that affect users' experience with data and their reuse, as well as intent for their use in the future, potentially resulting in financial losses for businesses. 19% of businesses had lost their customers using inaccurate or incomplete data in 2019 “Global Marketing Alliance, The cost of bad data: have you done the math?”, 2020 The 2020 edition of “Magic Quadrant for Data Quality Solutions” found that organizations estimate the average cost of poor data quality at more than $12 million per year Gartner Magic Quadrant for Data Quality Solutions, 2020,
  • 3. RELATED RESEARCHES ‹«  This state of affairs has led to much confusion within the data quality community and is even more bewildering for those who are new to the discipline and more importantly to business stakeholders » (DAMA UK, 2018) ** In different proposals, dimensions of the same name can have different semantics and vice versa. (Batini, 2016) General studies on data and information quality - define different dimensions of quality and their groupings ✘ The key data quality dimensions are not universally agreed upon*; ✘ There is no agreement on their meanings and usability **; ✘ Each dimension can be supplied with one or more metrics that varies from one solution to another; ✘ The number of different data quality dimensions, their definitions and grouping are often useful for only particular solution. Question: How to relate particular dimension (and which one?) to a particular use-case???
  • 5. Step Ia: results of the literature review Step Ib: results of the brainstorming session, identifying and removing duplicates (30 DQ-users) Step II: results of DELPHI analysis (12 experts) (Laranjeiro et al., 2015) - 22 studies (Scannapieco et al., 2002) – 6 studies (ISO/IEC, 2008) (Torchiano et al., 2017) (Rafique et al., 2012) (Askham et al., 2013) (Utamachant et al., 2018) (Wang and Strong, 1996) 1.accuracy/ correctness 2.objectivity 3.reputation/ traceability 4.believability/ credibility 5.timeliness 6.completeness 7.relevancy 8.value-added 9.interpretability 10.access security 11.currentness 12.representational consistency 13.consistency/ concise representation 14.accessibility 15.precision 16.efficiency 17.recoverability 18.portability 19.response time 20.adequacy 21.confidentiality (privacy, security) 22.understandability (ease of understanding, interpretability) 1.accuracy/ correctness 2.traceability 3.believability/ credibility 4.timeliness, currentness 5.completeness 6.consistency 7.accessibility 8.confidentiality/ privacy, security 9.understandability (ease of understanding, clarity, interpretability) DATA QUALITY DIMENSIONS: 2-STEP IDENTIFICATION
  • 6. Dimension* Level DT/DS Data quality issue associated accuracy/ correctness DT Incorrect/inaccurate values that do not belong to the domain Misspelling Precision Special characters Duplicates/uniqueness violations Incorrect references Different aggregation levels traceability DS DT untraceable believability/ credibility DS non-credible timeliness, currentness DS DT Outdated temporal data completeness DT Missing value ... ... ... DATA QUALITY DIMENSIONS AND ASSOCIATED DATA QUALITY ISSUES IDENTIFIED (PART I) *For definition of each dimension we have used, please, refer to the article
  • 7. Dimension Level DT/DS Data quality issue associated ... ... ... consistency DS DT Different representations (intra-relational constraint) Different word orderings between values of one attribute Use of synonyms / multiple notation for one object in scope of one attribute Use of synonyms / multiple notation for one object in scope of different datasets Different encoding formats, Wrong data type Different aggregation levels Different units Special characters accessibility DS Special characters Misspelling, Different encoding formats Different aggregation levels Different units Use of synonyms / multiple notation for one object in scope of different datasets Bulk download confidentiality/ privacy, security DS unsecure / non-confidential understandability (ease of understanding, clarity, interpretability) DS DT unclear DATA QUALITY DIMENSIONS AND ASSOCIATED DATA QUALITY ISSUES IDENTIFIED (PART II)
  • 8. Step I: results of the literature review Step II: results of the brainstorming session, identifying and removing duplicates (30 DQ-users) Step III: results of DELPHI analysis (12 experts) (Laranjeiro et al., 2015) - 22 studies (Scannapieco et al., 2002) – 6 studies (ISO/IEC, 2008) (Torchiano et al., 2017) (Rafique et al., 2012) (Askham et al., 2013) (Utamachant et al., 2018) (Wang and Strong, 1996) 1.accuracy/ correctness 2.objectivity 3.reputation/ traceability 4.believability/ credibility 5.timeliness 6.completeness 7.relevancy 8.value-added 9.interpretability 10.access security 11.currentness 12.representational consistency 13.consistency/ concise representation 14.accessibility 15.precision 16.efficiency 17.recoverability 18.portability 19.response time 20.adequacy 21.confidentiality (privacy, security) 22.understandability (ease of understanding, interpretability) 1.accuracy/ correctness 2.traceability 3.believability/ credibility 4.timeliness, currentness 5.completeness 6.consistency 7.accessibility 8.confidentiality/ privacy, security 9.understandability (ease of understanding, clarity, interpretability) DATA QUALITY DIMENSIONS: STEP III
  • 9. Data quality problem in question Frequency of checks (datasets) Frequency of issues in DS (#defective data sets/#total) Frequency of issues (#defective parameters/ #total) QD1: Incorrect/inaccurate values that does not belong to the domain 40.00% 16.67% 15.38% QD1: Misspelling 86.67% 7.69% 3.33% QD1: Precision 40.00% 0 0 QD1: Special characters 10% 13.33% 25.93% QD1: Duplicates / uniqueness violations 93.33% 28.57% 18.18% QD1: Incorrect references 80.00% 16.67% 13.33% QD1: Different aggregation levels 80.00% 16.67% 13.33% QD2: Traceability (DT) 66.67% 0 0 QD2: Traceability (DS) 93.33% 14.29% 6.67% QD3: Believability/ credibility 100% 13.33% 2.27% QD4: Outdated temporal data (DT) 93.33% 7.14% 10.00% QD4: Outdated temporal data (DS) 93.33% 64.29% 28.82% QD5: Completeness 93.33% 64.29% 28.82% ... ... ... ... RESULTS OF APPLYING DATA QUALITY REQUIREMENTS TO OPEN GOVERNMENT DATA (part I)
  • 10. Data quality problem in question Frequency of checks (datasets) Frequency of issues in DS (#defective data sets/#total) Frequency of issues (#defective parameters/ #total) QD6: Different representations (Intra-relational constraint) 86.67% 61.54% 61.90% QD6: Different word orderings between values of one attribute 93.33% 42.86% 25.00% QD6: Use of synonyms / multiple notation for one object in scope of one attribute 86.67% 61.54% 61.90% QD6: Use of synonyms / multiple notation for one object in different datasets 93.33% 50.00% 26.32% QD6:Different encoding formats 80.00% 0 0 QD6: Wrong data type 86.67% 7.69% 0.80% QD6:Different aggregation levels 46.67% 57.14% 25.93% QD6: Different units 53.33% 25.00% 21.74% QD6: Special characters 46.67% 57.14% 25.93% QD7: Special characters 86.67% 7.69% 8.57% QD7: Misspelling 90.00% 6.67% 8.33% QD7: Different encoding formats 33.33% 0 0 QD7: Different aggregation levels 80.00% 8.33% 10.00% QD7: Different units 80.00% 16.67% 21.74% QD7: Use of synonyms / multiple notation for one object in scope of different datasets 86.67% 30.77% 21.74% QD7: Bulk download 100.00% 20.00% 20.00% QD8: Confidentiality/ privacy, security 0 0 0 QD9: Understandability (DT) 100.00% 20.00% 11.76% QD9: Understandability (DS) 100.00% 66.67% 25.93%
  • 11. RESULTS This study has raised and answered 4 research questions: the list of main data quality issues to be considered when conducting data quality analysis was identified in course of the literature analysis, which was then filtered out during the brainstorming session. in terms of the DELPHI analysis with 12 experts the list was reduced to 9 data quality dimensions and 15 data quality issues mapped to each other, dividing data quality issues into two categories depending on their level, i.e., data and data set levels.  the validity of the data quality issues identified was examined by applying the list of data quality requirements set in RQ1 and RQ2 to 30 real open government data sets from the Latvian open government data portal. 14 data quality issues to be transformed into requirements for the web-based tool under development have been identified with 6 more appearing in some cases (<10% of data sets) to be considered for implementation.
  • 12. CONCLUSIONS I The concept and topic of “data quality” attracts researchers for more than three decades, and its popularity certainly will not change in the future - the data are not only an integral part of our lives and business. With the popularity of the open government data, their value now is even higher than ever. The paradigm according to which the data quality control and management is performed in closed systems, is no longer valid. This leads to the modification of already existing and the development of new data quality dimensions, their classification, data quality issues, etc.
  • 13. CONCLUSIONS I The results showed that most of the defects are representative for OGD available to each stakeholder. The OGD have data quality issues which, as demonstrated by OGD-related studies, have a negative impact on users’ readiness and willingness to re-use these data for their purposes such as innovative service and solutions. Let's keep in mind that the data are worth reusing only if they are usable both in terms of their value and quality, otherwise bringing businesses losses. Further studies on the topic include the development of the web-based data quality analysis tool where the knowledge obtained during this study will serve as a specification of the functionality to be covered by it.
  • 14. DATA AVAILABILITY Data are available in Open Access (under CC-BY)  DOI: https://doi.org/10.5281/zenodo.4604656 https://www.eosc-hub.eu/open-science-info
  • 15. THANK YOU FOR ATTENTION! QUESTIONS? For more information, see ResearchGate See also anastasijanikiforova.com For questions or any other queries, contact me via email - Anastasija.Nikiforova@lu.lv