SlideShare a Scribd company logo
1 of 16
Open Government Data Platform
India
(https://data.gov.in)
Meta Data and Quality of Data
By: Sunil Babbar, Scientist-C, NIC
Data Contributors and Their Role
• Nominated by Chief Data Officer
• Coordinate and Identify datasets which can be
contributed
• Preparing the datasets
– Getting them cleaned
– Metadata preparation for datasets in the predefined format
– Ensuring quality and correctness datasets of his/her
unit/division.
• Contributing Catalogs/Resources(Datasets) through pre-
defined workflow
(Data Contributor  Chief Data Officer(CDO) for review and publish  PMU to publish on OGD Platform)
Resources (Datasets / Apps)
 A data set (or dataset) is a collection of data
 A data set corresponds to the contents of a single table or
statistical data matrix, where
 every column represents a particular variable, and
 each row corresponds to a given member of the data set in
question
 OpenDataFormats:
CSV
XLS
ODF
XML/RDF
JSON
RSS/Atom
KML/GML
Catalog
 Catalogisgroupingof thesimilarresources(Datasets/Apps)
 A catalog represents a collection of resources that you
group together
 Acts like directory of information about resources
 BenefitofCatalog
 To facilitate data access by users who are first interested in a
particular kind of data
 Cataloghelpsingroupingtheresourceswithsametheme/subjectand
thusfacilitatetheuserinsearching aspecificdataset/resourceeasily
 Ministry/Departmentshavelessefforttouploadsamesetofresources
orupdatingthedatasetfornewperiodwithoutwritingthemetadata
againandagain
 Tofacilitatetheusersforeasiernavigationandsearchingforrelevant
data.
Catalog Formation
 Catalogwithsameresourcewithdifferenttimeperiod
(Annual,Quarterly,Monthly,WeeklyandDaily)
 Eg.AnnualRainfallData
 Catalogwithsameresourcebutwithdifferentjurisdiction
(India,States,Districts,Block,Village)
 States/UTs-wise Forest and Tree Cover
 Catalogwithsameresourcebutdifferentcategory
(ScheduleCaste,ScheduleTribe,General,Religionetc.)
 District-wise crimes committed against Schedule Caste
 CatalogwithSimilartypeofresourceundersamereport
(Resourcesofsimilarnature)fromthesamereport/survey
canbegroupedunderthesamecatalog
 Primary Census Abstract 2011 - India and States
MetaData
• Is the information that describes the data
– What is that data (About Data)
– Data source
– Who Created
– When created
– Etc.
• Metadata allows the data to be traced to
a know its origin and quality
Metadata Elements for Catalogs
 Title(Required):Auniquenameforthecatalog(groupofresources)
 Shouldcontainthegeneraltermswhichdescribestheessential
properties/characteristicsofthedatasets/resources
 Should be in plain English and include sufficient detail to facilitate search and
discovery
 Time-periodshouldnotbementionedinthecatalogtitlenormallysothatforthesimilar
resources,containingsametypeofdataforthenexttime-period/periodicupdating,can
beaccommodatedinsamecatalog
 Howeverinexceptionalcases,itcancontaintimeperiodparticularlyforperiodic
surveys/censuswhichcontainsahugenumberofdatasets/resourcesbelongingtothe
sameperiod/year
 Eg.CurrentPopulationSurvey,ConsumerPriceIndex,VarietywiseDailyMarketPrices
Data,StatewiseConstructionofDeepTubewellsovertheyears,etc.
 Description(Required):Provideadetaileddescriptionofthecatalog
 Anabstractdeterminingthenatureandpurposeofthecatalog
 Containsthenameofvariableswhichareavailableinthedatasets
 Canalsocontainsthedefinitionofsomevariable
Metadata Elements for Catalogs
 Keywords(Required):Itisalistof terms,separatedbycommas,
describingandindicatingatthecontentofthecatalog.Example:
rainfall,weather,monthlystatistics.
 Help users discover your dataset; please include terms that would be used by
technical and non-technical users.
 GroupName:ThisisanoptionalfieldtoprovideaGroupNameto
multiplecatalogsinordertoshowthattheymaybepresentedas
agroupora set.
 Sector& SubSector(Required):Choosethe
sectors(s)/subsector(s)thosemostcloselyapply(ies)toyour
catalog.
 AssetJurisdiction(Required):Thisisa requiredfieldtoidentify
theexactlocationorareatowhichthecatalogand
resources(dataset/apps)caterstoviz.entirecountry,
state/province,district,city,etc.
Example - Creation of catalog
 Catalog Title:
 CompanyMasterData2015
 (Incorrect-Contains time frame, so in future if we want to add data under this
catalog e.g Company master data for 2016, it would be not be possible to upload
data under this catalog)
 CompanyMasterData (Correct)
 CatalogDescription:
 GetdataofCompanymasterdata..??
 (Incorrect-Does not contain detail information. Description should contain the name
of variables which are available in the datasets)
 Get data on master details of any company registered with Registrar of Companies (RoC).
Data contains various information like Corporate Identification Number(CIN), Company Name,
Company Status, Company Class, Company Category, Authorized Capital in INR, Paid-up
Capital in INR, Date of Registration, Registered State, Registrar of Companies, Principal
Business Activity, Registered Office Address and Sub Category. (Correct)
 Keywords:
 CompanyMasterData,….??
 (Incorrect-listoftermsdescribingandindicatingthecontentofthecatalog,allthe
possiblesearchkeywordsshouldbeincluded
 RegisteredCompanies,CompanymasterData,CompanyData,IndianCompanies,
Company,CompanyDetails,CorporateIdentificationNumber,CIN,CompanyAddress
(Correct)
Metadata Elements for Resources
 Title(Required):Auniquenameoftheresource
 Shouldbeselfexplanatoryviz.ConsumerPriceIndexfor<Month/Year>etc.
 Resourcetitleshouldcontainthetimeframe,sonoduplicationwilloccurinfutureeg.
ConsumerPriceIndexfromApril-2000toApril-2015,Rainfalloftheyear2012
 AccessMethod(Required):Howuserisgoingtogetthatdata
 UploadaDatasetor
 SingleClickLinktoDataset
 Category(Required):IsitaDatasetoranApplication
 ReferenceURLs:Thismayincludedescriptiontothestudydesign,instrumentation,
implementation,limitations,andappropriateuseofthedatasetortool.Inthecaseof
multipledocumentsorURLs,pleasedelimitwithcommasorenterinseparatelines.
Metadata Elements for Resources
 IfResource Categoryis Dataset
 Granularity of Data:It mentions the time interval over which the
data inside thedatasetiscollected/updatedonaregularbasis(one-
time,annual,hourly,etc.)
 Frequency (Required):It mentions the time interval over which the
dataset ispublishedontheOGDPlatformonaregularinterval(one-
time,annual,hourly,etc.).
 Access Type:Itmentionsthetypeofaccessviz.Open,Priced,Registered
AccessorRestrictedAccess(G2G).
 IfResource Categoryis App
 App Type(Required):ItmentionsthetypeofAppbeingcontributedviz.
WebApp,WebService,MobileApp,WebMapService,RSS,APIsetc.
Metadata Elements for Resources
 DateReleased:ItmentionsthereleasedateoftheDataset/App.
 Note:It mentions the anymore information the contributor/ChiefData
Officer wishes to providetothedataconsumerorabouttheresource
 Resourcenoteshouldcontainproperexplanationsofanyspecial
characters/notationslike*,#,NAetcwhichwasusedinthedatasets
 Otherrelevantinformationregardingthisdatasetshouldalsobeprovidedinthenote
section.
 Informationregardingfiguresinthedatashouldalsobeprovided,i.eFiguresarein
numbers,Unit:(Rs./qtl.)
 FootnoteavailableunderareportshouldbepartofResourceNote
 NDSAPPolicy Compliance: Thisfieldistoindicateifthisdatasetisin
conformitywiththeNationalDataSharingandAccessPolicyoftheGovt.of
India.
Example - Creation of Resource
 Resource Title:
 NumberofRegisteredMotorVehicles (Transport&Non-Transport)inDelhi
 (Incorrect - Resource title should contain the time frame, so no duplication will
occur in future
 NumberofRegisteredMotorVehicles(Transport&Non-Transport)inDelhiduring2009-2010
(correct)
• ResourceNote:
 NIL
 (Incorrect - No note but dataset contains some special notations like *, #
etc, There are some cells contain NA, some other relevant information are
also present for this particular dataset)
 Figuresareinnumbers;NA:Notavailable;$:Category-wisedatanotreceived;*:Includedincars;
Totalsareprovisionalrepresentingsummationofavailabledata (Correct)
 ResourceCategory:
 Application
 (Incorrect–Asit is dataset not application)
 Datasets
Quality of Datasets
• Data Compositeness/Completeness/Consistency
– Check for the constituent elements (variables) within the
dataset
– The dataset should be well explained in terms of the variable
present therein the dataset through a descriptive metadata
– The metadata should well describe the time-period, units,
definitions, frequency, data source, jurisdiction and notes to
special mention in the dataset
– The time series data should be continuous in nature
• Data Coverage
– Dataset should be made available at the lowest possible levels
to allow users correctly describe the phenomena being
measured
Quality of Datasets
• Standard process of “data cleansing” :
– Assigning string, date, character and numbers to the required
fields
– Abbreviations and acronyms to be replaced by full forms.
– No special characters and blank spaces (replaced with NA) in
the matrix.
– Column header should be self-explanatory
– Similar font size with no formulas and merged columns.
– Dataset should be de-normalized without any merged column
– No formula of calculated column should appear in dataset like
Total or Average of available column or rows
– Above all it must be in machine readable format viz. CSV, XML,
JSON, ODS, XLS etc.
– File name should not contain special character except _ and -;
no blank space should not be present in file name.
THANK YOU

More Related Content

What's hot

Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach3 Round Stones
 
Open data presentation 2014 v1.3 - Nov 2014
Open data presentation 2014 v1.3 - Nov 2014Open data presentation 2014 v1.3 - Nov 2014
Open data presentation 2014 v1.3 - Nov 2014Pia Waugh
 
Developing open data tools and portals: experiences of impact delivery
Developing open data tools and portals: experiences of impact deliveryDeveloping open data tools and portals: experiences of impact delivery
Developing open data tools and portals: experiences of impact deliverygodanSec
 
Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021Monika Solanki
 
Increasing Role of Government in Geospatial Data Provision and Management
Increasing Role of Government in Geospatial Data Provision and ManagementIncreasing Role of Government in Geospatial Data Provision and Management
Increasing Role of Government in Geospatial Data Provision and ManagementTareq Alemadi
 
Centralized Warehouse to synergize sporadic data sources for efficient emerge...
Centralized Warehouse to synergize sporadic data sources for efficient emerge...Centralized Warehouse to synergize sporadic data sources for efficient emerge...
Centralized Warehouse to synergize sporadic data sources for efficient emerge...Janak Parajuli
 
Using Data in sustainable exploitation and Management of Natural Resources: S...
Using Data in sustainable exploitation and Management of Natural Resources: S...Using Data in sustainable exploitation and Management of Natural Resources: S...
Using Data in sustainable exploitation and Management of Natural Resources: S...African Open Science Platform
 
Australian Government Linked Data Group
Australian Government Linked Data GroupAustralian Government Linked Data Group
Australian Government Linked Data GroupArmin Haller
 
Brief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBrief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBernadette Hyland-Wood
 
US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data3 Round Stones
 
Developing the Story Behind the Data
Developing the Story Behind the DataDeveloping the Story Behind the Data
Developing the Story Behind the DataBill Bass
 
Big Data Analytics in Transportation
Big Data Analytics in TransportationBig Data Analytics in Transportation
Big Data Analytics in TransportationRandeep Sudan
 
Ta4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoon
Ta4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoonTa4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoon
Ta4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoonStatistics South Africa
 
Data Driven Meetup Wellington
Data Driven Meetup WellingtonData Driven Meetup Wellington
Data Driven Meetup Wellingtonenotsluap
 

What's hot (20)

Metadata Framework for Agricultural Resources Information System (AgRIS)
 Metadata Framework for Agricultural Resources Information System (AgRIS) Metadata Framework for Agricultural Resources Information System (AgRIS)
Metadata Framework for Agricultural Resources Information System (AgRIS)
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
Open data presentation 2014 v1.3 - Nov 2014
Open data presentation 2014 v1.3 - Nov 2014Open data presentation 2014 v1.3 - Nov 2014
Open data presentation 2014 v1.3 - Nov 2014
 
Developing open data tools and portals: experiences of impact delivery
Developing open data tools and portals: experiences of impact deliveryDeveloping open data tools and portals: experiences of impact delivery
Developing open data tools and portals: experiences of impact delivery
 
Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021
 
Increasing Role of Government in Geospatial Data Provision and Management
Increasing Role of Government in Geospatial Data Provision and ManagementIncreasing Role of Government in Geospatial Data Provision and Management
Increasing Role of Government in Geospatial Data Provision and Management
 
Centralized Warehouse to synergize sporadic data sources for efficient emerge...
Centralized Warehouse to synergize sporadic data sources for efficient emerge...Centralized Warehouse to synergize sporadic data sources for efficient emerge...
Centralized Warehouse to synergize sporadic data sources for efficient emerge...
 
SummaryReportMRM
SummaryReportMRMSummaryReportMRM
SummaryReportMRM
 
Using Data in sustainable exploitation and Management of Natural Resources: S...
Using Data in sustainable exploitation and Management of Natural Resources: S...Using Data in sustainable exploitation and Management of Natural Resources: S...
Using Data in sustainable exploitation and Management of Natural Resources: S...
 
Australian Government Linked Data Group
Australian Government Linked Data GroupAustralian Government Linked Data Group
Australian Government Linked Data Group
 
Primer: Data-Driven Startups
Primer: Data-Driven StartupsPrimer: Data-Driven Startups
Primer: Data-Driven Startups
 
Revised presentation
Revised presentationRevised presentation
Revised presentation
 
Brief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBrief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data Scientist
 
Open government-plan-4.0-final
Open government-plan-4.0-finalOpen government-plan-4.0-final
Open government-plan-4.0-final
 
US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data
 
Policy Cloud Data Driven - Technical overview
Policy Cloud Data Driven - Technical overviewPolicy Cloud Data Driven - Technical overview
Policy Cloud Data Driven - Technical overview
 
Developing the Story Behind the Data
Developing the Story Behind the DataDeveloping the Story Behind the Data
Developing the Story Behind the Data
 
Big Data Analytics in Transportation
Big Data Analytics in TransportationBig Data Analytics in Transportation
Big Data Analytics in Transportation
 
Ta4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoon
Ta4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoonTa4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoon
Ta4.04 mikkela.20170111 fin-data_advocacy_un2017_jakoon
 
Data Driven Meetup Wellington
Data Driven Meetup WellingtonData Driven Meetup Wellington
Data Driven Meetup Wellington
 

Viewers also liked

Policy on Open Application Programming Interfaces (APIs)
Policy on Open Application Programming Interfaces (APIs)Policy on Open Application Programming Interfaces (APIs)
Policy on Open Application Programming Interfaces (APIs)Data Portal India
 
Open data & apps for agriculture
Open data & apps for agricultureOpen data & apps for agriculture
Open data & apps for agricultureData Portal India
 
Insight of Potential - Chhattisgarh
      Insight of Potential - Chhattisgarh      Insight of Potential - Chhattisgarh
Insight of Potential - ChhattisgarhData Portal India
 
Open Government Data for Transparency & Innovation
Open Government Data for Transparency & InnovationOpen Government Data for Transparency & Innovation
Open Government Data for Transparency & InnovationData Portal India
 
Panel Discussion: Open Government Data: High Value Datasets
Panel Discussion: Open Government Data: High Value DatasetsPanel Discussion: Open Government Data: High Value Datasets
Panel Discussion: Open Government Data: High Value DatasetsData Portal India
 
Community Engagements with Open Government Data (OGD) Platform
Community Engagements with  Open Government Data (OGD) PlatformCommunity Engagements with  Open Government Data (OGD) Platform
Community Engagements with Open Government Data (OGD) PlatformData Portal India
 
Opportunities and challenges of foreign trade open data for economic development
Opportunities and challenges of foreign trade open data for economic developmentOpportunities and challenges of foreign trade open data for economic development
Opportunities and challenges of foreign trade open data for economic developmentData Portal India
 
National Data Sharing and Accessibility Policy [ NDSAP 2012 ]
National Data Sharing and Accessibility Policy [ NDSAP 2012 ]National Data Sharing and Accessibility Policy [ NDSAP 2012 ]
National Data Sharing and Accessibility Policy [ NDSAP 2012 ]Data Portal India
 
Revamping of MMPs/eGov Applications : A Digital India Initiative
Revamping of MMPs/eGov Applications: A Digital India InitiativeRevamping of MMPs/eGov Applications: A Digital India Initiative
Revamping of MMPs/eGov Applications : A Digital India InitiativeData Portal India
 
Open data for innovation in governance
Open data for innovation in governanceOpen data for innovation in governance
Open data for innovation in governanceData Portal India
 
Open Data & Open API in US and Worldwide
Open Data & Open API in US and WorldwideOpen Data & Open API in US and Worldwide
Open Data & Open API in US and WorldwideData Portal India
 
India unlocking the potential of Open Data
India unlocking the potential of Open DataIndia unlocking the potential of Open Data
India unlocking the potential of Open DataData Portal India
 
Data Driven Decision Making in Ministry of Health and Family Welfare
Data Driven Decision Making in Ministry of Health and Family WelfareData Driven Decision Making in Ministry of Health and Family Welfare
Data Driven Decision Making in Ministry of Health and Family WelfareData Portal India
 
Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...
Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...
Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...Data Portal India
 

Viewers also liked (20)

Policy on Open Application Programming Interfaces (APIs)
Policy on Open Application Programming Interfaces (APIs)Policy on Open Application Programming Interfaces (APIs)
Policy on Open Application Programming Interfaces (APIs)
 
Open data & apps for agriculture
Open data & apps for agricultureOpen data & apps for agriculture
Open data & apps for agriculture
 
Open Data for Innovation
Open Data for InnovationOpen Data for Innovation
Open Data for Innovation
 
Open data for healthcare
Open data for healthcareOpen data for healthcare
Open data for healthcare
 
Overview of Data Portal India
Overview of Data Portal IndiaOverview of Data Portal India
Overview of Data Portal India
 
Insight of Potential - Chhattisgarh
      Insight of Potential - Chhattisgarh      Insight of Potential - Chhattisgarh
Insight of Potential - Chhattisgarh
 
Open Government Data for Transparency & Innovation
Open Government Data for Transparency & InnovationOpen Government Data for Transparency & Innovation
Open Government Data for Transparency & Innovation
 
Panel Discussion: Open Government Data: High Value Datasets
Panel Discussion: Open Government Data: High Value DatasetsPanel Discussion: Open Government Data: High Value Datasets
Panel Discussion: Open Government Data: High Value Datasets
 
Community Engagements with Open Government Data (OGD) Platform
Community Engagements with  Open Government Data (OGD) PlatformCommunity Engagements with  Open Government Data (OGD) Platform
Community Engagements with Open Government Data (OGD) Platform
 
Opportunities and challenges of foreign trade open data for economic development
Opportunities and challenges of foreign trade open data for economic developmentOpportunities and challenges of foreign trade open data for economic development
Opportunities and challenges of foreign trade open data for economic development
 
National Data Sharing and Accessibility Policy [ NDSAP 2012 ]
National Data Sharing and Accessibility Policy [ NDSAP 2012 ]National Data Sharing and Accessibility Policy [ NDSAP 2012 ]
National Data Sharing and Accessibility Policy [ NDSAP 2012 ]
 
Revamping of MMPs/eGov Applications : A Digital India Initiative
Revamping of MMPs/eGov Applications: A Digital India InitiativeRevamping of MMPs/eGov Applications: A Digital India Initiative
Revamping of MMPs/eGov Applications : A Digital India Initiative
 
Data Based Intelligence
Data Based Intelligence Data Based Intelligence
Data Based Intelligence
 
Open data for innovation in governance
Open data for innovation in governanceOpen data for innovation in governance
Open data for innovation in governance
 
Open Data & Open API in US and Worldwide
Open Data & Open API in US and WorldwideOpen Data & Open API in US and Worldwide
Open Data & Open API in US and Worldwide
 
India unlocking the potential of Open Data
India unlocking the potential of Open DataIndia unlocking the potential of Open Data
India unlocking the potential of Open Data
 
Open Data Initiative India
Open Data Initiative IndiaOpen Data Initiative India
Open Data Initiative India
 
Data Driven Decision Making in Ministry of Health and Family Welfare
Data Driven Decision Making in Ministry of Health and Family WelfareData Driven Decision Making in Ministry of Health and Family Welfare
Data Driven Decision Making in Ministry of Health and Family Welfare
 
Open data for healthcare apps
Open data for healthcare appsOpen data for healthcare apps
Open data for healthcare apps
 
Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...
Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...
Use of Road Accidents Data by Government Stakeholders to reduce Road Accident...
 

Similar to Meta Data and Quality of Data for OGD Platform India

05. Physical Data Specification Template
05. Physical Data Specification Template05. Physical Data Specification Template
05. Physical Data Specification TemplateAlan D. Duncan
 
Dataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabulariesDataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabulariesValeria Pesce
 
Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023SakshiTiwari490123
 
HANA Performance Efficient Speed and Scale-out for Real-time BI
HANA Performance Efficient Speed and Scale-out for Real-time BIHANA Performance Efficient Speed and Scale-out for Real-time BI
HANA Performance Efficient Speed and Scale-out for Real-time BIIBM India Smarter Computing
 
Specifying data requirments
Specifying data requirmentsSpecifying data requirments
Specifying data requirmentsImran60577
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.pptPalaniKumarR2
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.pptSamPrem3
 
Data mining query language
Data mining query languageData mining query language
Data mining query languageGowriLatha1
 
Bca examination 2017 dbms
Bca examination 2017 dbmsBca examination 2017 dbms
Bca examination 2017 dbmsAnjaan Gajendra
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxfPhilippe Rocca-Serra
 
Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business EnablerSrinivasan Sankar
 
Urm concept for sharing information inside of communities
Urm concept for sharing information inside of communitiesUrm concept for sharing information inside of communities
Urm concept for sharing information inside of communitiesKarel Charvat
 
UNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningUNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningNandakumar P
 
data structures and its importance
 data structures and its importance  data structures and its importance
data structures and its importance Anaya Zafar
 

Similar to Meta Data and Quality of Data for OGD Platform India (20)

05. Physical Data Specification Template
05. Physical Data Specification Template05. Physical Data Specification Template
05. Physical Data Specification Template
 
Dataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabulariesDataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabularies
 
Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023
 
HANA Performance Efficient Speed and Scale-out for Real-time BI
HANA Performance Efficient Speed and Scale-out for Real-time BIHANA Performance Efficient Speed and Scale-out for Real-time BI
HANA Performance Efficient Speed and Scale-out for Real-time BI
 
Specifying data requirments
Specifying data requirmentsSpecifying data requirments
Specifying data requirments
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
 
Database Concepts
Database ConceptsDatabase Concepts
Database Concepts
 
Data mining query language
Data mining query languageData mining query language
Data mining query language
 
Bca examination 2017 dbms
Bca examination 2017 dbmsBca examination 2017 dbms
Bca examination 2017 dbms
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxf
 
Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business Enabler
 
UNIT I - Data Structures.pdf
UNIT I - Data Structures.pdfUNIT I - Data Structures.pdf
UNIT I - Data Structures.pdf
 
Urm concept for sharing information inside of communities
Urm concept for sharing information inside of communitiesUrm concept for sharing information inside of communities
Urm concept for sharing information inside of communities
 
UNIT II.docx
UNIT II.docxUNIT II.docx
UNIT II.docx
 
UNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningUNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data Mining
 
Week 1
Week 1Week 1
Week 1
 
Datamining
DataminingDatamining
Datamining
 
data structures and its importance
 data structures and its importance  data structures and its importance
data structures and its importance
 
Unit i
Unit iUnit i
Unit i
 

More from Data Portal India

#OpenGovDataHack Event Structure - 2017
#OpenGovDataHack Event Structure - 2017#OpenGovDataHack Event Structure - 2017
#OpenGovDataHack Event Structure - 2017Data Portal India
 
OGD India Journey, 2012 - 2017
OGD India  Journey, 2012 - 2017OGD India  Journey, 2012 - 2017
OGD India Journey, 2012 - 2017Data Portal India
 
#OpenGovDataHack Round Table - Problem Statements - July 17
#OpenGovDataHack Round Table -  Problem Statements - July 17#OpenGovDataHack Round Table -  Problem Statements - July 17
#OpenGovDataHack Round Table - Problem Statements - July 17Data Portal India
 
Legal Information Management and Briefing System
Legal Information Management and Briefing SystemLegal Information Management and Briefing System
Legal Information Management and Briefing SystemData Portal India
 
A Case Study on FCI Depot online System
A Case Study on FCI Depot online SystemA Case Study on FCI Depot online System
A Case Study on FCI Depot online SystemData Portal India
 
Data Driven Decision Making in India Budget
Data Driven Decision Making in India BudgetData Driven Decision Making in India Budget
Data Driven Decision Making in India BudgetData Portal India
 
Open Data for Business: The U.S. Experience
Open Data for Business: The U.S. ExperienceOpen Data for Business: The U.S. Experience
Open Data for Business: The U.S. ExperienceData Portal India
 
Guidelines for implementation of open API Policy
Guidelines for implementation of open API PolicyGuidelines for implementation of open API Policy
Guidelines for implementation of open API PolicyData Portal India
 

More from Data Portal India (11)

#OpenGovDataHack Event Structure - 2017
#OpenGovDataHack Event Structure - 2017#OpenGovDataHack Event Structure - 2017
#OpenGovDataHack Event Structure - 2017
 
OGD India Journey, 2012 - 2017
OGD India  Journey, 2012 - 2017OGD India  Journey, 2012 - 2017
OGD India Journey, 2012 - 2017
 
#OpenGovDataHack Round Table - Problem Statements - July 17
#OpenGovDataHack Round Table -  Problem Statements - July 17#OpenGovDataHack Round Table -  Problem Statements - July 17
#OpenGovDataHack Round Table - Problem Statements - July 17
 
Legal Information Management and Briefing System
Legal Information Management and Briefing SystemLegal Information Management and Briefing System
Legal Information Management and Briefing System
 
A Case Study on FCI Depot online System
A Case Study on FCI Depot online SystemA Case Study on FCI Depot online System
A Case Study on FCI Depot online System
 
Data Driven Decision Making in India Budget
Data Driven Decision Making in India BudgetData Driven Decision Making in India Budget
Data Driven Decision Making in India Budget
 
Open Data for Business: The U.S. Experience
Open Data for Business: The U.S. ExperienceOpen Data for Business: The U.S. Experience
Open Data for Business: The U.S. Experience
 
Guidelines for implementation of open API Policy
Guidelines for implementation of open API PolicyGuidelines for implementation of open API Policy
Guidelines for implementation of open API Policy
 
Overview of Open Govt Data
Overview of Open Govt DataOverview of Open Govt Data
Overview of Open Govt Data
 
#OpenDataApps Challenge
#OpenDataApps Challenge#OpenDataApps Challenge
#OpenDataApps Challenge
 
Open Data Apps Contest
Open Data Apps ContestOpen Data Apps Contest
Open Data Apps Contest
 

Recently uploaded

VIP Russian Call Girls in Indore Ishita 💚😋 9256729539 🚀 Indore Escorts
VIP Russian Call Girls in Indore Ishita 💚😋  9256729539 🚀 Indore EscortsVIP Russian Call Girls in Indore Ishita 💚😋  9256729539 🚀 Indore Escorts
VIP Russian Call Girls in Indore Ishita 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
EDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxEDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxaaryamanorathofficia
 
Item # 4 - 231 Encino Ave (Significance Only).pdf
Item # 4 - 231 Encino Ave (Significance Only).pdfItem # 4 - 231 Encino Ave (Significance Only).pdf
Item # 4 - 231 Encino Ave (Significance Only).pdfahcitycouncil
 
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Roomishabajaj13
 
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxxIncident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxxPeter Miles
 
VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...
VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...
VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...Suhani Kapoor
 
Fair Trash Reduction - West Hartford, CT
Fair Trash Reduction - West Hartford, CTFair Trash Reduction - West Hartford, CT
Fair Trash Reduction - West Hartford, CTaccounts329278
 
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...anilsa9823
 
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Call Girls in Nagpur High Profile
 
Junnar ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Junnar ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Junnar ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Junnar ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...tanu pandey
 
Night 7k to 12k Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
Night 7k to 12k  Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...Night 7k to 12k  Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
Night 7k to 12k Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...aartirawatdelhi
 
Call Girls Nanded City Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Nanded City Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Nanded City Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Nanded City Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...nservice241
 
Building the Commons: Community Archiving & Decentralized Storage
Building the Commons: Community Archiving & Decentralized StorageBuilding the Commons: Community Archiving & Decentralized Storage
Building the Commons: Community Archiving & Decentralized StorageTechSoup
 
2024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 282024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 28JSchaus & Associates
 
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation -  Humble BeginningsZechariah Boodey Farmstead Collaborative presentation -  Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginningsinfo695895
 
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Expressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptxExpressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptxtsionhagos36
 
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Call Girls in Nagpur High Profile
 

Recently uploaded (20)

Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
 
VIP Russian Call Girls in Indore Ishita 💚😋 9256729539 🚀 Indore Escorts
VIP Russian Call Girls in Indore Ishita 💚😋  9256729539 🚀 Indore EscortsVIP Russian Call Girls in Indore Ishita 💚😋  9256729539 🚀 Indore Escorts
VIP Russian Call Girls in Indore Ishita 💚😋 9256729539 🚀 Indore Escorts
 
EDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxEDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptx
 
Item # 4 - 231 Encino Ave (Significance Only).pdf
Item # 4 - 231 Encino Ave (Significance Only).pdfItem # 4 - 231 Encino Ave (Significance Only).pdf
Item # 4 - 231 Encino Ave (Significance Only).pdf
 
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
 
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxxIncident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
Incident Command System xxxxxxxxxxxxxxxxxxxxxxxxx
 
VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...
VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...
VIP High Profile Call Girls Gorakhpur Aarushi 8250192130 Independent Escort S...
 
Fair Trash Reduction - West Hartford, CT
Fair Trash Reduction - West Hartford, CTFair Trash Reduction - West Hartford, CT
Fair Trash Reduction - West Hartford, CT
 
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
 
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
 
Junnar ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Junnar ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Junnar ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Junnar ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
Night 7k to 12k Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
Night 7k to 12k  Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...Night 7k to 12k  Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
Night 7k to 12k Call Girls Service In Navi Mumbai 👉 BOOK NOW 9833363713 👈 ♀️...
 
Call Girls Nanded City Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Nanded City Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Nanded City Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Nanded City Call Me 7737669865 Budget Friendly No Advance Booking
 
The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...
 
Building the Commons: Community Archiving & Decentralized Storage
Building the Commons: Community Archiving & Decentralized StorageBuilding the Commons: Community Archiving & Decentralized Storage
Building the Commons: Community Archiving & Decentralized Storage
 
2024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 282024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 28
 
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation -  Humble BeginningsZechariah Boodey Farmstead Collaborative presentation -  Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
 
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Expressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptxExpressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptx
 
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
 

Meta Data and Quality of Data for OGD Platform India

  • 1. Open Government Data Platform India (https://data.gov.in) Meta Data and Quality of Data By: Sunil Babbar, Scientist-C, NIC
  • 2. Data Contributors and Their Role • Nominated by Chief Data Officer • Coordinate and Identify datasets which can be contributed • Preparing the datasets – Getting them cleaned – Metadata preparation for datasets in the predefined format – Ensuring quality and correctness datasets of his/her unit/division. • Contributing Catalogs/Resources(Datasets) through pre- defined workflow (Data Contributor  Chief Data Officer(CDO) for review and publish  PMU to publish on OGD Platform)
  • 3. Resources (Datasets / Apps)  A data set (or dataset) is a collection of data  A data set corresponds to the contents of a single table or statistical data matrix, where  every column represents a particular variable, and  each row corresponds to a given member of the data set in question  OpenDataFormats: CSV XLS ODF XML/RDF JSON RSS/Atom KML/GML
  • 4. Catalog  Catalogisgroupingof thesimilarresources(Datasets/Apps)  A catalog represents a collection of resources that you group together  Acts like directory of information about resources  BenefitofCatalog  To facilitate data access by users who are first interested in a particular kind of data  Cataloghelpsingroupingtheresourceswithsametheme/subjectand thusfacilitatetheuserinsearching aspecificdataset/resourceeasily  Ministry/Departmentshavelessefforttouploadsamesetofresources orupdatingthedatasetfornewperiodwithoutwritingthemetadata againandagain  Tofacilitatetheusersforeasiernavigationandsearchingforrelevant data.
  • 5. Catalog Formation  Catalogwithsameresourcewithdifferenttimeperiod (Annual,Quarterly,Monthly,WeeklyandDaily)  Eg.AnnualRainfallData  Catalogwithsameresourcebutwithdifferentjurisdiction (India,States,Districts,Block,Village)  States/UTs-wise Forest and Tree Cover  Catalogwithsameresourcebutdifferentcategory (ScheduleCaste,ScheduleTribe,General,Religionetc.)  District-wise crimes committed against Schedule Caste  CatalogwithSimilartypeofresourceundersamereport (Resourcesofsimilarnature)fromthesamereport/survey canbegroupedunderthesamecatalog  Primary Census Abstract 2011 - India and States
  • 6. MetaData • Is the information that describes the data – What is that data (About Data) – Data source – Who Created – When created – Etc. • Metadata allows the data to be traced to a know its origin and quality
  • 7. Metadata Elements for Catalogs  Title(Required):Auniquenameforthecatalog(groupofresources)  Shouldcontainthegeneraltermswhichdescribestheessential properties/characteristicsofthedatasets/resources  Should be in plain English and include sufficient detail to facilitate search and discovery  Time-periodshouldnotbementionedinthecatalogtitlenormallysothatforthesimilar resources,containingsametypeofdataforthenexttime-period/periodicupdating,can beaccommodatedinsamecatalog  Howeverinexceptionalcases,itcancontaintimeperiodparticularlyforperiodic surveys/censuswhichcontainsahugenumberofdatasets/resourcesbelongingtothe sameperiod/year  Eg.CurrentPopulationSurvey,ConsumerPriceIndex,VarietywiseDailyMarketPrices Data,StatewiseConstructionofDeepTubewellsovertheyears,etc.  Description(Required):Provideadetaileddescriptionofthecatalog  Anabstractdeterminingthenatureandpurposeofthecatalog  Containsthenameofvariableswhichareavailableinthedatasets  Canalsocontainsthedefinitionofsomevariable
  • 8. Metadata Elements for Catalogs  Keywords(Required):Itisalistof terms,separatedbycommas, describingandindicatingatthecontentofthecatalog.Example: rainfall,weather,monthlystatistics.  Help users discover your dataset; please include terms that would be used by technical and non-technical users.  GroupName:ThisisanoptionalfieldtoprovideaGroupNameto multiplecatalogsinordertoshowthattheymaybepresentedas agroupora set.  Sector& SubSector(Required):Choosethe sectors(s)/subsector(s)thosemostcloselyapply(ies)toyour catalog.  AssetJurisdiction(Required):Thisisa requiredfieldtoidentify theexactlocationorareatowhichthecatalogand resources(dataset/apps)caterstoviz.entirecountry, state/province,district,city,etc.
  • 9. Example - Creation of catalog  Catalog Title:  CompanyMasterData2015  (Incorrect-Contains time frame, so in future if we want to add data under this catalog e.g Company master data for 2016, it would be not be possible to upload data under this catalog)  CompanyMasterData (Correct)  CatalogDescription:  GetdataofCompanymasterdata..??  (Incorrect-Does not contain detail information. Description should contain the name of variables which are available in the datasets)  Get data on master details of any company registered with Registrar of Companies (RoC). Data contains various information like Corporate Identification Number(CIN), Company Name, Company Status, Company Class, Company Category, Authorized Capital in INR, Paid-up Capital in INR, Date of Registration, Registered State, Registrar of Companies, Principal Business Activity, Registered Office Address and Sub Category. (Correct)  Keywords:  CompanyMasterData,….??  (Incorrect-listoftermsdescribingandindicatingthecontentofthecatalog,allthe possiblesearchkeywordsshouldbeincluded  RegisteredCompanies,CompanymasterData,CompanyData,IndianCompanies, Company,CompanyDetails,CorporateIdentificationNumber,CIN,CompanyAddress (Correct)
  • 10. Metadata Elements for Resources  Title(Required):Auniquenameoftheresource  Shouldbeselfexplanatoryviz.ConsumerPriceIndexfor<Month/Year>etc.  Resourcetitleshouldcontainthetimeframe,sonoduplicationwilloccurinfutureeg. ConsumerPriceIndexfromApril-2000toApril-2015,Rainfalloftheyear2012  AccessMethod(Required):Howuserisgoingtogetthatdata  UploadaDatasetor  SingleClickLinktoDataset  Category(Required):IsitaDatasetoranApplication  ReferenceURLs:Thismayincludedescriptiontothestudydesign,instrumentation, implementation,limitations,andappropriateuseofthedatasetortool.Inthecaseof multipledocumentsorURLs,pleasedelimitwithcommasorenterinseparatelines.
  • 11. Metadata Elements for Resources  IfResource Categoryis Dataset  Granularity of Data:It mentions the time interval over which the data inside thedatasetiscollected/updatedonaregularbasis(one- time,annual,hourly,etc.)  Frequency (Required):It mentions the time interval over which the dataset ispublishedontheOGDPlatformonaregularinterval(one- time,annual,hourly,etc.).  Access Type:Itmentionsthetypeofaccessviz.Open,Priced,Registered AccessorRestrictedAccess(G2G).  IfResource Categoryis App  App Type(Required):ItmentionsthetypeofAppbeingcontributedviz. WebApp,WebService,MobileApp,WebMapService,RSS,APIsetc.
  • 12. Metadata Elements for Resources  DateReleased:ItmentionsthereleasedateoftheDataset/App.  Note:It mentions the anymore information the contributor/ChiefData Officer wishes to providetothedataconsumerorabouttheresource  Resourcenoteshouldcontainproperexplanationsofanyspecial characters/notationslike*,#,NAetcwhichwasusedinthedatasets  Otherrelevantinformationregardingthisdatasetshouldalsobeprovidedinthenote section.  Informationregardingfiguresinthedatashouldalsobeprovided,i.eFiguresarein numbers,Unit:(Rs./qtl.)  FootnoteavailableunderareportshouldbepartofResourceNote  NDSAPPolicy Compliance: Thisfieldistoindicateifthisdatasetisin conformitywiththeNationalDataSharingandAccessPolicyoftheGovt.of India.
  • 13. Example - Creation of Resource  Resource Title:  NumberofRegisteredMotorVehicles (Transport&Non-Transport)inDelhi  (Incorrect - Resource title should contain the time frame, so no duplication will occur in future  NumberofRegisteredMotorVehicles(Transport&Non-Transport)inDelhiduring2009-2010 (correct) • ResourceNote:  NIL  (Incorrect - No note but dataset contains some special notations like *, # etc, There are some cells contain NA, some other relevant information are also present for this particular dataset)  Figuresareinnumbers;NA:Notavailable;$:Category-wisedatanotreceived;*:Includedincars; Totalsareprovisionalrepresentingsummationofavailabledata (Correct)  ResourceCategory:  Application  (Incorrect–Asit is dataset not application)  Datasets
  • 14. Quality of Datasets • Data Compositeness/Completeness/Consistency – Check for the constituent elements (variables) within the dataset – The dataset should be well explained in terms of the variable present therein the dataset through a descriptive metadata – The metadata should well describe the time-period, units, definitions, frequency, data source, jurisdiction and notes to special mention in the dataset – The time series data should be continuous in nature • Data Coverage – Dataset should be made available at the lowest possible levels to allow users correctly describe the phenomena being measured
  • 15. Quality of Datasets • Standard process of “data cleansing” : – Assigning string, date, character and numbers to the required fields – Abbreviations and acronyms to be replaced by full forms. – No special characters and blank spaces (replaced with NA) in the matrix. – Column header should be self-explanatory – Similar font size with no formulas and merged columns. – Dataset should be de-normalized without any merged column – No formula of calculated column should appear in dataset like Total or Average of available column or rows – Above all it must be in machine readable format viz. CSV, XML, JSON, ODS, XLS etc. – File name should not contain special character except _ and -; no blank space should not be present in file name.