SlideShare ist ein Scribd-Unternehmen logo
1 von 13
XML FOR DUMMIES
                       http://it-slideshares.blogspot.com


Book author: Lucinda Dykes and Ed Tittle
Part 1 : XML Basics
Lecture 3: Slicing and Dicing Data Categories:
                      The Art of Taxonomy
http://it-slideshares.blogspot.com


             Contents

1     Taking Stock of Your Data



2     Breaking Down Data in Different Ways



3     Developing Your Taxonomy



4     Testing Your Taxonomy



5     Looning Ahead to Validation
http://it-slideshares.blogspot.com

              1. Taking Stock of Your Data
 Looking at business practices and partners.
   ♦ Taking a close look at the flow information in your business will help you
  identify the components of your content.
   ♦ Each different process is a specialized use of information.
   ♦ Take some time to talk to those people who create or          frequently process
  the data.Find out:
      ○ What users do with individual pieces of information.
      ○ What data users think is impossible to live without.
      ○ What data is unnecessary or optional.
 Gathering some content.
    ♦ The more complete your collection of sameples is, the better chance you
  have of creating markup that file fits all your content. Here are some      ideas:
      ○ Get data from multiple source
      ○ Get a lot of data.
      ○ Get a lot of data from multiple sources.
http://it-slideshares.blogspot.com

                      Taking Stock of Your Data(cont..)


 Checking whether a DTD or schema already exists.
  ♦ When you create a document according to a DTD or schema, you use a predefined
  structure that specifies how the components of markup should be used to describe a
  particular kink of content.
  ♦ Predifined DTDs and schemas usually come from a couple of different sources:
          ○ Industry groups or organizations: that want a common format for
             standard data such as OFX, CML.
          ○ Application builders: who created their systems to run with content
             described by a particular set of markup. For example : CFML, ASP.
 Searching for a schemaa repository.
   ♦ You could search for a schema or DTD, or add one of your own to the repository.
  Online at sites such as www.Biztalk.org or www.schema.net
   ♦ There is one still schema repository hosted by OASIS at ww.xml.org/xml/registry.jsp.
   OASIS provides a every comprehensive list of proposed XML applicatuons and industry initiatives at
  www.oasis-open.org.xml.html#applications.
   ♦ The whole point of using XML is to make your content as accessible to a system as
  possible.
   ♦ Content analysis with XML in mind is much easier when you have a handle on the ins and
  outs of XML Schemas and DTDs and how to put them together.
http://it-slideshares.blogspot.com




        2. Breaking Down Data in Different Ways.
When developed our hypothetical book-selling business, we went through the same
  data-analysis process we’re sharing with you. After we gathered our documents and
  familiarized ourselves with them, we took a good hard look at what we learned
  about our content. Here’s what we came up with:
   - Books can be categorized in a number if different ways, includeing: Author,
         title, publication date, publisher, edition, language, number of pages, size,
  type(fiction, nonfiction),special features, format, price, ISBN.
   - The customer information we collect includes: first name, last name,        address,
  city, state, zip code, email address, phone number.
   - The sales information we gather in addition to customer information includes:
  data , item number, price, total cost
 Winnowing out the wheat from the chaff.
   ♦ When we analyzed our content, we made some judgments about what
  information we needed to collect.
   ♦ We chose to exclude not useful information from our taxonomy strategy.
   ♦ When you content-analysis process for knowing the purpose of your markup
  can help you keep your goals in sight.
http://it-
                        slideshares.blogspot.com


    Breaking Down Data in Different Ways(cont..).
   Types of data that can be stored in XML.
        ♦ XML content can be divided into two main groups: data-intensive
          and document or text-intensive.
        ♦ On the data end of the spectrum, you find collections of data like
          those that reside in a database. Each collection consists of a more
    or less abitrary number of record structures, in which       record contains:
               ○ A unique identifier or key.
               ○ A common collection of named, organized values.
        ♦ XML can capture and represent data that describes other
    collection of data.
        ♦ XML can handle many kinds of data and can accommodate binary
    information, it can supply data to other computer            applications
    outside XML’s control.
        ♦ XML document can reference anything that a computer can
    represent.
http://it-
                       slideshares.blogspot.com



               3. Developing Your Taxonomy
 After you look at your content, you can start breaking it down into
  categories and subcategories.
 Here’s how we broke it down for our hypothetical book bisiness:
    ● Book                                ●Sales
         ○ Item Number                         ○ Item Number
         ○ Title                               ○ Price
                                               ○ Shipping
         ○ Author                              ○ Total Cost
         ○ Publisher                           ○ Date
         ○ Price                               ○ Source
         ○ Content Type
         ○ Format
         ○ ISBN.
http://it-
                 slideshares.blogspot.com




●Customer                  ♦ As you can see, Item Number appears as
                           a subcategory in both the Book and the
   ○Customer Number
                           Sales categories.
   ○First Name
                           ♦ The Item Number is unique to each copy
   ○Last Name              of a book, which makes it easy to keep track
                           of sales and inventory
   ○Address
   ○City
   ○State
   ○Zip Code
   ○E-mail Address
   ○Phone Number.
http://it-
                     slideshares.blogspot.com

                4. Testing Your Taxonomy
Using      trial and error for the best fit.
        ♦ You work with your markup, experiment with using
 combinations of elements and attributes until you get the best results.
 For example:We used two nested elements to specify the content type
 for a book.
             <book>
                 <contentType>Fiction</contentType>
             </book>
     ♦ The markup would use as many ‘content Type’ elements
     within the ‘book’ but we decided to go with ‘content Type’ as
     an attribute of the ‘book’ element instead, as show here.
           <book contentType=”Fiction”/>
     ♦ We decided on this route because we thought that we’d
     want to predefine the category names and require valid
     documents choose one of the names from the list in our
     DTD or schema.
http://it-
                       slideshares.blogspot.com

               Testing Your Taxonomy(cont..)
   Testing Your content analysis.
     ♦ The best way to your final markup is to apply it to as many
        content samples as you can lay     your hands on.
    -Shows the final draft of our bookstore markup.
           <?xml version=”1.0” standalone=”yes”?>
              <books>
                <book contentType=”Fiction” format=”Hardback”>
                  <bookInfo>
                      <title>The Da Vinci Code</title>
                      <author>Brown, Dan</author>
                      <publisher>Doubleday</publisher>
                      <isbn>0385504209</isbn>
                  </bookInfo>
                  <salesInfo>
                      <price priceType=”Retail”>$24.95</price>
                      <itemNumber>0385504209-1</itemNumber>
http://it-
                         slideshares.blogspot.com
                  <date>January 12, 2005</date>
                  <source sourceType=”Retail” />
                  <shipping>$5.00</shipping>
                  <cost>$29.95</cost>
             </salesInfo>
        </book>
        <totalCost>$29.95</totalCost>
        <customer custType=”newRetail”>
                 <custNumber>5594</custNumber>
                 <lastName>Blow</lastName>
                 <firstName>Joe</firstName>
                 <address>52 Joetta Lane</address>
                 <city>Cottage Grove</city>
                 <state>OR</state>
                 <zip>97424</zip>
                 <phone>767-3333</phone>
                 <email>jblow@pacinfo.com</email>
        </customer>

♦ The first line in our code <?xml version=”1.0” standalone=”yes”?> is an XML
declaration. You’ll learn all about XML declaratuons and all the other details of
XML syntax in Chapter 5.
http://it-
              slideshares.blogspot.com
          5. Looking Ahead to Validation
   You can get to make up as many rules as you want or need to
    make the markup do what you want it to.
   The rules that you create with XML can dictate which elements
    make up an XML document.
   Creating XML document descriptions enables you to state the
    rules that a whole class of documents must follow.
   The two main forms of XML document descriptions in use today
    are DTDs and XML schemas.
   DTDs work well for validating XML with text-intensive content,
    while XML schemas work well for validating XML with data-
    intensive content.
http://it-
    slideshares.blogspot.com




Thank you
  The end chapter 3

Weitere ähnliche Inhalte

Mehr von phanleson

Learning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with SparkLearning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with Sparkphanleson
 
Firewall - Network Defense in Depth Firewalls
Firewall - Network Defense in Depth FirewallsFirewall - Network Defense in Depth Firewalls
Firewall - Network Defense in Depth Firewallsphanleson
 
Mobile Security - Wireless hacking
Mobile Security - Wireless hackingMobile Security - Wireless hacking
Mobile Security - Wireless hackingphanleson
 
Authentication in wireless - Security in Wireless Protocols
Authentication in wireless - Security in Wireless ProtocolsAuthentication in wireless - Security in Wireless Protocols
Authentication in wireless - Security in Wireless Protocolsphanleson
 
E-Commerce Security - Application attacks - Server Attacks
E-Commerce Security - Application attacks - Server AttacksE-Commerce Security - Application attacks - Server Attacks
E-Commerce Security - Application attacks - Server Attacksphanleson
 
Hacking web applications
Hacking web applicationsHacking web applications
Hacking web applicationsphanleson
 
HBase In Action - Chapter 04: HBase table design
HBase In Action - Chapter 04: HBase table designHBase In Action - Chapter 04: HBase table design
HBase In Action - Chapter 04: HBase table designphanleson
 
HBase In Action - Chapter 10 - Operations
HBase In Action - Chapter 10 - OperationsHBase In Action - Chapter 10 - Operations
HBase In Action - Chapter 10 - Operationsphanleson
 
Hbase in action - Chapter 09: Deploying HBase
Hbase in action - Chapter 09: Deploying HBaseHbase in action - Chapter 09: Deploying HBase
Hbase in action - Chapter 09: Deploying HBasephanleson
 
Learning spark ch11 - Machine Learning with MLlib
Learning spark ch11 - Machine Learning with MLlibLearning spark ch11 - Machine Learning with MLlib
Learning spark ch11 - Machine Learning with MLlibphanleson
 
Learning spark ch10 - Spark Streaming
Learning spark ch10 - Spark StreamingLearning spark ch10 - Spark Streaming
Learning spark ch10 - Spark Streamingphanleson
 
Learning spark ch09 - Spark SQL
Learning spark ch09 - Spark SQLLearning spark ch09 - Spark SQL
Learning spark ch09 - Spark SQLphanleson
 
Learning spark ch07 - Running on a Cluster
Learning spark ch07 - Running on a ClusterLearning spark ch07 - Running on a Cluster
Learning spark ch07 - Running on a Clusterphanleson
 
Learning spark ch06 - Advanced Spark Programming
Learning spark ch06 - Advanced Spark ProgrammingLearning spark ch06 - Advanced Spark Programming
Learning spark ch06 - Advanced Spark Programmingphanleson
 
Learning spark ch05 - Loading and Saving Your Data
Learning spark ch05 - Loading and Saving Your DataLearning spark ch05 - Loading and Saving Your Data
Learning spark ch05 - Loading and Saving Your Dataphanleson
 
Learning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value PairsLearning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value Pairsphanleson
 
Learning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with SparkLearning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with Sparkphanleson
 
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about LibertagiaHướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagiaphanleson
 
Lecture 1 - Getting to know XML
Lecture 1 - Getting to know XMLLecture 1 - Getting to know XML
Lecture 1 - Getting to know XMLphanleson
 
Lecture 4 - Adding XTHML for the Web
Lecture  4 - Adding XTHML for the WebLecture  4 - Adding XTHML for the Web
Lecture 4 - Adding XTHML for the Webphanleson
 

Mehr von phanleson (20)

Learning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with SparkLearning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with Spark
 
Firewall - Network Defense in Depth Firewalls
Firewall - Network Defense in Depth FirewallsFirewall - Network Defense in Depth Firewalls
Firewall - Network Defense in Depth Firewalls
 
Mobile Security - Wireless hacking
Mobile Security - Wireless hackingMobile Security - Wireless hacking
Mobile Security - Wireless hacking
 
Authentication in wireless - Security in Wireless Protocols
Authentication in wireless - Security in Wireless ProtocolsAuthentication in wireless - Security in Wireless Protocols
Authentication in wireless - Security in Wireless Protocols
 
E-Commerce Security - Application attacks - Server Attacks
E-Commerce Security - Application attacks - Server AttacksE-Commerce Security - Application attacks - Server Attacks
E-Commerce Security - Application attacks - Server Attacks
 
Hacking web applications
Hacking web applicationsHacking web applications
Hacking web applications
 
HBase In Action - Chapter 04: HBase table design
HBase In Action - Chapter 04: HBase table designHBase In Action - Chapter 04: HBase table design
HBase In Action - Chapter 04: HBase table design
 
HBase In Action - Chapter 10 - Operations
HBase In Action - Chapter 10 - OperationsHBase In Action - Chapter 10 - Operations
HBase In Action - Chapter 10 - Operations
 
Hbase in action - Chapter 09: Deploying HBase
Hbase in action - Chapter 09: Deploying HBaseHbase in action - Chapter 09: Deploying HBase
Hbase in action - Chapter 09: Deploying HBase
 
Learning spark ch11 - Machine Learning with MLlib
Learning spark ch11 - Machine Learning with MLlibLearning spark ch11 - Machine Learning with MLlib
Learning spark ch11 - Machine Learning with MLlib
 
Learning spark ch10 - Spark Streaming
Learning spark ch10 - Spark StreamingLearning spark ch10 - Spark Streaming
Learning spark ch10 - Spark Streaming
 
Learning spark ch09 - Spark SQL
Learning spark ch09 - Spark SQLLearning spark ch09 - Spark SQL
Learning spark ch09 - Spark SQL
 
Learning spark ch07 - Running on a Cluster
Learning spark ch07 - Running on a ClusterLearning spark ch07 - Running on a Cluster
Learning spark ch07 - Running on a Cluster
 
Learning spark ch06 - Advanced Spark Programming
Learning spark ch06 - Advanced Spark ProgrammingLearning spark ch06 - Advanced Spark Programming
Learning spark ch06 - Advanced Spark Programming
 
Learning spark ch05 - Loading and Saving Your Data
Learning spark ch05 - Loading and Saving Your DataLearning spark ch05 - Loading and Saving Your Data
Learning spark ch05 - Loading and Saving Your Data
 
Learning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value PairsLearning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value Pairs
 
Learning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with SparkLearning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with Spark
 
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about LibertagiaHướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
 
Lecture 1 - Getting to know XML
Lecture 1 - Getting to know XMLLecture 1 - Getting to know XML
Lecture 1 - Getting to know XML
 
Lecture 4 - Adding XTHML for the Web
Lecture  4 - Adding XTHML for the WebLecture  4 - Adding XTHML for the Web
Lecture 4 - Adding XTHML for the Web
 

Kürzlich hochgeladen

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 

Kürzlich hochgeladen (20)

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 

Lecture 3 - Slicing and Dicing Data Categories -The Art of Taxonomy

  • 1. XML FOR DUMMIES http://it-slideshares.blogspot.com Book author: Lucinda Dykes and Ed Tittle Part 1 : XML Basics Lecture 3: Slicing and Dicing Data Categories: The Art of Taxonomy
  • 2. http://it-slideshares.blogspot.com Contents 1 Taking Stock of Your Data 2 Breaking Down Data in Different Ways 3 Developing Your Taxonomy 4 Testing Your Taxonomy 5 Looning Ahead to Validation
  • 3. http://it-slideshares.blogspot.com 1. Taking Stock of Your Data  Looking at business practices and partners. ♦ Taking a close look at the flow information in your business will help you identify the components of your content. ♦ Each different process is a specialized use of information. ♦ Take some time to talk to those people who create or frequently process the data.Find out: ○ What users do with individual pieces of information. ○ What data users think is impossible to live without. ○ What data is unnecessary or optional.  Gathering some content. ♦ The more complete your collection of sameples is, the better chance you have of creating markup that file fits all your content. Here are some ideas: ○ Get data from multiple source ○ Get a lot of data. ○ Get a lot of data from multiple sources.
  • 4. http://it-slideshares.blogspot.com Taking Stock of Your Data(cont..)  Checking whether a DTD or schema already exists. ♦ When you create a document according to a DTD or schema, you use a predefined structure that specifies how the components of markup should be used to describe a particular kink of content. ♦ Predifined DTDs and schemas usually come from a couple of different sources: ○ Industry groups or organizations: that want a common format for standard data such as OFX, CML. ○ Application builders: who created their systems to run with content described by a particular set of markup. For example : CFML, ASP.  Searching for a schemaa repository. ♦ You could search for a schema or DTD, or add one of your own to the repository. Online at sites such as www.Biztalk.org or www.schema.net ♦ There is one still schema repository hosted by OASIS at ww.xml.org/xml/registry.jsp. OASIS provides a every comprehensive list of proposed XML applicatuons and industry initiatives at www.oasis-open.org.xml.html#applications. ♦ The whole point of using XML is to make your content as accessible to a system as possible. ♦ Content analysis with XML in mind is much easier when you have a handle on the ins and outs of XML Schemas and DTDs and how to put them together.
  • 5. http://it-slideshares.blogspot.com 2. Breaking Down Data in Different Ways. When developed our hypothetical book-selling business, we went through the same data-analysis process we’re sharing with you. After we gathered our documents and familiarized ourselves with them, we took a good hard look at what we learned about our content. Here’s what we came up with: - Books can be categorized in a number if different ways, includeing: Author, title, publication date, publisher, edition, language, number of pages, size, type(fiction, nonfiction),special features, format, price, ISBN. - The customer information we collect includes: first name, last name, address, city, state, zip code, email address, phone number. - The sales information we gather in addition to customer information includes: data , item number, price, total cost  Winnowing out the wheat from the chaff. ♦ When we analyzed our content, we made some judgments about what information we needed to collect. ♦ We chose to exclude not useful information from our taxonomy strategy. ♦ When you content-analysis process for knowing the purpose of your markup can help you keep your goals in sight.
  • 6. http://it- slideshares.blogspot.com Breaking Down Data in Different Ways(cont..).  Types of data that can be stored in XML. ♦ XML content can be divided into two main groups: data-intensive and document or text-intensive. ♦ On the data end of the spectrum, you find collections of data like those that reside in a database. Each collection consists of a more or less abitrary number of record structures, in which record contains: ○ A unique identifier or key. ○ A common collection of named, organized values. ♦ XML can capture and represent data that describes other collection of data. ♦ XML can handle many kinds of data and can accommodate binary information, it can supply data to other computer applications outside XML’s control. ♦ XML document can reference anything that a computer can represent.
  • 7. http://it- slideshares.blogspot.com 3. Developing Your Taxonomy  After you look at your content, you can start breaking it down into categories and subcategories.  Here’s how we broke it down for our hypothetical book bisiness: ● Book ●Sales ○ Item Number ○ Item Number ○ Title ○ Price ○ Shipping ○ Author ○ Total Cost ○ Publisher ○ Date ○ Price ○ Source ○ Content Type ○ Format ○ ISBN.
  • 8. http://it- slideshares.blogspot.com ●Customer ♦ As you can see, Item Number appears as a subcategory in both the Book and the ○Customer Number Sales categories. ○First Name ♦ The Item Number is unique to each copy ○Last Name of a book, which makes it easy to keep track of sales and inventory ○Address ○City ○State ○Zip Code ○E-mail Address ○Phone Number.
  • 9. http://it- slideshares.blogspot.com 4. Testing Your Taxonomy Using trial and error for the best fit. ♦ You work with your markup, experiment with using combinations of elements and attributes until you get the best results. For example:We used two nested elements to specify the content type for a book. <book> <contentType>Fiction</contentType> </book> ♦ The markup would use as many ‘content Type’ elements within the ‘book’ but we decided to go with ‘content Type’ as an attribute of the ‘book’ element instead, as show here. <book contentType=”Fiction”/> ♦ We decided on this route because we thought that we’d want to predefine the category names and require valid documents choose one of the names from the list in our DTD or schema.
  • 10. http://it- slideshares.blogspot.com Testing Your Taxonomy(cont..)  Testing Your content analysis. ♦ The best way to your final markup is to apply it to as many content samples as you can lay your hands on. -Shows the final draft of our bookstore markup. <?xml version=”1.0” standalone=”yes”?> <books> <book contentType=”Fiction” format=”Hardback”> <bookInfo> <title>The Da Vinci Code</title> <author>Brown, Dan</author> <publisher>Doubleday</publisher> <isbn>0385504209</isbn> </bookInfo> <salesInfo> <price priceType=”Retail”>$24.95</price> <itemNumber>0385504209-1</itemNumber>
  • 11. http://it- slideshares.blogspot.com <date>January 12, 2005</date> <source sourceType=”Retail” /> <shipping>$5.00</shipping> <cost>$29.95</cost> </salesInfo> </book> <totalCost>$29.95</totalCost> <customer custType=”newRetail”> <custNumber>5594</custNumber> <lastName>Blow</lastName> <firstName>Joe</firstName> <address>52 Joetta Lane</address> <city>Cottage Grove</city> <state>OR</state> <zip>97424</zip> <phone>767-3333</phone> <email>jblow@pacinfo.com</email> </customer> ♦ The first line in our code <?xml version=”1.0” standalone=”yes”?> is an XML declaration. You’ll learn all about XML declaratuons and all the other details of XML syntax in Chapter 5.
  • 12. http://it- slideshares.blogspot.com 5. Looking Ahead to Validation  You can get to make up as many rules as you want or need to make the markup do what you want it to.  The rules that you create with XML can dictate which elements make up an XML document.  Creating XML document descriptions enables you to state the rules that a whole class of documents must follow.  The two main forms of XML document descriptions in use today are DTDs and XML schemas.  DTDs work well for validating XML with text-intensive content, while XML schemas work well for validating XML with data- intensive content.
  • 13. http://it- slideshares.blogspot.com Thank you The end chapter 3