Notes
Outline
Slide 1
Slide 2
Slide 3
Slide 4
Slide 5
Slide 6
Slide 7
HTML
Hypertext Markup Language
Pervasive and supported means of describing information for Web transmission
Describes data in terms of appearance – what it is supposed to look like (e.g. – Bold, Center, 13 Point)
Limited structure, reuse, and interchange
In other words, Appearance Based not Structure Based
Slide 9
SGML & XML
Describes data in terms of structure – what it is (e.g. Footnote, Reference, Affiliation, etc…)
Does not describe how your data is supposed to appear
In other words, Structure Based not Appearance Based
Slide 11
Slide 12
Slide 13
Differences Between SGML & XML
Mostly marketing
XML is more difficult to author DTD
More XML tools will be available than SGML
With XML, you will have an increased probability of interfacing with other systems
Slide 15
Slide 16
Slide 17
"Quick to market existing products"
Quick to market existing products
Quick to develop new products
More products (feasibility of publishing for smaller audiences)
Richer user experience (Searching, Linking, etc..)
Even though I have to produce multiple outputs (Paper, PDA x2, Web x2, CD)
Even though my inputs keep on increasing
Slide 19
Slide 20
Slide 21
Slide 22
Slide 23
Slide 24
Slide 25
Slide 26
Slide 27
Slide 28
Slide 29
Slide 30
Slide 31
What is a Document Type Definition (DTD)
Identifies structures within document to be tagged (Header, Abstract, Footnote, etc…)
Identifies what are the names of the tags (<fn> or <foot> for Footnote)
Contains rules regarding the order that tags can occur within the document
Type of tagging (Content vs. Structured)
Slide 33
Slide 34
Slide 35
Slide 36
Keys to Consistent SGML/XML
Bulletproof DTD
Tagging Rules have to be crystal clear
Structured Tagging done by computer
Content–Level Tagging aided by computer
Slide 38
DTD Identification/Development
Understand Organizational Goals
Understand the technology being implemented to realize goals
Identify Micro Requirements
Organizational Goals
Technology Requirements
Manuscript Submission Process
Authoring/Editing System
Content Management System
Rendering Environment
Distribution Plan
Micro Requirements
Type of data (Tables, Equations, Formulae, Figures)
References (Numeric, Harvard Style)
Variety of Data (Books, Journals, Newsletters)
Content Tagging vs. Structural Tagging
DTD Selection Possibilities
Public DTD
Aggregator DTD
Custom DTD
Combination
Slide 44
Slide 45
Slide 46
Slide 47
Slide 48
Slide 49
Slide 50
Slide 51
Slide 52
Slide 53
Slide 54
Slide 55
Slide 56
Slide 57
If…
5000 page project
5 minutes per page to fix
25,000 minutes => 417 hours
At 7 hours per day (no breaks) => 59.6 days
Or worse…
The tagging is inconsistent resulting in:
 The Content/Document Management System will be useless
You won’t be able to send it to an Aggregator
Data won’t render correctly
Slide 60
Slide 61
Slide 62
Slide 63
Slide 64
"Example:"
Example:  Automated Cross-References
See figure 15.5
See fig. 15.5
Refer to figure below
As illustrated on previous page
… drawing 15.5 years at hard labor.
See figure 15.1 in volume II of …
Slide 66
Slide 67
Slide 68
Slide 69
Slide 70
Slide 71
Slide 72
Slide 73
Slide 74
Slide 75
Slide 76
Slide 77
Slide 78
Slide 79
Slide 80
Slide 81
Slide 82
Post Comp Production Process  # 1
Slide 84
Post Comp Production Process - Improved
Slide 86
Slide 87
Slide 88
Slide 89
Slide 90
Slide 91
Slide 92
Slide 93