|
|
|
|
WHITE PAPER Tell me again ... Why
should I care about XML? (PAGE 2) Return to
page 1
... Because XML data is
well-fielded it
can also be directly adapted into non-traditional publishing outlets such
as in IETM's (Interactive Electronic Technical Manuals) or for use with
field maintenance reference software. This is of particular importance in
military applications.
Costs of
XML
- Training
- XML is conceptually different from the appearance-based text
representations that most people are used to. Thus, document authors and
maintainers must be educated about the differences between XML and
Microsoft Word, for example. And they need to be made aware of the new
requirements implied by XML's benefits.
These are:
- All text
must be tagged. This is a requirement most authors aren't
accustomed to.
- No
formatting is applied at authoring time. This violates the habits
of many authors. (Studies show that removing the formatting
requirement from authors can dramatically increase their productivity;
they focus on writing, not on making a small section of text look
"just so".)
- The
document structure, as defined by the DTD, must be adhered to.
XML authoring tools don't allow the writer to put a <heading
2> in front of a <heading 1>, even if that seems,
"OK, just this once."
Nonetheless,
today's XML tools support WYSIWYG interfaces, drag-and-drop technology,
and the other functions that non-technical computer users already
understand. The cost of additional training is more than offset by the
benefits of using XML. Not only that, but the cost of training is
declining as the learning curve becomes less and less
steep.
- Specialized
software - This is more a perceived cost than a real one. Today,
XML-compliant software is available at all price levels and many
mainstream word processing tools support XML. Plus Internet Explorer
and Netscape, and other browsers, support it natively. Naturally the high-end
tools offer more features, support larger environments, and provide
greater benefits than their low-end counterparts. A cost/benefits
analysis needs to be done to determine the best tool set for each
particular installation. If you do need a high-end solution, this will
give you a clear picture of the additional benefits you are getting for
your money.
- Legacy
conversions - To get the most out of XML, it is important that an
enterprise's entire active document set is moved over to an XML
environment. It is not enough to adopt XML on a "from here on in"
basis. XML conversions can be complex. But they needn't tie up internal
human resources. In fact, because expertise and experience are essential
for a successful conversion to XML, it is advisable to outsource the
conversion to a specialized XML conversion vendor. This is less
expensive in the long run than doing conversion in-house. Using a
single, experienced XML conversion vendor guarantees you'll get a
quality XML end product. Not only will your documents be technically
valid, they will also be meaningfully and consistently tagged.
Unless documents are professionally tagged, you won't get the benefits
of XML.
PDF: An
alternative to XML? PDF is a proprietary page
representation format developed by Adobe Systems. It puts documents in a "container" that
preserves not only the text but also the image of the page. PDF can be
generated directly by many traditional word processing packages. It can
also be generated by scanning paper documents.
PDF does not have
any of the content tagging capabilities of XML (except for limited
linking). And, although widely accepted, PDF is not a recognized
independent standard. PDF files are binary; besides text they may contain
images of various types, postscript, and other binary information. All
this is useful, but means PDF is not as portable as
XML.
Furthermore, when
PDF is generated from paper, text accuracy is very poor. Although readers
may see what appears to be a perfectly usable page, what is actually being
displayed is a bitmap image of the page. The text itself, extracted via an
OCR process during the PDF conversion, is not directly visible. It is
searchable -- but if the accuracy is poor, as is inevitable with
uncorrected OCR, the searches will be inaccurate, missing many potentially
important "hits" and producing irrelevant hits. Correction is possible,
but difficult and expensive -- possibly exceeding the cost of an XML
conversion.
PDF files are
generally large and unwieldy, especially when the page image is preserved
in bitmap form (usually the case when PDF was generated from paper). This
means they are difficult to transport over networks or to make available
over the web.
Data Conversion
Laboratory can and does do PDF conversions where appropriate. We
recommend, however, that they be limited to situations where paper is
being eliminated for space reasons, and the documents are not frequently
accessed, but must be available when required. We recommend XML for "live" data that needs to be frequently accessed, modified, or
searched.
Conclusion: Use XML! It's
just better ... DCL has a wide variety of
experience converting data from many formats into many formats. Our
expertise extends well beyond the domains of XML/XML, so we don't have an
XML axe to grind. But we believe that XML should be the format of choice
for all industries who need to manage their "intellectual capital." And we recommend the use of XML
in these circumstances. Not because it is legally mandated -- though in
many cases it is -- but because it provides the most attractive package of
benefits at justifiable cost. The truth is, we often find ourselves
saying: "Use XML! It's just better."
DCLnews
Editorial 12/3/2002
Read more
XML articles at DCL
Library
|

|
FREE Tech
Newsletter! Subscribe to
DCLnews for the latest tech, XML/XML, and e-Publishing news.
Plus top stories, reports, and interviews. Click here to
subscribe. |
Return to
top
|
|
|
|
|
|
Best Practices Santa Fe, NM, September 15-17, 2008. More…
XyUser Phoenix, AZ, September 22-24, 2008. More…
9th Annual Vasont Users' Group Meeting, Hershey, PA, October 6-8, 2008. More…
DITA/TECHCOMM 2008, Raleigh, NC, November 3-6 2008. More…
ATA e-Business Europe. Details TBA.
|
| |
|
|
Doc Train Life Sciences Indianapolis, IN, June 23-25, 2008. More…
X-Pubs London, England, June 22-24, 2008. More…
Mark Logic User San Francisco, CA, June 10-12, 2008. More…
PTC User Long Beach, CA, June 2-4, 2008. More…
Ultramain User Conference 2008, Albuquerque, NM, May 11-15, 2008. More…
Documentation and Training West 2008 Vancouver, BC, May 6-9, 2008. More…
CMS/DITA Santa Clara, CA, April 7-9, 2008. More…
DIA Med Comm Orlando, FL, March 10-11, 2008. More…
DIA EDM Philadelphia, PA, February 5-7, 2008. More…
Gilbane Boston Conference Boston, MA, November 29, 2007. More…
The LavaCon Conference on Advanced Technical Communication and Project Management New Orleans, LA, October 27-30, 2007. More…
2007 ATA e-Business Forum Miami, Florida, Oct 17-19, 2007. More…
DITA 2007™-East, Raleigh, North Carolina, October 4-6, 2007. More…
2007 XyUser Group Fall Conference, Boston, MA, Sept 23-26, 2007. More…
Mark Logic 2007 User Conference, San Francisco, CA, May 15-17, 2007. More…
Content Management Strategies/DITA North America
Conference 2007, Boston, MA, March 26-28, 2007. More…
DIA 18th Annual Workshop,
San Diego, CA. March 4-7, 2007. More…
DIA 2007 EDM & CDM Conference, Philadelphia, PA, Feb 6 - 8, 2007. More…
DITA 2007 – West, San Jose, CA, February 5-7, 2007. More…
Framemaker 2006 Chautauqua, Austin, TX, Nov 8-10, 2006. More…
PTC/User World Event 2006, Grapevine, TX, June 4-6. More…
19th Annual DIA Conference Philadelphia, PA, February 7-9. More…
XyUser's Conference, San Diego, California, September 11-14. DCL's Don Bridges delivered a presentation on "Content Reuse" More…
Structured Product Labeling, Washington, DC, August 23-24. More…
Tri-XML 2005, Raleigh, NC , July 28. DCL's Don Bridges delivered a presentation on "Content Reuse" More…
Pharmaceutical Labeling and Product Identification, Whippany, NJ, June 16-17. DCL's Don Bridges delivered a presentation on "Structured Product Labeling (SPL) and the Implications of Implementing an XML Solution." More…
More…
|
|