Data Conversion Laboratory, Revolutionizing Publishing for the Digital Age 
  DCLab.com | About DCL | Tech Info | Press Info | Contact Us | DCLNews | Partners | Wiki | Client Area     
menu
Data Conversion Lab

About DCL
  Why go to DCL?
  Clients
  Company Background
  Management
  DCL in the News
  Events
  Holiday Calendar
  Mission

DCL News
  Current Issue
  Back Issues
  Subscribe

Technology
  Technology Resources
  FAQ's
  Glossary
  Presentations
  DCL Work Tracking

Press Info

Clients' Area

Contact DCL
  Directions
  Request Estimate
  Positions

Books2Bytes
Popular Pages
* Current Issue of DCLnews
* DCL featured in The Columbia Guide to Digital Publishing
* Slash Document Costs
* Ann Rockley on ROI in CM
* PDF Resources
* XML Conversion Resources
* Roundtrip Document Conversion
* DCL Resources Library
*

Converting Legacy Data...

*

Aviation & Aerospace

*

PDF Conversion to XML & MS-Word

*

PDF Conversion

*

Quark to XML

* Getting Content into XML
Fact Sheets
* Public Access for Research Materials
* S1000D Conversion
* Content Reuse Assessment
* Document Conversion
* SPL - Pharmaceutical Industry
* Harmonizer™
* Jeppesen Map Revision Service
Technical Papers
* Why STM Publishers Should Use XML...
* Department of Defense and the Power of XML
* Your Data in XML
* SGML to SGML 1
* SGML to SGML 2
* Quark to XML
* Plan Ahead
* Do it Yourself?
* Encyclopedia
Presentations
* Conversion to XML: Documents versus Data (11/2003)
* Data Migration Considerations  (6/2003)
* Technology for Cost-Containment and Efficiency  (4/2003)
* Converting Textbooks to Meet the National XML Standard for Accessibility  (3/2003)
* More Presentations

WHITE PAPER
Tell me again ...
Why should I care about XML?

(
PAGE 2)
Return to page 1

    ... Because XML data is well-fielded it can also be directly adapted into non-traditional publishing outlets such as in IETM's (Interactive Electronic Technical Manuals) or for use with field maintenance reference software. This is of particular importance in military applications.

Costs of XML

  • Training - XML is conceptually different from the appearance-based text representations that most people are used to. Thus, document authors and maintainers must be educated about the differences between XML and Microsoft Word, for example. And they need to be made aware of the new requirements implied by XML's benefits.
  • These are:

    • All text must be tagged. This is a requirement most authors aren't accustomed to.
    • No formatting is applied at authoring time. This violates the habits of many authors. (Studies show that removing the formatting requirement from authors can dramatically increase their productivity; they focus on writing, not on making a small section of text look "just so".)
    • The document structure, as defined by the DTD, must be adhered to. XML authoring tools don't allow the writer to put a <heading 2> in front of a <heading 1>, even if that seems, "OK, just this once."

    Nonetheless, today's XML tools support WYSIWYG interfaces, drag-and-drop technology, and the other functions that non-technical computer users already understand. The cost of additional training is more than offset by the benefits of using XML. Not only that, but the cost of training is declining as the learning curve becomes less and less steep.

  • Specialized software - This is more a perceived cost than a real one. Today, XML-compliant software is available at all price levels and many mainstream word processing tools support XML. Plus Internet Explorer and Netscape, and other browsers, support it natively. Naturally the high-end tools offer more features, support larger environments, and provide greater benefits than their low-end counterparts. A cost/benefits analysis needs to be done to determine the best tool set for each particular installation. If you do need a high-end solution, this will give you a clear picture of the additional benefits you are getting for your money.
  • Legacy conversions - To get the most out of XML, it is important that an enterprise's entire active document set is moved over to an XML environment. It is not enough to adopt XML on a "from here on in" basis. XML conversions can be complex. But they needn't tie up internal human resources. In fact, because expertise and experience are essential for a successful conversion to XML, it is advisable to outsource the conversion to a specialized XML conversion vendor. This is less expensive in the long run than doing conversion in-house. Using a single, experienced XML conversion vendor guarantees you'll get a quality XML end product. Not only will your documents be technically valid, they will also be meaningfully and consistently tagged. Unless documents are professionally tagged, you won't get the benefits of XML.

PDF: An alternative to XML?
PDF is a proprietary page representation format developed by Adobe Systems. It puts documents in a "container" that preserves not only the text but also the image of the page. PDF can be generated directly by many traditional word processing packages. It can also be generated by scanning paper documents.

PDF does not have any of the content tagging capabilities of XML (except for limited linking). And, although widely accepted, PDF is not a recognized independent standard. PDF files are binary; besides text they may contain images of various types, postscript, and other binary information. All this is useful, but means PDF is not as portable as XML.

Furthermore, when PDF is generated from paper, text accuracy is very poor. Although readers may see what appears to be a perfectly usable page, what is actually being displayed is a bitmap image of the page. The text itself, extracted via an OCR process during the PDF conversion, is not directly visible. It is searchable -- but if the accuracy is poor, as is inevitable with uncorrected OCR, the searches will be inaccurate, missing many potentially important "hits" and producing irrelevant hits. Correction is possible, but difficult and expensive -- possibly exceeding the cost of an XML conversion.

PDF files are generally large and unwieldy, especially when the page image is preserved in bitmap form (usually the case when PDF was generated from paper). This means they are difficult to transport over networks or to make available over the web.

Data Conversion Laboratory can and does do PDF conversions where appropriate. We recommend, however, that they be limited to situations where paper is being eliminated for space reasons, and the documents are not frequently accessed, but must be available when required. We recommend XML for "live" data that needs to be frequently accessed, modified, or searched.

For further information on PDF, read:
- Converting PDF to XML: Can it be done easily? (FAQ)
- PDF Conversion: How, For Whom, And When? (Part 1)
PDF Conversion: How, For Whom, And When? (Part 2)

Conclusion: Use XML! It's just better ...
DCL has a wide variety of experience converting data from many formats into many formats. Our expertise extends well beyond the domains of XML/XML, so we don't have an XML axe to grind. But we believe that XML should be the format of choice for all industries who need to manage their "intellectual capital." And we recommend the use of XML in these circumstances. Not because it is legally mandated -- though in many cases it is -- but because it provides the most attractive package of benefits at justifiable cost. The truth is, we often find ourselves saying: "Use XML! It's just better."

DCLnews Editorial
12/3/2002

Read more...Read more XML articles at DCL Library

FREE Tech Newsletter!
Subscribe to DCLnews for the latest tech, XML/XML, and e-Publishing news. Plus top stories, reports, and interviews. Click here to subscribe.

Return to top

  Structured Product Labeling

Content Reuse

Subscribe

Books2Bytes

DCL Library

Columbia Guide
GSA Schedule
AIA Member
DCL Calendar

Best Practices Santa Fe, NM, September 15-17, 2008. More…
XyUser Phoenix, AZ, September 22-24, 2008. More…
9th Annual Vasont Users' Group Meeting, Hershey, PA, October 6-8, 2008. More…

DITA/TECHCOMM 2008, Raleigh, NC, November 3-6 2008. More…

ATA e-Business Europe. Details TBA.

 
Recent News

Doc Train Life Sciences Indianapolis, IN, June 23-25, 2008. More…

X-Pubs London, England, June 22-24, 2008. More…

Mark Logic User San Francisco, CA, June 10-12, 2008. More…

PTC User Long Beach, CA, June 2-4, 2008. More…

Ultramain User Conference 2008, Albuquerque, NM, May 11-15, 2008. More…

Documentation and Training West 2008 Vancouver, BC, May 6-9, 2008. More…

CMS/DITA Santa Clara, CA, April 7-9, 2008. More…

DIA Med Comm Orlando, FL, March 10-11, 2008. More…

DIA EDM Philadelphia, PA, February 5-7, 2008. More…

Gilbane Boston Conference Boston, MA, November 29, 2007. More…

The LavaCon Conference on Advanced Technical Communication and Project Management New Orleans, LA, October 27-30, 2007. More…

2007 ATA e-Business Forum Miami, Florida, Oct 17-19, 2007. More…

DITA 2007™-East, Raleigh, North Carolina, October 4-6, 2007. More…

2007 XyUser Group Fall Conference, Boston, MA, Sept 23-26, 2007. More…

Mark Logic 2007 User Conference, San Francisco, CA, May 15-17, 2007. More…

Content Management Strategies/DITA North America Conference 2007, Boston, MA, March 26-28, 2007. More…

DIA 18th Annual Workshop, San Diego, CA. March 4-7, 2007. More…

DIA 2007 EDM & CDM Conference, Philadelphia, PA, Feb 6 - 8, 2007. More…

DITA 2007 – West, San Jose, CA, February 5-7, 2007. More…

Framemaker 2006 Chautauqua, Austin, TX, Nov 8-10, 2006. More…

PTC/User World Event 2006, Grapevine, TX, June 4-6. More…

19th Annual DIA Conference Philadelphia, PA, February 7-9. More…

XyUser's Conference, San Diego, California, September 11-14. DCL's Don Bridges delivered a presentation on "Content Reuse" More…

Structured Product Labeling, Washington, DC, August 23-24. More…

Tri-XML 2005, Raleigh, NC , July 28. DCL's Don Bridges delivered a presentation on "Content Reuse" More…

Pharmaceutical Labeling and Product Identification, Whippany, NJ, June 16-17. DCL's Don Bridges delivered a presentation on "Structured Product Labeling (SPL) and the Implications of Implementing an XML Solution." More…

More…

Data Conversion Laboratory, Inc.   61-18 190th St., 2nd Floor, Fresh Meadows, NY 11365   718-357-8700   convert@dclab.com

Copyright © 1997-2008  Data Conversion Laboratory, Inc. All rights reserved.