Data Conversion Laboratory, Revolutionizing Publishing for the Digital Age 
  DCLab.com | About DCL | Tech Info | Press Info | Contact Us | DCLNews | Partners | Wiki | Client Area     
menu
Data Conversion Lab

About DCL
  Why go to DCL?
  Clients
  Company Background
  Management
  DCL in the News
  Events
  Mission

DCL News
  Current Issue
  Back Issues
  Subscribe

Technology
  Technology Resources
  FAQ's
  Glossary
  Presentations
  DCL Work Tracking

Press Info

Clients' Area

Contact DCL
  Directions
  Request Estimate
  Positions

Books2Bytes
Popular Pages
* Current Issue of DCLnews
* DCL featured in The Columbia Guide to Digital Publishing
* Slash Document Costs
* Ann Rockley on ROI in CM
* PDF Resources
* XML Conversion Resources
* Roundtrip Document Conversion
* DCL Resources Library
*

Converting Legacy Data...

*

Aviation & Aerospace

*

PDF Conversion to XML & MS-Word

*

PDF Conversion

*

Quark to XML

* Getting Content into XML
Fact Sheets
* Public Access for Research Materials
* S1000D Conversion
* Content Reuse Assessment
* Document Conversion
* SPL - Pharmaceutical Industry
* Harmonizer™
* Jeppesen Map Revision Service
Technical Papers
* Why STM Publishers Should Use XML...
* Department of Defense and the Power of XML
* Your Data in XML
* SGML to SGML 1
* SGML to SGML 2
* Quark to XML
* Plan Ahead
* Do it Yourself?
* Encyclopedia
Presentations
* Conversion to XML: Documents versus Data (11/2003)
* Data Migration Considerations  (6/2003)
* Technology for Cost-Containment and Efficiency  (4/2003)
* Converting Textbooks to Meet the National XML Standard for Accessibility  (3/2003)
* More Presentations

Beating Data Redundancy With Content Reuse

DCL study reveals up to 70% of data can be redundant in many firms and organizations. Big savings result from simply identifying duplication and reusing content, writes Mark Gross, president of DCL.

Interested in putting your data on a diet?

We are accepting a limited number of applications to join an “Early Adopters” program that affords you leading edge technology that won't have your budget on the bleeding edge. Interested?
Please click here for more details.

If you maintain a large document collection, you’re probably aware that information is frequently repeated throughout that collection. In technical documents, repair procedures, parts lists, and warnings will crop up in identical or near identical form throughout the manuals that accompany different versions of a product. In the legal world, contract clauses will be the same or very similar throughout many types of documents. And in the finance industry a large amount of boilerplate text is used and repeated on forms, regulatory documents, sales literature, and in the “small print” sections. It’s the same story with software Help files.

What surprised us was the extent of redundancy. At DCL, we recently surveyed a number of document collections using our new measurement software. We found that in each case duplication was over 40% and in some cases over 70%. Furthermore, in every case, the calculated number was far higher than the number guessed by the document owner.


"Don’t drown in content - reuse it."


This has major cost implications. Why? Because managing less content means lower costs. The data we’ve collected would suggest that the return-on-investment (ROI) for content management and for XML is far higher than usually thought. It’s even more pronounced in cases where documentation is deployed globally and has to be translated into multiple languages, or where documentation has significant regulatory review requirements.

Why so much redundancy?

In many areas, new documents lean heavily on information you’ve already put together in the past. When new documents are created or compiled, the traditional approach is to grab the stock information you need by copying and pasting it. Not only is this slow and prone to human error, it creates more and more content, which is costly to maintain. The result? You end up drowning in content because you are storing a lot of copies of what is essentially the same thing.

While this is the way documents have been managed since the advent of modern computing, it doesn’t have to be this way now - not with the latest content management tools.


"Our new measurement software found duplication was over 40% and in some cases over 70%..."


At DCL, we’ve been interested in this issue for a long time. Much of our business involves preparing documents so that they can be integrated into Content Management Systems (CMS) - which allow you to take advantage of content reuse. With this in mind, we have been particularly interested in helping quantify the benefits you might expect, and particularly the potential for content reuse in clients’ data sets.

Time & revenue savings

We found that most people have a lot more redundancy than they think. This became apparent when we started objectively measuring redundancy with software tools. Simpler approaches underestimate the amount of repetition because either they work primarily through intuition or they are only looking for exact matches. Beneath the surface, as our tools revealed, there are the “close matches”, such as small changes in wording and slight errors, and even sections of the text that have been forgotten about.

Once you realize the extent of duplication, it becomes clear that there is real potential for saving time and revenue by finding and removing repeated data, then implementing a content reuse strategy. Just imagine how much easier your life will be with half as much content to manage.

Mark Gross
July 20th, 2004

>>>Stay tuned for future DCLnews articles on the specifics of implementing content reuse in your organization.


Key benefits of reusing data:

1) Reduced cost and faster turnaround
• Costs less to maintain a smaller document set.
• Costs less to convert and distribute less data.
• Storage and computing costs go down.

2) Multiple benefits for some materials
• Materials needing extensive legal and technical review
• Materials needing translation into multiple languages.

3) Improved information consistency
• Reduces risk that something will go wrong.

4) Improved authoring productivity
• Writers focus on content, not on “tech” work.



FURTHER INFORMATION

Content Reuse - The Unseen Revolution
http://www.dclab.com/unseen_revolution.asp

Content Reuse, The Killer App
http://www.dclab.com/content_reuse.asp

Technical Documents Go Online At Continental Airlines http://www.dclab.com/coair_digdocs.asp

Documents Pulped By Bulldozer - XML To The Rescue! http://www.dclab.com/xmlrescue.asp

TI's Tech Docs Travel At The Speed Of Light http://www.dclab.com/texas_instruments_case_study.asp

Tech Docs Down The Well
http://www.dclab.com/schlumberger.asp


Interested in putting your data on a diet?

We are accepting a limited number of applications to join an “Early Adopters” program that affords you leading edge technology that won't have your budget on the bleeding edge. Interested?
Please click here for more details.

  Structured Product Labeling

Content Reuse

Subscribe

Books2Bytes

DCL Library

Columbia Guide
GSA Schedule
AIA Member
DCL Calendar

Ultramain User Conference 2008, Albuquerque, NM, May 11-15, 2008. More…

PTC User Long Beach, CA, June 2-4, 2008. More…

Mark Logic User San Francisco, CA, June 10-12, 2008. More…

X-Pubs London, England, June 22-24, 2008. More…

Doc Train Life Sciences Indianapolis, IN, June 23-25, 2008. More…

Best Practices Santa Fe, NM, September 15-17, 2008. More…
XyUser Phoenix, AZ, September 22-24, 2008. More…
9th Annual Vasont Users' Group Meeting, Hershey, PA, October 6-8, 2008. More…

DITA/TECHCOMM 2008, Raleigh, NC, November 3-6 2008. More…

ATA e-Business Europe. Details TBA.

 
DCL Calendar

Documentation and Training West 2008 Vancouver, BC, May 6-9, 2008. More…

 
Recent News

CMS/DITA Santa Clara, CA, April 7-9, 2008. More…

DIA Med Comm Orlando, FL, March 10-11, 2008. More…

DIA EDM Philadelphia, PA, February 5-7, 2008. More…

Gilbane Boston Conference Boston, MA, November 29, 2007. More…

The LavaCon Conference on Advanced Technical Communication and Project Management New Orleans, LA, October 27-30, 2007. More…

2007 ATA e-Business Forum Miami, Florida, Oct 17-19, 2007. More…

DITA 2007™-East, Raleigh, North Carolina, October 4-6, 2007. More…

2007 XyUser Group Fall Conference, Boston, MA, Sept 23-26, 2007. More…

Mark Logic 2007 User Conference, San Francisco, CA, May 15-17, 2007. More…

Content Management Strategies/DITA North America Conference 2007, Boston, MA, March 26-28, 2007. More…

DIA 18th Annual Workshop, San Diego, CA. March 4-7, 2007. More…

DIA 2007 EDM & CDM Conference, Philadelphia, PA, Feb 6 - 8, 2007. More…

DITA 2007 – West, San Jose, CA, February 5-7, 2007. More…

Framemaker 2006 Chautauqua, Austin, TX, Nov 8-10, 2006. More…

PTC/User World Event 2006, Grapevine, TX, June 4-6. More…

19th Annual DIA Conference Philadelphia, PA, February 7-9. More…

XyUser's Conference, San Diego, California, September 11-14. DCL's Don Bridges delivered a presentation on "Content Reuse" More…

Structured Product Labeling, Washington, DC, August 23-24. More…

Tri-XML 2005, Raleigh, NC , July 28. DCL's Don Bridges delivered a presentation on "Content Reuse" More…

Pharmaceutical Labeling and Product Identification, Whippany, NJ, June 16-17. DCL's Don Bridges delivered a presentation on "Structured Product Labeling (SPL) and the Implications of Implementing an XML Solution." More…

More…

Data Conversion Laboratory, Inc.   61-18 190th St., 2nd Floor, Fresh Meadows, NY 11365   718-357-8700   convert@dclab.com

Copyright © 1997-2008  Data Conversion Laboratory, Inc. All rights reserved.