What Lies Beneath Your Documents May Embarrass, Hurt or Cost You |
|
|
|
|
Rarely does a month go by without the media disclosing that a government entity or business organization had published a document containing content not intended for publication. In many countries, new content and data protection laws prohibit the disclosure or reuse of certain information - The Data Protection Act of the UK. Given that the delivery and publishing of documents via e‐mail and the Internet will continue to increase, organizations should protect against the risks inherent in attached or published documents. However, very few organizations have proactive policies, procedures or protections against this type of inadvertent content disclosure. In summary, what you see is not all you are sharing. This whitepaper details the Top 10 content elements that impart risk, defines paths toward auditing for that risk and suggests rules and procedures which aid in minimizing those risks. Many of these items cannot be detected using existing "metadata scrubbing" products. Summary: Top 10 Content Elements That Impart Risk
Risk #1: Microsoft Word's Track Changes set "not visible" when saving or opening If the document's final editor ‘hides' the markup, then saves the document, Word stores this view setting (along with several others) as a document preference, thus "hiding" tracked changes from the view of anyone subsequently opening it. The setting is only changed when it is toggled to ‘show' changes and the document is resaved. Each tracked change remains active in the document until it is "Accepted" or "Rejected", thus can be passed along to each successive document created from it. Because the document's tracked changes view setting is ‘hidden,' subsequent editors or owners of the document may never know what lies within the file until the initiation of a tracked changes session, or a change is made to "Show" tracked changes. Word 2003 provides a "Security" setting, "on" by default, which forces such changes to be visible when such a document is opened. Posing an additional challenge is one of version‐specificity: Although change tracking has been a feature since Word 2.0, it was not rewritten until Word 2002/Office XP - thus ignoring hundreds of core product functionality enhancements during the process. As a result, "tracked" collaborations between users of dis‐similar versions of Word consistently challenge the document, initiators and contributors alike. These issues further increase the risk of hiding previously‐tracked changes in the depths of the Word binary file. Risk #2: Text marked with a "Hidden" font attribute or text obscured from view Generally, some type of low‐level automation is offered to remove hidden text once the document is finalized, yet either users fail to run these routines or the routines fail to perform completely. For example, complex areas of the document are often missed - text boxes within the header, for example. Also categorized as ‘hidden text' are several formatting foibles that obscure authored text from view. While not hidden in the sense of a font color or attribute, text is nonetheless hiding within non‐visible sub‐structures: Each of these scenarios occurs when time‐consuming and complex formatting tasks cause authors, content developers and administrative staff to take common but dangerous shortcuts: copy or cut and paste from other documents, or opening an old document as the means for jumpstarting the creation of a new work. Risk #3: Text redacted with a white font color or covered by graphic lines or boxes While effective to the paper‐printed page, this method is not effective when "printing" to a PDF file. In the resultant PDF, the image layer displays the desired effect, while the text layer of the PDF file contains the original text. On an unsecured PDF file, text can be easily‐retrieved through a Clipboard copy, a PDF‐to‐Word conversion tool - even turning up in electronic searches. A recent case of this was the release of classified data in a military PDF documenting the killing of an Italian journalist at a military checkpoint. Risk #4: Text in Comments Risk #5: File | Properties content, such as Title, Author, Keywords, Comments In most Enterprises, these properties can be used to tag documents for integration within a larger taxonomy or document management system, usually for purposes of classification and easy retrieval via search. If the use of Word's Keywords property is used to organize documentas "Adversary" or "Prospective Client," this disclosure may reveal a posture thought to remain proprietary inside your organization. Risk #6: Passing content elements 1‐5 (above) into a resulting PDF rendition As a result, Adobe's PDF file format has become a frequent method for electronic distribution. In recent years, Adobe's extended integration within Microsoft Office applications - most notably Word - permits the seamless creation and attachment of a PDF file to an email, all occurring within a single‐click. Issues arise, however, when these 1‐click operations are inadvertently configured to incorporate active functionality from Word: Additionally, Adobe's PDF format can be used to ‘package' electronic source, permitting the publisher to include copies of the source file as attachments to the PDF. This results in the recipient being able to open and reuse the editable source when perhaps that was not the intention. In situations where the organization's IT group has eliminated the ability to directly integrate functionality with Acrobat, users simply File | Print to the Adobe PDF, or other PDF print driver. In this scenario, inadvertent use of Word's ‘sticky' Print feature - a "Document Showing Markup," which exposes all comments and tracked changes, or printing "Hidden Text" - delivers all suppressed content into the PDF. Couple these risk factors with the common practice of emailing PDFs straight from the authoring application without first reviewing it, these content disclosures are no longer protected from view. Other emerging workflows, such as the enablement of PDFs as a vehicle for passing comments or edits using Acrobat and Reader 7 - do minimize the risk by establishing a change management process that conceals the source, yet these remarks are imparted into the PDF itself. This means they remain accessible to anyone viewing or printing the PDF unless proper security or distribution restrictions are also applied. Risk #7: Internal Word versions - whole editions of a document, saved within a single .doc file Once again, these settings become preferences that travel with the document - along with all its previous versions and any commenting, tracking or editing history intact - that is, until all versions are deleted (one‐by‐one), the checkbox is cleared, and the document is resaved. Risk #8: Automatically saved data such as "last 10 authors" or previously‐deleted text Exposing previously‐deleted text can be controlled by disallowing "Fast Saves", however, many IT organizations remain completely unaware of the danger this option exposes. It should be noted that the "Last 10 Authors" listing is an automatic feature of Word and cannot be turned off. Shedding the detail requires the file be converted, the content be moved to a new blank document shell where saving stores only one previous author, or the binary file be externally edited. It should be noted, however, that Word 2003 no longer stores this detail with the file. Risk #9: Custom document properties, particularly email routing information One such workflow automatically locates the originating document that launched the review cycle, then automates merging participants' comments and changes as their annotated file is opened. These exchanges are made possible by electronic Routing Slips, Custom Document Properties and document preferences that are automatically imparted into documents involved in the reviewing cycle. Custom Document Properties (File | Document Properties | Custom) will identify the sender, sender's email address and the subject line of the email - a location where participants in a review are less likely to monitor propriety: they assume only internal participants will see it. Risk #10: Inserting embedded objects, such as Excel spreadsheets, Visio drawings; or inserting ‘cropped' pictures In the case of drawings or pictures, users often leverage the Office applications' "Picture Toolbar" feature called ‘Crop.' This tool allows the user the ability to pan into the portion of the picture they want represented. But what users may not realize is that the whole picture becomes embedded in the document, with its other elements available to anyone who double‐clicks on the drawing. Solutions Step One: Conduct a Practical Risk Assessment Next, identify the documents to be audited. Most organizations have well‐controlled document or file management systems, and can identify work product by client, matter or other business relationship. This facilitates a measurable, definable scope for your audit. Additionally, work with your internal practice or workgroups to identify high‐profile, high‐impact projects where documents must "leave the building," and documents that are reused to create ones that will. Finally, the sheer volume of documents and discrepancies between them rules a manual approach to your audit both impractical and impossible. Thus, secure an automated process such as Microsystems DocXamine to look across work product types - Word and PDF files being the most widely deployed - making sure it can thoroughly research all sub‐structures of the document, can configure and sort those items deemed of most risk to the organization, and can extract and track documents ‘found' from your various document management systems, as well as detailed reporting. Step Two: Raise awareness, educate document distributors, propose Best Practices or provide elevated levels of support This whitepaper, along with other content available from Microsystems, Microsoft and Adobe can form the basis for such documentation. Microsystems DocXtools running on the user's desktop can detect presence of such risks, and deliver this guidance in the open document. Establish a hotline, Intranet location or other internal communications vehicle to focus on identifying and mitigating risks found. Arm this response team with expertise on all versions of Adobe Acrobat and Microsoft Word, along with knowledge on dissemination methods such as extranets, Outlook or Web pages, taking care to sensitize them to business issues, risks and practices of your organization. Step Three: Develop Policies & Procedures Step Four: Support with Technology Solutions The Microsystems Advantage Knowledge Partnership Software DocXtools - The production, diagnostic and clean‐up tool: DocXtools assesses, cleans, and formats documents, aiding in the production of a high‐quality document. Many of DocXtools features are available as "one‐click fixes" and aid firms in: DocXamine - The analysis and quality control tool: DocXamine delivers an automated analysis of Word and PDF documents. The analysis detects common problems, summarizes results and provides recommendations for solution through the use of DocXtools. Pre‐Hire Skills Assessment - The Pre‐Hire Skills Assessment module enables the automated evaluation of the advanced Word skills of potential new hires. It is targeted at an individual test taker, lists what he/she did right and wrong and provides a total score. It enables firms to make hiring decisions based upon concrete information and it fosters the creation of targeted training plans for the potential new hire. Support - Solutions Center, Document Emergency Room Training & Education On‐Demand Training: Learn more about Microsystems tools and Microsoft Word from our extensive collection of on‐demand training resources, which include How‐to‐videos, Word resources and tips and tricks. Client & Consulting Services To begin the first step toward improving the document creation, quality control and delivery processes to meet firm goals, please call 630.598.1100 or e‐mail our Sales Team at sales.inquiry@microsystems.com. Microsystems |






