Microsoft Office 2003 Editions: Overview of Developer TechnologiesSeptember 2004
possible. This enables documents to have an acceptable transfer format for document-centric solutions.
Formatting information being inserted. Formatting in a Word document is always an important aspect of the presentation of information. However, many documents need content from databases which is in a raw format with no related display information. You can use XML transforms to associate XML data streams with specific formatting so that when the transformed XML data arrives in Word, it appears formatted as needed to fit that document solution. XML makes it easier for you to provide this capability. The capability simplifies routine formatting work an information worker would otherwise spend (unproductive) time performing.
XML Reference Schema Technology
The next sections highlight the XML reference schema capability in each application in Office 2003 Editions.
Microsoft Office Word 2003. The Word 2003 XML format or schema, called WordprocessingML, is available in all versions of Word 2003. This format is an alternative file format to the well-known .doc binary format and saves all of the same information that is saved in the Word binary format. This means that if you saved a document as WordprocessingML and reopened it, Word retains every document feature. Word saves the WordprocessingML file with a processing instruction at the top of the file so that when a user opens a WordprocessingML file from Microsoft Windows® Explorer or Microsoft Internet Explorer, Windows opens the XML file directly in Word. Organizations or individuals can configure Word or specific templates in Word to save in the WordprocessingML format by default.
Saving a full Word document as XML results in a detailed XML file with several namespaces. However, the structure of WordprocessingML is not complex. The simplest WordprocessingML document consists of just five elements and one namespace:
Although Word saves all of the document information in the XML file, several types of information are saved in a “binary” format through base-64 encoding (a standard technique for XML). Word saves all the details to which it has access as text. Base-64 encoding is needed for items such as images (which are binary), Microsoft ActiveX® controls (which don’t support a protocol for saving in an XML format) and Microsoft Visual Basic® for Application (VBA) projects (which don’t provide a mechanism for Word to save as text).
© 2006 Microsoft Corporation. All rights reserved. Page 6 By using or providing feedback on these materials, you agree to the attached license agreement.
To comment on this paper or request more documentation on these developer features, contact us at . We look forward to hearing from you.