The first step in our conversion process is to create the conversion script that takes the files from Word format to XML files that conform to the S1000D standard. Our Consultant has over 20 years’ experience in this field and his conversion scripts are written to minimise the level of (costly and time-consuming) manual post-conversion clean-up required.
Using the conversion script, the Word files are converted to well-formed XML. If required, these files can be further split into separate files that match section levels in the Word file (i.e. section 1.1, section 1.2 etc). These files can be named according to the section levels in the document (i.e. section 1.1 etc) or according to DMC naming conventions, if these have been supplied electronically. Following the completion of any post-conversion manual clean-up required, the resulting split files will be parsing XML files that conform to the S1000D standard.
Graphics in the Word document are handled in the conversion as follows. For every graphic encountered, a graphic element is placed in the resultant XML file and an entity created. As with the document text, if DMC naming conventions for the images have been supplied electronically, these can be incorporated into the XML file. Graphics can be extracted from the Word file, but they will be at a resolution of 72 dpi, which will affect their quality and suitability for press. However, should the graphics be supplied separately, together with an electronic document mapping the source file name to the target file name, then the graphics can be renamed accordingly during the conversion process.
On completion of the conversion, the client is supplied with:
- an XML file for each component part of the original Word document(s), assigned the relevant DMC number if available
- a file that maps the component sections in the Word file to the outputted file name
- a file that details the graphics in the documents. If the source figures do not have DMC numbers, the file simply lists the figures; if the source figures do have a DMC number, the file maps the figure names to the appropriate DMC number.
- System Integration
- With skills across a range of programming languages, we provide expertise in development environments and the industry to which the technology is to be applied. This is particularly important where standards such as AECMA/S1000D, Def. Stan 00-60 and ATA iSpec2200 need to be clearly understood to allow development that fits the client’s requirements and maintains the integrity of the standard.
The development team also has experience with developing integrations with Common Source Databases (CSDBs) to enable the user to check out their documentation from the database onto their local drives for editing and checking them back in for document control.
Development also comes into play when embarking on a large data conversion project, reducing the manual intervention required to a minimum.