Document formats


win 1251
 



Document formats

We developped our own filters for following file formats (DTP):

  • RTF (Word)
  • MIF (FrameMaker)
  • HTML
  • SGML
  • XML
  • InterLeaf ASCII
  • PowerPoint
  • TROFF/NROFF

Filters allow to extract a text to be translated, while totally saving a file format. It means, that all the format information (template, structure, fonts/styles, tables, figures and references) is saved.

Filter possibilities and features for each document format are specified below:

RTF (Word) Selection of the text from the document and its subsequent insertion. The text from main stream is selected only. By default, a text from WordDraw, floating frames and similar objects is not extracted. It is possible to "include" manually similar objects to main stream of the text. Using preliminary (manual) processing it is possible to protect (mask) required document parts from translation.
MIF (FrameMaker) Selection of the text from the document and its subsequent insertion. All the text is selected, including the information available on special pages (reference and master page). The text can be sorted out for convenience of the subsequent (manual) postediting (the internal order of storing a text in MIF differs from its logical representation in the document). It is possibile to process variables separately, and to specify which variables are not to be translated. In addition, it is possible to specify a text not to be translated, using following parameters: character style, a name of a paragraph style or a structure element (for structured documents).
HTML
SGML
XML
Selection of the text from the document and its subsequent insertion. It is possible to specify a text not to be translated, using a combination of tags and attributes in tags. An operation mode without disclosing entities into appropriate text representations is supported.
InterLeaf ASCII Selection of the text from the document and its subsequent insertion. The operation mode without extracting such units as strings and/or microdocuments is supported.
PowerPoint Selection of the text from the document and its subsequent insertion.
TROFF/NROFF Selection of the text from the document and its subsequent insertion.
Due to the lack of a uniform standard for macros used when creating documents, it is necessary to customize preliminary this filter according to specific documents (taking into account macros packages used).

The list of our filters is continuously expanded. So, in the near future a new filter will be designed which will "extract" text segments from Transit and Trados Translation Memory systems.


© "Argonaut" Ltd. 2002