Styles are very important and a very powerful feature of Microsoft Word. In MetaTexis, you can control the segmentation process by adding styles to the list of styles not to be segmented or to the list of styles to be skipped.
To add styles:
1. Activate the relevant checkbox first.
2. Click the style to be added in the drop-down list located below the styles list.
Two different kinds of styles can be added: Paragraph styles and characters styles. Paragraph styles are marked with "P", characters styles with "C".
You have the following possibilities:
· If you add a paragraph style to the list of styles not to be segmented, each paragraph with that style will not be segmented. The paragraph will be treated as one segment.
· If you add a paragraph style to the list of styles to be skipped, each paragraph with this style will neither be segmented nor shown to you in a TU. MetaTexis will simply skip it.
· If you add a character style to the list of styles not to be segmented, the groups of characters with this style will not be segmented. They are a part of the surrounding segment. This allows you to protect any combination of characters against segmentation (e.g. tags).
· If you add a character style to the list of styles to be skipped, groups of characters with this style will be neither segmented nor shown to you as part of a segment. MetaTexis will simply ignore them.
A style cannot be added to both lists because this would lead to logical conflicts. You can either skip or not segment a style, but not both.
Example: You can use character styles to
deal with tagged texts that are not supported by MetaTexis. You can define two
different styles: one for tags not to be segmented [e.g. tags for italics, bold,
special characters etc.], and the other tag for tags to be skipped [e.g.
"external" XML tags ("<seg>", "<\seg>").
After you have defined the characters styles, just assign the styles to the tags concerned, e.g. via the Microsoft Word search and replace function.
When you have done this, MetaTexis will skip the external styles, while it will not segment the internal styles (even if they contain a dot-space combination).