Example situation
Let's assume we have a Word document structured in this way:
Topic 1
Topic A
Topic B
Topic 2
Topic C
Topic D
Topic 3
Topic E
Topic F
Topic F1
Topic F2
Topic G
We can see in this structure that three heading levels exist. Let's assume the titles of each section were marked with the standard heading levels Heading 1, Heading 2 and Heading 3. To configure Hyper.Net to identify this document structure and reproduce it in the Rich Hypertext rendition, we need to use Style Matchers to look for these styles in the incoming content.
Let's make the example more difficult by introducing the following wrinkle: let's say that none of what we would like to use as level three headings were marked with Heading 3. Instead, the documents of this content type were authored with Heading 1 and Heading 2 for the first two levels, which was all anyone at the time wanted to have in the table of contents. We'd like the level 3 sections in our online table of contents as well, but they have not been marked with Heading 3. Instead, manual formatting was consistently applied to them, for example, Arial 16 Bold. Here's a description of the source document as seen through Hyper.Net's eyes:
Level 1: Match style Heading 1
Level 2: Match style Heading 2
Level 3: Match format Arial 16 Bold
Let's now throw in one more wrinkle. After reviewing a few of the documents of this content type, we've noticed that a large number of them were not authored quite correctly. Sometimes the Heading 2 style was not applied, and manual formatting was applied instead: Arial 20 Bold. We'd like to detect our level 2 headings by looking for both the Heading 2 style and the manually applied formatting. Hyper.Net would describe this in the following way:
Level 1: Match style Heading 1
Level 2: Match style Heading 2, Match format Arial 20 Bold
Level 3: Match format Arial 16 Bold
If all of the documents having this content type conform to this structural specification, you can proceed to setting this up in Hyper.Net. The rest happens automatically.
| |
|
|
Note
|
If your documents are structurally inconsistent, you can't expect to publish structurally consistent results.
For example, if Heading 2 was used sometimes to mark level 2 sections and sometimes for level 3 sections because the author was only looking at the formatting that resulted from application of the style, you have a problem. You have a few options when inconsistency is present:
Limited inconsistency: resolve serious inconsistency manually before publishing, resolve minor inconsistency over time as documents are updated
Widespread inconsistency: resolve the inconsistency manually before publishing, attempt automating the cleanup using Word macros (they can even be applied to large batches of documents when properly implemented)
|