TechWhirl (TECHWR-L) is a resource for technical writing and technical communications professionals of all experience levels and in all industries to share their experiences and acquire information.
For two decades, technical communicators have turned to TechWhirl to ask and answer questions about the always-changing world of technical communications, such as tools, skills, career paths, methodologies, and emerging industries. The TechWhirl Archives and magazine, created for, by and about technical writers, offer a wealth of knowledge to everyone with an interest in any aspect of technical communications.
>
> The old Word docs are now in decent shape, but still far from perfectly
> structured. I'm looking at these hundreds of pages and starting to picture
> myself going through them, line-by-line, and copying the text into XML.
>
huh?
cut-and-paste makes no sense at all.
Surely you can get a good first cut by using styles to delineate topics
and such?
Once the styles are set, either
1) a VBA script to grab the text a block at a time per styles and put
the chunks into individual topic files with the DITA markup
or
2) save to some flavor of HTML and use your favorite scripting language
or
3) a transform from /Word's XML (probably the bare/stripped kind) into
topics
wouldn't be all that hard?
regards
Jay
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Create HTML or Microsoft Word content and convert to Help file formats or
printed documentation. Features include support for Windows Vista & 2007
Microsoft Office, team authoring, plus more. http://www.DocToHelp.com/TechwrlList
True single source, conditional content, PDF export, modular help.
Help & Manual is the most powerful authoring tool for technical
documentation. Boost your productivity! http://www.helpandmanual.com
---
You are currently subscribed to TECHWR-L as archive -at- web -dot- techwr-l -dot- com -dot-