Yes, this is the idea. To have plain unformatted text to import to a wordprocessor.
But to make it more complicated I want to preserve tables and program listings formats...
At he same time it would convert Unix and Mac end of lines to DOS, and do character set conversion. For example ASCII to Windows, or Windows to HTML.
A thing I want to do and I don't know how is to remove page headers, footers and page numbers. Any idea?
An example. The [after] is the output of what i have so far (except for the CODE tag):
[before]
Double precision MBF numbers use only eight bits for an exponent
rather than eleven, trading a reduced absolute range for increased
resolution. That is, there are fewer exponent bits than the IEEE method
uses, which means that extremely large and extremely small numbers cannot
be represented. However, the additional mantissa bits offer more absolute
digits of precision.
The IEEE format:
ÚÄÄÄÄÄÄÄÄÂÄÄÄÄÄÄÄÄÂÄÄÄÄÄÄÄÄÂÄÄÄÄÄÄÄÄ¿
³SEEEEEEE³EMMMMMMM³MMMMMMMM³MMMMMMMM³
ÀÄÄÄÄÄÄÄÄÁÄÄÄÄÄÄÄÄÁÄÄÄÄÄÄÄÄÁÄÄÄÄÄÄÄÄÙ
[after]
Double precision MBF numbers use only eight bits for an exponent rather than eleven, trading a reduced absolute range for increased resolution. That is, there are fewer exponent bits than the IEEE methoduses, which means that extremely large and extremely small numbers cannotbe represented. However, the additional mantissa bits offer more absolute digits of precision.
The IEEE format:
++++++++++++++++++++++++++++++++++++|
³SEEEEEEE³EMMMMMMM³MMMMMMMM³MMMMMMMM³
+++++++++++++++++++++++++++++++++++++