Document format for a DOS word processor where control commands begin with a dot

Question

I successfully achieved the reading of several 5.25″ old floppies. They contain old documents from a retired lawyer.

For the memories, we want to read them correctly, nevertheless, I don't find which software was used neither he remember which one.

The text files are similar, starting in the first five lines with the following characters, then follows the text:

.op
.mt1
.hm1
.pl88
..
†††††----!----------!------------------------------------------------
†††††    DÆ† ArmandÔ Sanche˙ D°az¨† mayoÚ dÂ edad¨† casado¨† GraduadÔ ç
ç
†††††Social¨† vecinÔ dÂ est· Ciuda‰ coÓ domiciliÔ · efectÔ dÂ notifiç
ç

Does anyone know how I can find out what software to use to read these files?

Kind regards, David.

What kind of answer do you expect? Do you just want to know the used word processor? Or do you want to extract the contents? If so, with or without the formatting? Please edit your question and clarify. — the busybee, Commented Apr 20, 2023 at 5:56
@DavidSosa: You can try to install WordStar itself and see if the files open properly. On this site winworldpc.com/product/wordstar you can find various versions of the program. I did not use any of those files, and I do not plan to do it - simply because I do not have Wordstar files. I have the link from the Wikipedia site: justsolve.archiveteam.org/wiki/WordStar. I got there by searching "import from wordstar" - which returns quite a lot of results. — virolino, Commented Apr 20, 2023 at 12:55

scruss · Accepted Answer · 2023-04-20 18:36:04Z

19

WordTsar – A Wordstar clone might be able to read the files. It is open-source and cross-platform. Compatibility depends on the version of WordStar used to write the files.

It does implement an impressive number of WS's Dot Commands.

answered Apr 20, 2023 at 18:36

scruss

21.8k1 gold badge47 silver badges116 bronze badges

3

LOL, what a great name for a clone!
– Jörg W Mittag
Commented Apr 21, 2023 at 12:20
1

Awesome, thank you very much everyone for your valuable help. I am impressed by the power of the community of this forum. And many thanks to you, @scruss for giving me the direct solution. Have a lovely weekend =)
– David Sosa
Commented Apr 21, 2023 at 20:52
2

I'm really glad it worked for you, @DavidSosa. WordStar is one of the few legacy file formats that OpenOffice will not convert, so it's good to know that WordTsar can fill a gap
– scruss
Commented Apr 22, 2023 at 20:50

Add a comment |

manassehkatz-Moving 2 Codidact · Accepted Answer · 2023-04-20 02:19:06Z

33

Almost definitely WordStar or a compatible program such as NewWord.

dot commands at the top. From the WordStar 3.3 manual:
- .op - omit page numbers
- .mt1 - margin top 1 line
- .hm1 - heading margin 1 line
- .pl88 - page length 88 lines (likely 11" x 8 lines/inch)
- .. - next line is a comment line and not printed
8th bit of last letter of each word set high - just clear the bit to get regular text

The only thing that doesn't make sense to me is the 5 characters at the beginning of each regular text line. That may be an artifact of the copy/paste process - i.e., actually some other character.

answered Apr 20, 2023 at 2:19

manassehkatz-Moving 2 Codidact

15k2 gold badges41 silver badges67 bronze badges

1

Maybe they are "let's center the table using tabs, spaces or some other character"...
– virolino
Commented Apr 20, 2023 at 6:20
21

The † is unicode 0x2020 which is the same as two spaces run together as a 16 bit character. My guess is there has been some incorrect conversion between character sets along the way.
– JeremyP
Commented Apr 20, 2023 at 10:43
@virolino according to other answers/comments, that seems to be how Wordstar worked
– Esther
Commented Apr 20, 2023 at 16:47

Add a comment |

camelccc · Accepted Answer · 2023-04-19 16:20:20Z

19

Looks to me like a set of dot commands found at the beginning of a Wordstar file .mt1 margin at top, .hm heading margin. Try Wordstar 3

Unfortunately, modern word processors that I know of don't have import filters for Wordstar.

edited Apr 19, 2023 at 16:20

answered Apr 19, 2023 at 16:14

camelccc

6934 silver badges10 bronze badges

9

Also evident from the use of high ASCII characters as the last character of each word. WordStar only supports 7-bit ASCII; the 8th bit was used as a flag on the last character of words, presumably to indicate that the space(s) following the word could be adjusted (add or remove spaces, add or remove line breaks. WordStar didn’t format the paragraph on the fly; it literally changed spaces/line breaks, on the file’s content)
– Euro Micelli
Commented Apr 19, 2023 at 16:33
2

Although there is something weird about the specific high-Ascii characters being shown in the post. They don’t match what you would expect, even accounting for typical codepage conversion snafus between 437, 1252, 850, or even 860 for a stretch (none of the snafus I tried anyway)
– Euro Micelli
Commented Apr 19, 2023 at 16:55
1

Not for the faint-of-heart but I did find this on converting WS to MS Word: ataridogdaze.com/tech/….
– jwh20
Commented Apr 19, 2023 at 20:39
2

In reference to @jwh20’s suggestion: if the OP doesn’t have Perl, installing Perl just for this purpose might be a bit overkill. But the task is ultimately simple: “Set the 8th bit of every character to 0”. This can be easily reimplemented on just about any language amenable to the OP.
– Euro Micelli
Commented Apr 19, 2023 at 22:19
6

Retail versions of WordPerfect (up to at least version 11) should be able to import WordStar files and then re-save them in something that most modern word processors can read (plain text, RTF, WordPerfect, Word). In really old DOS versions the function isn't in the WordPerfect executable itself but rather in a standalone converter called CONVERT.EXE.
– Psychonaut
Commented Apr 20, 2023 at 13:05

| Show 6 more comments

Tanner Swett · Accepted Answer · 2023-04-21 11:18:48Z

10

manassehkatz's answer mentions that the 8th bit of the last letter of each word has been set high, so we should be able to clear the high bit in order to get the original text. However, it's not quite obvious what the character encoding of the text in the question is.

Investigation reveals that the text has been decoded as Mac OS Roman or a similar encoding. If we encode the text as Mac OS Roman, clear the high bits, and then decode it again, most of the characters now make sense.

The character † (dagger) changes to a space, and the character ç (c with cedilla) changes to a carriage return, so I've just left those as-is. If I replace all of the other characters, the result is:

††††† D.† Armando Sanchez D!az,† mayor de edad,† casado,† Graduado ç ç
†††††Social,† vecino de esta Ciudad con domicilio a efecto de notifiç ç

I have no explanation for why we end up with "D!az" instead of "Diaz" or "Díaz." Aside from that, this seems like perfectly normal Spanish text: "Mr. Armando Sánchez Díaz, of legal age, married, Graduado Social, resident of this City with address for purposes of notifi..."

edited Apr 21, 2023 at 11:18

answered Apr 20, 2023 at 22:07

Tanner Swett

2091 silver badge7 bronze badges

1

If this is truly a WordStar file then there is no real concept of character sets - it is simply 7-bit ASCII.
– manassehkatz-Moving 2 Codidact
Commented Apr 20, 2023 at 22:49
3

Yes, but the route from 7-bit ASCII “with a sometimes-set 8th-bit “ to Unicode, necessarily via one or more encoding conversion(s), can be very tortuous indeed.
– Euro Micelli
Commented Apr 21, 2023 at 1:15
@manassehkatz-Moving2Codidact a Spaniard or Mexican using WordStar in their own country would have their own character set, no?
– RonJohn
Commented Apr 21, 2023 at 4:37
2

Why do you guess that this is Mac OS Roman? WordStar was never released on a Macintosh system, so why would that be a plausible guess?
– Schmuddi
Commented Apr 21, 2023 at 6:28
7

@Schmuddi It's not just plausible, it's nearly certain. Note that I'm not claiming that WordStar did anything related to Mac OS Roman; what I'm saying happened is that, in the year 2023, when the asker opened up this file to view it, they viewed it using either Mac OS Roman or a similar encoding. And given the way that WordStar uses the high bit, and the letters that those Spanish words end with, the encoding must be one where the characters ÆÔ˙ÚÂ·‰Ó are all encoded the way that they are in Mac OS Roman.
– Tanner Swett
Commented Apr 21, 2023 at 11:12

| Show 3 more comments

Stack Exchange Network

Document format for a DOS word processor where control commands begin with a dot

4 Answers 4

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged
ms-dos
file-format
identify-this-software
.

Linked

Hot Network Questions

Document format for a DOS word processor where control commands begin with a dot

4 Answers 4

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged ms-dosfile-formatidentify-this-software.

Linked

Related

Hot Network Questions

Not the answer you're looking for? Browse other questions tagged
ms-dos
file-format
identify-this-software
.