Bug 141971

Summary: FILEOPEN: Add support for MIK character set
Product: LibreOffice Reporter: ianev
Component: CalcAssignee: Not Assigned <libreoffice-bugs>
Status: UNCONFIRMED ---    
Severity: enhancement CC: 79045_79045, erack, mikekaganski, sberg.fun
Priority: medium    
Version: unspecified   
Hardware: All   
OS: All   
See Also: https://bugs.documentfoundation.org/show_bug.cgi?id=146429
Whiteboard:
Crash report or crash signature: Regression By:
Attachments: Cyrillic letters (small and capital)

Description ianev 2021-04-29 13:30:39 UTC
Created attachment 171507 [details]
Cyrillic letters (small and capital)

The MIK character set was the most widely used encoding of Cyrillic letters in Bulgaria during the DOS era. There is a substantial amount of Lotus 1-2-3 files written in MIK. Because they are in a binary format (WK1), pure text converters are not able to convert the encoding in them without destroying the file format.

Reference: https://en.wikipedia.org/wiki/MIK_(character_set)
Code page 866 is a close partial substitute.
Comment 1 Roman Kuznetsov 2022-10-05 19:16:04 UTC
DOS era is ended around 25 years ago, and today all software shoud work using UTF-8.

I disagree with the idea.

Mike, what do you think about this?
Comment 2 Mike Kaganski 2022-10-05 21:07:58 UTC
I don't quite see why we couldn't have another table for MIK. IMO, that would align with "document liberation" goal. Stephan, Eike: any blockers or something that would make it undesirable?
Comment 3 Eike Rathke 2022-10-06 10:43:08 UTC
Never heard of MIK charset, but why not support it if it was actually used and would be needed to import legacy documents.

From a "DOS era is ended around 25 years ago, and today all software shoud work using UTF-8" standpoint we could throw away at least 4/5 of all encodings we support.