Bug 143763 - Aragonese (an_ES) spellchecking dictionary has extra unneeded slashes
Summary: Aragonese (an_ES) spellchecking dictionary has extra unneeded slashes
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Linguistic (show other bugs)
Version:
(earliest affected)
unspecified
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2021-08-06 19:54 UTC by Dimitrij Mijoski
Modified: 2021-08-14 11:34 UTC (History)
1 user (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Dimitrij Mijoski 2021-08-06 19:54:43 UTC
Description:
Here is the dictionary https://cgit.freedesktop.org/libreoffice/dictionaries/tree/an_ES/an_ES.dic

If we do run:

grep -P '/\s*$' an_ES.dic

we can find the problematic lines. There are 2315 problematic lines. A short example:

doi/
voi/
vas/
va/
vamos/
vatz/
van/
iba/
ibas/
ibanos/

Those slashes should be removed. That will reduce the files size and speedup the loading of the dictionary. The fix can be done with sed to replace the slashes and then with uniq to remove some adjacent duplicate lines.

This bug should be fixed here as this dictionary has no author's upstream, this is its best upstream. 

Steps to Reproduce:
1. cd to this directory https://cgit.freedesktop.org/libreoffice/dictionaries/tree/an_ES
2. run grep -P '/\s*$' an_ES.dic

Actual Results:
Grep returns 2315 lines.

Expected Results:
Grep should return nothing.


Reproducible: Always


User Profile Reset: No



Additional Info:
No other information.