Bug 158770 - German "Umlaut" from a PDF cannot be pasted
Summary: German "Umlaut" from a PDF cannot be pasted
Status: NEEDINFO
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
7.5.9.2 release
Hardware: All Linux (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-12-19 07:32 UTC by documentfoundation.ytpqh
Modified: 2024-01-02 17:30 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:


Attachments
Screenshot of LibreOffice writer showing the error with the German "Umlaut" Symbol (47.12 KB, image/png)
2023-12-19 07:49 UTC, documentfoundation.ytpqh
Details

Note You need to log in before you can comment on or make changes to this bug.
Description documentfoundation.ytpqh 2023-12-19 07:32:05 UTC
Description:
German "Umlaut" symbols (ä,ö,ü) cannot be pasted correctly from the clipboard when they are taken from a PDF file.

The same symbols can be copied/pasted correctly in any other application that I tested. For example: Kate, VSCodium, Joplin, etc.
 
When the PDF file is opened in the browser (Firefox) the German "Umlaut" symbols can be copied correctly, but if the PDF is opened on "Okular", the symbols cannot be pasted into LibreOffice.

Steps to Reproduce:
1. Download the following PDF File > https://www.amboss.com/media/de/m2-lernplan-f24
2. Open the PDF in Okular
3. Go to page 3 and copy some words with the German "Umlaut" (ä,ö,ü). For example: "Dyslipidämie"
4. Paste the word into LibreOffice (Calc, Writer, etc).
5. See the error (The German Umlaut is not shown properly).

Actual Results:
The German Umlaut (ä,ö,ü) is not shown correctly.

Expected Results:
The German Umlaut should be pasted correctly in LibreOffice, as in any other software.


Reproducible: Always


User Profile Reset: Yes

Additional Info:
Version: 7.5.9.2 (X86_64) / LibreOffice Community
Build ID: 50(Build:2)
CPU threads: 8; OS: Linux 6.6; UI render: default; VCL: kf5 (cairo+xcb)
Locale: en-US (en_US); UI: en-US
7.5.9-2
Calc: threaded

-----
The error also occurs on "LibreOffice Fresh" installed from the Official AUR Repository on Arch Linux (Endeavour OS - KDE).

The error also occurs when LibreOffice is in German (with the German package installed from Arch Repository). Moreover, if the OS is in German, the same bug appears. 

It seems a problem specific to LibreOffice because any other software can paste the Umlaut correctly from that PDF file.
Comment 1 documentfoundation.ytpqh 2023-12-19 07:49:21 UTC
Created attachment 191498 [details]
Screenshot of LibreOffice writer showing the error with the German "Umlaut" Symbol
Comment 2 Stéphane Guillou (stragu) 2024-01-02 17:30:17 UTC
(In reply to documentfoundation.ytpqh from comment #0)
> When the PDF file is opened in the browser (Firefox) the German "Umlaut"
> symbols can be copied correctly, but if the PDF is opened on "Okular", the
> symbols cannot be pasted into LibreOffice.

No repro copying from page 3:

Dyslipidämie

Using Okular 1.9.3's Tools > Text Selection, and pasting into LO Writer:

Version: 7.6.4.1 (X86_64) / LibreOffice Community
Build ID: e19e193f88cd6c0525a17fb7a176ed8e6a3e2aa1
CPU threads: 8; OS: Linux 5.15; UI render: default; VCL: gtk3
Locale: en-AU (en_AU.UTF-8); UI: en-US
Calc: threaded

Version: 7.5.9.2 (X86_64) / LibreOffice Community
Build ID: cdeefe45c17511d326101eed8008ac4092f278a9
CPU threads: 8; OS: Linux 5.15; UI render: default; VCL: kf5 (cairo+xcb)
Locale: en-AU (en_AU.UTF-8); UI: en-US
Calc: threaded

Which version of Okular?
Does it happen in a brand new Writer document as well?
Do you use any clipboard manager?