Bug 99737

Summary: limitation of the HTML filter to handle multiple adjacent tags
Product: LibreOffice Reporter: Ramyani Ghosh <rmghosh>
Component: WriterAssignee: Not Assigned <libreoffice-bugs>
Status: NEW ---    
Severity: minor CC: buzea.bogdan, ilmari.lauhakangas
Priority: medium Keywords: filter:docx
Version: 4.2.8.2 release   
Hardware: All   
OS: All   
Whiteboard:
Crash report or crash signature: Regression By:
Bug Depends on:    
Bug Blocks: 108799    
Attachments: test.html

Description Ramyani Ghosh 2016-05-09 05:24:27 UTC
<strong> and <em> are not working together using Libreoffice "docx:Office Open XML Text" when converting from html to docx. 

I have tried to convert from html to docx using Libreoffice  using the following command...

soffice --headless --convert-to "docx:Office Open XML Text" 'test.html'

test.html

    <html>
      <head>
      </head>
      <body>
        <strong><em>Apply em then strong</em></strong>
        <em><strong>Apply strong then em</strong></em>
      </body>
    </html>

When I am converting this test.html to docx then `<strong><em>Apply em then strong</em></strong>` is only taking `<em>` and `<em><strong>Apply strong then em</strong></em>` is only taking `<strong>`.
Comment 1 Ramyani Ghosh 2016-05-16 12:12:01 UTC
I had found this problem in LibreOffice 4.2.8.2 . Later I have updated the libreoffice version to 5.1.3.2. and it also has the same issue.
Comment 2 Buovjaga 2016-05-18 07:17:07 UTC
Repro.

Win 7 Pro 64-bit Version: 5.2.0.0.alpha1+
Build ID: f688acfdae00ebdd891737e533d54368810185e1
CPU Threads: 4; OS Version: Windows 6.1; UI Render: default; 
TinderBox: Win-x86@62-merge-TDF, Branch:MASTER, Time: 2016-05-18_00:11:31
Locale: fi-FI (fi_FI)
Comment 3 Buovjaga 2016-05-18 07:17:22 UTC
Created attachment 125136 [details]
test.html
Comment 4 QA Administrators 2017-05-22 13:40:50 UTC Comment hidden (obsolete)
Comment 5 QA Administrators 2021-04-09 03:47:34 UTC Comment hidden (spam)