Hello,
I use Aspose.PDF Cloud to convert PDF documents into HTML. We found out that in a particular situation, the rendered HTML is a bit weird. Let me give an example.
We have this as text in a PDF (in the PDF, it is centered):
VIVAMUS PRETIUM ULTRICES
morbi accumsan turpis ante, in suscipit lectus venenatis hendrerit
suspendisse eget dictum tortor, nec ultricies odio
nam arcu neque, dictum vel velit sit amet, sagittis bibendum odio. In volutpat ornare mauris
ut interdum libero eu vestibulum feugiat. Nunc et ornare libero, sed euismod lectus
duis ante sem, accumsan vitae eros non, pretium sodales nibh
ut molestie aliquet augue, ut lacinia purus interdum non phasellus
quisque convallis augue vitae luctus.
______________________________
LAOREET
for
Nullam Tristique
Ipsum Luctus
Tortor Venenatis
Diam Dignissim
Gravida Diam23 November 2015Sed ligula sem, ullamcorper id est at, sollicitudin aliquet augue. Etiam eget elit dolor. Pellentesque ut ipsum leo. Proin ultrices nulla at scelerisque varius.
I had to obfuscate the text but the only interesting things are the line composed of underscores (____) and the date (23 November 2015).
In the HTML produced, I get this:
VIVAMUS PRETIUM ULTRICES
morbi accumsan turpis ante, in suscipit lectus venenatis hendrerit
suspendisse eget dictum tortor, nec ultricies odio
nam arcu neque, dictum vel velit sit amet, sagittis bibendum odio. In volutpat ornare mauris
ut interdum libero eu vestibulum feugiat. Nunc et ornare libero, sed euismod lectus
duis ante sem, accumsan vitae eros non, pretium sodales nibh
ut molestie aliquet augue, ut lacinia purus interdum non phasellus
quisque convallis augue vitae luctus.
_
_____________________________
LAOREET
for
Nullam Tristique
Ipsum Luctus
Tortor Venenatis
Diam Dignissim
2
3 November 2015
Sed ligula sem, ullamcorper id est at, sollicitudin aliquet augue. Etiam eget elit dolor. Pellentesque ut ipsum leo. Proin ultrices nulla at scelerisque varius.
To increase readability, I modified the increment and highlighted in green/red the underscores line and the date.
As you can see, it isolates the first char of each line, causing that when I open the HTML file, the first underscore (_) is above the rest of the line, and same for the date, the 2 is one line above "3 November 2015".
What we also stated is that for each isolated character (_ and 2), the line does not end with but all other lines who do not have an isolated character do finish with .
Are you aware of this issue? Is there a workaround? Will it be fixed?
Thanks