Quantcast
Channel: MobileRead Forums - Reading and Management
Viewing all articles
Browse latest Browse all 24044

Removing Line breaks using regex in PDF when converting

$
0
0
I have a PDF file with unnecessary line breaks when converting to EPUB. Heuristic processing doesn't work to remove them even if I set to 1. So I thought of using RegEx to replace those breaks with "blank"

Example 1
paying their own</p>
<p class="calibre1">money

Example 2
wrong.</p>
<p class="calibre1">“Who did this....

I can write a regex to get lines without '.'
[^\.]</p>\n<p class="calibre1">

but all this does is highlight the first character in the found string as well (i.e. the "n" from "own" in the first example)

Is there any way to select the string but without removing that last character?

Viewing all articles
Browse latest Browse all 24044

Trending Articles