Skip to content

Repeated word out of place ruins text  #4042

@brucenielson

Description

@brucenielson

Karl Popper A World of Propensities

I'm using version 1.24.13.

Attached is a pdf I'm trying to load as markdown. The first page reads as follows:

I shall begin with some personal memories...

It
was 54 years ago, in Prague in August 1934, that I first
attended an International Congress of Philosophy. I found it
uninspiring. But the Congress was preceded by another meeting
in Prague, organized by Otto Neurath, who had kindly invited
me to attend a 'Preliminary Conference' ('Vorkonferenz' as he
called it) which he organized on behalf of the Vienna Circle.

I came to Prague with the corrected page proofs of my book,
It
Logik der Forschung. was published three months later...essentially an Aristotelian theory at which, it appears,
Tarski and Godel arrived, independently at almost the same
It
time. was first published by Tarski in 1930, whereupon
It
Godel, of course, accepted Tarski's priority.

Note the weird newline after the first "It" and then that word "It" becomes an unwanted artifact that breaks up the text after that several times. This problem repeats on the next page with some other word.

I tried loading the same PDF using PyPDF and the problem goes away. (But obviously PyPDF isn't trying to convert to markdown.)

See: https://bugs.ghostscript.com/show_bug.cgi?id=708129

I can get you the file I used if desired.

Metadata

Metadata

Assignees

No one assigned

    Labels

    not a bugnot a bug / user error / unable to reproduce

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions