I am trying to convert PDF to tag file. It worked perfected fine in python 2. Tried th

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

I'm trying to do it as well, here's the error I get : <div class="snippet-clipboar

convert to tag does not work in python 3 about pdfminer.six HOT 5 CLOSED

johnsonice commented on May 18, 2024

convert to tag does not work in python 3

from pdfminer.six.

Comments (5)

goulu commented on May 18, 2024

can anyone add a test to reproduce this please ? thanks !

from pdfminer.six.

bittner commented on May 18, 2024

@johnsonice It looks like the error message is incomplete. Can you add the missing pieces, and fix the markup using ``` python (which starts a multiline code block)?

from pdfminer.six.

marine9357 commented on May 18, 2024

I'm trying to do it as well, here's the error I get :

File "C:\Users\Marine\Anaconda3\lib\site-packages\pdfminer\tools\pdf2txt.py", line 114, in main 
    interpreter.process_page(page)
File "C:\Users\Marine\Anaconda3\lib\site-packages\pdfminer\pdfinterp.py", line 851, in process_page 
    self.device.begin_page(page, ctm)
File "C:\Users\Marine\Anaconda3\lib\site-packages\pdfminer\pdfdevice.py", line 155, in begin_page 
    self.outfp.write(utils.make_compat_bytes(output))
TypeError: write() argument must be str, not bytes

I launched the command like this :

pdf2txt.py -c UTF-8 -o artscient -t tag structure_article_scientifique.pdf

Could it be a problem with the codec value? I also tried to remove the codec parameter, the same error appears.

from pdfminer.six.

TamasNeumer commented on May 18, 2024

Having the same issue both on python2 and python3.6

from pdfminer.six.

pietermarsman commented on May 18, 2024

Closing this because there is nothing reproducible. Feel free to reopen if the problem still exists.

I've tested if the function behaves correctly on Python3.7 and it does.

class TestCompatStr():
    def test_bytes(self):
        s = 'Hello World!'
        b = s.encode()
        ret = make_compat_str(b)
        assert_equal(s, ret)

    def test_str(self):
        s = 'Hello World!'
        ret = make_compat_str(s)
        assert_equal(s, ret)