Coder Social home page Coder Social logo

Comments (17)

MaliParag avatar MaliParag commented on June 21, 2024

Did you try other pdfs?

from tfd-icdar2019.

VladimirKalachikhin avatar VladimirKalachikhin commented on June 21, 2024

Yes, I download these files:
http://aif.centre-mersenne.org/article/AIF_1970__20_1_493_0.pdf ,AIF_1970_493_498.pdf
http://aif.centre-mersenne.org/article/AIF_1999__49_2_375_0.pdf ,AIF_1999_375_404.pdf
http://www.numdam.org/article/ASENS_1970_4_3_3_273_0.pdf ,ASENS_1970_273_284.pdf
http://www.numdam.org/article/ASENS_1997_4_30_3_367_0.pdf ,ASENS_1997_367_384.pdf
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC323452/pdf/pnas00314-0027.pdf ,Borcherds86.pdf
http://www.numdam.org/article/BSMF_1970__98__165_0.pdf ,BSMF_1970_165_192.pdf
http://www.numdam.org/article/BSMF_1998__126_2_245_0.pdf ,BSMF_1998_245_271.pdf
http://people.virginia.edu/~lls2l/finite_dimensional.pdf ,Cline88.pdf

Other files are unavailable.

Only for Borcherds86.pdf and Cline88.pdf bounding boxes are placed on math regions correctly. For other files bounding boxes are fully displaced.

from tfd-icdar2019.

BigPandaCPU avatar BigPandaCPU commented on June 21, 2024

Dear sir,
I got the same errors too, There are 9 pdf files displaced. They are
AIF_1970_493_498, AIF_1999_375_404, ASENS_1970_273_284,
Bergweiler83, BSMF_1970_165_192, BSMF_1998_245_271,
InvM_1970_121_134, MA_1970_26_38, MA_1977_275_292.
Others are match well with the label.
The fellow is AIF_1999_375_404.pdf 1.png
1

from tfd-icdar2019.

MaliParag avatar MaliParag commented on June 21, 2024

Which version of pdf2image are you using?

I think I used the following version -

Name: pdf2image
Version: 1.5.4

from tfd-icdar2019.

macqueen09 avatar macqueen09 commented on June 21, 2024

many PDF link are not aviliable.
who has a package of all pdf files? can you share a link by GoogleDriver or BaiDu or something else? Thanks.

from tfd-icdar2019.

VladimirKalachikhin avatar VladimirKalachikhin commented on June 21, 2024

The answer to questions:
https://github.com/VladimirKalachikhin/marmot-to-ICDAR

from tfd-icdar2019.

humeme avatar humeme commented on June 21, 2024

i got the same problem on AIF_1999_375_404.pdf @2.png!! with pdf2image-version==1.5.4@MaliParag
222

2

from tfd-icdar2019.

Jeozhao avatar Jeozhao commented on June 21, 2024

Hi @VladimirKalachikhin ,
I have the same problem as you. I found that some images do not match their corresponding GT. Have you solved this problem now?
Thank you!

from tfd-icdar2019.

Jeozhao avatar Jeozhao commented on June 21, 2024

Hi @MaliParag ,

Could you please share your image dataset with us?
I found that different download channels and different versions of the pdf2png conversion tool may cause the image to not match GT. So, it would be very grateful to us if you share your data set with us.

from tfd-icdar2019.

VladimirKalachikhin avatar VladimirKalachikhin commented on June 21, 2024

Have you solved this problem now?

I used MARMOT dataset, see above.

from tfd-icdar2019.

Jeozhao avatar Jeozhao commented on June 21, 2024

Have you solved this problem now?

I used MARMOT dataset, see above.

Hi @VladimirKalachikhin
Can this data be converted to be the same as TDF-ICDAR2019?
Or is it just that the format can be kept consistent, but the content is not consistent?
Thanks!

from tfd-icdar2019.

VladimirKalachikhin avatar VladimirKalachikhin commented on June 21, 2024

I don't quite understand you. MARMOT just another one dataset. I created a simple tool to convert MARMOT to IDCAR-compatible format for use IDCAR instruments.

from tfd-icdar2019.

Jeozhao avatar Jeozhao commented on June 21, 2024

I don't quite understand you. MARMOT just another one dataset. I created a simple tool to convert MARMOT to IDCAR-compatible format for use IDCAR instruments.

Thank you for your reply. I have understand your mean.

from tfd-icdar2019.

ducMNSD avatar ducMNSD commented on June 21, 2024

Dear sir,
I got the same errors too, There are 9 pdf files displaced. They are
AIF_1970_493_498, AIF_1999_375_404, ASENS_1970_273_284,
Bergweiler83, BSMF_1970_165_192, BSMF_1998_245_271,
InvM_1970_121_134, MA_1970_26_38, MA_1977_275_292.
Others are match well with the label.
The fellow is AIF_1999_375_404.pdf 1.png
1

could you share me all image datasets that you created, thank you very much !

from tfd-icdar2019.

BigPandaCPU avatar BigPandaCPU commented on June 21, 2024

Dear sir,
I got the same errors too, There are 9 pdf files displaced. They are
AIF_1970_493_498, AIF_1999_375_404, ASENS_1970_273_284,
Bergweiler83, BSMF_1970_165_192, BSMF_1998_245_271,
InvM_1970_121_134, MA_1970_26_38, MA_1977_275_292.
Others are match well with the label.
The fellow is AIF_1999_375_404.pdf 1.png
1

could you share me all image datasets that you created, thank you very much !

I get the data from this.
https://github.com/MaliParag/TFD-ICDAR2019#download-instructions
QQ截图20210224093816

The download link file.
22

from tfd-icdar2019.

MingchangLi avatar MingchangLi commented on June 21, 2024

NOTE: If you find the bounding boxes are displaced from math regions, it is because the document image that you have rendered is of different size than the one used while annotating. datasetV2 provides file sizes for each image. Resize the image that you have rendered to the size provided in datasetV2 and you should be able to use the annotations.

from tfd-icdar2019.

VladimirKalachikhin avatar VladimirKalachikhin commented on June 21, 2024

datasetV2 provides file sizes for each image.

I know.

from tfd-icdar2019.

Related Issues (4)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.