Coder Social home page Coder Social logo

Rate limit to build package about quilt HOT 5 CLOSED

quiltdata avatar quiltdata commented on September 3, 2024
Rate limit to build package

from quilt.

Comments (5)

kevinemoore avatar kevinemoore commented on September 3, 2024 2

@AliferSales, you might want to try adding the encoding as a kwarg like this:
rj_bu_2014_1:
file: rj_bu_2014_1.csv
kwargs:
encoding: ISO-8559-1

If that doesn't work, you can use transform: id as @akarve suggested, to build a package with the raw csv file. You can then push that package with quilt push --public alifersales/<packagename> so we'll be able to reproduce the error. Thanks!

from quilt.

akarve avatar akarve commented on September 3, 2024 1

OK, temporary solution. Put transform: id underneath, and at the same level of indentation as, file: rj.... That will copy the file, and you can parse it in pandas.

There is probably a datetime or format/type change later in the full file that the Parquet serializer doesn't like. Later today I'll provide an upload link for the full file so we can see what's up. Thanks for reporting this.

from quilt.

kevinemoore avatar kevinemoore commented on September 3, 2024

This looks like quilt build is failing to parse your csv. Build uses pandas read csv to read csv files into a DataFrame then serializes the DataFrame. You might need to pass some additional parameters to pandas, which you can do with a build file. Are you building from a buil file (e.g. build.yml)? Or, straight from a directory path?

from quilt.

AliferSales avatar AliferSales commented on September 3, 2024

Yes. The build.yaml that I'm using is:

$cat build.yaml
contents:
  README:
    file: README.md
  rj_bu_2014_1:
    file: rj_bu_2014_1.csv

I can open the file with Pandas and the only non-default specification is that enconding is "ISO-8559-1".

However, I create the another file with head of my csv:

$head rj_bu_2014_1.csv >> new_file.csv

Thus, I tried to build a package from this new file and the error don't occurred. Because of that I asked about rate limit.

from quilt.

AliferSales avatar AliferSales commented on September 3, 2024

Solved!

Really, the problem was with the pandas parsing. I didn't know that parser is made by pandas. I just needed to explicit the encoding (in the case, ISO-8559-1) and Bingo, as @kevinemoore suggested!

Thank @kevinemoore and @akarve. Quiltdata is a very good idea and looks very practical. I'm begining to use it and I'm very excited.

:D

from quilt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.