Coder Social home page Coder Social logo

Comments (1)

kieferk avatar kieferk commented on August 19, 2024

Good catch! This is in fact a bug. It was happening because I was using the original dataframe's index to sort, then re-indexing with the sorted indices. When there were duplicate indices it would duplicate the rows.

Should be fixed now. I just changed to indexing using .iloc instead.

I tried the same on my machine with the new master branch:

from dfply import *
utime = pd.DataFrame({"u":1,"eventTime":["01-01-1971 01:04:00","01-01-1971 02:07:00","01-01-1971 01:09:00","01-01-1971 01:10:00"]})

print(utime >> arrange(X.eventTime))
             eventTime  u
0  01-01-1971 01:04:00  1
2  01-01-1971 01:09:00  1
3  01-01-1971 01:10:00  1
1  01-01-1971 02:07:00  1

utime = utime.set_index("u")

print(utime >> arrange(X.eventTime))
             eventTime
u                     
1  01-01-1971 01:04:00
1  01-01-1971 01:09:00
1  01-01-1971 01:10:00
1  01-01-1971 02:07:00

Which is the behavior you expected. If you pull the master branch and reinstall it should work.

from dfply.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.