Coder Social home page Coder Social logo

woe's People

Contributors

boredbird avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

woe's Issues

Documentation

Hi, I am looking for documentation about your package : what can be done with it and how. Is there such a document somewhere? I have not been able to locate one so far. Thanks for your help.

check_point 函数在合并分段的时候有一个错误

check_point 函数在合并分段的时候有一个错误,
feature_process.py 112行这里:
pdf = df[(df[var] > split[i]) & (df[var] <= split[i+1])]
没有考虑到相邻的split中的sample数量都少于min_sample,但是合起来又大于min_sample的情况。
可以改为:

pre_left_position = float('-inf') # a value to remember the left point of the segmentation
for i in range(-1,split.__len__()-1):
    pdf = df[(df[var] > pre_left_position) & (df[var] <= split[i+1])]
    if (pdf.shape[0] < min_sample) or (np.unique(pdf['target']).__len__()<=1):
        #print(var, pre_left_position, i, "continue")
        continue
    else:
        new_split.append(split[i+1])
        pre_left_position = split[i+1]

离散型变量调用bin_data_split

离散型变量为什么要使用bin_data_split,因为bin_data_split中如果np.unique(var)<=8时候,只调用了下check_point()就直接retrun了,外面又调用check_point(),如果离散变量的个数>8怎么办?就按照百分位做了吗?

Too many debug logs

When I use proc_woe_continuous and eval_feature_detail to get iv results, there are too many msgs printed to terminal. Can it be quiet? Or can you use a debug parameter to control standard output?

How to make transformations for test values

Greetings and congrats for your work here.

I was wondering how can I make transformations on unseen data, I mean data for which we dont have a target variable. Is it possible? and then use the new woed features as input for other machine learning algorithm?

Thanks!

could not import woe.config as config

File "", line 8, in
import woe.config as config

File "apps/python/anaconda3/lib/python3.6/site-packages/woe/config.py", line 51
raise KeyError,"Invalid dtype specified! "
^
SyntaxError: invalid syntax

could not import woe.config as config
how to support python3 ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.