feature generation that can be used for timeseries. trick from the london talk.

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

That cream stuff sounds cool beans. I'll give it a spin. Also: PyData Amsterdam

Sounds great! I've started making some slides (written in English) for an upcomi

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Cheers <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-u

feature request: RBFRepeater about scikit-lego HOT 11 CLOSED

koaning commented on September 24, 2024 2

feature request: RBFRepeater

from scikit-lego.

Comments (11)

MaxHalford commented on September 24, 2024 1

I just did :)

from scikit-lego.

MaxHalford commented on September 24, 2024

Hey @koaning,

I stumbled upon your talk a few days ago and really enjoyed many of your talking points. I was curious about the RBF kernel trick so I decided to implement it in an online learning library me and some friends are working on. From what I understand The idea is simply to computing the distance between, say, a month and all the 12 months of the year using a RBF. This way September is closer to August than it is to March, which isn't taken into account if one simply one-hot encodes the month. Is this correct? If you're interested I coded it at the end of this notebook.

from scikit-lego.

koaning commented on September 24, 2024

That cream stuff sounds cool beans. I'll give it a spin. Also: PyData Amsterdam has a CFP open at the moment. I'm still in the committee and that cream library sounds like something we'd love to host.
The goal here is to make an sklearn compatible transformer that is general. Your example is good but our goal is to be very general; like be able to supply a date column and a number of RBFs you'd like per year. Or a column that you specify that will denote the timewindow. There's going to be a sprint this wednesday so I'll keep this thread up to date.

from scikit-lego.

MaxHalford commented on September 24, 2024

Sounds great! I've started making some slides (written in English) for an upcoming of the data science Meetup back here in Toulouse, so maybe I can reuse them.
Okay good to know, I just wanted to make sure the maths were right. Indeed I think that having a transformer to extract date features would be nice because it could then pipeline into a RBFTransformer.

Good stuff!

Edit: if you're going to try creme I suggest you install the latest version from GitHub using pip install git+https://github.com/creme-ml/creme as there is a lot of stuff that isn't on PyPI yet.

from scikit-lego.

koaning commented on September 24, 2024

Question about creme: most of the learning that occurs, is that just a small SGD step that occurs per datapoint or is there something more happening? SKlearn has some passive agressive things api here, but creme is not doing that atm?

I like the idea of doing a rolling mean on an intercept by the way.

from scikit-lego.

MaxHalford commented on September 24, 2024

I'm not 100% sure what you mean but here goes: you can provide an optimizer to LinearRegression and LogisticRegression. The default optimizer for both is called VanillaSGD and simply performs textbook online gradient descent. There are many optimisers you can use, such as PassiveAggressiveI, PassiveAggressiveII, Adam, etc. sklearn's SGDClassifier and SGDRegressor can only use plain gradient descent because they use a special trick for the intercept that isn't generic. Because we use a running statistic to compute the intercept we're "allowed" to use any optimizer we wish.

I hope I'm clear! I'm going to write an explanatory notebook when I get some time!

from scikit-lego.

koaning commented on September 24, 2024

Yep. This is all I wanted to know. Thanks!

Do consider sending that cfp tho: https://pydata.org/amsterdam2019/cfp/

from scikit-lego.

MaxHalford commented on September 24, 2024

@koaning when are the speakers for PyData Amsterdam annouced? I have to book a plane ticket early if I come.

from scikit-lego.

MBrouns commented on September 24, 2024

@MaxHalford tomorrow, but you're in! We're looking forward to seeing your talk!

from scikit-lego.

MaxHalford commented on September 24, 2024

Cheers @MBrouns, I'm really excited! I'll book my ticket ASAP :)

from scikit-lego.

koaning commented on September 24, 2024

This feature has now been implemented. Documentation will follow.

from scikit-lego.

feature request: RBFRepeater about scikit-lego HOT 11 CLOSED

Comments (11)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent