Describe the bug Hey folks, I am new to <code class="notranslate"

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Closing in favour of <a class="issue-link js-issue-link" data-error-text="Failed to lo

`evaluate` errors about mlj.jl HOT 3 CLOSED

vboussange commented on June 12, 2024

`evaluate` errors

from mlj.jl.

Comments (3)

ablaom commented on June 12, 2024 1

@vboussange Thanks for putting MLJ through it's paces and for the positive feedback.

The issue here is that you are using TunedModel to wrap two models with different prediction type. One predicts probability distributions, while the other predicts point values:

julia> prediction_type(LinearRegressor)
:probabilistic

julia> prediction_type(NN)
:deterministic

This should be disallowed, but isn't, and we can see the wrapped model decides, without any good reason, to be :deterministic:

julia> prediction_type(multi_model)
:deterministic

So training the TunedModel tries to apply rms directly to probabilistic (Poisson) distributions and so fails.

Below is a workaround. The changes I've made are marked A and B:

A force predictions of the linear model to be deterministic by applying mean to the probabilistic predictions
B makes sure that NN receives Continuous data, instead of Count data, to suppress the scitype warning you have been getting (not a critical correction)

using Distributions
using LinearAlgebra
using DataFrames
using UnPack
using MLJ


## Generating synthetic data
n_features = 5
datasize = 1500
T = Float64
TI = Int64

# covariance matrix, needs to be symmetric
Σ = rand(T, n_features, n_features) * 2 .- 1
Σ = Σ*Σ'

μ = randn(T, n_features)
X = DataFrame(rand(MvNormal(μ, Σ), datasize)',:auto);

a = randn(T, n_features)

y = TI[]
for i in 1:datasize
    Xi = X[i,:] |> Vector
    push!(y, rand(Poisson(exp.(a' * Xi))))
end

# building a GLM
LinearRegressor = MLJ.@load LinearCountRegressor pkg=GLM
linearregressor = LinearRegressor()
linearregressor = linearregressor |> (y -> mean.(y)) # <--------- A
mach = machine(linearregressor, X, y)
fit!(mach)
# works

# building a neural network regressor
using Flux, MLJFlux
mutable struct MyBuilder <: MLJFlux.Builder
        nhidden # number of neurons in hidden layers
        σ1 #hidden layers activation function
    σ2 #output activation function
end

function MLJFlux.build(nn::MyBuilder, rng, n_in, n_out)
        init = Flux.glorot_uniform(rng)
    @unpack nhidden, σ1, σ2 = nn
        return Chain(Dense(n_in, nhidden, σ1, init=init),
                BatchNorm(nhidden),
                                 Dense(nhidden, nhidden, σ1, init=init),
                 BatchNorm(nhidden),
                                 Dense(nhidden, n_out, σ1, init=init),
                 σ2)
end

NN = MLJ.@load NeuralNetworkRegressor pkg=MLJFlux
nnflux = NN(builder = MyBuilder(64, relu, softplus),
            batch_size=100,
            epochs=200,
            loss = Flux.poisson_loss)
nnflux = ContinuousEncoder() |> nnflux # <------------ B
mach = machine(nnflux, X, y)
fit!(mach) # works


# comparing multiple models
mymodels = [nnflux, linearregressor]

multi_model = TunedModel(
    models=mymodels,
    resampling = CV(nfolds=3),
    measure = rms,
    check_measure = false,
)

e = MLJ.evaluate(multi_model, X, y, resampling = CV(nfolds=2),
                measure=rms,
                verbosity=6,
                # acceleration = CPUThreads()
                )

# PerformanceEvaluation object with these fields:
#   model, measure, operation, measurement, per_fold,
#   per_observation, fitted_params_per_fold,
#   report_per_fold, train_test_rows, resampling, repeats
# Extract:
# ┌────────────────────────┬───────────┬─────────────┬─────────┬──────────────┐
# │ measure                │ operation │ measurement │ 1.96*SE │ per_fold     │
# ├────────────────────────┼───────────┼─────────────┼─────────┼──────────────┤
# │ RootMeanSquaredError() │ predict   │ 30.2        │ 34.5    │ [39.9, 15.0] │
# └────────────────────────┴───────────┴─────────────┴─────────┴──────────────┘

from mlj.jl.

ablaom commented on June 12, 2024

Closing in favour of JuliaAI/MLJTuning.jl#200

from mlj.jl.

vboussange commented on June 12, 2024

Sweet, thanks for the inputs!

from mlj.jl.

`evaluate` errors about mlj.jl HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent