Coder Social home page Coder Social logo

Comments (6)

acampove avatar acampove commented on July 20, 2024 1

@acampove do you maybe have sources for

(from studies done by other people)

I would, AFAIU, second the argument of @arogozhnikov that, in general, you can do that

Hi Jonas,

This is what I remember Christoph Langenbruch said once in a meeting the measurement of the Bs-> phi mumu branching ratio. I would check their note or just talk to him, maybe you know him better than I do :)

Cheers.

from hep_ml.

arogozhnikov avatar arogozhnikov commented on July 20, 2024

Estimating correlation is a dead thing for that many numbers.
Better just model that correlation with training multiple models (e.g. by different cross-val splits), and estimate error of downstream processing by having multiple replicas

val1 = f(x_i, w_1i)
val2 = f(x_i, w_2i)
val2 = f(x_i, w_3i)
etc.

from hep_ml.

acampove avatar acampove commented on July 20, 2024

That approach would mean doing probably 200 trainings for the MVA. The real data usually is background subtracted using the sPlot technique. Apparently (from studies done by other people) we cannot bootstrap the sweighted sample, we have to:

  1. Bootstrap the unweighted data + simulation
  2. Obtain the sweights doing the fit.
  3. Train the MVA on the bootstrapped data and simulation

Many times, which seems very challenging computationally. In case of 2D or even 3D reweighting, I think this just means that we should not use hep_ml, given that obtaining uncertainties is highly non trivial and a number without uncertainties is pretty useless. Hep_ml would only be an alternative once you start thinking of reweighting in higher dimensions.

from hep_ml.

acampove avatar acampove commented on July 20, 2024

By the way, the bootstrapping argument also applies to k-Folding.

from hep_ml.

arogozhnikov avatar arogozhnikov commented on July 20, 2024

You surely can subsample sweighted samples. Opposite would mean your fitting is unstable (and hence probably wrong).

from hep_ml.

jonas-eschle avatar jonas-eschle commented on July 20, 2024

@acampove do you maybe have sources for

(from studies done by other people)

I would, AFAIU, second the argument of @arogozhnikov that, in general, you can do that

from hep_ml.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.