Comments (5)
Hi @qingyun-wu, yes, you right, it is not very common to use f1 score for multiclass tasks. However, sometimes looking to the f1-micro / f1-macro could be required. By very good, I mean that if the dataset is unbalanced a micro f1 should be used more tough than a macro f1, however it could be an unnecessary complication.
Instead, I think the solution proposed by @sonichi should be great. If the task is multiclass f1 shouldn't be a valid option, while micro_f1/macro_f1 yes.
from flaml.
or maybe we should ask users to specify 'micro_f1', 'macro_f1', etc. as metrics.
from flaml.
@qingyun-wu do you agree? If so, I'd encourage @gianpdomiziani to create a PR. Thanks.
from flaml.
I agree. Thanks!
from flaml.
Hi @gianpdomiziani,
Thanks for reporting this issue. It seems f1 score is not a commonly used metric for multiclass classification tasks. So it is not clear to me how we should properly chose the 'best' metric in this case.
Among the several average options ['micro', 'macro', 'weighted', 'samples'], 'micro', 'macro', 'weighted' all work well. Do you think it is reasonable to just set the average option to be one of these three in the case of the multiclass classification task?
Thank you!
from flaml.
Related Issues (20)
- Should we add isort to the pre-commit? HOT 11
- Regression with Grouped kFold CV HOT 1
- Question About tuning PyTorch with FLAML HOT 2
- Error in flaml.tune.run
- Question about the custom metric HOT 2
- Cannot predict with numpy inputs
- Need estimators like support vector machine, Decision Tree, Naïve Bayes Classifier.
- How to change value of max_iter in Logistic Regression in AutoML ?
- Custom Objective Function for LGBM
- Where can I get the corresponding model for each cross validation
- Parameter tuning in final_estimator with automl ensemble
- couldn't get the same results with FLAML when using the original ML model HOT 1
- Represent "best error" in runtime logger instance using scientific notation?
- custom_metric() function error
- Preprocessing Data is missng ,raise key error. automl self was created data that is not repreat behaviour. HOT 1
- Typo or misnaming in automl/timeseries/ts_model.py
- Update Optuna
- Shifting of Time Series data
- BlendSearch in UDF mode
- Official example: Time Series Forecast TypeError: 'NoneType' object is not callable HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from flaml.