I've been trying to use GNOVA in my project. For some of the sumstats files I'm using, I get the following error:
Traceback (most recent call last):
File "gnova.py", line 86, in <module>
pipeline(parser.parse_args())
File "gnova.py", line 47, in pipeline
out = calculate(gwas_snps, ld_scores, annots, N1, N2)
File "/Volumes/BD/GNOVA/calculate.py", line 72, in calculate
m1 = linear_model.LinearRegression().fit(ld_scores, pd.DataFrame((Z_x) ** 2), sample_weight=w1)
File "/Volumes/Users/Library/Python/2.7/lib/python/site-packages/sklearn/linear_model/base.py", line 458, in fit
y_numeric=True, multi_output=True)
File "/Volumes/Users/Library/Python/2.7/lib/python/site-packages/sklearn/utils/validation.py", line 750, in check_X_y
dtype=None)
File "/Volumes/Users/Library/Python/2.7/lib/python/site-packages/sklearn/utils/validation.py", line 568, in check_array
allow_nan=force_all_finite == 'allow-nan')
File "/Volumes/Users/Library/Python/2.7/lib/python/site-packages/sklearn/utils/validation.py", line 56, in _assert_all_finite
raise ValueError(msg_err.format(type_err, X.dtype))
ValueError: Input contains NaN, infinity or a value too large for dtype('float64').
produces "NaN". I think this shouldn't be the case. Could this be a matter of the pandas version used?
My workaround was to introduce a
but I don't think that's how it's meant to be.