The hastic-server from hastic-zzz

Release size reduce

Current release-archive size is ~55MB
We should reduce it somehow

Overlapping patterns classification

What changes should we do in UI and requirements from it
Algorithm implementation

@rozetko, @VargBurz just add some thoughts at least

Send debug info when errors occur

We should send stacktrace and some additional data somewhere on errors for easier debugging

Bundle babel-polyfill

It's a problem starting without webpack:

"No such file or directory" error on anomaly create

You get this error when you've just created anomaly and there are no segments yet

Learning on deletion

All of your models: drops, peaks, ... don`t give a shit about deletion.

Fix known-issues

I think we should rethink this part: https://github.com/hastic/hastic-server#known-bugs--issues

Let's check what is actual and make issues here on github if we have bugs.

Kill analytics spawned process on stop server process

https://nodejs.org/api/child_process.html#child_process_options_detached

I am not sure that we close it properly @rozetko
let's check it

Error: type object 'sklearn.tree...'

With usage "General approach", user gets error from python
like follows:

Selecting pattern not-overlapping: best candidate

It is simpler version of https://github.com/hastic/hastic-server/issues/44
where we don't make proper classification, we just select the pattern with best fit.

Update dataset after every learning cycle

Currently, dataset doesn't get updated after first downloading
We should update it on every learning

Missing pattern detection

Sometimes you want to detect when you don't have a pattern in your data and get notifications about it

Any thoughts?

ImportError: cannot import name 'isna'

Steps to reproduce:

clone repo
go to analytics/
pip3 install -r requirements.txt
python3 server.py

You would see error:

Traceback (most recent call last):
  File "server.py", line 7, in <module>
    from worker import Worker
  File "/mnt/c/Users/rozetko/git/hastic-server/analytics/worker.py", line 2, in <module>
    from anomaly_model import AnomalyModel
  File "/mnt/c/Users/rozetko/git/hastic-server/analytics/anomaly_model.py", line 2, in <module>
    from data_provider import DataProvider
  File "/mnt/c/Users/rozetko/git/hastic-server/analytics/data_provider.py", line 1, in <module>
    import pandas as pd
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/__init__.py", line 42, in <module>
    from pandas.core.api import *
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/core/api.py", line 10, in <module>
    from pandas.core.groupby import Grouper
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/core/groupby/__init__.py", line 2, in <module>
    from pandas.core.groupby.groupby import (
  File "/home/rozetko/.local/lib/python3.5/site-packages/pandas/core/groupby/groupby.py", line 42, in <module>
    from pandas.core.dtypes.missing import isna, isnull, notna, _maybe_fill
ImportError: cannot import name 'isna'

Peaks detector algorithm improvement

Make it work only after labeling (like drops)
Make it find peaks similar to labeled
Find / create test dataset
Write tests

Case-sensitive anomaly name

Now anomaly name is case-sensitive
If you create anomaly with capital letter(s) in name - you get "Not found" or "Internal error" alert
We should change all names to lower-case

deb package

I like how Grafana does it:

https://grafana.com/grafana/download?platform=linux

just installation from source and everything works

Leave only node 6.14 build

Guess we don't need 2 types of build because node 6.14 build would work in any 6.14+ version.
Also, documentation with different types of build looks confusing.
@jonyrock what do you think?

Learning on creation of pattern from beginning

If you make a new pattern with name "drop" and you already have "drop" in database, learning state would start

Smart automatic correlation searching

We want to send data source where we will find correlated metrics automatically. When just reports than metrics which analytic found.
Sounds futuristic , but lets make subtasks.

Push metric data from server to analytics

I believe that analytics should know about how extract data. It is server responsibility.
We need to send data for initial learning and then data about new data from datasource.

There are many benefits from this.

@rozetko lets stop querying grafana from python

For example, we need to detect patterns in for few metrics together like in merge sort.
Another example is: https://github.com/hastic/hastic-server/issues/44 where apply model to predict class.
So we need to keep in memory model for few pattern predictors.

Module for multiple metrics analysis
Alerting refactoring

Tolerance to missing data

We haven't defined how system should behave when we have no data. We need to understand requirements to the system first.

Informative error messages instead of "Internal error"

It would be nice to see a proper error message in panel alert instead of just "Internal error"

node 6.14 support

We want to support old versions of Node.js
I think we better setup CI for this and update it in docker

Use anomaly IDs

We still use anomaly names in some cases instead of IDs
It is really confusing
We should use only IDs

Persistence only on node.js

We want to make node.js part the service which responsible for data processing.
So python doesn't write to anywhere.

Docker file after 0.12-alpha release

@rozetko
I think our docker file is not actual. We use requirements.txt and least

Index transform bug

There is a bug when indices get transformed to time

In this screenshot all segments have the same length in indices, but the one on the left has a lot larger length

NeDB instead of files for analytics

We can't scale with files. Obviously.

Messages from analytics instead of files

We update alerts.json to send "notifications" about new detected alerts. It is crazy.
I think it is not the only place.

Module for messaging on node.js side

The module https://github.com/hastic/hastic-server/blob/8fce88d76a795656e73262704df735f625717978/server/src/services/analytics.ts

has both transport & logic in one code. Lets separate it.

Restore dev version

It would be also nice to have a way to debug python code

Analytics / server messaging

We want to use zeromq for messaging between node.js / python

It will allow to separate processes and debug easier

ZeroMQ Node/Python worker refactoring
Docs to new installation
Remove redundant deps after refactoring
Return analytics status
Fix building for production

Define behavior when prediction task happens during learning

Any behavior even if is defined should be rethinked . I think it is the case where user should define which strategy to choose.

Task model on node.js side

we use any - like definitions for task, but we can do it more accurate

let task = {
    type: 'learn',
    anomaly_id: anomalyId,
    pattern,
    segments: segments
  };

Python3 / pip3 installation instructions

I am sorry to say that, but I need to use google to find instructions about how should I install python and pip.

I just want to start to use software. Is there a nice way to install Python/pip on my machine without googling ? :)

For example, with node.js you only need to link to: https://nodejs.org/en/download/package-manager/#debian-and-ubuntu-based-linux-distributions and that's it. Just one command.

@rozetko what do you think?

Return version of server

We can face a problem where user has an older version of hastic-panel and we need to return version of hastic-server. Also it is necessary for bug reports.

So we need to

keep track of current version of server with is the same as release
return server version when get status (root) url

The first time we get this problem: https://github.com/hastic/hastic-server/pull/65

Multiple metrics patterns

We now want to have pattern for multiple series.

We need to make

Algorithms with multiple series
Node Server part
UI

HASTIC_API_KEY to config file

We should get HASTIC_API_KEY from config if it exists. Otherwise, use environment variable.

Analytics / Server logs improvements

Imagine log which looks like this:

[ANALYTICS] something
[ANALYTICS] something more
[SERVER] oh oh
[SERVER] more logs
[ANALYTICS] learning

@rozetko

Documentation about entities naming

We have entities like:

anomaly
pattern
segment
and so on
It can be hard to understand them sometimes.
We should create doc with their interpretation.

Support datasources other than InfluxDB

Currently only InfluxDB datasource is supported
We should find a way to support other Grafana's datasources

Send task execution failure back to server

https://github.com/hastic/hastic-server/blob/0b857996d8e2e7844c3002698d53d32a92b884d5/analytics/server.py#L36

We need to wrap task result payload in res with metadata with taskId and result of execution.
so it's like:

{
"taskId": 123123,
"status": "ok",
"data": {
    ... payload data ...
  } 
}

Filter features

It would be nice to have Kalman filter for example before we do analytics

all_anomalies.json gets rewrited after restart

Steps to reproduce:

create anomaly
find it in data/anomalies/all_anomalies.json
restart server
create new anomaly

Now there is only new anomaly in all_anomalies.json
It happens because loadAnomaliesMap() method doesn't get executed on anomaly insert

hastic-zzz / hastic-server Goto Github PK

hastic-server's People

Contributors

Stargazers

Watchers

Forkers

hastic-server's Issues

Recommend Projects

Recommend Topics

Recommend Org