Coder Social home page Coder Social logo

paddlepaddle / visualdl Goto Github PK

View Code? Open in Web Editor NEW
4.7K 147.0 625.0 186.12 MB

Deep Learning Visualization Toolkit(『飞桨』深度学习可视化工具 )

Home Page: https://www.paddlepaddle.org.cn/paddle/visualdl

License: Apache License 2.0

Shell 0.06% JavaScript 12.79% Python 9.32% TypeScript 20.41% Rust 0.21% Dockerfile 0.02% HTML 57.03% SCSS 0.17% CSS 0.01%
visualization deep-learning paddlepaddle onnx caffe

visualdl's Introduction

中文

Build Status PyPI Downloads License

QQ Group

Introduction

VisualDL, a visualization analysis tool of PaddlePaddle, provides a variety of charts to show the trends of parameters, and visualizes model structures, data samples, histograms of tensors, PR curves , ROC curves and high-dimensional data distributions. It enables users to understand the training process and the model structure more clearly and intuitively so as to optimize models efficiently.

VisualDL provides various visualization functions, including tracking metrics in real-time, visualizing the model structure, displaying the data sample, visualizing the relationship between hyperparameters and model metrics, presenting the changes of distributions of tensors, showing the pr curves, projecting high-dimensional data to a lower dimensional space and more. Additionally, VisualDL provides VDL.service, which enables developers easily to save, track and share visualization results of experiments. For specific guidelines of each function, please refer to VisualDL User Guide. For up-to-date experience, please feel free to try our Online Demo. Currently, VisualDL iterates rapidly and new functions will be continuously added.

Browsers supported by VisualDL are:

  • Google Chrome ≥ 79
  • Firefox ≥ 67
  • Microsoft Edge ≥ 79
  • Safari ≥ 11.1

VisualDL natively supports the use of Python. Developers can retrieve plentiful visualization results by simply adding a few lines of Python code into the model before training.

Contents

Key Highlights

Easy to Use

The high-level design of API makes it easy to use. Only one click can initiate the visualization of model structures.

Various Functions

The function contains the visualization of training parameters, data samples, graph structures, histograms of tensors, PR curves and high-dimensional data distributions.

High Compatibility

VisualDL provides the visualization of the mainstream model structures such as Paddle, ONNX, Caffe, widely supporting visual analysis for diverse users.

Fully Support

By Integrating into PaddlePaddle and related modules, VisualDL allows developers to use different components without obstructions, and thus to have the best experience in the PaddlePaddle ecosystem.

Installation

Install by PiP

python -m pip install visualdl -i https://mirror.baidu.com/pypi/simple

Install by Code

git clone https://github.com/PaddlePaddle/VisualDL.git
cd VisualDL

python setup.py bdist_wheel
pip install --upgrade dist/visualdl-*.whl

Please note that Python 2 is no longer maintained officially since January 1, 2020. VisualDL now only supports Python 3 in order to ensure the usability of codes.

Usage Guideline

VisualDL stores the data, parameters and other information of the training process in a log file. Users can launch the panel to observe the visualization results.

1. Log

The Python SDK is provided at the back end of VisualDL, and a logger can be customized through LogWriter. The interface description is shown as follows:

class LogWriter(logdir=None,
                max_queue=10,
                flush_secs=120,
                filename_suffix='',
                **kwargs)

Interface Parameters

parameters type meaning
logdir string The path location of log file. VisualDL will create a log file under this path to record information generated by the training process. If not specified, the path will be runs/${CURRENT_TIME}as default.
max_queue int The maximum capacity of the data generated before recording in a log file. Default value is 10. If the capacity is reached, the data are immediately written into the log file.
flush_secs int The maximum cache time of the data generated before recording in a log file. Default value is 120. When this time is reached, the data are immediately written to the log file. (When the log message queue reaches the maximum cache time or maximum capacity, it will be written to the log file immediately)
filename_suffix string Add a suffix to the default log file name.
display_name string This parameter is displayed in the location of Select Data Stream in the panel. If not set, the default name is logdir.(When logdir is too long or needed to be hidden).
file_name string Set the name of the log file. If the file_name already exists, setting the file_name will be new records in the same log file, which will continue to be used. Note that the name should include 'vdlrecords'.

Example

Create a log file and record scalar values:

from visualdl import LogWriter

# create a log file under `./log/scalar_test/train`
with LogWriter(logdir="./log/scalar_test/train") as writer:
    # use `add_scalar` to record scalar values
    writer.add_scalar(tag="acc", step=1, value=0.5678)
    writer.add_scalar(tag="acc", step=2, value=0.6878)
    writer.add_scalar(tag="acc", step=3, value=0.9878)
# you can also use the following method without using context manager `with`:
"""
writer = LogWriter(logdir="./log/scalar_test/train")

writer.add_scalar(tag="acc", step=1, value=0.5678)
writer.add_scalar(tag="acc", step=2, value=0.6878)
writer.add_scalar(tag="acc", step=3, value=0.9878)

writer.close()
"""

2. Launch Panel

In the above example, the log has recorded three sets of scalar values. Developers can view the visualization results of the log file through launching the visualDL panel. There are two ways to launch the log file:

Launch by Command Line

Use the command line to launch the VisualDL panel:

visualdl --logdir <dir_1, dir_2, ... , dir_n> --model <model_file> --host <host> --port <port> --cache-timeout <cache_timeout> --language <language> --public-path <public_path> --api-only --component_tabs <tab_name1, tab_name2, ...>

Parameter details:

parameters meaning
--logdir Set one or more directories of the log. All the logs in the paths or subdirectories will be displayed on the VisualDL Board indepentently.
--model Set a path to the model file (not a directory). VisualDL will visualize the model file in Graph page. PaddlePaddle、ONNX、Keras、Core ML、Caffe and other model formats are supported. Please refer to Graph - Functional Instructions.
--host Specify IP address. The default value is 127.0.0.1. Specify it as 0.0.0.0 or public IP address so that other machines can visit VisualDL Board.
--port Set the port. The default value is 8040.
--cache-timeout Cache time of the backend. During the cache time, the front end requests the same URL multiple times, and then the returned data are obtained from the cache. The default cache time is 20 seconds.
--language The language of the VisualDL panel. Language can be specified as 'en' or 'zh', and the default is the language used by the browser.
--public-path The URL path of the VisualDL panel. The default path is '/app', meaning that the access address is 'http://<host>:<port>/app'.
--api-only Decide whether or not to provide only API. If this parameter is set, VisualDL will only provides API service without displaying the web page, and the API address is 'http://<host>:<port>/<public_path>/api'. Additionally, If the public_path parameter is not specified, the default address is 'http://<host>:<port>/api'.
--component_tabs Decide which components are presented in page, currently support 15 components, i.e. 'scalar', 'image', 'text', 'embeddings', 'audio', 'histogram', 'hyper_parameters', 'static_graph', 'dynamic_graph', 'pr_curve', 'roc_curve', 'profiler', 'x2paddle', 'fastdeploy_server', 'fastdeploy_client'. If this parameter is set, only specified components will be presented. If not set, and specify --logdir parameter, only components with data in vdlrecords log are presented. If both --component_tabs and --logdir are not set, only present 'static_graph', 'x2paddle', 'fastdeploy_server', 'fastdeploy_client' components by default

To visualize the log file generated in the previous step, developers can launch the panel through the command:

visualdl --logdir ./log

Launch in Python Script

Developers can start the VisualDL panel in Python script as follows:

visualdl.server.app.run(logdir,
                        model="path/to/model",
                        host="127.0.0.1",
                        port=8080,
                        cache_timeout=20,
                        language=None,
                        public_path=None,
                        api_only=False,
                        open_browser=False)

Please note: since all parameters are indefinite except logdir, developers should specify parameter names when using them.

The interface parameters are as follows:

parameters type meaning
logdir string or list[string_1, string_2, ... , string_n] Set one or more directories of the log. All the logs in the paths or subdirectories will be displayed on the VisualDL Board indepentently.
model string Set a path to the model file (not a directory). VisualDL will visualize the model file in Graph page.
host string Specify IP address. The default value is 127.0.0.1. Specify it as 0.0.0.0 or public IP address so that other machines can visit VisualDL Board.
port int Set the port. The default value is 8040.
cache_timeout int Cache time of the backend. During the cache time, the front end requests the same URL multiple times, and then the returned data are obtained from the cache. The default cache time is 20 seconds.
language string The language of the VisualDL panel. Language can be specified as 'en' or 'zh', and the default is the language used by the browser.
public_path string The URL path of the VisualDL panel. The default path is '/app', meaning that the access address is 'http://<host>:<port>/app'.
api_only boolean Decide whether or not to provide only API. If this parameter is set, VisualDL will only provides API service without displaying the web page, and the API address is 'http://<host>:<port>/<public_path>/api'. Additionally, If the parameter public_path is not specified, the default address is 'http://<host>:<port>/api'.
open_browser boolean Whether or not to open the browser. If this parameter is set as True, the browser will be opened automatically and VisualDL panel will be launched at the same time. If parameter api_only is specified as True, parameter open_browser can be ignored.
component_tabs string or list[string_1, string_2, ... , string_n] Decide which components are presented in page, currently support 15 components, i.e. 'scalar', 'image', 'text', 'embeddings', 'audio', 'histogram', 'hyper_parameters', 'static_graph', 'dynamic_graph', 'pr_curve', 'roc_curve', 'profiler', 'x2paddle', 'fastdeploy_server', 'fastdeploy_client'. If this parameter is set, only specified components will be presented. If not set, and specify --logdir parameter, only components with data in vdlrecords log are presented. If both --component_tabs and --logdir are not set, only present 'static_graph', 'x2paddle', 'fastdeploy_server', 'fastdeploy_client' components by default

To visualize the log file generated in the previous step, developers can launch the panel through the command:

from visualdl.server import app

app.run(logdir="./log")

After launching the panel by one of the above methods, developers can see the visualization results on the browser shown as blow:

3. Read data in log files using LogReader

VisualDL also provides LogReader interface to read any data from log files.

class LogReader(file_path='')

Interface Parameters

parameters type meaning
file_path string File path of the log file. Required.

Example

If there is a log file named vdlrecords.1605533348.log in the directory of ./log, we can retrieve the data under the 'loss' tag in the scalar by:

from visualdl import LogReader
reader = LogReader(file_path='./vdlrecords.1605533348.log')
data = reader.get_data('scalar', 'loss')
print(data)

The result will be a list shown as below:

...
id: 5
tag: "Metrics/Training(Step): loss"
timestamp: 1605533356039
value: 3.1297709941864014
...

For more information of LogReader, please refer to LogReader.

Function Preview

Scalar

Scalar makes use of various charts to display how the parameters, such as accuracy, loss and learning rate, changes during the training process. In this case, developers can observe not only the single but also the multiple groups of parameters in order to understand the training process and thus speed up the process of model tuning.

Dynamic Display

After the launch of VisualDL Board, the LogReader will continuously record the data to display in the front-end. Hence, the changes of parameters can be visualized in real-time, as shown below:

Comparison of Multiple Experiments

Developers can compare multiple experiments by specifying and uploading the path of each experiment at the same time so as to visualize the same parameters in the same chart.

Image

Image provides real-time visualizations of the image data during the training process, allowing developers to observe the changes of images at different training stages and to deeply understand the effects of the training process.

Audio

Audio aims to allow developers to listen to the audio data in real-time during the training process, helping developers to monitor the process of speech recognition and text-to-speech.

Text

Text visualizes the text output of NLP models within any stage, aiding developers to compare the changes of outputs so as to deeply understand the training process and simply evaluate the performance of the model.

Graph

Graph enables developers to visualize model structures by only one click. Moreover, Graph allows developers to explore model attributes, node information, node input and output. aiding them analyze model structures quickly and understand the direction of data flow easily. Additionally, Graph supports the visualization of dynamic and static model graph respectively.

  • dynamic graph

  • static graph

Histogram

Histogram displays how the trend of tensor (weight, bias, gradient, etc.) changes during the training process in the form of histogram. Developers can adjust the model structures accurately by having an in-depth understanding of the effect of each layer.

  • Offset Mode

  • Overlay Mode

PR Curve

PR Curve displays the precision and recall values under different thresholds, helping developers to find the best threshold efficiently.

ROC Curve

ROC Curve shows the performance of a classification model at all classification thresholds; the larger the area under the curve, the better the model performs, aiding developers in evaluating the model performance and choosing an appropriate threshold.

High Dimensional

High Dimensional provides three approaches--T-SNE, PCA and UMAP--to do the dimensionality reduction, allowing developers to have an in-depth analysis of the relationship between high-dimensional data and to optimize algorithms based on the analysis.

Hyper Parameters

Hyper Parameters visualize the relationship between hyperparameters and model metrics (such as accuracy and loss) in a rich view, helping you identify the best hyperparameters in an efficient way.

Performance Analysis

Performance Analysis(Profiler) visualize the profiling data collected during your program runs, helping you identify program bottlenecks and optimize performance. Please refer to VisualDL Profiler Guide

Performance Analysis

Performance Analysis(Profiler) visualize the profiling data collected during your program runs, helping you identify program bottlenecks and optimize performance. Please refer to VisualDL Profiler Guide.

X2Paddle

The X2Paddle component provides the functions of onnx model format visualization and transformation to paddle format.

FastDeployServer

The FastDeployServer component provides the functions of loading and editing the model repository, fastdeployserver service management and monitoring, and providing the client to test service. Please refer to use VisualDL for fastdeploy serving deployment visualization.

FastDeployClient

The FastDeployClient component is mainly used to quickly access the fastdeployserver service, to help users visualize prediction requests and results. Please refer to use VisualDL as fastdeploy client for request visualization.

VDL.service

VDL.service enables developers to easily save, track and share visualization results with anyone for free.

Frequently Asked Questions

If you are confronted with some problems when using VisualDL, please refer to our FAQs.

Contribution

VisualDL, in which Graph is powered by Netron, is an open source project supported by PaddlePaddle and ECharts.

Developers are warmly welcomed to use, comment and contribute.

More Details

For more details related to the use of VisualDL, please refer to VisualDL User Guide, VisualDL Profiler Guide, Use VisualDL for fastdeploy serving deployment visualization, Use VisualDL as fastdeploy client for request visualization.

Technical Communication

Welcome to join the official QQ group 1045783368 to communicate with PaddlePaddle team and other developers.

visualdl's People

Contributors

bingblog avatar cs2be avatar daming-lu avatar dependabot[bot] avatar deqingli avatar difers avatar findmyway avatar foreverseer avatar hsen-dev avatar jacquesqiao avatar jetfuel avatar kyanaww avatar larastustu avatar makefile avatar mohamadmansourx avatar monika19950721 avatar nepeplwu avatar niandalu avatar nickyfantasy avatar oraoto avatar peterpanzh avatar rainyfly avatar shenyuhan avatar superjomn avatar wlbksy avatar xiaoyixin-cmd avatar yixinkristy avatar zeyuchen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

visualdl's Issues

server can not work after move position

from visualdl.python.storage import LogReader
from visualdl.server import lib
-from visualdl.server import log as logger
+from visualdl.server.log import logger
from visualdl.server.mock import data as mock_data
from visualdl.server.mock import data as mock_tags

the static file work directory also need to fix

protobuf might be too large

When store image to Protobuf message, it might easily out of the maximum storage.

some solutions:

  • just sample several steps
  • define maximum image shape

Scalars needs to be simpled

目前Scalars中,如果图的数据量过大,会出现卡顿。
可以在sdk里面加一些自动的设置。

Cannot compile on MacOS

In file included from /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/include/c++/v1/iostream:40:
In file included from /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/include/c++/v1/istream:163:
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/include/c++/v1/ostream:1038:10: error: use of overloaded operator '<<' is ambiguous (with operand types 'visualdl::logging::LogStreamFatal' and 'const int')
__os << __x;
~~~~ ^ ~~~
/Users/qiaolongfei/project/VisualDL/visualdl/logic/sdk.h:175:5: note: in instantiation of function template specialization 'std::__1::operator<<<visualdl::logging::LogStreamFatal, int>' requested here
CHECK_GT(step_cycle, 0);
^
/Users/qiaolongfei/project/VisualDL/visualdl/utils/logging.h:112:26: note: expanded from macro 'CHECK_GT'
#define CHECK_GT(v0, v1) CHECK_BINARY(v0, v1, >)
^
/Users/qiaolongfei/project/VisualDL/visualdl/utils/logging.h:116:51: note: expanded from macro 'CHECK_BINARY'
if (!(v0 op v1)) LOG_FATAL << " Check failed: " << v0 << #op << v1 << " "
^
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/include/c++/v1/ostream:790:1: note: candidate function [with _Traits = std::__1::char_traits]
operator<<(basic_ostream<char, _Traits>& __os, char __c)
^
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/include/c++/v1/ostream:757:1: note: candidate function [with _CharT = char, _Traits = std::__1::char_traits]
operator<<(basic_ostream<_CharT, _Traits>& __os, char __cn)
^
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/include/c++/v1/ostream:797:1: note: candidate function [with _Traits = std::__1::char_traits]
operator<<(basic_ostream<char, _Traits>& __os, signed char __c)
^
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/include/c++/v1/ostream:804:1: note: candidate function [with _Traits = std::__1::char_traits]
operator<<(basic_ostream<char, _Traits>& __os, unsigned char __c)
^
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefaul

Graph Design

  • add onnx to VisualDL #34
  • add edge in Graph data format design #32
  • impl edge in Graph API

material needed by Marketing

  • demo gif/vedio
    • scalar
    • image
    • histgram
    • graph
    • 没有Visual-DL之前,纯代码滚屏的那种效果,体现一下无法可视化的时候深度学习的状态
    • Graph那个部分,也可以录一个从缩在一点的状态突然散开的效果,做个单独的Gif
    • easy to use, diff of integrate visualdl to other platform

need more documentations

Tutorial

  • quick start @ChunweiYan
  • introduction for scalar
  • introduction for histogram
  • introduction for image
  • introduction for graph

Site

  • content of home page
    • project aims
    • our advantages
    • team infomation
  • quick start(same with Tutorial)
  • demos (with basic documentation those can be transformed to reading-sutible HTML pages)
    • mxnet demo
    • pytorch demo
    • paddle demo
  • how to contribute

TODO

  • scalar
    • 默认关闭 outlier
    • 数据刷新失败,则停止刷新,保持原状,避免空白控件的出现 -[1级]
    • 多个线条颜色(等美研小伙伴给配色)
    • legend 要用富文本
  • 页面布局
    • 根据设计修改
    • margin 修改
    • 浏览器tab里边的VisualDL加一个空格,变成Visual DL
    • 拉到最上边的时候,banner那一栏看不到了
  • graph
    • 下午ready
    • 配色
    • 图片映射到画布大小-[1 级]
    • 缩略图位置固定 - [2级]
    • 用鼠标拖动 graph,松开鼠标图片会弹回去
  • demo
  • histogram
    • histogram 尖头部分超出容器,自适应一下
    • histogram 底部的 放大缩小按钮点了无效,并且报 offsetWidth 的错 - [0级]
    • 加最值的线
    • mouseout, circle dot 没清掉
    • 鼠标hover,每个 polygon 对应的 circle 没生效
    • 搜索筛选
    • 优化下每个多边形的间距
    • 优化下渐变色
    • 修改overlay 的问题 -[0级]
  • image
    • 能不能加个判断,每次只 update 用户打开的图片
  • frontend 代码优化
    • 优化下代码的结构
    • 添加代码注释
  • backend 代码整理
    • sdk 添加注释 issue pr
    • refactor command line args #76
    • pip 打包
    • docker
    • 添加版权信息
  • release
    • online demo #69
    • documents #83

Use AWS to download test models if Travis is running

We found Baidu BOS is slow while being accessed from Travis. cURLing the test models (26MB and 6MB) could timeout. So we need a logic so that if it is Travis, switch the cURL to download models from AWS (overseas service)

Static "real-time" demo webpage

We need an online demo to display some real-time effect, some requirements are as follows:

  • quite light-weight and stand large amount visits
    • can the single frontend simulate this effect?
  • look like real-time, the components should be updated dynamically.

build need enhancement

  • bin dependencies:

    • cmake
    • nodejs -> node
  • python

    • wheel

add some document or refine the scripts.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.