Comments (6)
What's the shape of your original dataset like? How you tried different batch sizes?
from ydata-synthetic.
Thank you for your response.
The (2, 1500) samples are cut out of long sequences of EEG recordings. Hours, with frequcy=512hz. So the real shape of the data is much larger.
And yes, I have tried larger batches and they helped. Currently, I'm fixated on 1024.
from ydata-synthetic.
So you have 2 input data features and 1500 data rows correct?
The batch size may improve execution time but can degrade the performance of your GAN. I suggest you run TimeGAN changing the batch size only with each run and monitor the discrimination and prediction scores. You might also want to add some loss graphs to monitor which part of the training is collapsing.
TimeGAN is a complex architecture with a lot of hyperparameters. I've done some experiments with TimeGAN with financial time series data using price series of many stocks together, changing the batch size and the sequence length.
Balancing data quality with performance is not an easy task, you just have to keep on experimenting with a selection of parameters. You might also want to change the optimisers from Adam to RMSProp and play around a bit with the learning rate.
Hope this helps.
from ydata-synthetic.
Thank you so much. I'll try to follow your advice.
Indeed this is not an easy task.
from ydata-synthetic.
Hi @Rane90,
TimeGAN architecture can be applicable to a variety of TimeSeries datasets, nevertheless the hyper parameter tuning for such an architecture will vary a lot from dataset to dataset.
The architecture is ready to be used, but you need to tweak and play around with the parameters. Also, bear in mind the architecture was build to work for time sequences/windows. Have a look into the the supplementary materials available for TimeGan: https://www.vanderschaar-lab.com/papers/NIPS2019_TGAN_Supplementary.pdf.
To generate longer sequences, I would advise you to use a different architecture.
Number of iterations, Learning rate an applied data preparation are usually the parameters that have an higher effect on the results you will get.
from ydata-synthetic.
I'll be happy to follow-up any other questions in our community slack.
from ydata-synthetic.
Related Issues (20)
- [SUGGESTION] - Replace comments to clear the confusion HOT 1
- Issues install on MacOS HOT 5
- related to ydata-sdk Synthesizer model
- [BUG] Cannot import name 'TimeGAN' from 'ydata_synthetic.synthesizers.timeseries' HOT 2
- Install error HOT 1
- How can synthetic data be sorted by time? HOT 1
- [BUG] ydata fails to import on fresh Docker container HOT 2
- [BUG] unable to import module TimeSeriesSynthesizer when running mba_doppelganger.py example HOT 1
- Runninging TimeGAN initializer stalls the program indefinitely in example file
- [BUG]ModuleNotFoundError: No module named 'ydata' HOT 1
- [FEAT]Remove generator vars from gradient of the Supervised loss in TimeGAN (as loss does not depend on it)
- [BUG] Cannot install version 1.3.1
- How can I save the data generated by timeGAN?
- Installation Dependancies Failure[BUG] HOT 1
- [FEAT] How can i set min and max bounds for columns with numberic data? HOT 1
- [BUG]cannot import name 'streamlit_app' from 'ydata_synthetic'
- Model training debugging capabilities
- [BUG] Creating synthetic data via CGAN
- [FEAT] GPU no longer accessible in Google Collab
- [FEAT] Telemetry
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ydata-synthetic.