Comments (2)
Thanks for updating the issue @louisedry-quinten and glad you were able to get going.
While primary_key
is allowed in the PARSynthesizer, it cannot be the same as sequence_key
. The reason is that primary_key
is assumed to contain uniquely identifiable IDs for every row (such as an index that always increments). Meanwhile, a sequence_key
repeats because it is the same for every row inside of a sequence (eg. the stock market symbol in the demo).
from sdv.
Found my own mistake after checking the metadata from the tutorial (https://colab.research.google.com/drive/1YLk2uwn8yrSRPy0soEeJwu8Hdk_tGTlE?usp=sharing#scrollTo=D19y1UOOJ-Pv).
"primary_key"should be removed from the metadata before fitting the model.
from sdv.
Related Issues (20)
- '<Synthesizer>' object has no attribute '_model'
- Allow the ability to easily remove primary keys
- Constraint should not be set on columns inside a gps relationship
- Set the default transformer for GPS column relationship
- Column relationship warning should be raised during synthesizer initialization only
- PARSynthesizer creates limited ranges (and is unable to forecast past the max date) HOT 2
- Improving Multi-Table Synthetic Data (Healthcare dataset) -- NaN values getting created HOT 32
- Make the `get_parameters` function consistent between synthesizers
- Reinstate `get_table_parameters` for the multi-table synthesizers
- Validate condition and provide user-friendly messages for NaN/missing values (currently unsupported)
- What is the license of sdv-dev (DataCebo) SDV? HOT 2
- Improve quality of `sequence_index`: Move the start dates into the context model
- Add a `version` module to align with SDV Enterprise
- Warn users to save their metadata file after auto-detecting/updating it
- Support the `'category'` dtype in SDV (currently `'object'` representation is supported)
- Set the GPSNoiser as default transformer for GPS column relationship
- Add-ons warning is raised twice for multi table synthesizers.
- Revisit `extended_columns` abstraction
- Improved error message if a column is already present in a relationship
- Datatime columns as context_columns in PARsynthizer HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sdv.