You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The PAR model needs to be migrated to the new synthesizer structure.
Expected behavior
There should be a new module called sequential in sdv (the same level as single_table)
There should be a class called PARSynthesizer
The PARSynthesizer should have the following init parameters
metadata: A SingleTableMetadata object
enforce_min_max_values: A boolean describing whether or not to strictly enforce the observed min/max values in numerical columns
enforce_rounding: A boolean describing whether or not to round the synthetic data based on the input data
context_columns: A list of strings, representing the columns that do not vary in a sequence
context_synthesizer: A string with the name of the model to use the context values
(default) 'GaussianCopulaSynthesizer'
Available options: 'GaussianCopulaSynthesizer', 'CTGANSynthesizer', 'CopulaGANSynthesizer', 'TVAESynthesizer'
context_synthesizer_parameters: A dictionary that maps each parameter name to a parameter values. Refer to the context model for the parameters that are allowed.
segment_size
epochs
cuda
sample_size
verbose
Additional context
There are a few main changes from the current PAR model implementation
The context_synthesizer now only accepts a string, but will work with any single-table synthesizer. It is now accompanied by the new context_synthesizer_parameters parameter which allows users to specify the configuration for the context_model.
The PARSynthesizer inherits directly from the BaseSynthesizer class.
If there end up being too many differences, another option could be to just make PARSynthesizer not inherit from anything. It will have different sampling methods from the other synthesizers. Configuring the DataProcessor for it might also be challenging since it has to only transform the columns for the context model.
A third option could be to move the sampling methods to a BaseSingleTableSynthesizer class that inherits from the BaseSynthesizer
The current BaseTimeseriesModel and PAR model can be combined into one class now since there aren't any other sequential models.
The text was updated successfully, but these errors were encountered:
Problem Description
The
PAR
model needs to be migrated to the new synthesizer structure.Expected behavior
sequential
in sdv (the same level assingle_table
)PARSynthesizer
PARSynthesizer
should have the following init parameterscontext_columns
: A list of strings, representing the columns that do not vary in a sequencecontext_synthesizer
: A string with the name of the model to use the context valuescontext_synthesizer_parameters
: A dictionary that maps each parameter name to a parameter values. Refer to the context model for the parameters that are allowed.segment_size
epochs
cuda
sample_size
verbose
Additional context
There are a few main changes from the current
PAR
model implementationcontext_synthesizer
now only accepts a string, but will work with any single-table synthesizer. It is now accompanied by the newcontext_synthesizer_parameters
parameter which allows users to specify the configuration for thecontext_model
.PARSynthesizer
inherits directly from theBaseSynthesizer
class.PARSynthesizer
not inherit from anything. It will have different sampling methods from the other synthesizers. Configuring theDataProcessor
for it might also be challenging since it has to only transform the columns for the context model.BaseSingleTableSynthesizer
class that inherits from theBaseSynthesizer
PAR
model can be combined into one class now since there aren't any other sequential models.The text was updated successfully, but these errors were encountered: