Create PARSynthesizer #1055

amontanez24 · 2022-10-05T06:49:02Z

The PAR model needs to be migrated to the new synthesizer structure.

There should be a new module called sequential in sdv (the same level as single_table)
There should be a class called PARSynthesizer
The PARSynthesizer should have the following init parameters
- metadata: A SingleTableMetadata object
- enforce_min_max_values: A boolean describing whether or not to strictly enforce the observed min/max values in numerical columns
- enforce_rounding: A boolean describing whether or not to round the synthetic data based on the input data
- context_columns: A list of strings, representing the columns that do not vary in a sequence
- context_synthesizer: A string with the name of the model to use the context values
  - (default) 'GaussianCopulaSynthesizer'
  - Available options: 'GaussianCopulaSynthesizer', 'CTGANSynthesizer', 'CopulaGANSynthesizer', 'TVAESynthesizer'
- context_synthesizer_parameters: A dictionary that maps each parameter name to a parameter values. Refer to the context model for the parameters that are allowed.
- segment_size
- epochs
- cuda
- sample_size
- verbose

There are a few main changes from the current PAR model implementation

The context_synthesizer now only accepts a string, but will work with any single-table synthesizer. It is now accompanied by the new context_synthesizer_parameters parameter which allows users to specify the configuration for the context_model.
The PARSynthesizer inherits directly from the BaseSynthesizer class.
- If there end up being too many differences, another option could be to just make PARSynthesizer not inherit from anything. It will have different sampling methods from the other synthesizers. Configuring the DataProcessor for it might also be challenging since it has to only transform the columns for the context model.
- A third option could be to move the sampling methods to a BaseSingleTableSynthesizer class that inherits from the BaseSynthesizer
The current BaseTimeseriesModel and PAR model can be combined into one class now since there aren't any other sequential models.

The text was updated successfully, but these errors were encountered:

amontanez24 added the feature request Request for a new feature label Oct 5, 2022

amontanez24 added this to the 1.0.0 milestone Oct 5, 2022

amontanez24 mentioned this issue Oct 18, 2022

Create PARSynthesizer #1068

Merged

amontanez24 closed this as completed Oct 20, 2022

amontanez24 self-assigned this Mar 24, 2023

Provide feedback