scaling

Functionality for scaling data like SIMCA.

friendly_mvda.scaling.scale_data(data: DataFrame, centering_type: str, scaling_type: str, primary_id: str, secondary_ids: list[str] | None = None) → tuple[list[str], list[float], DataFrame]

Scale the data using the specified centering and scaling types.

parameters: data: DataFrame

The data to be scaled.

centering_type: str: The centering type to be used. Options are “Ctr” for mean centering or any other value for no centering.
scaling_type: str: The scaling type to be used. Options are “UV” for unit variance scaling, “Pareto” for Pareto scaling, or any other value for no scaling.
primary_id: str: The name of the primary observation ID column.
secondary_ids: list[str] | None: The names of the secondary observation ID columns.

Returns the scale type and scale weights.

friendly_mvda.scaling.scale_with_reference(data: DataFrame, reference_data: DataFrame, centering_type: str, scaling_type: str, primary_id: str, secondary_ids: list[str] | None) → tuple[list[str], list[float], DataFrame]

Scale the data using the specified centering and scaling types with reference data.

works like scale_data but uses reference data for calculating the scaling weights.

Returns the scale type and scale weights.

friendly_mvda.scaling.update_transformed_data_names(transformed_data: DataFrame, original_data: DataFrame, primary_id: str, secondary_ids: list[str] | None = None) → DataFrame: Update the names of the transformed data to match the original data. This function ensures that the transformed data has the same column names as the original data,