Skip to content

How to specify the threshold in the data cleaning stages for each stage? #2

Description

@Jasonxu1225

Thanks for the good work!

Could you please provide more insight into how the thresholds are specified for each stage of the data cleaning pipeline? Since these predefined thresholds can have a substantial impact on the final processed dataset, it would be helpful to understand the rationale behind their selection and whether they are task-dependent, empirically tuned, or based on some general statistical criterion.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions