Skip to content

[Question] Parallelizing Across Multiple 'learn_one' calls #1729

@gcverissimo

Description

@gcverissimo

Hi, and thanks for the great library!

I’m wondering whether it’s possible to call model.learn_one() in parallel from multiple workers or threads in order to speed up the processing of incoming data. Specifically, I’d like to know:

  1. Is it safe or supported to call model.learn_one() concurrently on the same model instance?
  2. If not, is there a recommended approach for handling data updates in parallel? (e.g., merging models)
  3. My target components are:
  • preprocessing.RobustScaler
  • Regression models such as HoeffdingTreeRegressor, etc.

My use case involves multiple producers generating data samples simultaneously, and I’d like each to update the model as those samples arrive.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions