Wednesday, February 2, 2022

[FIXED] How to perform a preprocessing method on the data with Perceptron in GridSearchCV?

February 02, 2022 python, python-3.x, scikit-learn No comments

Issue

I have already checked this question but the answers didn't help.

I am trying to use a preprocessing method such as StandardScaler and Normalizer with Perceptron in GridSearchCV:

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler, Normalizer
from sklearn.linear_model import Perceptron

param_grid = [{
    'tol': [1e-1, 1e-3, 1e-5],
    'penalty': ['l2', 'l1', 'elasticnet'],
    'eta0': [0.0001, 0.001, 0.01, 0.1, 1.0]
}]

scoring = {
    'AUC-ROC': 'roc_auc',
    'Accuracy': 'accuracy',
    'AUC-PR': 'average_precision'
}

pipe = Pipeline([('scale', StandardScaler()), ('clf', Perceptron())])

search = GridSearchCV(pipe,
                      param_grid,
                      scoring=scoring,
                      refit='AUC-ROC',
                      cv=skf,
                      return_train_score=True)

results = search.fit(Xtrain, ytrain)

When I run the code I get:

ValueError: Invalid parameter class_weight for estimator Pipeline(steps=[('scale', StandardScaler()), ('clf', Perceptron())]). Check the list of available parameters with `estimator.get_params().keys()`.

I think this error is raised as the param_grid provided is not applicable to StandardScaler(). In addition, when I print search.get_params().keys() I get:

dict_keys(['cv', 'error_score', 'estimator__memory', 'estimator__steps', 'estimator__verbose', 'estimator__scale', 'estimator__clf', 'estimator__scale__copy', 'estimator__scale__with_mean', 'estimator__scale__with_std', 'estimator__clf__alpha', 'estimator__clf__class_weight', 'estimator__clf__early_stopping', 'estimator__clf__eta0', 'estimator__clf__fit_intercept', 'estimator__clf__l1_ratio', 'estimator__clf__max_iter', 'estimator__clf__n_iter_no_change', 'estimator__clf__n_jobs', 'estimator__clf__penalty', 'estimator__clf__random_state', 'estimator__clf__shuffle', 'estimator__clf__tol', 'estimator__clf__validation_fraction', 'estimator__clf__verbose', 'estimator__clf__warm_start', 'estimator', 'n_jobs', 'param_grid', 'pre_dispatch', 'refit', 'return_train_score', 'scoring', 'verbose'])

How do I fix it?

Solution

You should specify to which transform in the pipeline the param_grid parameters should be applied:

param_grid = [{
    'clf__tol': [1e-1, 1e-3, 1e-5],
    'clf__penalty': ['l2', 'l1', 'elasticnet'],
    'clf__eta0': [0.0001, 0.001, 0.01, 0.1, 1.0]
}]

Answered By - David M.

This Answer collected from stackoverflow and tested by PythonFixing community admins, is licensed under cc by-sa 2.5 , cc by-sa 3.0 and cc by-sa 4.0

Wednesday, February 2, 2022

[FIXED] How to perform a preprocessing method on the data with Perceptron in GridSearchCV?

Issue

Solution

0 comments:

Post a Comment

Popular Posts

Labels