objectif
Imputer les valeurs manquantes (moyenne, médiane, mode).
code minimal
from sklearn.impute import SimpleImputer
import numpy as np
X = np.array([[1.0, np.nan],[2.0, 3.0]])
imp = SimpleImputer(strategy="mean").fit(X)
print(imp.transform([[np.nan, 5.0]]).shape)
utilisation
from sklearn.impute import SimpleImputer
import numpy as np
X = [[np.nan],[1.0],[2.0]]
print(SimpleImputer(strategy="median").fit_transform(X).tolist())
variante(s) utile(s)
from sklearn.impute import SimpleImputer
print(SimpleImputer(strategy="most_frequent") is not None)
notes
- Placer dans Pipeline avant le modèle.