objectif
Discrétiser des variables continues en bacs.
code minimal
import pandas as pd
s = pd.Series([1,2,3,4,5,6,7,8])
print(pd.cut(s, bins=4, labels=False).tolist())
utilisation
import pandas as pd
s = pd.Series([1,2,3,4,5,6,7,8])
print(pd.qcut(s, q=4, labels=False).tolist())
variante(s) utile(s)
import pandas as pd
s = pd.Series([1,2,3,4])
bins = pd.cut(s, bins=[0,2,4], include_lowest=True)
print(bins.astype(str).tolist())
notes
qcutéquilibre les effectifs par quantile.