← retour aux snippets

pandas: cut et qcut (binning)

Discrétiser des variables continues en bacs.

python pandas #pandas#binning#cut

objectif

Discrétiser des variables continues en bacs.

code minimal

import pandas as pd

s = pd.Series([1,2,3,4,5,6,7,8])
print(pd.cut(s, bins=4, labels=False).tolist())

utilisation

import pandas as pd

s = pd.Series([1,2,3,4,5,6,7,8])
print(pd.qcut(s, q=4, labels=False).tolist())

variante(s) utile(s)

import pandas as pd

s = pd.Series([1,2,3,4])
bins = pd.cut(s, bins=[0,2,4], include_lowest=True)
print(bins.astype(str).tolist())

notes

  • qcut équilibre les effectifs par quantile.