← retour aux snippets

pandas: str.split et extract multiples

Extraire plusieurs champs via split/regex extract.

python pandas #pandas#string#regex

objectif

Extraire plusieurs champs via split/regex extract.

code minimal

import pandas as pd

s = pd.Series(["a:1","b:2"])
df = s.str.split(":", expand=True).rename(columns={0:"k",1:"v"})
print(df.to_dict(orient="list"))

utilisation

import pandas as pd

s = pd.Series(["x=10;y=20"])
ex = s.str.extract(r"x=(\d+);y=(\d+)")
print(ex.astype(int).to_dict(orient="list"))

variante(s) utile(s)

import pandas as pd

s = pd.Series(["a|b|c"])
print(s.str.split("\\|").explode().tolist())

notes

  • expand=True renvoie un DataFrame directement.