Slide 26
Slide 26 text
3.3. Pandas → Python in preprocessing code
Code Styles/
Preprocessing
Functions
Pandas Python
Filter
dataframe.where(.query)
dataframe.groupby()
dataframe[[“”, “”, ‘“]]
dataframe.loc[]
dataframe.iloc[]
if - else + for +.append()
[[v1, v2, v3] for value in values]
Replace
dataframe.fillna()
dic = {“key1”: value1, “key2”: value, …}
dataframe['column1'].replace(dic, inplace=True)
dic = {“key1”: value1, “key2”: value, …}
[[dic.get(v, v) for v in value] for value in
values]
De-duplicate
/Be unique
duplicated() / drop_duplicates()
dataframe['column1'].unique()
(outuput: array([v1, v2, v3]))
set(list)
list({v1, v2, v2, …})
list({value[0] for value in values})
Delete/Drop
dataframe.dropna()
dataframe.drop()
dataframe.drop(index=index list)
if - else + for +.append()
[[v1, v2, v3] for value in values]