Preprocessing utilities

class bamt.preprocessors.BasePreprocessor[source]

Bases: object

Base for Preprocessor

static get_nodes_types(data)[source]
get_nodes_signs(data)[source]
code_categories(data: DataFrame, encoder) Tuple[DataFrame, Dict[str, Dict]][source]

Encoding categorical parameters

Parameters:
  • data (DataFrame) – input dataset

  • encoder – any object with fit_transform method

Returns:

output dataset with encoded parameters dict: dictionary with values and codes

Return type:

pd.DataFrame

discretize(data: DataFrame, discretizer) tuple[source]
decode()[source]
class bamt.preprocessors.Preprocessor(pipeline: list)[source]

Bases: BasePreprocessor

property info
scan(data: DataFrame)[source]

Function to scan data. If something is wrong, it will be send to log file

apply(data: DataFrame) Tuple[DataFrame, Dict][source]

Apply pipeline data: data to apply on