SubstanceParameter

class baybe.parameters.substance.SubstanceParameter[source]

Bases: DiscreteParameter

Generic substances that are treated with cheminformatics descriptors.

Only a decorrelated subset of descriptors should be used as otherwise this can result in a large number of features. For a handful of molecules, keeping only descriptors that have a maximum correlation of 0.7 reduces the number of descriptors to about 5-20. The number might be substantially higher with more labels given.

Public methods

__init__(name, data[, decorrelate, encoding])

Method generated by attrs for class SubstanceParameter.

from_dict(dictionary)

Create an object from its dictionary representation.

from_json(string)

Create an object from its JSON representation.

is_in_range(item)

Return whether an item is within the parameter range.

summary()

Return a custom summarization of the parameter.

to_dict()

Create an object's dictionary representation.

to_json()

Create an object's JSON representation.

transform_rep_exp2comp(data)

Transform data from experimental to computational representation.

Public attributes and properties

comp_df

Return the computational representation of the parameter.

is_numeric

Class variable encoding whether this parameter is numeric.

values

Returns the labels of the given set of molecules.

data

A mapping that provides the SMILES strings for all available parameter values.

decorrelate

Specifies the used decorrelation mode for the parameter encoding.

encoding

An optional encoding for the parameter.

name

The name of the parameter

__init__(name: str, data: dict[str, str], decorrelate: bool | float = True, encoding=SubstanceEncoding.MORDRED)

Method generated by attrs for class SubstanceParameter.

For details on the parameters, see Public attributes and properties.

classmethod from_dict(dictionary: dict)

Create an object from its dictionary representation.

Parameters:

dictionary (dict) – The dictionary representation.

Return type:

TypeVar(_T)

Returns:

The reconstructed object.

classmethod from_json(string: str)

Create an object from its JSON representation.

Parameters:

string (str) – The JSON representation of the object.

Return type:

TypeVar(_T)

Returns:

The reconstructed object.

is_in_range(item: Any)

Return whether an item is within the parameter range.

Parameters:

item (Any) – The item to be checked.

Return type:

bool

Returns:

True if the item is within the parameter range, False otherwise.

summary()

Return a custom summarization of the parameter.

Return type:

dict

to_dict()

Create an object’s dictionary representation.

Return type:

dict

to_json()

Create an object’s JSON representation.

Return type:

str

Returns:

The JSON representation as a string.

transform_rep_exp2comp(data: Series)

Transform data from experimental to computational representation.

Parameters:

data (Series) – Data to be transformed.

Return type:

DataFrame

Returns:

The transformed version of the data.

property comp_df: DataFrame

Return the computational representation of the parameter.

data: dict[str, str]

A mapping that provides the SMILES strings for all available parameter values.

decorrelate: Union[bool, float]

Specifies the used decorrelation mode for the parameter encoding.

  • False: The encoding is used as is.

  • True: The encoding is decorrelated using a default correlation threshold.

  • float in (0, 1): The encoding is decorrelated using the specified threshold.

encoding: SubstanceEncoding

An optional encoding for the parameter.

is_numeric: ClassVar[bool] = False

Class variable encoding whether this parameter is numeric.

name: str

The name of the parameter

property values: tuple

Returns the labels of the given set of molecules.