pyrolite.util.missing
- pyrolite.util.missing.md_pattern(Y)[source]
Get the missing data patterns from an array.
- Parameters
Y (
numpy.ndarray
|pandas.DataFrame
) – Input dataset.- Returns
pattern_ids (
numpy.ndarray
) – Pattern ID array.pattern_dict (
dict
) – Dictionary of patterns indexed by pattern IDs. Contains a pattern and count for each pattern ID.
- pyrolite.util.missing.cooccurence_pattern(Y, normalize=False, log=False)[source]
Get the co-occurence patterns from an array.
- Parameters
Y (
numpy.ndarray
|pandas.DataFrame
) – Input dataset.normalize (
bool
) – Whether to normalize the cooccurence to compare disparate variables.log (
bool
) – Whether to take the log of the cooccurence.- Returns
co_occur – Cooccurence frequency array.
- Return type