medcat.utils.ner.helpers

Module Contents

Functions

_deid_text(cat, text[, redact])

De-identify text.

deid_text(*args, **kwargs)

make_or_update_cdb(json_path[, cdb, min_count])

Creates a new CDB or updates an existing one with new

medcat.utils.ner.helpers._deid_text(cat, text, redact=False)

De-identify text.

De-identified text. If redaction is enabled, identifiable entities will be replaced with starts (e.g *****). Otherwise, the replacement will be the CUI or in other words, the type of information that was hidden (e.g [PATIENT]).

Parameters:
  • cat (CAT) – The CAT object to use for deid.

  • text (str) – The input document.

  • redact (bool, optional) – Whether to redact. Defaults to False.

Returns:

str – The de-identified document.

Return type:

str

medcat.utils.ner.helpers.deid_text(*args, **kwargs)
Return type:

str

medcat.utils.ner.helpers.make_or_update_cdb(json_path, cdb=None, min_count=0)

Creates a new CDB or updates an existing one with new concepts if the cdb argument is provided. All concepts that are less frequent than min_count will be ignored.