:py:mod:`medcat.datasets.patient_concept_stream` ================================================ .. py:module:: medcat.datasets.patient_concept_stream Module Contents --------------- Classes ~~~~~~~ .. autoapisummary:: medcat.datasets.patient_concept_stream.PatientConceptStreamConfig medcat.datasets.patient_concept_stream.PatientConceptStream Attributes ~~~~~~~~~~ .. autoapisummary:: medcat.datasets.patient_concept_stream._CITATION medcat.datasets.patient_concept_stream._DESCRIPTION .. py:data:: _CITATION :value: Multiline-String .. raw:: html
Show Value .. code-block:: python """@misc{kraljevic2020multidomain, title={Multi-domain Clinical Natural Language Processing with MedCAT: the Medical Concept Annotation Toolkit}, author={Zeljko Kraljevic and Thomas Searle and Anthony Shek and Lukasz Roguski and Kawsar Noor and Daniel Bean and Aurelie Mascio and Leilei Zhu and Amos A Folarin and Angus Roberts and Rebecca Bendayan and Mark P Richardson and Robert Stewart and Anoop D Shah and Wai Keong Wong and Zina Ibrahim and James T Teo and Richard JB Dobson}, year={2020}, eprint={2010.01165}, archivePrefix={arXiv}, primaryClass={cs.CL} } """ .. raw:: html
.. py:data:: _DESCRIPTION :value: Multiline-String .. raw:: html
Show Value .. code-block:: python """Takes as input a pickled dict of pt2stream. The format should be: {'patient_id': (concept_cui, concept_count_for_patient, timestamp_of_first_occurrence_for_patient), ...} """ .. raw:: html
.. py:class:: PatientConceptStreamConfig Bases: :py:obj:`datasets.BuilderConfig` BuilderConfig for PatientConceptStream. :param \*\*kwargs: keyword arguments forwarded to super. .. py:class:: PatientConceptStream(cache_dir = None, dataset_name = None, config_name = None, hash = None, base_path = None, info = None, features = None, token = None, use_auth_token='deprecated', repo_id = None, data_files = None, data_dir = None, storage_options = None, writer_batch_size = None, name='deprecated', **config_kwargs) Bases: :py:obj:`datasets.GeneratorBasedBuilder` PatientConceptStream: as input takes the patient to stream of concepts. TODO: Move the preparations scripts out of notebooks .. py:attribute:: BUILDER_CONFIGS .. py:method:: _info() Construct the DatasetInfo object. See `DatasetInfo` for details. Warning: This function is only called once and the result is cached for all following .info() calls. :Returns: **info** -- (DatasetInfo) The dataset information .. py:method:: _split_generators(dl_manager) Returns SplitGenerators. .. py:method:: _generate_examples(filepath) This function returns the examples in the raw (text) form.