Data Generation & Transfer

The basic principle is that we:

  1. fetch data from the EPIC system/team
  2. format the data in flat files desired by Plexus' team
  3. transfer csv files from EPIC1 network share
  4. zip the files,
  5. PGP encrypt the zip file
  6. uploading the encrypted zip file to an SFTP location.

Windows Server Based Approach (current approach)

Data Lake Based Approach (future approach)

Still working on the details. Coming soon!

Execution Details & Requirements

Data Generation Requirements

The Plexus team currently expects 20 text files broken down into the following sections:

LZ_STD_ENCOUNTER
LZ_STD_ENCOUNTER_NOTES
LZ_STD_ENC_DIAGNOSIS
LZ_STD_ENC_LABS
LZ_STD_ENC_LABS_NOTES
LZ_STD_ENC_MEDICATION
LZ_STD_ENC_PROCEDURE
LZ_STD_ENC_PROCEDURE_NOTES
LZ_STD_ENC_PROFESSIONAL
LZ_STD_ENC_PROVIDER
LZ_STD_PATIENT
LZ_STD_PATIENT_ALLERGY
LZ_STD_PATIENT_HISTORY
LZ_STD_PATIENT_HISTORY_SOCIAL
LZ_STD_PATIENT_INSURANCE
LZ_STD_PATIENT_OBSERVATION
LZ_STD_PATIENT_PROBLEM
LZ_STD_PATIENT_VACCINATION
LZ_STD_PROVIDER
LZ_STD_SMARTFORM

Transfer Submission Requirements

The Plexus team, expects 2 files in the /uploads directory on the SFTP server:

  1. WUSTL_extracts_{mmddyyyy}.zip.gpg

    This file contains the EMR data files zipped and encrypted.

  2. COMPLETE_WUSTL_extracts_{mmddyyyy}.txt

    This is a blank or empty file which act as trigger file for Plexus' automation system. The presence of this file indicates the uploads are completed. On receiving the trigger file, their automated process searches for the extracts to process.

Testing

When uploading a test extract file to the uploads directory, please add prefix ‘TEST’ so we can differentiate between production and test extract. For example, TEST_WUSTL_extracts_{mmddyyyy}.zip.gpg.

Windows (winsrvr) to Databricks Transition Considerations

(discussion points from a 2025-01-09 group meeting)

  1. how do we transition from the old windows approach to the new databricks approach?
  2. have a "soak-in" period.
  3. ensure we've transferred any necessary old stuff archival
  4. decomissioning the of the VM and notification with Epic1 to remove storage

Prior Work


Updated on August 7, 2025