Our goal is to create a site akin to the Johns Hopkins PMAP site (https://pm.jh.edu/). The website will highlight various aspects of our data lake, providing the WashU community with crucial information about the tools, governance, and resources available.

Here is an outline of the sections we are intending to include:

  • Platforms

Data Sources and Tools:
   - A section describing the data sources and tools available, similar to (https://pm.jh.edu/how-it-works/).

WUSM Data Lake Overview:
   - A detailed description of the WUSM Data Lake's purpose, modeled after the description of PMAP (https://pm.jh.edu/researchers/).

Access and Usage:
   - Sections for each type of access/usage:
     - Information about our BYOB program and its requirements/benefits, as seen here: https://pm.jh.edu/centers-of-excellence/.
     - Research project-based usage
     - Operational and clinical use
     - Student/Educational use

Governance Information:
   - Details about data access approvals
   - Links to request access

Pricing information:
   - Potentially a pricing calculator for customers to estimate project costs.

Metrics and Dashboards:
   - A page displaying metrics derived from the data lake, such as:
     - Total number of users and projects
     - Current data sources available
     - Number of records processed each day

Help and Resources:
   - How-to guides and other resources for data lake users, potentially reusing content from the i2dbdocs site.

Table of Contents


Updated on August 7, 2025