What Do You Mean By SDTM (Study Data Tabulation Model)?

  • What Do You Mean By SDTM (Study Data Tabulation Model)?

    Posted by Garry Jones on October 25, 2022 at 1:30 pm

    What is SDTM?

    SDTM (Study Data Tabulation Model) defines a standard structure for human clinical trial (study) data tabulations and for nonclinical study data tabulations that are to be submitted as part of a product application to a regulatory authority such as the United States Food and Drug Administration (FDA). The Submission Data Standards team of Clinical Data Interchange Standards Consortium (CDISC) defines SDTM. On July 21, 2004, SDTM was selected as the standard specification for submitting tabulation data to the FDA for clinical trials and on July 5, 2011 for nonclinical studies. Eventually, all data submissions will be expected to conform to this format. As a result, clinical and nonclinical Data Managers will need to become proficient in the SDTM to prepare submissions and apply the SDTM structures, where appropriate, for operational data management.

    Background

    SDTM is built around the concept of observations collected about subjects who participated in a clinical study. Each observation can be described by a series of variables, corresponding to a row in a dataset or table. Each variable can be classified according to its Role. A Role determines the type of information conveyed by the variable about each distinct observation and how it can be used. Variables can be classified into four major roles:

    1. Identifier variables, which identify the study, subject of the observation, the domain, and the sequence number of the record
    2. Topic variables, which specify the focus of the observation (such as the name of a lab test)
    3. Timing variables, which describe the timing of the observation (such as start date and end date)
    4. Qualifier variables, which include additional illustrative text, or numeric values that describe the results or additional traits of the observation (such as units or descriptive adjectives).

    Datasets and domains

    Observations are normally collected for all subjects in a series of domains. A domain is defined as a collection of logically-related observations with a topic-specific commonality about the subjects in the trial. The logic of the relationship may relate to the scientific matter of the data, or to its role in the trial.

    Typically, each domain is represented by a dataset, but it is possible to have information relevant to the same topicality spread among multiple datasets. Each dataset is distinguished by a unique, two-character DOMAIN code that should be used consistently throughout the submission. This DOMAIN code is used in the dataset name, the value of the DOMAIN variable within that dataset, and as a prefix for most variable names in the dataset.

    The dataset structure for observations is a flat file representing a table with one or more rows and columns. Normally, one dataset is submitted for each domain. Each row of the dataset represents a single observation and each column represents one of the variables. Each dataset or table is accompanied by metadata definitions that provide information about the variables used in the dataset. The metadata are described in a data definition document named ‘Define’ that is submitted along with the data to regulatory authorities.

    Special-purpose domains

    The CDISC Version 3.x Submission Data Domain Models include special-purpose domains with a specific structure and cannot be extended with any additional qualifier or timing variables other than those specified.

    1. Demographics includes a set of standard variables that describe each subject in a clinical study
    2. Comments describes a fixed structure for recording free-text comments on a subject, or comments related to records or groups of records in other domains.

    Additional fixed structure, non-extensible special-purpose domains are discussed in the Trial Design model.

    The general domain classes

    Most observations collected during the study (other than those represented in special purpose domains) should be divided among three general observation classes: Interventions, Events, or Findings:

    1. The Interventions class captures investigational treatments, therapeutic treatments, and surgical procedures that are intentionally administered to the subject (with some actual or expected physiological effect) either as specified by the study protocol (e.g., “exposure”), coincident with the study assessment period (e.g., “concomitant medications”), or other substances self-administered by the subject (such as alcohol, tobacco, or caffeine)
    2. The Events class captures occurrences or incidents independent of planned study evaluations occurring during the trial (e.g., ‘adverse events’ or ‘disposition’) or prior to the trial (e.g., ‘medical history’).
    3. The Findings class captures the observations resulting from planned evaluations to address specific questions such as observations made during a physical examination, laboratory tests, ECG testing, and sets of individual questions listed on questionnaires.

    In most cases, the identification of the general class appropriate to a specific collection of data by topicality is straightforward. Often the Findings general class is the best choice for general observational data collected as measurements or responses to questions. In cases when the topicality may not be as clear, the choice of class may be based more on the scientific intent of the protocol or analysis plan or the data structure.

    The CDISC standard domain models (SDTMIG 3.2)

    Special-Purpose Domains:

    1. Comments (CO)
    2. Demographics (DM)
    3. Subject Elements (SE)
    4. Subject Visits (SV)

    Interventions General Observation Class:

    1. Concomitant Medications (CM)
    2. Exposure as Collected (EC)
    3. Exposure (EX)
    4. Substance Use (SU)
    5. Procedures (PR)

    Events General Observation Class:

    1. Adverse Events (AE)
    2. Clinical Events (CE)
    3. Disposition (DS)
    4. Protocol Deviations (DV)
    5. Medical History (MH)
    6. Healthcare Encounters (HO)

    Findings General Observation Class:

    1. Drug Accountability (DA)
    2. Death Details (DD)
    3. ECG Test Results (EG)
    4. Inclusion/Exclusion Criterion Not Met (IE)
    5. Immunogenicity Specimen Assessments (IS)
    6. Laboratory Test Results (LB)
    7. Microbiology Specimen (MB)
    8. Microscopic Findings (MI)
    9. Morphology (MO)
    10. Microbiology Susceptibility Test (MS)
    11. PK Concentrations (PC)
    12. PK Parameters (PP)
    13. Physical Examination (PE)
    14. Questionnaires (QS)
    15. Reproductive System Findings (RP)
    16. Disease Response (RS)
    17. Subject Characteristics (SC)
    18. Subject Status (SS)
    19. Tumor Identification (TU)
    20. Tumor Results (TR)
    21. Vital Signs (VS)

    Findings About :

    1. Findings About Events or Interventions (FA)
    2. Skin Response (SR)

    Trial Design Domains:

    1. Trial Arms (TA)
    2. Trial Disease Assessment (TD)
    3. Trial Elements (TE)
    4. Trial Visits (TV)
    5. Trial Inclusion/Exclusion Criteria (TI)
    6. Trial Summary (TS)

    Special-Purpose Relationship Datasets:

    1. Supplemental Qualifiers – SUPPQUAL
    2. Relate Records – RELREC

    Limitations and criticism of standards

    One criticism of the SDTM standards is that they are continually changing, with new versions released frequently. CDISC claims that SDTM standards are backward compatible. But the claim is unreliable. It is not possible to map the data from EDC DBMS to SDTM standards until the clinical trial completes. New domains, for example the exposure as collected (EC) domain, were added recently. However, backward compatibility with earlier domains is not always possible. The standards are not reliable, and well evolved. The controlled terminology is a very small subset of National Cancer institute terminology.

    Garry Jones replied 1 year, 6 months ago 1 Member · 0 Replies
  • 0 Replies

Sorry, there were no replies found.

error

Enjoy this site? Please spread the word :)

LinkedIn
Share