How many 3 digit numbers can be formed with the digits 12345 if there can be repetition
Ex 7.1, 1 How many 3-digit numbers can be formed from the digits 1, 2, 3, 4 and 5 assuming that (i) repetition of the digits is allowed? 3 digit number : Number of 3 digit numbers with repetition = 5 × 5 × 5 = 125 Show
Solution : (i) When repetition of digits is allowed: This is the implementation guide for human clinical trials corresponding to version 1.7 of the CDISC Study Data Tabulation Model. Revision History DateVersion2018-11-203.3 Final2013-11-263.2 Final2012-07-163.1.3 Final2008-11-123.1.2 Final2005-08-263.1.1 Final2004-07-143.1 © 2018 Clinical Data Interchange Standards Consortium, Inc. All rights reserved. Contents
1 Introduction 1.1 PurposeThis document comprises the CDISC Version 3.3 (v3.3) Study Data Tabulation Model Implementation Guide for Human Clinical Trials (SDTMIG), which has been prepared by the Submissions Data Standards (SDS) team of the Clinical Data Interchange Standards Consortium (CDISC). Like its predecessors, v3.3 is intended to guide the organization, structure, and format of standard clinical trial tabulation datasets submitted to a regulatory authority. Version 3.3 supersedes all prior versions of the SDTMIG. The SDTMIG should be used in close concert with the version 1.7 of the CDISC Study Data Tabulation Model (SDTM, available at http://www.cdisc.org/sdtm), which describes the general conceptual model for representing clinical study data that is submitted to regulatory authorities and should be read prior to reading the SDTMIG. Version 3.3 provides specific domain models, assumptions, business rules, and examples for preparing standard tabulation datasets that are based on the SDTM. This document is intended for companies and individuals involved in the collection, preparation, and analysis of clinical data that will be submitted to regulatory authorities. 1.2 Organization of this DocumentThis document is organized into the following sections:
1.3 Relationship to Prior CDISC DocumentsThis document, together with the SDTM, represents the most recent version of the CDISC Submission Data Domain Models. Since all updates are intended to be backward compatible, the term "v3.x" is used to refer to Version 3.3 and all subsequent versions. The most significant changes since the prior version, v3.2, include:
A detailed list of changes between versions is provided in Appendix E, Revision History. Version 3.1 was the first fully implementation-ready version of the CDISC Submission Data Standards that was directly referenced by the FDA for use in human clinical studies involving drug products. However, future improvements and enhancements will continue to be made as sponsors gain more experience submitting data in this format. Therefore, CDISC will be preparing regular updates to the implementation guide to provide corrections, clarifications, additional domain models, examples, business rules, and conventions for using the standard domain models. CDISC will produce further documentation for controlled terminology as separate publications, so sponsors are encouraged to check the CDISC website (http://www.cdisc.org/terminology) frequently for additional information. See Section 4.3, Coding and Controlled Terminology Assumptions, for the most up-to-date information on applying Controlled Terminology. 1.4 How to Read this Implementation GuideThis SDTM Implementation Guide (SDTMIG) is best read online, so the reader can benefit from the many hyperlinks included to both internal and external references. The following guidelines may be helpful in reading this document:
This implementation guide covers most data collected in human clinical trials, but separate implementation guides provide information about certain data, and should be consulted when needed.
1.4.1 How to Read a Domain SpecificationA domain specification table includes rows for all required and expected variables for a domain and for a set of permissible variables. The permissible variables do not include all the variables that are allowed for the domain; they are a set of variables that the SDS team considered likely to be included. The columns of the table:
2.1 Observations and VariablesThe SDTMIG for Human Clinical Trials is based on the SDTM's general framework for organizing clinical trials information that is to be submitted to regulatory authorities. The SDTM is built around the concept of observations collected about subjects who participated in a clinical study. Each observation can be described by a series of variables, corresponding to a row in a dataset. Each variable can be classified according to its Role. A Role determines the type of information conveyed by the variable about each distinct observation and how it can be used. Variables can be classified into five major roles:
The set of Qualifier variables can be further categorized into five sub-classes:
For example, in the observation, "Subject 101 had mild nausea starting on Study Day 6," the Topic variable value is the term for the adverse event, "NAUSEA". The Identifier variable is the subject identifier, "101". The Timing variable is the study day of the start of the event, which captures the information, "starting on Study Day 6", while an example of a Record Qualifier is the severity, the value for which is "MILD". Additional Timing and Qualifier variables could be included to provide the necessary detail to adequately describe an observation. 2.2 Datasets and DomainsObservations about study subjects are normally collected for all subjects in a series of domains. A domain is defined as a collection of logically related observations with a common topic. The logic of the relationship may pertain to the scientific subject matter of the data or to its role in the trial. Each domain is represented by a single dataset. Each domain dataset is distinguished by a unique, two-character code that should be used consistently throughout the submission. This code, which is stored in the SDTM variable named DOMAIN, is used in four ways: as the dataset name, the value of the DOMAIN variable in that dataset; as a prefix for most variable names in that dataset; and as a value in the RDOMAIN variable in relationship tables Section 8, Representing Relationships and Data. All datasets are structured as flat files with rows representing observations and columns representing variables. Each dataset is described by metadata definitions that provide information about the variables used in the dataset. The metadata are described in a data definition document, a Define-XML document, that is submitted with the data to regulatory authorities. The Define-XML standard, available at https://www.cdisc.org/standards/transport/define-xml, specifies metadata attributes to describe SDTM data. Data stored in SDTM datasets include both raw (as originally collected) and derived values (e.g., converted into standard units, or computed on the basis of multiple values, such as an average). The SDTM lists only the name, label, and type, with a set of brief CDISC guidelines that provide a general description for each variable. The domain dataset models included in Section 5, Models for Special Purpose Domains and Section 6, Domain Models Based on the General Observation Classes of this document provide additional information about Controlled Terms or Format, notes on proper usage, and examples. See Section 1.4.1, How to Read a Domain Specification. 2.3 The General Observation ClassesMost subject-level observations collected during the study should be represented according to one of the three SDTM general observation classes: Interventions, Events, or Findings. The lists of variables allowed to be used in each of these can be found in the SDTM.
In most cases, the choice of observation class appropriate to a specific collection of data can be easily determined according to the descriptions provided above. The majority of data, which typically consists of measurements or responses to questions, usually at specific visits or time points, will fit the Findings general observation class. Additional guidance on choosing the appropriate general observation class is provided in Section 8.6.1, Guidelines for Determining the General Observation Class. General assumptions for use with all domain models and custom domains based on the general observation classes are described in Section 4, Assumptions for Domain Models; specific assumptions for individual domains are included with the domain models. 2.4 Datasets Other Than General Observation Class DomainsThe SDTM includes four types of datasets other than those based on the general observation classes:
[1] SE and SV were included as part of the Trial Design Model in SDTMIG v3.1.1, but were moved in SDTMIG v3.1.2. 2.5 The SDTM Standard Domain ModelsA sponsor should only submit domain datasets that were actually collected (or directly derived from the collected data) for a given study. Decisions on what data to collect should be based on the scientific objectives of the study, rather than the SDTM. Note that any data collected that will be submitted in an analysis dataset must also appear in a tabulation dataset. The collected data for a given study may use standard domains from this and other SDTM Implementation Guides as well as additional custom domains based on the three general observation classes. A list of standard domains is provided in Section 3.2.1, Dataset-Level Metadata. Final domains will be published only in an SDTM Implementation Guide (the SDTMIG for human clinical trials or another implementation guide, such as the SDTMIG for Medical Devices). Therapeutic area standards projects and other projects may develop proposals for additional domains. Draft versions of these domains may be made available in the CDISC wiki in the SDTM Draft Domains (https://wiki.cdisc.org/x/s4Iv) area. Starting with SDTMIG v3.3:
What constitutes a change for the purposes of deciding a domain version will be developed further, but for SDTMIG v3.3, a domain was assigned a version of v3.3 if there was a change to the specification and/or the assumptions from the domain as it appeared in SDTMIG v3.2. These general rules apply when determining which variables to include in a domain:
2.6 Creating a New DomainThis section describes the overall process for creating a custom domain, which must be based on one of the three SDTM general observation classes. The number of domains submitted should be based on the specific requirements of the study. Follow the process below to create a custom domain:
Figure 2.6: Creating a New Domain 2.7 SDTM Variables Not Allowed in SDTMIGThis section identifies those SDTM variables that either 1) should not be used in SDTM-compliant data tabulations of clinical trials data or 2) have not yet been evaluated for use in human clinical trials. The following SDTM variables, defined for use in non-clinical studies (SEND), must NEVER be used in the submission of SDTM-based data for human clinical trials:
The following variables can be used for non-clinical studies (SEND) but must NEVER be used in the Demographics domain for human clinical trials, where all subjects are human. See Section 9.2, Non-host Organism Identifiers (OI), for information about representing taxonomic information for non-host organisms such as bacteria and viruses.
The following variables have not been evaluated for use in human clinical trials and must therefore be used with extreme caution:
The following identifier variable can be used for non-clinical studies (SEND), and may be used in human clinical trials when appropriate:
Other variables defined in the SDTM are allowed for use as defined in this SDTMIG except when explicitly stated. Custom domains, created following the guidance in Section 2.6, Creating a New Domain, may utilize any appropriate Qualifier variables from the selected general observation class. 3 Submitting Data in Standard Format3.1 Standard Metadata for Dataset Contents and AttributesThe SDTMIG provides standard descriptions of some of the most commonly used data domains, with metadata attributes. These include descriptive metadata attributes that should be included in a Define-XML document. In addition, the CDISC domain models include two shaded columns that are not sent to the FDA. These columns assist sponsors in preparing their datasets:
The domain models in Section 6, Domain Models Based on the General Observation Classes illustrate how to apply the SDTM when creating a specific domain dataset. In particular, these models illustrate the selection of a subset of the variables offered in one of the general observation classes, along with applicable timing variables. The models also show how a standard variable from a general observation class should be adjusted to meet the specific content needs of a particular domain, including making the label more meaningful, specifying controlled terminology, and creating domain-specific notes and examples. Thus the domain models not only demonstrate how to apply the model for the most common domains, but also give insight on how to apply general model concepts to other domains not yet defined by CDISC. 3.2 Using the CDISC Domain Models in Regulatory Submissions — Dataset MetadataThe Define-XML document that accompanies a submission should also describe each dataset that is included in the submission and describe the natural key structure of each dataset. While most studies will include DM and a set of safety domains based on the three general observation classes (typically including EX, CM, AE, DS, MH, LB, and VS), the actual choice of which data to submit will depend on the protocol and the needs of the regulatory reviewer. Dataset definition metadata should include the dataset filenames, descriptions, locations, structures, class, purpose, and keys, as shown in Section 3.2.1, Dataset-Level Metadata. In addition, comments can also be provided where needed. In the event that no records are present in a dataset (e.g., a small PK study where no subjects took concomitant medications), the empty dataset should not be submitted and should not be described in the Define-XML document. The annotated CRF will show the data that would have been submitted had data been received; it need not be re-annotated to indicate that no records exist. 3.2.1 Dataset-Level MetadataNote that the key variables shown in this table are examples only. A sponsor's actual key structure may be different. DatasetDescriptionClassStructurePurposeKeysLocationCOCommentsSpecial PurposeOne record per comment per subjectTabulationSTUDYID, USUBJID, IDVAR, COREF, CODTCco.xptDMDemographicsSpecial PurposeOne record per subjectTabulationSTUDYID, USUBJIDdm.xptSESubject ElementsSpecial PurposeOne record per actual Element per subjectTabulationSTUDYID, USUBJID, ETCD, SESTDTCse.xptSMSubject Disease MilestonesSpecial PurposeOne record per Disease Milestone per subjectTabulationSTUDYID, USUBJID, MIDSsm.xptSVSubject VisitsSpecial PurposeOne record per subject per actual visitTabulationSTUDYID, USUBJID, VISITNUMsv.xptAGProcedure AgentsInterventionsOne record per recorded intervention occurrence per subjectTabulationSTUDYID, USUBJID, AGTRT, AGSTDTCag.xptCMConcomitant/Prior MedicationsInterventionsOne record per recorded intervention occurrence or constant-dosing interval per subjectTabulationSTUDYID, USUBJID, CMTRT, CMSTDTCcm.xptECExposure as CollectedInterventionsOne record per protocol-specified study treatment, collected-dosing interval, per subject, per moodTabulationSTUDYID, USUBJID, ECTRT, ECSTDTC, ECMOODec.xptEXExposureInterventionsOne record per protocol-specified study treatment, constant-dosing interval, per subjectTabulationSTUDYID, USUBJID, EXTRT, EXSTDTCex.xptMLMeal DataInterventionsOne record per food product occurrence or constant intake interval per subjectTabulationSTUDYID, USUBJID, MLTRT, MLSTDTCml.xptPRProceduresInterventionsOne record per recorded procedure per occurrence per subjectTabulationSTUDYID, USUBJID, PRTRT, PRSTDTCpr.xptSUSubstance UseInterventionsOne record per substance type per reported occurrence per subjectTabulationSTUDYID, USUBJID, SUTRT, SUSTDTCsu.xptAEAdverse EventsEventsOne record per adverse event per subjectTabulationSTUDYID, USUBJID, AEDECOD, AESTDTCae.xptCEClinical EventsEventsOne record per event per subjectTabulationSTUDYID, USUBJID, CETERM, CESTDTCce.xptDSDispositionEventsOne record per disposition status or protocol milestone per subjectTabulationSTUDYID, USUBJID, DSDECOD, DSSTDTCds.xptDVProtocol DeviationsEventsOne record per protocol deviation per subjectTabulationSTUDYID, USUBJID, DVTERM, DVSTDTCdv.xptHOHealthcare EncountersEventsOne record per healthcare encounter per subjectTabulationSTUDYID, USUBJID, HOTERM, HOSTDTCho.xptMHMedical HistoryEventsOne record per medical history event per subjectTabulationSTUDYID, USUBJID, MHDECODmh.xptCVCardiovascular System FindingsFindingsOne record per finding or result per time point per visit per subjectTabulationSTUDYID, USUBJID, VISITNUM, CVTESTCD, CVTPTREF, CVTPTNUMcv.xptDADrug AccountabilityFindingsOne record per drug accountability finding per subjectTabulationSTUDYID, USUBJID, DATESTCD, DADTCda.xptDDDeath DetailsFindingsOne record per finding per subjectTabulationSTUDYID, USUBJID, DDTESTCD, DDDTCdd.xptEGECG Test ResultsFindingsOne record per ECG observation per replicate per time point or one record per ECG observation per beat per visit per subjectTabulationSTUDYID, USUBJID, EGTESTCD, VISITNUM, EGTPTREF, EGTPTNUMeg.xptFAFindings About Events or InterventionsFindingsOne record per finding, per object, per time point, per visit per subjectTabulationSTUDYID, USUBJID, FATESTCD, FAOBJ, VISITNUM, FATPTREF, FATPTNUMfa.xptFTFunctional TestsFindingsOne record per Functional Test finding per time point per visit per subjectTabulationSTUDYID, USUBJID, TESTCD, VISITNUM, FTTPTREF, FTTPTNUMft.xptIEInclusion/Exclusion Criteria Not MetFindingsOne record per inclusion/exclusion criterion not met per subjectTabulationSTUDYID, USUBJID, IETESTCDie.xptISImmunogenicity Specimen AssessmentsFindingsOne record per test per visit per subjectTabulationSTUDYID, USUBJID, ISTESTCD, VISITNUMis.xptLBLaboratory Test ResultsFindingsOne record per lab test per time point per visit per subjectTabulationSTUDYID, USUBJID, LBTESTCD, LBSPEC, VISITNUM, LBTPTREF, LBTPTNUMlb.xptMBMicrobiology SpecimenFindingsOne record per microbiology specimen finding per time point per visit per subjectTabulationSTUDYID, USUBJID, MBTESTCD, VISITNUM, MBTPTREF, MBTPTNUMmb.xptMIMicroscopic FindingsFindingsOne record per finding per specimen per subjectTabulationSTUDYID, USUBJID, MISPEC, MITESTCDmi.xptMKMusculoskeletal System FindingsFindingsOne record per assessment per visit per subjectTabulationSTUDYID, USUBJID, VISITNUM, MKTESTCD, MKLOC, MKLATmk.xptMOMorphologyFindingsOne record per Morphology finding per location per time point per visit per subjectTabulationSTUDYID, USUBJID, VISITNUM, MOTESTCD, MOLOC, MOLATmo.xptMSMicrobiology SusceptibilityFindingsOne record per microbiology susceptibility test (or other organism-related finding) per organism found in MBTabulationSTUDYID, USUBJID, MSTESTCD, VISITNUM, MSTPTREF, MSTPTNUMms.xptNVNervous System FindingsFindingsOne record per finding per location per time point per visit per subjectTabulationSTUDYID, USUBJID, VISITNUM, CVTPTNUM, CVLOC, NVTESTCDnv.xptOEOphthalmic ExaminationsFindingsOne record per ophthalmic finding per method per location, per time point per visit per subjectTabulationSTUDYID, USUBJID, FOCID, OETESTCD, OETSTDTL, OEMETHOD, OELOC, OELAT, OEDIR, VISITNUM, OEDTC, OETPTREF, OETPTNUM, OEREPNUMoe.xptPCPharmacokinetics ConcentrationsFindingsOne record per sample characteristic or time-point concentration per reference time point or per analyte per subjectTabulationSTUDYID, USUBJID, PCTESTCD, VISITNUM, PCTPTREF, PCTPTNUMpc.xptPEPhysical ExaminationFindingsOne record per body system or abnormality per visit per subjectTabulationSTUDYID, USUBJID, PETESTCD, VISITNUMpe.xptPPPharmacokinetics ParametersFindingsOne record per PK parameter per time-concentration profile per modeling method per subjectTabulationSTUDYID, USUBJID, PPTESTCD, PPCAT, VISITNUM, PPTPTREFpp.xptQSQuestionnairesFindingsOne record per questionnaire per question per time point per visit per subjectTabulationSTUDYID, USUBJID, QSCAT, QSSCAT, VISITNUM, QSTESTCDqs.xptRERespiratory System FindingsFindingsOne record per finding or result per time point per visit per subjectTabulationSTUDYID, USUBJID, VISITNUM, RETESTCD, RETPTNUM, REREPNUMre.xptRPReproductive System FindingsFindingsOne record per finding or result per time point per visit per subjectTabulationSTUDYID, DOMAIN, USUBJID, RPTESTCD, VISITNUMrp.xptRSDisease Response and Clin ClassificationFindingsOne record per response assessment or clinical classification assessment per time point per visit per subject per assessor per medical evaluatorTabulationSTUDYID, USUBJID, RSTESTCD, VISITNUM, RSTPTREF, RSTPTNUM, RSEVAL, RSEVALIDrs.xptSCSubject CharacteristicsFindingsOne record per characteristic per subject.TabulationSTUDYID, USUBJID, SCTESTCDsc.xptSRSkin ResponseFindingsOne record per finding, per object, per time point, per visit per subjectTabulationSTUDYID, USUBJID, SRTESTCD, SROBJ, VISITNUM, SRTPTREF, SRTPTNUMsr.xptSSSubject StatusFindingsOne record per finding per visit per subjectTabulationSTUDYID, USUBJID, SSTESTCD, VISITNUMss.xptTRTumor/Lesion ResultsFindingsOne record per tumor measurement/assessment per visit per subject per assessorTabulationSTUDYID, USUBJID, TRTESTCD, EVALID, VISITNUMtr.xptTUTumor/Lesion IdentificationFindingsOne record per identified tumor per subject per assessorTabulationSTUDYID, USUBJID, EVALID, LNKIDtu.xptURUrinary System FindingsFindingsOne record per finding per location per per visit per subjectTabulationSTUDYID, USUBJID, VISITNUM, URTESTCD, URLOC, URLAT, URDIRur.xptVSVital SignsFindingsOne record per vital sign measurement per time point per visit per subjectTabulationSTUDYID, USUBJID, VSTESTCD, VISITNUM, VSTPTREF, VSTPTNUMvs.xptTATrial ArmsTrial DesignOne record per planned Element per ArmTabulationSTUDYID, ARMCD, TAETORDta.xptTDTrial Disease AssessmentsTrial DesignOne record per planned constant assessment periodTabulationSTUDYID, TDORDERtd.xptTETrial ElementsTrial DesignOne record per planned ElementTabulationSTUDYID, ETCDte.xptTITrial Inclusion/Exclusion CriteriaTrial DesignOne record per I/E crierionTabulationSTUDYID, IETESTCDti.xptTMTrial Disease MilestonesTrial DesignOne record per Disease Milestone typeTabulationSTUDYID, MIDSTYPEtm.xptTSTrial Summary InformationTrial DesignOne record per trial summary parameter valueTabulationSTUDYID, TSPARMCD, TSSEQts.xptTVTrial VisitsTrial DesignOne record per planned Visit per ArmTabulationSTUDYID, ARM, VISITtv.xptRELRECRelated RecordsRelationshipsOne record per related record, group of records or datasetTabulationSTUDYID, RDOMAIN, USUBJID, IDVAR, IDVARVAL, RELIDrelrec.xptRELSUBRelated SubjectsRelationshipsOne record per relationship per related subject per subjectTabulationSTUDYID, USUBJID, RSUBJID, SRELrelsub.xptSUPP--Supplemental Qualifiers for [domain name]RelationshipsOne record per IDVAR, IDVARVAL, and QNAM value per subjectTabulationSTUDYID, RDOMAIN, USUBJID, IDVAR, IDVARVAL, QNAMsupp--.xptOINon-host Organism IdentifiersStudy ReferenceOne record per taxon per non-host organismTabulationNHOID, OISEQoi.xpt Separate Supplemental Qualifier datasets of the form supp--.xpt are required. See Section 8.4, Relating Non-Standard Variables Values to a Parent Domain. 3.2.1.1 Primary KeysThe table in Section 3.2.1, Dataset-Level Metadata shows examples of what a sponsor might submit as variables that comprise the primary key for SDTM datasets. Since the purpose of this column is to aid reviewers in understanding the structure of a dataset, sponsors should list all of the natural keys (see definition below) for the dataset. These keys should define uniqueness for records within a dataset, and may define a record sort order. The identified keys for each dataset should be consistent with the description of the dataset structure as described in the Define-XML document. For all the general-observation-class domains (and for some special purpose domains), the --SEQ variable was created so that a unique record could be identified consistently across all of these domains via its use, along with STUDYID, USUBJID, DOMAIN. In most domains, --SEQ will be a surrogate key (see definition below) for a set of variables that comprise the natural key. In certain instances, a Supplemental Qualifier (SUPP--) variable might also contribute to the natural key of a record for a particular domain. See Section 4.1.9, Assigning Natural Keys in the Metadata, for how this should be represented, and for additional information on keys. A natural key is a set of data (one or more columns of an entity) that uniquely identifies that entity and distinguishes it from any other row in the table. The advantage of natural keys is that they exist already; one does not need to introduce a new, "unnatural" value to the data schema. One of the difficulties in choosing a natural key is that just about any natural key one can think of has the potential to change. Because they have business meaning, natural keys are effectively coupled to the business, and they may need to be reworked when business requirements change. An example of such a change in clinical trials data would be the addition of a position or location that becomes a key in a new study, but that wasn't collected in previous studies. A surrogate key is a single-part, artificially established identifier for a record. Surrogate key assignment is a special case of derived data, one where a portion of the primary key is derived. A surrogate key is immune to changes in business needs. In addition, the key depends on only one field, so it's compact. A common way of deriving surrogate key values is to assign integer values sequentially. The --SEQ variable in the SDTM datasets is an example of a surrogate key for most datasets; in some instances, however, --SEQ might be a part of a natural key as a replacement for what might have been a key (e.g., a repeat sequence number) in the sponsor's database. 3.2.1.2 CDISC Submission Value-Level MetadataIn general, the SDTMIG v3.x Findings data models are closely related to normalized, relational data models in a vertical structure of one record per observation. Since the v3.x data structures are fixed, sometimes information that might have appeared as columns in a more horizontal (denormalized) structure in presentations and reports will instead be represented as rows in an SDTM Findings structure. Because many different types of observations are all presented in the same structure, there is a need to provide additional metadata to describe the expected properties that differentiate, for example, hematology lab results from serum chemistry lab results in terms of data type, standard units, and other attributes. For example, the Vital Signs data domain could contain subject records related to diastolic and systolic blood pressure, height, weight, and body mass index (BMI). These data are all submitted in the normalized SDTM Findings structure of one row per vital signs measurement. This means that there could be five records per subject (one for each test or measurement) for a single visit or time point, with the parameter names stored in the Test Code/Name variables, and the parameter values stored in result variables. Since the unique Test Code/Names could have different attributes (i.e., different origins, roles, and definitions) there would be a need to provide value-level metadata for this information. The value-level metadata should be provided as a separate section of the Define-XML document. For details on the CDISC Define-XML standard, see https://www.cdisc.org/standards/transport/define-xml. 3.2.2 ConformanceConformance with the SDTMIG Domain Models is minimally indicated by:
4.1 General Domain Assumptions4.1.1 Review Study Data Tabulation and Implementation GuideReview the Study Data Tabulation Model as well as this complete Implementation Guide before attempting to use any of the individual domain models. 4.1.2 Relationship to Analysis DatasetsSpecific guidance on preparing analysis datasets can be found in the CDISC Analysis Data Model (ADaM) Implementation Guide and other ADaM documents, available at http://www.cdisc.org/adam. 4.1.3 Additional Timing VariablesAdditional Timing variables can be added as needed to a standard domain model based on the three general observation classes, except for the cases specified in Assumption 4.4.8, Date and Time Reported in a Domain Based on Findings. Timing variables can be added to special purpose domains only where specified in the SDTMIG domain model assumptions. Timing variables cannot be added to SUPPQUAL datasets or to RELREC (described in Section 8, Representing Relationships and Data). 4.1.3.1 EPOCH Variable GuidanceWhen EPOCH is included in a Findings class domain, it should be based on the --DTC variable, since this is the date/time of the test or, for tests performed on specimens, the date/time of specimen collection. For observations in Interventions or Events class domains, EPOCH should be based on the --STDTC variable, since this is the start of the Intervention or Event. A possible, though unlikely, exception would be a finding based on an interval specimen collection that started in one epoch but ended in another. --ENDTC might be a more appropriate basis for EPOCH in such a case. Sponsors should not impute EPOCH values, but should, where possible, assign EPOCH values on the basis of CRF instructions and structure, even ifEPOCH was not directly collected and date/time data was not collected with sufficient precision to permit assignment of an observation to an EPOCH on the basis of date/time data alone. If it is not possible to determine theEPOCH of an observation, then EPOCH should be null. Methods for assigning EPOCH values can be described in the Define-XML document. Since EPOCH is a study-design construct, it is not applicable to Interventions or Events that started before the subject's participation in the study, nor to Findings performed before their participation in the study. For such records, EPOCH should be null. Note that a subject's participation in a study includes screening, which generally occurs before the reference start date, RFSTDTC, in the DM domain. 4.1.4 Order of the VariablesThe order of variables in the Define-XML document must reflect the order of variables in the dataset. The order of variables in the CDISC domain models has been chosen to facilitate the review of the models and application of the models. Variables for the three general observation classes must be ordered with Identifiers first, followed by the Topic, Qualifier, and Timing variables. Within each role, variables must be ordered as shown in SDTM Tables 2.2.1, 2.2.2, 2.2.3, 2.2.3.1, 2.2.4, and 2.2.5. 4.1.5 SDTM Core DesignationsThree categories are specified in the "Core" column in the domain models:
4.1.6 Additional Guidance on Dataset NamingSDTM datasets are normally named to be consistent with the domain code; for example, the Demographics dataset (DM) is named dm.xpt. (See the SDTM Domain Abbreviation codelist, C66734, in CDISC Controlled Terminology (https://www.cancer.gov/research/resources/terminology/cdisc) for standard domain codes). Exceptions to this rule are described in Section 4.1.7, Splitting Domains, for general-observation-class datasets and in Section 8, Representing Relationships and Data, for the RELREC and SUPP-- datasets. In some cases, sponsors may need to define new custom domains and may be concerned that CDISC domain codes defined in the future will conflict with those they choose to use. To eliminate any risk of a sponsor using a name that CDISC later determines to have a different meaning, domain codes beginning with the letters X, Y, or Z have been reserved for the creation of custom domains. Any letter or number may be used in the second position. Note the use of codes beginning with X, Y, or Z is optional, and not required for custom domains. 4.1.7 Splitting DomainsSponsors may choose to split a domain of topically related information into physically separate datasets.
The following rules must be adhered to when splitting a domain into separate datasets to ensure they can be appended back into one domain dataset:
Note that submission of split SDTM domains may be subject to additional dataset splitting conventions as defined by regulators via technical specifications and/or as negotiated with regulatory reviewers. 4.1.7.1 Example of Splitting QuestionnairesThis example shows the QS domain data split into three datasets: Clinical Global Impression (QSCG), Cornell Scale for Depression in Dementia (QSCS) and Mini Mental State Examination (QSMM). Each dataset represents a subset of the QS domain data and has only one value of QSCAT. QS Domains Dataset for Clinical Global Impressions qscg.xpt RowSTUDYIDDOMAINUSUBJIDQSSEQQSSPIDQSTESTCDQSTESTQSCATQSORRESQSSTRESCQSSTRESNQSBLFLVISITNUMVISITVISITDYQSDTCQSDY1CDISC01QSCDISC01.1000081CGI-CGI-ICGIGLOBGlobal ImprovementClinical Global ImpressionsNo change44 Dataset for Cornell Scale for Depression in Dementia qscs.xpt RowSTUDYIDDOMAINUSUBJIDQSSEQQSSPIDQSTESTCDQSTESTQSCATQSORRESQSSTRESCQSSTRESNQSBLFLVISITNUMVISITVISITDYQSDTCQSDY1CDISC01QSCDISC01.1000083CSDD-01CSDD01AnxietyCornell Scale for Depression in DementiaSevere22 Dataset for Mini Mental State Examination qsmm.xpt RowSTUDYIDDOMAINUSUBJIDQSSEQQSSPIDQSTESTCDQSTESTQSCATQSORRESQSSTRESCQSSTRESNQSBLFLVISITNUMVISITVISITDYQSDTCQSDY1CDISC01QSCDISC01.10000881MMSE-A.1MMSEA1Orientation Time ScoreMini Mental State Examination444 SUPPQS Domains Supplemental Qualifiers for QSCG suppqscg.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1CDISC01QSCDISC01.100008QSCATClinical Global ImpressionsQSLANGQuestionnaire LanguageGERMANCRF Supplemental Qualifiers for QSCS suppqscs.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1CDISC01QSCDISC01.100008QSCATCornell Scale for Depression in DementiaQSLANGQuestionnaire LanguageGERMANCRF Supplemental Qualifiers for QSMM suppqsmm.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1CDISC01QSCDISC01.100008QSCATMini Mental State ExaminationQSLANGQuestionnaire LanguageGERMANCRF 4.1.8 Origin Metadata4.1.8.1 Origin Metadata for VariablesThe origin element in the Define-XML document file is used to indicate where the data originated. Its purpose is to unambiguously communicate to the reviewer the origin of the data source. For example, data could be on the CRF (and thus should be traceable to an annotated CRF), derived (and thus traceable to some derivation algorithm), or assigned by some subjective process (and thus traceable to some external evaluator). The Define-XML specification is the definitive source of allowable origin values. Additional guidance and supporting examples can be referenced using the Metadata Submission Guidelines (MSG) for SDTMIG. 4.1.8.2 Origin Metadata for RecordsSponsors are cautioned to recognize that a derived origin means that all values for that variable were derived, and that collected on the CRF applies to all values as well. In some cases, both collected and derived values may be reported in the same field. For example, some records in a Findings dataset such as QS contain values collected from the CRF; other records may contain derived values, such as a total score. When both derived and collected values are reported in a variable, the origin is to be described using value-level metadata. 4.1.9 Assigning Natural Keys in the MetadataSection 3.2, Using the CDISC Domain Models in Regulatory Submissions — Dataset Metadata, indicates that a sponsor should include in the metadata the variables that contribute to the natural key for a domain. In a case where a dataset includes a mix of records with different natural keys, the natural key that provides the most granularity is the one that should be provided. The following examples are illustrations of how to do this, and include a case where a Supplemental Qualifier variable is referenced because it forms part of the natural key. Musculoskeletal System Findings (MK) domain example: Sponsor A chooses the following natural key for the MK domain: STUDYID, USUBJID, VISTNUM, MKTESTCD Sponsor B collects data in such a way that the location (MKLOC and MKLAT) and method (MKMETHOD) variables need to be included in the natural key to identify a unique row. Sponsor B then defines the following natural key for the MK domain. STUDYID, USUBJID, VISITNUM, MKTESTCD, MKLOC, MKLAT, MKMETHOD In certain instances a Supplemental Qualifier variable (i.e., a QNAM value, see Section 8.4, Relating Non-Standard Variables Values to a Parent Domain) might also contribute to the natural key of a record, and therefore needs to be referenced as part of the natural key for a domain. The important concept here is that a domain is not limited by physical structure. A domain may be comprised of more than one physical dataset, for example the main domain dataset and its associated Supplemental Qualifiers dataset. Supplemental Qualifiers variables should be referenced in the natural key by using a two-part name. The word QNAM must be used as the first part of the name to indicate that the contributing variable exists in a domain-specific SUPP-- and the second part is the value of QNAM that ultimately becomes a column reference (e.g., QNAM.XVAR when the SUPP-- record has a QNAM of "XVAR") when the SUPPQUAL records are joined on to the main domain dataset. Continuing with the MK domain example above: Sponsor B might have collected data that used different imaging methods, using imaging devices with different makes and models, and using different hand positions. The sponsor considers the make and model information and hand position to be essential data that contributes to the uniqueness of the test result, and so includes a device identifier (SPDEVID) in the data and creates a Supplemental Qualifier variable for hand position (QNAM = "MKHNDPOS"). The natural key is then defined as follows: STUDYID, USUBJID, SPDEVID, VISITNUM, MKTESTCD, MKLOC, MKLAT, MKMETHOD, QNAM.MKHNDPOS Where the notation "QNAM.MKHNDPOS" means the Supplemental Qualifier whose QNAM is "MKHNDPOS". This approach becomes very useful in a Findings domain when --TESTCD values are "generic" and rely on other variables to completely describe the test. The use of generic test codes helps to create distinct lists of manageable controlled terminology for --TESTCD. In studies where multiple repetitive tests or measurements are being made, for example in a rheumatoid arthritis study where repetitive measurements of bone erosion in the hands and wrists might be made using both X-ray and MRI equipment, the generic MKTEST "Sharp/Genant Bone Erosion Score" would be used in combination with other variables to fully identify the result. Taking just the phalanges, a sponsor might want to express the following in a test in order to make it unique:
When CDISC controlled terminology for a test is not available, and a sponsor creates --TEST and --TESTCD values, trying to encapsulate all information about a test within a unique value of a --TESTCD is not a recommended approach for the following reasons:
As a result, the preferred approach would be to use a generic (or simple) test code that requires associated qualifier variables to fully express the test detail. This approach was used in creating the CDISC controlled terminology that would be used in the above example: The MKTESTCD value "SGBESCR" is a "generic" test code, and additional information about the test is provided by separate qualifier variables. The variables that completely specify a test may include domain variables and supplemental qualifier variables. Expressing the natural key becomes very important in this situation in order to communicate the variables that contribute to the uniqueness of a test. The following variables would be used to fully describe the test. The natural key for this domain includes both parent dataset variables and a supplemental qualifier variable that contribute to the natural key of each row and to describe the uniqueness of the test. SPDEVIDMKTESTCDMKTESTMKLOCMKLATMKMETHODQNAM.MKHNDPOSACME3000SGBESCRSharp/Genant Bone Erosion ScoreMETACARPOPHALANGEAL JOINT 1LEFTX-RAYPALM UP 4.2 General Variable Assumptions4.2.1 Variable-Naming ConventionsSDTM variables are named according to a set of conventions, using fragment names (listed in Appendix D, CDISC Variable-Naming Fragments). Variables with names ending in "CD" are "short" versions of associated variables that do not include the "CD" suffix (e.g., --TESTCD is the short version of --TEST). Values of --TESTCD must be limited to eight characters and cannot start with a number, nor can they contain characters other than letters, numbers, or underscores. This is to avoid possible incompatibility with SAS v5 Transport files. This limitation will be in effect until the use of other formats (such as Dataset-XML) becomes acceptable to regulatory authorities. QNAM serves the same purpose as --TESTCD within supplemental qualifier datasets, and so values of QNAM are subject to the same restrictions as values of --TESTCD. Values of other "CD" variables are not subject to the same restrictions as --TESTCD.
Variable descriptive names (labels), up to 40 characters, should be provided as data variable labels for all variables, including Supplemental Qualifier variables. Use of variable names (other than domain prefixes), formats, decodes, terminology, and data types for the same type of data (even for custom domains and Supplemental Qualifiers) should be consistent within and across studies within a submission. 4.2.2 Two-Character Domain IdentifierIn order to minimize the risk of difficulty when merging/joining domains for reporting purposes, the two-character Domain Identifier is used as a prefix in most variable names. Variables in domain specification tables (see Section 5, Models for Special Purpose Domains, Section 6, Domain Models Based on the General Observation Classes, Section 7, Trial Design Model Datasets, Section 8, Representing Relationships and Data, and Section 9, Study References) already specify the complete variable names. When adding variables from the SDTM to standard domains or creating custom domains based on the General Observation Classes, sponsors must replace the -- (two hyphens) prefix in the SDTM tables of General Observation Class, Timing, and Identifier variables with the two-character Domain Identifier (DOMAIN) value for that domain/dataset. The two-character domain code is limited to A-Z for the first character, and A-Z, 0-9 for the second character. No other characters are allowed. This is for compatibility with SAS version 5 Transport files and with file naming for the Electronic Common Technical Document (eCTD). The following variables are exceptions to the philosophy that all variable names are prefixed with the Domain Identifier:
Required Identifiers are not prefixed because they are usually used as keys when merging/joining observations. The --SEQ and the optional Identifiers --GRPID and --REFID are prefixed because they may be used as keys when relating observations across domains. 4.2.3 Use of "Subject" and USUBJID"Subject" is used to generically refer to both "patients" and "healthy volunteers" in order to be consistent with the recommendation in FDA guidance. The term "Subject" should be used consistently in all labels and Define-XML document comments. To identify a subject uniquely across all studies for all applications or submissions involving the product, a unique identifier (USUBJID) should be assigned and included in all datasets. The unique subject identifier (USUBJID) is required in all datasets containing subject-level data. USUBJID values must be unique for each trial participant (subject) across all trials in the submission. This means that no two (or more) subjects, across all trials in the submission, may have the same USUBJID. Additionally, the same person who participates in multiple clinical trials (when this is known) must be assigned the same USUBJID value in all trials. The below dm.xpt sample rows illustrate a single subject who participates in two studies, first in ACME01 and later in ACME14. Note that this is only one example of the possible values for USUBJID. CDISC does not recommend any specific format for the values of USUBJID, only that the values need to be unique for all subjects in the submission, and across multiple submissions for the same compound. Many sponsors concatenate values for the Study, Site and Subject into USUBJID, but this is not a requirement. It is acceptable to use any format for USUBJID, as long as the values are unique across all subjects per FDA guidance. Study ACME01 dm.xpt dm.xpt RowSTUDYIDDOMAINUSUBJIDSUBJIDSITEIDINVNAM1ACME01DMACME01-05-00100105John Doe Study ACME14 dm.xpt dm.xpt RowSTUDYIDDOMAINUSUBJIDSUBJIDSITEIDINVNAM1ACME14DMACME01-05-00101714Mary Smith 4.2.4 Text Case in Submitted DataIt is recommended that text data be submitted in upper case text. Exceptions may include long text data (such as comment text) and values of --TEST in Findings datasets (which may be more readable in title case if used as labels in transposed views). Values from CDISC controlled terminology or external code systems (e.g., MedDRA) or response values for QRS instruments specified by the instrument documentation should be in the case specified by those sources, which may be mixed case. The case used in the text data must match the case used in the Controlled Terminology provided in the Define-XML document. 4.2.5 Convention for Missing ValuesMissing values for individual data items should be represented by nulls. Conventions for representing observations not done, using the SDTM --STAT and --REASND variables, are addressed in Section 4.5.1.2, Tests Not Done and the individual domain models. 4.2.6 Grouping Variables and CategorizationGrouping variables are Identifiers and Qualifiers variables, such as the --CAT (Category) and --SCAT (Subcategory), that group records in the SDTM domains/datasets and can be assigned by sponsors to categorize topic-variable values. For example, a lab record with LBTEST = "SODIUM" might have LBCAT = "CHEMISTRY" and LBSCAT = "ELECTROLYTES". Values for --CAT and --SCAT should not be redundant with the domain name or dictionary classification provided by --DECOD and --BODSYS. 1. Hierarchy of Grouping Variables STUDYID 2. How Grouping Variables Group Data A. For the subject
B. Across subjects (records with different USUBJID values)
C. Within subjects (records with the same USUBJID values)
D. Although --SPID and --REFID are Identifier variables, they may sometimes be used as grouping variables and may also have meaning across domains. E. --LNKID and --LNKGRP express values that are used to link records in separate domains. As such, these variables are often used in IDVAR in a RELREC relationship when there is a dataset-to-dataset relationship.
3. Differences between Grouping Variables The primary distinctions between --CAT/--SCAT and --GRPID are:
Therefore, data that would be the same across subjects is usually more appropriate in --CAT/--SCAT, and data that would vary across subjects is usually more appropriate in --GRPID. For example, a Concomitant Medication administered as part of a known combination therapy for all subjects ("Mayo Clinic Regimen", for example) would more appropriately use --CAT/--SCAT to identify the medication as part of that regimen. Groups of medications recorded on an SAE form as treatments for the SAE would more appropriately use --GRPID because the groupings are likely to differ across subjects. In domains based on the Findings general observation class, the --RESCAT variable can be used to categorize results after the fact. --CAT and --SCAT by contrast, are generally pre-defined by the sponsor or used by the investigator at the point of collection, not after assessing the value of Findings results. 4.2.7 Submitting Free Text from the CRFSponsors often collect free text data on a CRF to supplement a standard field. This often occurs as part of a list of choices accompanied by "Other, specify." The manner in which these data are submitted will vary based on their role. The handling of verbatim text values for the ---OBJ variable is discussed in Section 6.4.3 Variables Unique to Findings About. 4.2.7.1 "Specify" Values for Non-Result Qualifier VariablesWhen free-text information is collected to supplement a standard non-result Qualifier field, the free-text value should be placed in the SUPP-- dataset described in Section 8.4, Relating Non-Standard Variables Values to a Parent Domain. When applicable, controlled terminology should be used for SUPP-- field names (QNAM) and their associated labels (QLABEL) (see Section 8.4, Relating Non-Standard Variables Values to a Parent Domain and Appendix C2, Supplemental Qualifiers Name Codes). For example, when a description of "Other Medically Important Serious Adverse Event" category is collected on a CRF, the free text description should be stored in the SUPPAE dataset.
Another example is a CRF that collects reason for dose adjustment with additional free-text description: Reason for Dose Adjustment (EXADJ)Describe
The free text description should be stored in the SUPPEX dataset.
When the CRF includes a list of values for a qualifier field that includes "Other" and the "Other" is supplemented with a "Specify" free text field, then the manner in which the free text "Specify" value is submitted will vary based on the sponsor's coding practice and analysis requirements. For example, consider a CRF that collects the indication for an analgesic concomitant medication (CMINDC) using a list of pre-specified values and an "Other, specify" field : Indication for analgesic
An investigator has selected "OTHER" and specified "Broken arm". Several options are available for submission of this data: 1) If the sponsor wishes to maintain controlled terminology for the CMINDC field and limit the terminology to the five pre-specified choices, then the free text is placed in SUPPCM. CMINDCOTHER QNAMQLABELQVALCMINDOTHOther IndicationBROKEN ARM 2) If the sponsor wishes to maintain controlled terminology for CMINDC but will expand the terminology based on values seen in the specify field, then the value of CMINDC will reflect the sponsor's coding decision and SUPPCM could be used to store the verbatim text. CMINDCFRACTURE QNAMQLABELQVALCMINDOTHOther IndicationBROKEN ARM Note that the sponsor might choose a different value for CMINDC (e.g., "BONE FRACTURE") depending on the sponsor's coding practice and analysis requirements. 3) If the sponsor does not require that controlled terminology be maintained and wishes for all responses to be stored in a single variable, then CMINDC will be used and SUPPCM is not required. CMINDCBROKEN ARM 4.2.7.2 "Specify" Values for Result Qualifier VariablesWhen the CRF includes a list of values for a result field that includes "Other" and the "Other" is supplemented with a "Specify" free text field, then the manner in which the free text "Specify" value is submitted will vary based on the sponsor's coding practice and analysis requirements. For example, consider a CRF where the sponsor requests the subject's eye color: Eye Color
An investigator has selected "OTHER" and specified "BLUEISH GRAY". As in the above discussion for non-result Qualifier values, the sponsor has several options for submission: 1) If the sponsor wishes to maintain controlled terminology in the standard result field and limit the terminology to the five pre-specified choices, then the free text is placed in --ORRES and the controlled terminology in --STRESC. SCTESTSCORRESSCSTRESCEye ColorBLUEISH GRAYOTHER 2) If the sponsor wishes to maintain controlled terminology in the standard result field, but will expand the terminology based on values seen in the specify field, then the free text is placed in --ORRES and the value of --STRESC will reflect the sponsor's coding decision. SCTESTSCORRESSCSTRESCEye ColorBLUEISH GRAYGRAY 3) If the sponsor does not require that controlled terminology be maintained, the verbatim value will be copied to --STRESC. SCTESTSCORRESSCSTRESCEye ColorBLUEISH GRAYBLUEISH GRAY 4.2.7.3 "Specify" Values for Topic VariablesInterventions: If a list of specific treatments is provided along with "Other, Specify", --TRT should be populated with the name of the treatment found in the specified text. If the sponsor wishes to distinguish between the pre-specified list of treatments and those recorded under "Other, Specify," the --PRESP variable could be used. For example: Indicate which of the following concomitant medications
If ibuprofen and diclofenac were reported, the CM dataset would include the following: CMTRTCMPRESPIBUPROFENYDICLOFENAC Events: "Other, Specify" for Events may be handled similarly to Interventions. --TERM should be populated with the description of the event found in the specified text and --PRESP could be used to distinguish between prespecified and free text responses. Findings: "Other, Specify" for tests may be handled similarly to Interventions. --TESTCD and --TEST should be populated with the code and description of the test found in the specified text. If specific tests are not prespecified on the CRF and the investigator has the option of writing in tests, then the name of the test would have to be coded to ensure that all --TESTCD and --TEST values are consistent with the test controlled terminology. For example, a lab CRF collected values for Hemoglobin, Hematocrit and "Other, specify". The value the investigator wrote for "Other, specify" was "Prothrombin time" with an associated result and units. The sponsor would submit the controlled terminology for this test, i.e., LBTESTCD would be "PT" and LBTEST would be "Prothrombin Time", rather than the verbatim term, "Prothrombin time" supplied by the investigator. 4.2.8 Multiple Values for a Variable4.2.8.1 Multiple Values for an Intervention or Event Topic VariableIf multiple values are reported for a topic variable (i.e., --TRT in an Interventions general-observation-class dataset or --TERM in an Events general-observation-class dataset), it is expected that the sponsor will split the values into multiple records or otherwise resolve the multiplicity per the sponsor's standard data management procedures. For example, if an adverse event term of "Headache and Nausea" or a concomitant medication of "Tylenol and Benadryl" is reported, sponsors will often split the original report into separate records and/or query the site for clarification. By the time of submission, the datasets should be in conformance with the record structures described in the SDTMIG. Note that the Disposition dataset (DS) is an exception to the general rule of splitting multiple topic values into separate records. For DS, one record for each disposition or protocol milestone is permitted according to the domain structure. For cases of multiple reasons for discontinuation see Section 6.2.3, Disposition, Assumption 5 for additional information. 4.2.8.2 Multiple Values for a Findings Result VariableIf multiple result values (--ORRES) are reported for a test in a Findings class dataset, multiple records should be submitted for that --TESTCD. For example,
When a finding can have multiple results, the key structure for the findings dataset must be adequate to distinguish between the multiple results. See Section 4.1.9 Assigning Natural Keys in the Metadata. 4.2.8.3 Multiple Values for a Non-Result Qualifier VariableThe SDTM permits one value for each Qualifier variable per record. If multiple values exist (e.g., due to a "Check all that apply" instruction on a CRF), then the value for the Qualifier variable should be "MULTIPLE" and SUPP-- should be used to store the individual responses. It is recommended that the SUPP-- QNAM value reference the corresponding standard domain variable with an appended number or letter. In some cases, the standard variable name will be shortened to meet the 8-character variable name requirement, or it may be clearer to append a meaningful character string as shown in the second AE example below, where the first three characters of the drug name are appended. Likewise the QLABEL value should be similar to the standard label. The values stored in QVAL should be consistent with the controlled terminology associated with the standard variable. See Section 8.4, Relating Non-Standard Variables Values to a Parent Domain for additional guidance on maintaining appropriately unique QNAM values. The following example includes selected variables from the ae.xpt and suppae.xpt datasets for a rash whose locations are the face, neck, and chest. AE Dataset AETERMAELOCRASHMULTIPLE SUPPAE Dataset QNAMQLABELQVALAELOC1Location of the Reaction 1FACEAELOC2Location of the Reaction 2NECKAELOC3Location of the Reaction 3CHEST In some cases, values for QNAM and QLABEL more specific than those above may be needed. For example, a sponsor might conduct a study with two study drugs (e.g., open-label study of Abcicin + Xyzamin), and may require the investigator assess causality and describe action taken for each drug for the rash: AE Dataset AETERMAERELAEACNRASHMULTIPLEMULTIPLE SUPPAE Dataset QNAMQLABELQVALAERELABCCausality of AbcicinPOSSIBLY RELATEDAERELXYZCausality of XyzaminUNLIKELY RELATEDAEACNABCAction Taken with AbcicinDOSE REDUCEDAEACNXYZAction Taken with XyzaminDOSE NOT CHANGED In each of the above examples, the use of SUPPAE should be documented in the Define-XML document and the annotated CRF. The controlled terminology used should be documented as part of value-level metadata. If the sponsor has clearly documented that one response is of primary interest (e.g., in the CRF, protocol, or analysis plan), the standard domain variable may be populated with the primary response and SUPP-- may be used to store the secondary response(s). For example, if Abcicin is designated as the primary study drug in the example above: AE Dataset AETERMAERELAEACNRASHPOSSIBLY RELATEDDOSE REDUCED SUPPAE Dataset QNAMQLABELQVALAERELXCausality of XyzaminUNLIKELY RELATEDAEACNXAction Taken with XyzaminDOSE NOT CHANGED Note that in the latter case, the label for standard variables AEREL and AEACN will have no indication that they pertain to Abcicin. This association must be clearly documented in the metadata and annotated CRF. 4.2.9 Variable LengthsVery large transport files have become an issue for FDA to process. One of the main contributors to the large file sizes has been sponsors using the maximum length of 200 for character variables. To help rectify this situation:
4.3 Coding and Controlled Terminology AssumptionsExamples provided in the column "CDISC Notes" are only examples and not intended to imply controlled terminology. Check current controlled terminology at this link: http://www.cancer.gov/cancertopics/cancerlibrary/terminologyresources/cdisc. 4.3.1 Types of Controlled TerminologyAs of SDTMIG v3.3, controlled terminology is represented one of the following ways:
In addition, the "Controlled Terms, Codelist or Format" column has been used to indicate variables that use an ISO 8601 format. 4.3.2 Controlled Terminology Text CaseTerms from controlled terminology should be in the case that appears the source codelist or code system (e.g., CDISC codelist or external code system such as MedDRA). See Section 4.2.4 Text Case in Submitted Data 4.3.3 Controlled Terminology ValuesThe controlled terminology or a reference to the controlled terminology should be included in the Define-XML document file wherever applicable. All values in the permissible value set for the study should be included, whether they are represented in the submitted data or not. Note that a null value should not be included in the permissible value set. A null value is implied for any list of controlled terms unless the variable is "Required" (see Section 4.1.5, SDTM Core Designations). When a domain or datasetspecification includes a codelist for a variable, not every value in that codelist may have been part of planned data collection; only values that were part of planned data collection should be included in the Define-XML document. For example, --PRESP variables are associated with the NY codelist, but only the value "Y" is allowed in --PRESP variables. Future versions of the Define-XML Specification are expected to include information on representing subsets of controlled terminology. 4.3.4 Use of Controlled Terminology and Arbitrary Number CodesControlled terminology or human-readable text should be used instead of arbitrary number codes in order to reduce ambiguity for submission reviewers. For example, CMDECOD would contain human-readable dictionary text rather than a numeric code. Numeric code values may be submitted as Supplemental Qualifiers if necessary. 4.3.5 Storing Controlled Terminology for Synonym Qualifier Variables
The sponsor is expected to provide the dictionary name and version used to map the terms by utilizing the Define-XML external codelist attributes. 4.3.6 Storing Topic Variables for General Domain ModelsThe topic variable for the Interventions and Events general-observation-class models is often stored as verbatim text. For an Events domain, the topic variable is --TERM. For an Interventions domain, the topic variable is --TRT. For a Findings domain, the topic variable, --TESTCD, should use Controlled Terminology (e.g., "SYSBP" for Systolic Blood Pressure). If CDISC standard controlled terminology exists, it should be used; otherwise, sponsors should define their own controlled list of terms. If the verbatim topic variable in an Interventions or Event domain is modified to facilitate coding, the modified text is stored in --MODIFY. In most cases (other than PE), the dictionary-coded text is derived into --DECOD. Since the PEORRES variable is modified instead of the topic variable for PE, the dictionary-derived text would be placed in PESTRESC. The variables used in each of the defined domains are: DomainOriginal VerbatimModified VerbatimStandardized ValueAEAETERMAEMODIFYAEDECODDSDSTERM 4.3.7 Use of "Yes" and "No" ValuesVariables where the response is "Yes" or "No" ("Y" or "N") should normally be populated for both "Y" and "N" responses. This eliminates confusion regarding whether a blank response indicates "N" or is a missing value. However, some variables are collected or derived in a manner that allows only one response, such as when a single check box indicates "Yes". In situations such as these, where it is unambiguous to populate only the response of interest, it is permissible to populate only one value ("Y" or "N") and leave the alternate value blank. An example of when it would be acceptable to use only a value of "Y" would be for Last Observation Before Exposure Flag (--LOBXFL) variables, where "N" is not necessary to indicate that a value is not the last observation before exposure. Note: Permissible values for variables with controlled terms of "Y" or "N" may be extended to include "U" or "NA" if it is the sponsor's practice to explicitly collect or derive values indicating "Unknown" or "Not Applicable" for that variable. 4.4 Actual and Relative Time AssumptionsTiming variables (SDTM Table 2.2.5) are an essential component of all SDTM subject-level domain datasets. In general, all domains based on the three general observation classes should have at least one Timing variable. In the Events or Interventions general observation class, this could be the start date of the event or intervention. In the Findings observation class, where data are usually collected at multiple visits, at least one Timing variable must be used. The SDTMIG requires dates and times of day to be stored according to the international standard ISO 8601 (http://www.iso.org). ISO 8601 provides a text-based representation of dates and/or times, intervals of time, and durations of time. 4.4.1 Formats for Date/Time VariablesAn SDTM DTC variable may include data that is represented in ISO 8601 format as a complete date/time, a partial date/time, or an incomplete date/time. The SDTMIG template uses ISO 8601 for calendar dates and times of day, which are expressed as follows:
where:
Other characters defined for use within the ISO 8601 standard are:
Key aspects of the ISO 8601 standard are as follows:
Implementation of the ISO 8601 standard means that date/time variables are character/text data types. The SDTM fragment employed for date/time character variables is DTC. 4.4.2 Date/Time PrecisionThe concept of representing date/time precision is handled through use of the ISO 8601 standard. According to ISO 8601, precision (also referred to by ISO 8601 as "completeness" or "representations with reduced accuracy") can be inferred from the presence or absence of components in the date and/or time values. Missing components are represented by right truncation or a hyphen (for intermediate components that are missing). If the date and time values are completely missing, the SDTM date field should be null. Every component except year is represented as two digits. Years are represented as four digits; for all other components, one-digit numbers are always padded with a leading zero. The table below provides examples of ISO 8601 representations of complete and truncated date/time values using ISO 8601 "appropriate right truncations" of incomplete date/time representations. Note that if no time component is represented, the [T] time designator (in addition to the missing time) must be omitted in ISO 8601 representation.
This date and date/time model also provides for imprecise or estimated dates, such as those commonly seen in Medical History. To represent these intervals while applying the ISO 8601 standard, it is recommended that the sponsor concatenate the date/time values (using the most complete representation of the date/time known) that describe the beginning and the end of the interval of uncertainty and separate them with a solidus as shown in the table below:
Other uncertainty intervals may be represented by the omission of components of the date when these components are unknown or missing. As mentioned above, ISO 8601 represents missing intermediate components through the use of a hyphen where the missing component would normally be represented. This may be used in addition to "appropriate right truncations" for incomplete date/time representations. When components are omitted, the expected delimiters must still be kept in place and only a single hyphen is to be used to indicate an omitted component. Examples of this method of omitted component representation are shown in the table below: Date and Time as Originally RecordedLevel of UncertaintyISO 8601 Date/Time1December 15, 2003 13:15:17Date/time to the nearest second2003-12-15T13:15:172December 15, 2003 ??:15Unknown hour with known minutes2003-12-15T-:153December 15, 2003 13:??:17Unknown minutes with known date, hours, and seconds2003-12-15T13:-:174The 15th of some month in 2003, time not collectedUnknown month and time with known year and day2003---155December 15, but can't remember the year, time not collectedUnknown year with known month and day--12-1567:15 of some unknown dateUnknown date with known hour and minute-----T07:15 Note that Row 6 above, where a time is reported with no date information, represents a very unusual situation. Since most data is collected as part of a visit, when only a time appears on a CRF, it is expected that the date of the visit would usually be used as the date of collection. Using a character-based data type to implement the ISO 8601 date/time standard will ensure that the date/time information will be machine and human readable without the need for further manipulation, and will be platform and software independent. 4.4.3 Intervals of Time and Use of Duration for --DUR Variables4.4.3.1 Intervals of Time and Use of DurationAs defined by ISO 8601, an interval of time is the part of a time axis, limited by two time "instants" such as the times represented in SDTM by the variables --STDTC and --ENDTC. These variables represent the two instants that bound an interval of time, while the duration is the quantity of time that is equal to the difference between these time points. ISO 8601 allows an interval to be represented in multiple ways. One representation, shown below, uses two dates in the format: YYYY-MM-DDThh:mm:ss/YYYY-MM-DDThh:mm:ss While the above would represent the interval (by providing the start date/time and end date/time to bound the interval of time), it does not provide the value of the duration (the quantity of time). Duration is frequently used during a review; however, the duration timing variable (--DUR) should generally be used in a domain if it was collected in lieu of a start date/time (--STDTC) and end date/time (--ENDTC). If both --STDTC and --ENDTC are collected, durations can be calculated by the difference in these two values, and need not be in the submission dataset. Both duration and duration units can be provided in the single --DUR variable, in accordance with the ISO 8601 standard. The values provided in --DUR should follow one of the following ISO 8601 duration formats: PnYnMnDTnHnMnS - or - PnW where:
The letter "P" must precede other values in the ISO 8601 representation of duration. The "n" preceding each letter represents the number of Years, Months, Days, Hours, Minutes, Seconds, or the number of Weeks. As with the date/time format, "T" is used to separate the date components from time components. Note that weeks cannot be mixed with any other date/time components such as days or months in duration expressions. As is the case with the date/time representation in --DTC, --STDTC, or --ENDTC, only the components of duration that are known or collected need to be represented. Also, as is the case with the date/time representation, if no time component is represented, the [T] time designator (in addition to the missing time) must be omitted in ISO 8601 representation. ISO 8601 also allows that the "lowest-order components" of duration being represented may be represented in decimal format. This may be useful if data are collected in formats such as "one and one-half years", "two and one-half weeks", "one-half a week" or "one quarter of an hour" and the sponsor wishes to represent this "precision" (or lack of precision) in ISO 8601 representation. Remember that this is ONLY allowed in the lowest-order (right-most) component in any duration representation. The table below provides some examples of ISO-8601-compliant representations of durations: Duration as originally recordedISO 8601 Duration2 YearsP2Y10 weeksP10W3 Months 14 daysP3M14D3 DaysP3D6 Months 17 Days 3 HoursP6M17DT3H14 Days 7 Hours 57 MinutesP14DT7H57M42 Minutes 18 SecondsPT42M18SOne-half hourPT0.5H5 Days 12¼ HoursP5DT12.25H4 ½ WeeksP4.5W Note that a leading zero is required with decimal values less than one. 4.4.3.2 Interval with UncertaintyWhen an interval of time is an amount of time (duration) following an event whose start date/time is recorded (with some level of precision, i.e. when one knows the start date/time and the duration following the start date/time), the correct ISO 8601 usage to represent this interval is as follows: YYYY-MM-DDThh:mm:ss/PnYnMnDTnHnMnS where the start date/time is represented before the solidus [/], the "Pn…" following the solidus represents a "duration", and the entire representation is known as an "interval". Note that this is the recommended representation of elapsed time, given a start date/time and the duration elapsed. When an interval of time is an amount of time (duration) measured prior to an event whose start date/time is recorded (with some level of precision, i.e., where one knows the end date/time and the duration preceding that end date/time), the syntax is: PnYnMnDTnHnMnS/YYYY-MM-DDThh:mm:ss where the duration, "Pn…", is represented before the solidus [/], the end date/time is represented following the solidus, and the entire representation is known as an "interval". 4.4.4 Use of the "Study Day" VariablesThe permissible Study Day variables (--DY, --STDY, and --ENDY) describe the relative day of the observation starting with the reference date as Day 1. They are determined by comparing the date portion of the respective date/time variables (--DTC, --STDTC, and --ENDTC) to the date portion of the Subject Reference Start Date (RFSTDTC from the Demographics domain). The Subject Reference Start Date (RFSTDTC) is designated as Study Day 1. The Study Day value is incremented by 1 for each date following RFSTDTC. Dates prior to RFSTDTC are decreased by 1, with the date preceding RFSTDTC designated as Study Day -1 (there is no Study Day 0). This algorithm for determining Study Day is consistent with how people typically describe sequential days relative to a fixed reference point, but creates problems if used for mathematical calculations because it does not allow for a Day 0. As such, Study Day is not suited for use in subsequent numerical computations, such as calculating duration. The raw date values should be used rather than Study Day in those calculations. All Study Day values are integers. Thus, to calculate Study Day: --DY = (date portion of --DTC) - (date portion of RFSTDTC) + 1 if --DTC is on or after RFSTDTC --DY = (date portion of --DTC) - (date portion of RFSTDTC) if --DTC precedes RFSTDTC This algorithm should be used across all domains. 4.4.5 Clinical Encounters and VisitsAll domains based on the three general observation classes should have at least one timing variable. For domains in the Events or Interventions observation classes, and for domains in the Findings observation class, for which data are collected only once during the study, the most appropriate timing variable may be a date (e.g., --DTC, --STDTC) or some other timing variable. For studies that are designed with a prospectively defined schedule of visit-based activities, domains for data that are to be collected more than once per subject (e.g., Labs, ECG, Vital Signs) are expected to include VISITNUM as a timing variable. Clinical encounters are described by the CDISC Visit variables. For planned visits, values of VISIT, VISITNUM, and VISITDY must be those defined in the Trial Visits (TV) dataset (Section 7.3.1, Trial Visits). For planned visits:
Sponsor practices for populating visit variables for unplanned visits may vary across sponsors.
The following table shows an example of how the visit identifiers might be used for lab data: USUBJIDVISITVISITNUMVISITDYLBDY001Week 1277001Week 231413001Week 2 Unscheduled3.1 4.4.6 Representing Additional Study DaysThe SDTM allows to represent study days relative to the RFSTDTC reference start date variable in the DM dataset, using variables --DY, as described above in Section 4.4.4, Use of the "Study Day" Variables. The calculation of additional study days within subdivisions of time in a clinical trial may be based on one or more sponsor-defined reference dates not represented by RFSTDTC. In such cases, the sponsor may define Supplemental Qualifier variables and the Define-XML document should reflect the reference dates used to calculate such study days. If the sponsor wishes to define "day within element" or "day within epoch", the reference date/time will be an element start date/time in the Subject Elements (SE) dataset (Section 5.3, Subject Elements). 4.4.7 Use of Relative Timing Variables--STRF and --ENRF The variables --STRF and --ENRF represent the timing of an observation relative to the sponsor-defined Study Reference Period, when information such as "BEFORE", "PRIOR", "ONGOING"', or "CONTINUING" is collected in lieu of a date and this collected information is in relation to the sponsor-defined Study Reference Period. The sponsor-defined Study Reference Period is the continuous period of time defined by the discrete starting point, RFSTDTC, and the discrete ending point, RFENDTC, for each subject in the Demographics dataset. --STRF is used to identify the start of an observation relative to the sponsor-defined Study Reference Period. --ENRF is used to identify the end of an observation relative to the sponsor-defined Study Reference Period. Allowable values for --STRF are "BEFORE", "DURING", "DURING/AFTER", "AFTER", and "U" (for unknown). Although "COINCIDENT" and "ONGOING" are in the STENRF codelist, they describe timing relative to a point in time rather than an interval of time, so are not appropriate for use with --STRF variables. It would be unusual for an event or intervention to be recorded as starting "AFTER" the Study Reference Period, but could be possible, depending on how the Study Reference Period is defined in a particular study. Allowable values for --ENRF are "BEFORE", "DURING", "DURING/AFTER", "AFTER" and "U" (for unknown). If --ENRF is used, then --ENRF = "AFTER" means that the event did not end before or during the Study Reference Period. Although "COINCIDENT" and "ONGOING" are in the STENRF codelist, they describe timing relative to a point in time rather than an interval of time, so are not appropriate for use with --ENRF variables. As an example, a CRF checkbox that identifies concomitant medication use that began prior to the Study Reference Period would translate into CMSTRF = "BEFORE", if selected. Note that in this example, the information collected is with respect to the start of the concomitant medication use only, and therefore the collected data corresponds to variable CMSTRF, not CMENRF. Note also that the information collected is relative to the Study Reference Period, which meets the definition of CMSTRF. Some sponsors may wish to derive --STRF and --ENRF for analysis or reporting purposes even when dates are collected. Sponsors are cautioned that doing so in conjunction with directly collecting or mapping data such as "BEFORE", "PRIOR", "ONGOING", etc., to --STRF and --ENRF will blur the distinction between collected and derived values within the domain. Sponsors wishing to do such derivations are instead encouraged to use analysis datasets for this derived data. In general, sponsors are cautioned that representing information using variables --STRF and --ENRF may not be as precise as other methods, particularly because information is often collected relative to a point in time or to a period of time other than the one defined as the Study Reference Period. SDTMIG v3.1.2 attempted to address these limitations by the addition of four new relative timing variables, which are described in the following paragraph. Sponsors should use the set of variables that allows for accurate representation of the collected data. In many cases, this will mean using these new relative timing variables in place of --STRF and --ENRF. --STRTPT, --STTPT, --ENRTPT, and --ENTPT While the variables --STRF and --ENRF are useful in the case when relative timing assessments are made coincident with the start and end of the Study Reference Period, these may not be suitable for expressing relative timing assessments such as "Prior" or "Ongoing" that are collected at other times of the study. As a result, four new timing variables were added in v3.1.2 to express a similar concept at any point in time. The variables --STRTPT and --ENRTPT contain values similar to --STRF and --ENRF, but may be anchored with any timing description or date/time value expressed in the respective --STTPT and --ENTPT variables, and are not limited to the Study Reference Period. Unlike the variables --STRF and --ENRF, which for all domains are defined relative to one Study Reference Period, the timing variables --STRTPT, --STTPT, --ENRTPT, and --ENTPT are defined by each sponsor for each study. Allowable values for --STRTPT and --ENRTPT are as follows: If the reference time point corresponds to the date of collection or assessment:
If the reference time point is prior to the date of collection or assessment:
Although "DURING" and "DURING/AFTER" are in the STENRF codelist, they describe timing relative to an interval of time rather than a point in time, so are not allowable for use with --STRTPT and --ENRTPT variables. Examples of --STRTPT, --STTPT, --ENRTPT, and --ENTPT Example: Medical History Assumptions:
Example when "Active" is checked:
Figure 4.4.7: Example of --ENRTPT and --ENTPT for Medical History Example: Prior and Concomitant Medications Assumptions:
Example when both "Prior" and "Continuing" are checked:
Example: Adverse Events Assumptions:
Example when "Unknown" is checked:
4.4.8 Date and Time Reported in a Domain Based on FindingsWhen the date/time of collection is reported in any domain, the date/time should go into the --DTC field (e.g., EGDTC for Date/Time of ECG). For any domain based on the Findings general observation class, such as lab tests which are based on a specimen, the collection date is likely to be tied to when the specimen or source of the finding was captured, not necessarily when the data were recorded. In order to ensure that the critical timing information is always represented in the same variable, the --DTC variable is used to represent the time of specimen collection. For example, in the LB domain the LBDTC variable would be used for all single-point blood collections or spot urine collections. For timed lab collections (e.g., 24-hour urine collections) the LBDTC variable would be used for the start date/time of the collection and LBENDTC for the end date/time of the collection. This approach will allow the single-point and interval collections to use the same date/time variables consistently across all datasets for the Findings general observation class. The table below illustrates the proper use of these variables. Note that --STDTC is not used for collection dates over an interval in the Findings general observation class and is therefore blank in the following table. Collection Type--DTC--STDTC--ENDTCSingle-Point CollectionX 4.4.9 Use of Dates as Result VariablesDates are generally used only as timing variables to describe the timing of an event, intervention, or collection activity, but there may be occasions when it may be preferable to model a date as a result (--ORRES) in a Findings dataset. Note that using a date as a result to a Findings question is unusual and atypical, and should be approached with caution. This situation, however, may occasionally occur when a) a group of questions (each of which has a date response) is asked and analyzed together; or b) the Event(s) and Intervention(s) in question are not medically significant (often the case when included in questionnaires). Consider the following cases:
One approach to modeling these data would be to place the text of the question in --TEST and the response to the question, a date represented in ISO 8601 format, in --ORRES and --STRESC, as long as these date results do not contain the dates of medically significant events or interventions. Again, use extreme caution when storing dates as the results of Findings. Remember, in most cases, these dates should be timing variables associated with a record in an Intervention or Events dataset. 4.4.10 Representing Time PointsTime points can be represented using the time point variables, --TPT, --TPTNUM, --ELTM, and the time point anchors, --TPTREF (text description) and --RFTDTC (the date/time). Note that time-point data will usually have an associated --DTC value. The interrelationship of these variables is shown in Figure 4.4.10 below. Figure 4.4.10: Relationships among Time Point Variables Values for these variables for Vital Signs measurements taken at 30, 60, and 90 minutes after dosing would look like the following. VSTPTNUMVSTPTVSELTMVSTPTREFVSRFTDTCVSDTC130 MINPT30MDOSE ADMINISTRATION2006-08-01T08:002006-08-01T08:30260 MINPT1HDOSE ADMINISTRATION2006-08-01T08:002006-08-01T09:01390 MINPT1H30MDOSE ADMINISTRATION2006-08-01T08:002006-08-01T09:32 Note that VSELTM is the planned elapsed time, not the actual elapsed time. The actual elapsed time could be derived in an analysis dataset, if desired, as VSDTC-VSRFTDTC. Values for these variables for Urine Collections taken pre-dose, and from 0-12 hours and 12-24 hours after dosing would look like the following. LBTPTNUMLBTPTLBELTMLBTPTREFLBRFTDTCLBDTC115 MIN PRE-DOSE-PT15MDOSE ADMINISTRATION2006-08-01T08:002006-08-01T07:4520-12 HOURSPT12HDOSE ADMINISTRATION2006-08-01T08:002006-08-01T20:35312-24 HOURSPT24HDOSE ADMINISTRATION2006-08-01T08:002006-08-02T08:40 Note that the value in LBELTM represents the end of the specimen collection interval. When time points are used, --TPTNUM is expected. Time points may or may not have an associated --TPTREF. Sometimes, --TPTNUM may be used as a key for multiple values collected for the same test within a visit; as such, there is no dependence upon an anchor such as --TPTREF, but there will be a dependency upon the VISITNUM. In such cases, VISITNUM will be required to confer uniqueness to values of --TPTNUM. If the protocol describes the scheduling of a dose using a reference intervention or assessment, then --TPTREF should be populated, even if it does not contribute to uniqueness. The fact that time points are related to a reference time point, and what that reference time point is, are important for interpreting the data collected at the time point. Not all time points will require all three variables to provide uniqueness. In fact, in some cases a time point may be uniquely identified without the use of VISIT, or without the use of --TPTREF, or, without the use of either one. For instance:
For trials with many time points, the requirement to provide uniqueness using only VISITNUM, --TPTREF, and --TPTNUM may lead to a scheme where multiple natural keys are combined into the values of one of these variables. For instance, in a crossover trial with multiple doses on multiple days within each period, either of the following options could be used. VISITNUM might be used to designate period, --TPTREF might be used to designate the day and the dose, and --TPTNUM might be used to designate the timing relative to the reference time point. Alternatively, VISITNUM might be used to designate period and day within period, --TPTREF might be used to designate the dose within the day, and --TPTNUM might be used to designate the timing relative to the reference time point. Option 1 VISITVISITNUM--TPT--TPTNUM--TPTREFPERIOD 13PRE-DOSE1DAY 1, AM DOSE1H24H3PRE-DOSE1DAY 1, PM DOSE1H24H3PRE-DOSE1DAY 5, AM DOSE1H24H3PRE-DOSE1DAY 5, PM DOSE1H24H3PERIOD 24PRE-DOSE1DAY 1, AM DOSE1H24H3PRE-DOSE1DAY 1, PM DOSE1H24H3 Option 2 VISITVISITNUM--TPT--TPTNUM--TPTREFPERIOD 1, DAY 13PRE-DOSE1AM DOSE1H24H3PRE-DOSE1PM DOSE1H24H3PERIOD 1, DAY 54PRE-DOSE1AM DOSE1H24H3PRE-DOSE1PM DOSE1H24H3PERIOD 2, DAY 15PRE-DOSE1AM DOSE1H24H3PRE-DOSE1PM DOSE1H24H3 Within the context that defines uniqueness for a time point, which may include domain, visit, and reference time point, there must be a one-to-relationship between values of --TPT and --TPTNUM. In other words, if domain, visit, and reference time point uniquely identify subject data, then if two subjects have records with the same values of DOMAIN, VISITNUM, --TPTREF, and --TPTNUM, then these records may not have different time point descriptions in --TPT. Within the context that defines uniqueness for a time point, there is likely to be a one-to-one relationship between most values of --TPT and --ELTM. However, since --ELTM can only be populated with ISO 8601 periods of time (as described in Section 4.4.3, Intervals of Time and Use of Duration for --DUR Variables), --ELTM may not be populated for all time points. For example, --ELTM is likely to be null for time points described by text such as "pre-dose" or "before breakfast". When --ELTM is populated, if two subjects have records with the same values of DOMAIN, VISITNUM, --TPTREF, and --TPTNUM, then these records may not have different values in --ELTM. When the protocol describes a time point with text such as "4-6 hours after dose" or "12 hours +/- 2 hours after dose" the sponsor may choose whether and how to populate --ELTM. For example, a time point described as "4-6 hours after dose" might be associated with an --ELTM value of PT4H. A time point described as "12 hours +/- 2 hours after dose" might be associated with an --ELTM value of PT12H. Conventions for populating --ELTM should be consistent (the examples just given would probably not both be used in the same trial). It would be good practice to indicate the range of intended timings by some convention in the values used to populate --TPT. Sponsors may, of course, use more stringent requirements for populating --TPTNUM, --TPT, and --ELTM. For instance, a sponsor could decide that all time points with a particular --ELTM value would have the same values of --TPTNUM, and --TPT, across all visits, reference time points, and domains. 4.4.11 Disease Milestones and Disease Milestone Timing VariablesA "disease milestone" is an event or activity that can be anticipated in the course of a disease, but whose timing is not controlled by the study schedule. A disease milestone may be something that occurred pre-study, but which represents a time at which data would have been collected, such as diagnosis of the disease under study. A disease milestone may also be something which is anticipated to occur during a study and which, if it occurs, triggers the collection of related data outside the regular schedule of visits, such as an adverse event of interest. The types of Disease Milestones for a study are defined in the study-level Trial Disease Milestones (TM) dataset (Section 7.3.3, Trial Disease Milestones). The times at which disease milestones occurred for a particular subject are summarized in the special purpose Subject Disease Milestones (SM) domain (Section 5.4, Subject Disease Milestones), a domain similar in structure to the Subject Visits (SV) and Subject Elements (SE) domains. Not all studies will have disease milestones. If a study does not have disease milestones, the TM and SM domains will not be present and the disease milestones timing variables may not be included in other domains. Disease Milestone Naming Instances of disease milestones are given names at a subject level. The name of a disease milestone is composed of a character string that depends on the disease milestone type (MIDSTYPE in TM and SM) and, if the type of disease milestone is one that may occur multiple times, a chronological sequence number for this disease milestone among other instances of the same type for the subject. The character string used in the name of a disease milestone is usually a short form of the disease milestone type. For example, if the type of disease milestone was "EPISODE OF DISEASE UNDER STUDY", the values of MIDS for instances of this type of event could include "EPISODE1", "EPISODE2", etc, or "EPISODE01", "EPISODE02", etc. The association between the longer text in MIDSTYPE and the shorter text in MIDS can be seen in SM, which includes both variables. Disease Milestones Name (MIDS) If something that has been defined as a disease milestone for a particular study occurred for a particular subject, it is represented as usual, in the appropriate findings, intervention, or events class record. In addition this record will include the MIDS timing variable, populated with the name of the disease milestone. The timing of a disease milestone is also represented in the special purpose SM domain. The record that represents a disease milestone does not include values for the timing variables RELMIDS and MIDSDTC, which are used to represent the timing of other observations relative to a disease milestone. The usual timing variables in the record for a disease milestone (e.g., --DTC, --STDTC, --ENDTC) provide the needed timing for this observation and for the timing information represented in the SM domain. Timing Relative to a Disease Milestone (MIDS, RELMIDS, MIDSDTC) For an observation triggered by the occurrence of a disease milestone, the relationship of the observation to the disease milestone can be represented using the disease milestones timing variables MIDS, RELMIDS, and MIDSDTC to describe the timing of the observation.
In some cases, data collected in conjunction with a disease milestone does not include the collection of a separate date for the related observation. This is particularly common for pre-study disease milestones, but may occur with on-study disease milestones as well. In such cases, MIDSDTC provides a related date/time in records that would not otherwise contain any date. In records that do contain date/time(s) of the observation, MIDSDTC allows easy comparison of the date(s) of the observation to the (start) date of the disease milestone. In such cases, it functions much like the reference time point date/time (--RFTDTC) in observations at time points. When a disease milestone is an event or intervention, some data triggered by the disease milestone may be modeled as Findings About the disease milestone (i.e., FAOBJ is the disease milestone). In such cases, RELMIDS should be used to describe the temporal relationship between the Disease Milestone and the subject of the question being asked in the finding, rather than as describing when the question was asked.
Use of Disease Milestone Timing Variables with other Timing Variables The disease milestone timing variables provide timing relative to an activity or event that has been identified, for the particular study, as a disease milestone. Their use does not preclude the use of variables that collect actual date/times or timing relative to the study schedule.
Linking and Disease Milestones When disease milestones have been defined for a study, the MIDS variable serves to link observations associated with a disease milestone in a way similar to the way that VISITNUM links observations collected at a visit. If disease milestones were not defined for the study, it would be possible to link records associated with a disease milestone using RELREC, but the use of disease milestones has certain advantages:
4.5 Other Assumptions4.5.1 Original and Standardized Results of Findings and Tests Not Done4.5.1.1 Original and Standardized ResultsThe --ORRES variable contains the result of the measurement or finding as originally received or collected. --ORRES is an expected variable and should always be populated, with two exceptions:
Note that records for which --DRVFL = "Y" may combine data collected at more than one visit. In such a case the sponsor must define the value for VISITNUM, addressing the correct temporal sequence. If a new record is derived for a dataset, and the source is not eDT, then that new record should be flagged as derived. For example, in ECG data, if a corrected QT interval value derived in-house by the sponsor were represented in an SDTM record, then EGDRVFL would be "Y". If a corrected QT interval value was received from a vendor or was produced by the ECG machine, the derived flag would be null. When --ORRES is populated, --STRESC must also be populated, regardless of whether the data values are character or numeric. The variable, --STRESC, is populated either by the conversion of values in --ORRES to values with standard units, or by the assignment of the value of --ORRES (as in the PE Domain, where --STRESC could contain a dictionary-derived term). A further step is necessary when --STRESC contains numeric values. These are converted to numeric type and written to --STRESN. Because --STRESC may contain a mixture of numeric and character values, --STRESN may contain null values, as shown in the flowchart below. --ORRES When the original measurement or finding is a selection from a defined codelist, in general, the --ORRES and --STRESC variables contain results in decoded format, that is, the textual interpretation of whichever code was selected from the codelist. In some cases where the code values in the codelist are statistically meaningful standardized values or scores, which are defined by sponsors or by valid methodologies such as SF36 questionnaires, the --ORRES variables will contain the decoded format, whereas, the --STRESC variables as well as the --STRESN variables will contain the standardized values or scores. Occasionally data that are intended to be numeric are collected with characters attached that cause the character-to-numeric conversion to fail. For example, numeric cell counts in the source data may be specified with a greater than (>) or less than (<) sign attached (e.g. >10,000 or <1). In these cases, the value with the greater than (>) or less than (<) sign attached should be moved to the --STRESC variable, and --STRESN should be null. The rules for modifying the value for analysis purposes should be defined in the analysis plan and a numeric value should only be imputed in the ADaM datasets. If the value in --STRESC has different units, the greater than (>) or less than (<) sign should be maintained. An example is included in Section 4.5.1.3, Examples of Original and Standard Units and Test Not Done, Example 1, Rows 11 and 12. 4.5.1.2 Tests Not DoneIf the data on the CRF is missing and "Yes/No" or "Done/Not Done" was not explicitly captured, a record should not be created to indicate that the data was not collected. When an entire examination (laboratory draw, ECG, vital signs, or physical examination), or a group of tests (hematology or urinalysis), or an individual test (glucose, PR interval, blood pressure, or hearing) is not done, and this information is explicitly captured with a "Yes/No" or "Done/Not Done" question, this information should be represented in the dataset. The reason for the missing information may or may not have been collected. A sponsor has two options:
The example below illustrates the single-record approach for representing a group of tests not done. If a single record is used to represent a group of tests were not done:
For example, if urinalysis tests were not done, then:
4.5.1.3 Examples of Original and Standard Units and Test Not DoneThe following examples are meant to illustrate the use of Findings results variables, and are not meant as comprehensive domain examples. Certain required and expected variables are omitted, for example USUBJID, and the samples may represent data for more than one subject. Example Row 1:A numeric value was converted to the standard unit.Row 2:A numeric value was copied; the original unit was the standard unit so conversion was not needed.Rows 3-4:A character result was copied from the LBORRES to LBSTRESC. Since this is not a numeric result, LBSTRESN is null.Row 5:A character result was converted to a standardized format.Row 6:A result of "BLQ" was collected and copied to LBSTRESC. Note that the sponsor populated both LBORRESU and LBSTRESU with standard units, but these could have been left null.Row 7:A result was derived from multiple results, so LBDRVFL = "Y". Note that the original collected data are not shown in this example.Row 8:A result for LBTEST = "HCT" is missing for visit 2, as indicated by LBSTAT = "NOT DONE"; neither LBORRES nor LBSTRESC is populated.Row 9:Tests in the category "HEMATOLOGY" were not done at visit 3, as indicated by LBTESTCD = "LBALL" and LBSTAT = "NOT DONE".Row 10:None of the tests in the LB domain were done at visit 4, as indicated by LBTESTCD = "LBALL", a null LBCAT value, and LBSTAT = "NOT DONE".Row 11:Shows a result collected as an inequality. The unit collected was the standard unit, so the result required no conversion and was copied to LBSTRESC.Row 12:Shows a result collected as an inequality. In LBSTRESC, the numeric part of LBORRES has been converted to the standard unit, and the less than (<) sign has been retained. LBSTRESN is not populated. lb.xpt RowLBTESTCDLBCATLBORRESLBORRESULBSTRESCLBSTRESNLBSTRESULBSTATLBLOBXFLVISITNUMLBDTC1GLUCCHEMISTRY6.0mg/dL60.060.0mg/L Example Row 1:A numeric result was collected in standard units. Since no conversion was necessary, the result was copied into LBSTRESC and LBSTRESN.Rows 2-3:Numeric results were converted to standard units.Row 4:Character values were copied to EGSTRESC. EGSTRESN is null.Row 5:The overall interpretation of the ECG is represented as a separate test.Row 6:The result for EGTESTCD = "PRAG" was missing at visit 2, as indicated by EGSTAT = "NOT DONE"; neither EGORRES nor EGSTRESC is populated.Row 7:At visit 3, there were no ECG results, as indicated by EGTESTCD = "EGALL" and EGSTAT = "NOT DONE". eg.xpt RowEGTESTCDEGTESETEGORRESEGORRESUEGSTRESCEGSTRESNEGSTRESUEGSTATVISITNUMEGDTC1QRSAGPR Interval, Aggregate0.362sec0.3620.362sec Example Rows 1-2:Numeric values were converted to standard units.Row 3:A result for VSTESTCD = "HR" is missing, as indicated by VSSTAT = "NOT DONE"; neither VSORRES nor VSSTRESC is populated.Rows 4-5:Two measurements for VSTESTCD= "SYSBP" were done at visit 1.Row 6:A third measurement for VSTESTCD = "SYSBP" at visit 1 was a derived record, as indicated by VSDRVFL = "Y".Row 7:At visit 2, there were no Vital Signs results, as indicated by VSTESTCD = "VSALL" and VSSTAT = "NOT DONE". vs.xpt RowVSTESTCDVSORRESVSORRESUVSSTRESCVSSTRESNVSSTRESUVSSTATVSDRVFLVISITNUMVSDTC1HEIGHT60in152152cm 4.5.2 Linking of Multiple ObservationsSee Section 8, Representing Relationships and Data, for guidance on expressing relationships among multiple observations. 4.5.3 Text Strings That Exceed the Maximum Length for General-Observation-Class Domain Variables4.5.3.1 Test Name (--TEST) Greater than 40 CharactersSponsors may have test descriptions (--TEST) longer than 40 characters in their operational database. Since the --TEST variable is meant to serve as a label for a --TESTCD when a Findings dataset is transposed to a more horizontal format, the length of --TEST is limited to 40 characters (except as noted below) to conform to the limitations of the SAS v5 Transport format currently used for submission datasets. Therefore, sponsors have the choice to either insert the first 40 characters or a text string abbreviated to 40 characters in --TEST. Sponsors should include the full description for these variables in the study metadata in one of two ways:
The convention above should also be applied to the Qualifier Value Label (QLABEL) in Supplemental Qualifiers (SUPP--) datasets. IETEST values in IE and TI are exceptions to the above 40-character rule and are limited to 200 characters, since they are not expected to be transformed to column labels. Values of IETEST that exceed 200 characters should be described in study metadata as per the convention above. For further details see IE Assumption 3, and TI Assumption 5. 4.5.3.2 Text Strings Greater than 200 Characters in Other VariablesSome sponsors may collect data values longer than 200 characters for some variables. Because of the current requirement for the SAS v5 Transport file format, it is not possible to store the long text strings using only one variable. Therefore, the SDTMIG has defined conventions for storing long text string using multiple variables. For general-observation-class variables and supplemental qualifiers (i.e., non-standard variables), the conventions are as follows:
Example: MHTERM with 500 characters mh.xpt RowSTUDYIDDOMAINUSUBJIDMHSEQMHTERM112345MH99-12361st ~200 chars of text, split between words suppmh.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL112345MH99-123MHSEQ6MHTERM1Reported Term for the Medical History2nd ~200 chars of text, split between wordsCRF Example: AEACN with >200 characters In this example, the text entered for AEACNOTH was longer than 200 characters, but required only one supplemental qualifier for the text that extended beyond what could be represented in the standard variable. ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAETERMAEACNOTH112345AE99-1234HEART FAILURE1st ~200 characters of text, split between words suppae.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL112345AE99-123AESEQ4AEACNOT1Other Action Takenremaining characters of textCRF Example pr.xpt RowSTUDYIDDOMAINUSUBJIDPRSEQPRTRT112345PR99-1234KIDNEY TRANSPLANT In this example, the text of the supplemental qualifier PRREAS was longer than 200 characters, but required only one additional supplemental qualifier to represent the remaining text. supppr.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG112345PR99-123PRSEQ1PRREASReason1st ~200 characters of text, split between wordsCRF212345PR99-123PRSEQ1PRREAS1Reasonremaining characters of textCRF The following domains have specialized conventions for representing values longer than 200 characters:
The following table summarizes the conventions and notes the specializations. Text Strings >200 Char Conventions General Observation Class & Supplemental Qualifier Variables Text Strings >200 Char Conventions CO.COVAL Text Strings >200 Char Conventions TS.TSVAL Text Strings >200 Char Conventions TI.IETEST and IE.IETEST The first 200 characters of text should be stored in the variable and each additional 200 characters of text should be stored as a record in the SUPP-- datasetThe first 200 characters of text should be stored in COVAL and each additional 200 characters of text should be stored in COVAL1 to COVALn.The first 200 characters of text should be stored in TSVAL and each additional 200 characters of text should be stored in TSVAL1 to TSVALn.If the inclusion/exclusion criteria text is >200 characters, put meaningful text in IETEST and describe the full text in the study metadata.When splitting a text string into several records, the text should be split between words to improve readability.When splitting a text string into several records, the text should be split between words to improve readability.When splitting a text string into several records, the text should be split between words to improve readability.Not applicable.The value for QLABEL should be the original domain variable label.The variable labels for COVAL1 to COVALn should be "Comment".The variable labels for TSVAL1 to TSVALn should be "Parameter Value".Not applicable.4.5.4 Evaluators in the Interventions and Events Observation ClassesBecause observations may originate from more than one source (e.g., an Investigator or Independent Assessor), the observations recorded in the Findings class include the --EVAL qualifier. For the Interventions and Events observation classes, which do not include the --EVAL variable, all data are assumed to be attributed to the principal investigator. The QEVAL variable can be used to describe the evaluator for any data item in a SUPP-- dataset (Section 8.4.1, Supplemental Qualifiers – SUPP-- Datasets), but is not required when the data are objective. For observations that have primary and secondary evaluations of specific qualifier variables, sponsors should put data from the primary evaluation into the standard domain dataset and data from the secondary evaluation into the Supplemental Qualifier datasets (SUPP--). Within each SUPP-- record, the value for QNAM should be formed by appending a "1" to the corresponding standard domain variable name. In cases where the standard domain variable name is already eight characters in length, sponsors should replace the last character with a "1" (incremented for each additional attribution). This example illustrates a case where an adjudication committee evaluated an adverse event. The evaluations of the adverse event by the primary investigator were represented in the standard AE dataset. The evaluations of the adjudication committee were represented in SUPPAE. See Section 8.4, Relating Non-Standard Variables Values to a Parent Domain. Note that the QNAM for the "Relationship to Non-Study Treatment" supplemental qualifier is AERELNS1, rather than AERELNST1, since AERELNST already eight characters in length. suppae.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL112345AE99-123AESEQ3AESEV1Severity/ IntensityMILDCRFADJUDICATION COMMITTEE212345AE99-123AESEQ3AEREL1CausalityPOSSIBLY RELATEDCRFADJUDICATION COMMITTEE312345AE99-123AESEQ3AERELNS1Relationship to Non-Study TreatmentPossibly related to aspirin useCRFADJUDICATION COMMITTEE 4.5.5 Clinical Significance for Findings Observation Class DataFor assessments of clinical significance when the overall interpretation is a record in the domain, use a Supplemental Qualifier (SUPP--) record (with QNAM = "--CLSIG") linked to the record that contains the overall interpretation or a particular result. An example would be a QNAM value of "EGCLSIG" in SUPPEG with a value of "Y", indicating that an ECG result of "ATRIAL FIBRILLATION" was clinically significant. Separate from clinical significance are results of "NORMAL" or "ABNORMAL", or lab values that are out of normal range. Examples of the latter include the following:
4.5.6 Supplemental Reason VariablesThe SDTM general observation classes include the --REASND variable to submit the reason a response is not present (a result in a findings class or an --OCCUR value in an events or interventions variable). However, sponsors sometimes collect the reason that something wasdone. For the interventions general observation class, --INDC is available to represent the medical condition for which the intervention was given, and --ADJ is available to represent the reason for a dose adjustment. If the sponsor collects a reason for performing a test represented in a findings or an activity represented in an events domain, or a reason for an intervention other than a medical indication, the reason can be represented in the SUPP-- dataset (as described in Section 8.4.1, Supplemental Qualifiers – SUPP-- Datasets) using the supplemental qualifier with QNAM of "--REAS" listed in Appendix C2, Supplemental Qualifiers Name Codes. If multiple reasons are reported, refer to Section 4.2.8.3, Multiple Values for a Non-Result Qualifier Variable. For example, if the sponsor collected the reason that an extra lab test was done, a SUPPLB record might be populated as follows. Note that the sponsor used a label that was made more specific to the LB domain, rather than the label "Reason" in the appendix. supplb.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG112345LB99-123LBSEQ3LBREASReason Test was PerformedORIGINAL SAMPLE LOSTCRF 4.5.7 Presence or Absence of Pre-Specified Interventions and EventsInterventions (e.g., concomitant medications) and Events (e.g., medical history) can generally be collected in two different ways, by recording either verbatim free text or the responses to a pre-specified list of treatments or terms. Since the method of solicitation for information on treatments and terms may affect the frequency at which they are reported, whether they were pre-specified may be of interest to reviewers. The --PRESP variable is used to indicate whether a specific intervention (--TRT) or event (--TERM) was solicited. The --PRESP variable has controlled terminology of "Y" (for "Yes") or a null value. It is a permissible variable, and should only be used when the topic variable values come from a pre-specified list. Questions such as "Did the subject have any concomitant medications?" or "Did the subject have any medical history?" should not have records in an SDTM domain because 1) these are not valid values for the respective topic variables of CMTRT and MHTERM, and 2) records whose sole purpose is to indicate whether or not a subject had records are not meaningful. The --OCCUR variable is used to indicate whether a pre-specified intervention or event occurred or did not occur. It has controlled terminology of "Y" and "N" (for "Yes" and "No"). It is a permissible variable and may be omitted from the dataset if no topic-variable values were pre-specified. If a study collects both pre-specified interventions and events as well as free-text events and interventions, the value of --OCCUR should be "Y" or "N" for all pre-specified interventions and events, and null for those reported as free-text. The --STAT and --REASND variables can be used to provide information about pre-specified interventions and events for which there is no response (e.g., investigator forgot to ask). As in Findings, --STAT has controlled terminology of NOT DONE. SituationValue of --PRESPValue of --OCCURValue of --STATSpontaneously reported event occurred Refer to the standard domains in the Events and Interventions General Observation Classes for additional assumptions and examples. 4.5.8 Accounting for Long-Term Follow-upStudies often include long-term follow-up assessments to monitor a subject's condition. Use cases include studies in terminally ill populations that periodically assess survival and studies involving chronic disease that include follow up to assess relapse. Long-term follow-up is often conducted via telephone calls rather than clinic visits. Regardless of the method of contact, the information should be stored in the appropriate topic-based domain. Overall study conclusion in the Disposition (DS) domain occurs once all contact with the subject ceases. If a study has a clinical treatment phase followed by a long-term follow-up phase, these two segments of the study can be represented as separate epochs within the overall study, each with its own epoch disposition record. The recommended SDTM approach to storing these data can be described by an example. Assume an oncology study encompasses two months of clinical treatment and assessments followed by once-monthly telephone contacts. The contacts continue until the subject dies. During the telephone contact, the investigator collects information on the subject's survival status and medication use. The answers to certain questions may trigger other data collection. For example, if the subject's survival status is "dead", then this indicates that the subject has ceased participation in the study, so a study discontinuation record would need to be created. In SDTM, the data related to these follow-up telephone contacts should be stored as follows:
4.5.9 Baseline ValuesThe new variable --LOBXFL has been introduced in this release to address the need for a consistent definition of a value that can serve as a reference with which to compare post-treatment values. This generic definition approximates the concept of baseline and can be used to calculate post-treatment changes. In domains where --BLFL was expected, its core value has been changed from expected to permissible, the new variable --LOBXFL, with a core value of expected, has been added to contain the consistent definition. In domains where --BLFL was permissible, the new variable --LOBXFL was added with a core value of permissible. The table below shows a set of similar flag variables and their usage across SDTM and ADaM: VariableStructure Where It Is DefinedRequirement in That StructureDefinitionIntended Use--LOBXFLSDTM FindingsExpected or PermissibleLast non-missing value prior to RFXSTDTC (Operationally derived)Consistent pre-treatment reference value baseline for use across all studies and sponsors.ABLFLADaM BDSConditionally RequiredFlags the record that is the source of the baseline value for a given parameter specified in the Statistical Analysis Plan (May differ both across and within studies and datasets)Baseline for ADaM analysis as specified in the Statistical Analysis Plan--BLFLSDTM FindingsPermissible (formerly expected in some domains)A baseline defined by the sponsor (Could be derived in the same manner as --LOBXFL or ABLFL, but is not required to be)Any sponsor-defined baseline use As shown above, each variable serves a specific need. The SDTM variable --LOBXFL (and/or --BLFL, if used) can be copied to ADaM for traceability and transparency, but only the ADaM variable ABLFL would be used to signify baseline for analysis. The content of --LOBXFL and ABLFL will be exactly the same when the Statistical Analysis Plan specifies that the baseline used for analysis is the last non-missing value prior to RFXSTDTC. 5 Models for Special Purpose DomainsSpecial Purpose Domains is an SDTM category in its own right. Special Purpose Domains provide specific, standardized structures to represent additional important information that does not fit any of the General Observation Classes. Domain CodeDomain DescriptionCO Comments A special purpose domain that contains comments that may be collected alongside other data. DMDemographics A special purpose domain that includes a set of essential standard variables that describe each subject in a clinical study. It is the parent domain for all other observations for human clinical subjects. SESubject Elements A special purpose domain that contains the actual order of elements followed by the subject, together with the start date/time and end date/time for each element. SMSubject Disease Milestones A special purpose domain that is designed to record the timing, for each subject, of disease milestones that have been defined in the Trial Disease Milestones (TM) domain. SVSubject Visits A special purpose domain that contains the actual start and end data/time for each visit of each individual subject. 5.1 CommentsCO – Description/OverviewA special purpose domain that contains comments that may be collected alongside other data. CO – Specificationco.xpt, Comments — Special Purpose, Version 3.3. One record per comment per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). CO – Assumptions
CO – ExamplesExample Row 1:Shows a comment collected on a separate comments page. Since it was unrelated to any specific domain or record, RDOMAIN, IDVAR, and IDVARVAL are null.Row 2:Shows a comment that was collected on the bottom of the PE page for Visit 7, without any indication of specific records it applied to. Since the comment related to a specific domain, RDOMAIN is populated. Since it was related to a specific visit, VISIT, COREF is "VISIT 7". However, since it does not relate to a specific record, IDVAR and IDVARVAL are null.Row 3:Shows a comment related to a single AE record having its AESEQ=7.Row 4:Shows a comment related to multiple EX records with EXGRPID = "COMBO1".Row 5:Shows a comment related to multiple VS records with VSGRPID = "VS2".Row 6:Shows one option for representing a comment collected on a visit-specific comments page not associated with a particular domain. In this case, the comment is linked to the Subject Visit record in SV (RDOMAIN = "SV") and IDVAR and IDVARVAL are populated link the comment to the particular visit.Row 7:Shows a second option for representing a comment associated only with a visit. In this case, COREF is used to show that the comment is related to the particular visit.Row 8:Shows a third option for representing a comment associated only with a visit. In this case, the VISITNUM variable was populated to indicate that the comment was associated with a particular visit. co.xpt RowSTUDYIDDOMAINRDOMAINUSUBJIDCOSEQIDVARIDVARVALCOREFCOVALCOVAL1COVAL2COEVALVISITNUMCODTC11234CO 5.2 DemographicsDM – Description/OverviewA special purpose domain that includes a set of essential standard variables that describe each subject in a clinical study. It is the parent domain for all other observations for human clinical subjects. DM – Specificationdm.xpt, Demographics — Special Purpose, Version 3.3. One record per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ARMCD is limited to 20 characters. It is not subject to the character restrictions that apply to TESTCD.The maximum length of ARMCD is longer than for other "short" variables to accommodate the kind of values that are likely to be needed for crossover trials. For example, if ARMCD values for a seven-period crossover were constructed using two-character abbreviations for each treatment and separating hyphens, the length of ARMCD values would be 20. If the subject was not assigned to an Arm, ARMCD is null and ARMNRS is populated. With the exception of studies which use multi-stage Arm assignments, must be a value of ARMCD in the Trial Arms Dataset. ExpARMDescription of Planned ArmChar*Synonym QualifierName of the Arm to which the subject was assigned. If the subject was not assigned to an Arm, ARM is null and ARMNRS is populated. With the exception of studies which use multi-stage Arm assignments, must be a value of ARM in the Trial Arms Dataset. ExpACTARMCDActual Arm CodeChar*Record QualifierCode of actual Arm. ACTARMCD is limited to 20 characters. It is not subject to the character restrictions that apply to TESTCD. The maximum length of ACTARMCD is longer than for other short variables to accommodate the kind of values that are likely to be needed for crossover trials. With the exception of studies which use multi-stage Arm assignments, must be a value of ARMCD in the Trial Arms Dataset. If the subject was not assigned to an Arm or followed a course not described by any planned Arm, ACTARMCD is null and ARMNRS is populated. ExpACTARMDescription of Actual ArmChar*Synonym QualifierDescription of actual Arm. With the exception of studies which use multi-stage Arm assignments, must be a value of ARM in the Trial Arms Dataset. If the subject was not assigned to an Arm or followed a course not described by any planned Arm, ACTARM is null and ARMNRS is populated. ExpARMNRSReason Arm and/or Actual Arm is NullChar*Record QualifierA coded reason that Arm variables (ARM and ARMCD) and/or actual Arm variables (ACTARM and ACTARMCD) are null. Examples: "SCREEN FAILURE", "NOT ASSIGNED", "ASSIGNED, NOT TREATED", "UNPLANNED TREATMENT". It is assumed that if the Arm and actual Arm variables are null, the same reason applies to both Arm and actual Arm.ExpACTARMUDDescription of Unplanned Actual ArmCharRecord QualifierA description of actual treatment for a subject who did not receive treatment described in one of the planned trial Arms.ExpCOUNTRYCountryCharISO 3166-1 Alpha-3Record QualifierCountry of the investigational site in which the subject participated in the trial.ReqDMDTCDate/Time of CollectionCharISO 8601TimingDate/time of demographic data collection.PermDMDYStudy Day of CollectionNum TimingStudy day of collection measured as integer days.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). DM – Assumptions
DM – ExamplesExample dm.xpt RowSTUDYIDDOMAINUSUBJIDSUBJIDRFSTDTCRFENDTCRFXSTDTCRFXENDTCRFICDTCRFPENDTCSITEIDINVNAMBRTHDTCAGEAGEUSEXRACEETHNICARMCDARMACTARMCDACTARMARMNRSACTARMUDCOUNTRY1ABC123DMABC12301001010012006-01-122006-03-102006-01-122006-03-102006-01-032006-04-0101JOHNSON, M1948-12-1357YEARSMWHITEHISPANIC OR LATINOADrug AADrug A Example Sample CRF: EthnicityCheck oneHispanic or LatinoNot Hispanic or Latino RaceCheck oneAmerican Indian or Alaska NativeAsianBlack or African AmericanNative Hawaiian or Other Pacific IslanderWhite Row 1:Shows data for a subject who was "NOT HISPANIC OR LATINO" and was "ASIAN".Row 2:Shows data for a subject who was "HISPANIC OR LATINO" and "WHITE". dm.xpt RowSTUDYIDDOMAINUSUBJIDRACEETHNIC1ABCDM001ASIANNOT HISPANIC OR LATINO2ABCDM002WHITEHISPANIC OR LATINO Example In this example, the subject is permitted to check all applicable races. Sample CRF: RaceCheck all that applyAmerican Indian or Alaska NativeAsianBlack or African AmericanNative Hawaiian or Other Pacific IslanderWhiteOther, Specify: ____________________ Row 1:Subject "001" checked "Other, Specify" and entered a specify value of "Brazilian". "Brazilian" is represented in a supplemental qualifier.Row 2:Subject "002" checked three of the listed races and "Other, Specify." The RACE variable is populated with "MULTIPLE" and the individual races are represented in supplemental qualifiers.Row 3:Shows the record for a subject who refused to provide information about race.Row 4:Shows the record for a subject who checked just one race, "ASIAN". dm.xpt RowSTUDYIDDOMAINUSUBJIDRACE1ABCDM001OTHER2ABCDM002MULTIPLE3ABCDM003 Row 1:The other race specified by subject "001" was represented using the supplemental qualifier RACEOTH.Rows 2-4:The three selections made by subject "002" were represented using supplemental qualifiers RACE1, RACE2, and RACE 3. The third race checked was "Other, Specify", so the value of RACE3 is "OTHER".Row 5:The other race specified by subject "002" was represented using the supplemental qualifier RACEOTH, in the same manner as for subject "001". suppdm.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCDM001 Example In this example, the sponsor has chosen to map some of the predefined races to other races, specifically Japanese and Non-Japanese to Asian. Note: Sponsors may choose not to map race data, in which case the previous examples should be followed. Sample CRF: RaceCheck OneAmerican Indian or Alaska NativeAsian Row 1:Shows the record for a subject who checked "Non-Japanese", which was mapped by the sponsor to the RACE value "ASIAN".Row 2:Shows the record for a subject who checked "Japanese", which was mapped by the sponsor to the RACE value "ASIAN". dm.xpt RowSTUDYIDDOMAINUSUBJIDRACE1ABCDM001ASIAN2ABCDM002ASIAN The values captured on the CRF, "Non-Japanese" and "Japanese", were represented using the supplemental qualifier "RACEOR". suppdm.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCDM001 Example In this example, the sponsor has chosen to map the values entered into the "Other, Specify" field to one of the preprinted races. Note: Sponsors may choose not to map race data, in which case the first two examples should be followed. Sample CRF: RaceCheck OneAmerican Indian or Alaska NativeAsianBlack or African AmericanNative Hawaiian or Other Pacific IslanderWhiteOther, Specify: _____________________ Row 1:Shows the record for a subject who checked "Other, Specify" and entered "Japanese". Their race was was mapped to "ASIAN" by the sponsor.Row 2:Shows the record for a subject who checked "Other, Specify" and entered "Swedish". Their race was mapped to "WHITE" by the sponsor. dm.xpt RowSTUDYIDDOMAINUSUBJIDRACE1ABCDM001ASIAN2ABCDM002WHITE The text entered in the "Other, Specify" line of the CRF was represented using the Supplemental qualifier RACEOR. suppdm.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCDM001 Example The following example illustrates values of ARMCD for subjects in Example Trial 1, described in Section 7.2.1, Trial Arms. This study included two elements, Screen and Run-In, before subjects were randomized to treatment. For this study, the sponsor submitted data on all subjects, including screen-failure subjects. This example Demography dataset does not include all the DM required and expected variables, only those that illustrate the variables that represent arm information. Row 1:Subject "001" was randomized to Arm "Drug A". As shown in the SE dataset, this subject completed the "Drug A" element, so their actual arm was also "Drug A".Row 2:Subject "002" was randomized to Arm "Drug B". As shown in the SE dataset, their actual arm was consistent with their randomization.Row 3:Subject "003" was a screen failure, so they were not assigned to an arm or treated. The arm actual arm variables are null, and ARMNRS = "SCREEN FAILURE".Row 4:Subject "004" withdrew during the Run-in Element. Like Subject "003", they were not assigned to an arm or treated. However, they were not considered a screen failure, and ARMNRS = "NOT ASSIGNED".Row 5:Subject "005" was randomized but dropped out before being treated. Thus the actual arm variables are not populated and ARMNRS = "ASSIGNED, NOT TREATED". dm.xpt RowSTUDYIDDOMAINUSUBJIDARMCDARMACTARMCDACTARMARMNRSACTARMUD1ABCDM001ADrug AADrug A Rows 1-3:Subject "001" completed all the Elements for Arm A.Rows 4-6:Subject "002" completed all the Elements for Arm B.Row 7:Subject "003" was a screen failure, who participated only in the "Screen" element.Rows 8-9:Subject "004" withdrew during the "Run-in" Element, before they could be randomized.Rows 10-11:Subject "005" withdrew after they were randomized, but did not start treatment. se.xpt RowSTUDYIDDOMAINUSUBJIDSESEQETCDELEMENTSESTDTCSEENDTC1ABCSE0011SCRNScreen2006-06-012006-06-072ABCSE0012RIRun-In2006-06-072006-06-213ABCSE0013ADrug A2006-06-212006-07-054ABCSE0021SCRNScreen2006-05-032006-05-105ABCSE0022RIRun-In2006-05-102006-05-246ABCSE0023BDrug B2006-05-242006-06-077ABCSE0031SCRNScreen2006-06-272006-06-308ABCSE0041SCRNScreen2006-05-142006-05-219ABCSE0042RIRun-In2006-05-212006-05-2610ABCSE0051SCRNScreen2006-05-142006-05-2111ABCSE0052RIRun-In2006-05-212006-05-26 Example The following example illustrates values of ARMCD for subjects in Example Trial 3, described in Section 7.2.1, Trial Arms. Row 1:Subject "001" was randomized to Drug A. At the end of the Double Blind Treatment Epoch, they were assigned to Open Label A. Thus their ARMCD is "AA". They received the treatment to which they were assigned, so ACTRMCD is also "AA".Row 2:Subject "002" was randomized to Drug A. They were lost to follow-up during the Double Blind Treatment Epoch, so never reached the Open Label Epoch, when they would have been assigned to either the Open Drug A or the Rescue Element. Their ARMCD is "A". This case illustrates the exception to the rule that ARMCD, ARM, ACTARMCD, and ACTARM must be populated with values from the TA dataset.Row 3:Subject "003" was randomized to Drug A, but Received Drug B. At the end of the Double Blind Treatment Epoch, they were assigned to Rescue Treatment. ARMCD shows the result of their assignments, "AR", while ACTARMCD shows their actual treatment, "BR". dm.xpt RowSTUDYIDDOMAINUSUBJIDARMCDARMACTARMCDACTARMARMNRSACTARMUD1DEFDM001AAA-OPEN AAAA-OPEN A Rows 1-3:Show that the subject passed through all three Elements for the AA Arm.Rows 4-5:Show the two Elements ("Screen" and "Treatment A") the subject passed through.Rows 6-8:Show that the subject passed through the three Elements associated with the "B-Rescue" Arm. se.xpt RowSTUDYIDDOMAINUSUBJIDSESEQETCDELEMENTSESTDTCSEENDTC1DEFSE0011SCRNScreen2006-01-072006-01-122DEFSE0012DBATreatment A2006-01-122006-04-103DEFSE0013OAOpen Drug A2006-04-102006-07-054DEFSE0021SCRNScreen2006-02-032006-02-105DEFSE0022DBATreatment A2006-02-102006-03-246DEFSE0031SCRNScreen2006-02-222006-03-017DEFSE0032DBBTreatment B2006-03-012006-06-278DEFSE0033RSCRescue2006-06-272006-09-24 5.3 Subject ElementsSE – Description/OverviewA special purpose domain that contains the actual order of elements followed by the subject, together with the start date/time and end date/time for each element. The Subject Elements dataset consolidates information about the timing of each subject's progress through the Epochs and Elements of the trial. For Elements that involve study treatments, the identification of which Element the subject passed through (e.g., Drug X vs. placebo) is likely to derive from data in the Exposure domain or another Interventions domain. The dates of a subject's transition from one Element to the next will be taken from the Interventions domain(s) and from other relevant domains, according to the definitions (TESTRL values) in the Trial Elements (TE) dataset (Section 7.2.2, Trial Elements). The Subject Elements dataset is particularly useful for studies with multiple treatment periods, such as crossover studies. The Subject Elements dataset contains the date/times at which a subject moved from one Element to another, so when the Trial Arms (TA; Section 7.2.1, Trial Arms), Trial Elements (TE; Section 7.2.2, Trial Elements), and Subject Elements datasets are included in a submission, reviewers can relate all the observations made about a subject to that subject's progression through the trial.
SE – Specificationse.xpt, Subject Elements — Special Purpose, Version 3.3. One record per actual Element per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
TimingNumber that gives the planned order of the Element within the subject's assigned Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the Element in the planned sequence of Elements for the Arm to which the subject was assigned.PermSESTDTCStart Date/Time of ElementCharISO 8601TimingStart date/time for an Element for each subject.ReqSEENDTCEnd Date/Time of ElementCharISO 8601TimingEnd date/time for an Element for each subject.ExpSEUPDESDescription of Unplanned ElementChar Synonym QualifierDescription of what happened to the subject during an unplanned Element. Used only if ETCD has the value of "UNPLAN".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). SE – AssumptionsSubmission of the Subject Elements dataset is strongly recommended, as it provides information needed by reviewers to place observations in context within the study. The Trial Elements and Trial Arms datasets should also be submitted, as they define the design and the terms referenced by the Subject Elements dataset. The Subject Elements domain allows the submission of data on the timing of the trial Elements a subject actually passed through in their participation in the trial. Read Section 7.2.2, Trial Elements, on the Trial Elements (TE) dataset and Section 7.2.1, Trial Arms, on the Trial Arms (TA) dataset, as these datasets define a trial's planned Elements and describe the planned sequences of Elements for the Arms of the trial.
SE – ExamplesSTUDYID and DOMAIN, which are required in the SE and DM domains, have not been included in the following examples, to improve readability. Example This example shows data for two subjects for a crossover trial with four Epochs. STUDYID and DOMAIN, which are required in the SE and DM domains, have not been included in the following examples, to improve readability. Row 1:The record for the SCREEN Element for subject "789". Note that only the date of the start of the "SCREEN" Element was collected, while for the end of the Element, which corresponds to the start of IV dosing, both date and time were collected.Row 2:The record for the IV Element for subject "789". The IV Element started with the start of IV dosing and ended with the start of oral dosing, and full date/times were collected for both.Row 3:The record for the ORAL Element for subject "789". Only the date, and not the time, of the start of follow-up was collected.Row 4:The FOLLOWUP Element for subject "789" started and ended on the same day. Presumably, the Element had a positive duration, but no times were collected.Rows 5-8:Subject "790" was treated incorrectly, as shown by the fact that the values of SESEQ and TAETORD do not match. This subject entered the "IV" Element before the "ORAL" Element, but the planned order of Elements for this subject was "ORAL", then "IV". The sponsor has assigned EPOCH values for this subject according to the actual order of Elements, rather than the planned order. The correct order of Elements is the subject's ARMCD, shown in the DM dataset.Rows 9-10:Subject "791" was screened, randomized to the IV-ORAL arm, and received the IV treatment, but did not return to the unit for the treatment epoch or follow up. se.xpt RowUSUBJIDSESEQETCDSESTDTCSEENDTCSEUPDESTAETORDEPOCH17891SCREEN2006-06-012006-06-03T10:32 Row 1:Subject "789" was assigned to the "IV-ORAL" arm and was treated accordingly.Row 2:Subject "790" was assigned to the "ORAL-IV" arm, but their actual treatment was "IV" then "ORAL".Row 3:Subject "791" was assigned to the "IV-ORAL" arm. Although they received only the first of the two planned treatment elements, they were following their assigned treatment when they withdrew early, so the actual arm variables are populated with the values for the arm to which they were assigned. dm.xpt RowUSUBJIDSUBJIDRFSTDTCRFENDTCSITEIDINVNAMBIRTHDTCAGEAGEUSEXRACEETHNICARMCDARMACTARMCDACTARMARMNRSACTARMUDCOUNTRY17890012006-06-032006-06-1701SMITH, J1948-12-1357YEARSMWHITEHISPANIC OR LATINOIOIV-ORALIOIV-ORAL Example The data below represent two subjects enrolled in a trial in which assignment to an arm occurs in two stages. See Example Trial 3 as described in Section 7.2.1, Trial Arms. In this trial, subjects were randomized at the beginning of the blinded treatment epoch, then assigned to treatment for the open treatment epoch according to their response to treatment in the blinded treatment epoch. See Demographics domain DM Example 6 for other examples of ARM and ARMCD values for this trial. In this trial, start of dosing was recorded as dates without times, so SESTDTC values include only dates. Epochs could not be assigned to observations that occurred on epoch transition dates on the basis of the SE dataset alone, so the sponsors algorithms for dealing with this ambiguity were documented in the Define-XML document. Rows 1-2:Show data for a subject who completed only two Elements of the trial.Rows 3-6:Show data for a subject who completed the trial, but received the wrong drug for the last 2 weeks of the double-blind treatment period. This has been represented by treating the period when the subject received the wrong drug as an unplanned Element. Note that TAETORD, which represents the planned order of Elements within an Arm, has not been populated for this unplanned Element. Even though this Element was unplanned, the sponsor assigned a value of BLINDED TREATMENT to EPOCH. se.xpt RowUSUBJIDSESEQETCDSESTDTCSEENDTCSEUPDESTAETORDEPOCH11231SCRN2006-06-012006-06-03 Row 1:Shows the record for a subject who was randomized to blinded treatment A, but withdrew from the trial before the open treatment epoch and did not have a second treatment assignment. They were thus incompletely assigned to an arm. The code used to represent this incomplete assignment, "A", is not in the Trial Arms table for this trial design, but is the first part of the codes for the two arms to which they could have been assigned ("AR" or "AO").Row 2:Shows the record for a subject who was randomized to blinded treatment A, but was erroneously treated with B for part of the blinded treatment epoch. ARM and ARMCD for this subject reflect their planned treatment and are not affected by the fact that their treatment deviated from plan. Their assignment to Rescue treatment for the open treatment epoch proceeded as planned. The sponsor decided that the subject's treatment, which consisted partly of Drug A and partly of Drug B, did not match any planned arm, so ACTARMCD and ACTARM were left null. ARMNRS was populated with "UNPLANNED TREATMENT" and the way in which this treatment was unplanned was described in ACTARMUD. dm.xpt RowUSUBJIDSUBJIDRFSTDTCRFENDTCSITEIDINVNAMBIRTHDTCAGEAGEUSEXRACEETHNICARMCDARMACTARMCDACTARMARMNRSACTARMUDCOUNTRY11230122006-06-032006-06-1001JONES, D1943-12-0862YEARSMASIANHISPANIC OR LATINOAAAA 5.4 Subject Disease MilestonesSM – Description/OverviewA special purpose domain that is designed to record the timing, for each subject, of disease milestones that have been defined in the Trial Disease Milestones (TM) domain. SM – Specificationsm.xpt, Subject Disease Milestones — Special Purpose, Version 1.0. One record per Disease Milestone per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). SM – Assumptions
SM – ExamplesExample In this study, the Disease Milestones of interest were initial diagnosis and hypoglycemic events, as shown in Section 7.3.3, Trial Disease Milestones, Example 1. Row 1:Shows that this subject's initial diagnosis of diabetes occurred in October of 2005. Since this is a partial date, SMDY is not populated. No end date/time was recorded for this Milestone.Rows 2-3:Show that this subject had two hypoglycemic events. In this case, only start date/times have been collected. Since these date/times include full dates, SMSTDY has been populated in each case.Row 4:Shows that this subject's initial diagnosis of diabetes occurred on May 15, 2010. Since a full date was collected, the study day of this Milestone was populated. Since diagnosis was pre-study, the study day of the Disease Milestone is negative. No hypoglycemic events were recorded for this subject. sm.xpt RowSTUDYIDDOMAINUSUBJIDSMSEQMIDSMIDSTYPESMSTDTCSMENDTCSMSTDYSMENDY1XYZSM0011DIAGDIAGNOSIS2005-10 Information in SM is taken from records in other domains. In this study, diagnosis was represented in the MH domain, and hyypoglycemic events were represented in the CE domain. The MH records for diabetes with MHEVDTYP = "DIAGNOSIS" are the records which represent the disease milestones for the defined MIDSTYPE of "DIAGNOSIS", so these records include the MIDS variable with the value "DIAG". Since these are records for disease milestones, rather than associated records, the variables RELMIDS and MIDSDTC are not needed. mh.xpt RowSTUDYIDDOMAINUSUBJIDMHSEQMHTERMMHDECODMHEVDTYPMHPRESPMHOCCURMHDTCMHSTDTCMHENDTCMHDYMIDS1XYZMH0011TYPE 2 DIABETESType 2 diabetes mellitusDIAGNOSISYY2013-08-062005-10 In this study, information about hypoglycemic events was collected in a separate CRF module, and CE records recorded in this module were represented with CECAT = "HYPOGLYCEMIC EVENT". Each CE record for a hypoglycemic event is a disease milestone, and records for a study have distinct values of MIDS. ce.xpt RowSTUDYIDDOMAINUSUBJIDCESEQCETERMCEDECODCECATCEPRESPCEOCCURCESTDTCCEENDTCMIDS1XYZCE0011HYPOGLYCEMIC EVENTHypoglycaemiaHYPOGLYCEMIC EVENTYY2013-09-01T11:002013-09-01T2:30HYPO12XYZCE0011HYPOGLYCEMIC EVENTHypoglycaemiaHYPOGLYCEMIC EVENTYY2013-09-24T8:482013-09-24T10:00HYPO2 5.5 Subject VisitsSV – Description/OverviewA special purpose domain that contains the actual start and end data/time for each visit of each individual subject. The Subject Visits domain consolidates information about the timing of subject visits that is otherwise spread over domains that include the visit variables (VISITNUM and possibly VISIT and/or VISITDY). Unless the beginning and end of each visit is collected, populating the Subject Visits dataset will involve derivations. In a simple case, where, for each subject visit, exactly one date appears in every such domain, the Subject Visits dataset can be created easily by populating both SVSTDTC and SVENDTC with the single date for a visit. When there are multiple dates and/or date/times for a visit for a particular subject, the derivation of values for SVSTDTC and SVENDTC may be more complex. The method for deriving these values should be consistent with the visit definitions in the Trial Visits (TV) dataset (Section 7.3.1, Trial Visits). For some studies, a visit may be defined to correspond with a clinic visit that occurs within one day, while for other studies, a visit may reflect data collection over a multi-day period. The Subject Visits dataset provides reviewers with a summary of a subject's visits. Comparison of an individual subject's SV dataset with the TV dataset, which describes the planned visits for the trial, quickly identifies missed visits and "extra" visits. Comparison of the values of STVSDY and SVENDY to VISIT and/or VISITDY can often highlight departures from the planned timing of visits. SV – Specificationsv.xpt, Subject Visits — Special Purpose, Version 3.2. One record per subject per actual visit, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Synonym Qualifier
TimingPlanned study day of the start of the visit based upon RFSTDTC in Demographics.PermSVSTDTCStart Date/Time of VisitCharISO 8601TimingStart date/time for a Visit.ExpSVENDTCEnd Date/Time of VisitCharISO 8601TimingEnd date/time of a Visit.ExpSVSTDYStudy Day of Start of VisitNum TimingStudy day of start of visit relative to the sponsor-defined RFSTDTC.PermSVENDYStudy Day of End of VisitNum TimingStudy day of end of visit relative to the sponsor-defined RFSTDTC.PermSVUPDESDescription of Unplanned VisitChar Synonym QualifierDescription of what happened to the subject during an unplanned visit.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). SV – Assumptions
SV – ExamplesExample The data below represents the visits for a single subject. Row 1:Data for the screening visit was gathered over the course of six days.Row 2:The visit called "DAY 1" started and ended as planned, on Day 1.Row 3:The visit scheduled for Day 8 occurred one day early, on Day 7.Row 4:The visit called "WEEK 2" started and ended as planned, on Day 15.Row 5:Shows an unscheduled visit. SVUPDES provides the information that this visit dealt with evaluation of an adverse event. Since this visit was not planned, VISITDY was not populated. The sponsor chose not to populate VISIT. VISITNUM was populated, probably because the data collected at this encounter is in a Findings domain such as EG, LB, or VS, in which VISIT is treated as an important timing variable.Row 6:This subject had their last visit, a follow-up visit on study day 26, eight days after the unscheduled visit, but well before the scheduled visit day of 71. sv.xpt RowSTUDYIDDOMAINUSUBJIDVISITNUMVISITVISITDYSVSTDTCSVENDTCSVSTDYSVENDYSVUPDES1123456SV1011SCREEN-72006-01-152006-01-20-6-1 6.1 Models for Interventions DomainsMost subject-level observations collected during the study should be represented according to one of the three SDTM general observation classes. This is the list of domains corresponding to the Interventions class. Domain CodeDomain DescriptionAG Procedure Agents An interventions domain that contains the agents administered to the subject as part of a procedure or assessment, as opposed to drugs, medications and therapies administered with therapeutic intent. CMConcomitant and Prior Medications An interventions domain that contains concomitant and prior medications used by the subject, such as those given on an as needed basis or condition-appropriate medications. EC and EXExposure Domains Exposure (EX) An interventions domain that contains the details of a subject's exposure to protocol-specified study treatment. Study treatment may be any intervention that is prospectively defined as a test material within a study, and is typically but not always supplied to the subject. Exposure as Collected (EC) An interventions domain that contains information about protocol-specified study treatment administrations, as collected. MLMeal Data Information regarding the subject's meal consumption, such as fluid intake, amounts, form (solid or liquid state), frequency, etc., typically used for pharmacokinetic analysis. PRProcedures An interventions domain that contains interventional activity intended to have diagnostic, preventive, therapeutic, or palliative effects. SUSubstance Use An interventions domain that contains substance use information that may be used to assess the efficacy and/or safety of therapies that look to mitigate the effects of chronic substance use. 6.1.1 Procedure AgentsAG – Description/OverviewAn interventions domain that contains the agents administered to the subject as part of a procedure or assessment, as opposed to drugs, medications and therapies administered with therapeutic intent. AG – Specificationag.xpt, Procedure Agents — Interventions, Version 1.0. One record per recorded intervention occurrence per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the agent administration started.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the start date/time of the agent administration started.PermAGSTDTCStart Date/Time of AgentCharISO 8601TimingThe date/time when administration of the treatment indicated by AGTRT and the dosing variables began.PermAGENDTCEnd Date/Time of AgentCharISO 8601TimingThe date/time when administration of the treatment indicated by AGTRT and the dosing variables ended.PermAGSTDYStudy Day of Start of AgentNum TimingStudy day of start of agent relative to the sponsor-defined RFSTDTC.PermAGENDYStudy Day of End of AgentNum TimingStudy day of end of agent relative to the sponsor-defined RFSTDTC.PermAGDURDuration of AgentCharISO 8601TimingCollected duration for an agent episode. Used only if collected on the CRF and not derived from start and end date/times.PermAGSTRFStart Relative to Reference PeriodChar(STENRF)TimingDescribes the start of the agent relative to sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). If information such as "PRIOR", "ONGOING", or "CONTINUING" was collected, this information may be translated into AGSTRF.PermAGENRFEnd Relative to Reference PeriodChar(STENRF)TimingDescribes the end of the agent relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). If information such as "PRIOR", "ONGOING", or "CONTINUING" was collected, this information may be translated into AGENRF.PermAGSTRTPTStart Relative to Reference Time PointChar(STENRF)TimingIdentifies the start of the agent as being before or after the sponsor-defined reference time point defined by variable AGSTTPT.PermAGSTTPTStart Reference Time PointChar TimingDescription or date/time in ISO 8601 character format of the reference point referred to by AGSTRTPT. Examples: "2003-12-15" or "VISIT 1".PermAGENRTPTEnd Relative to Reference Time PointChar(STENRF)TimingIdentifies the end of the agent as being before or after the reference time point defined by variable AGENTPT. Identifies the end of the agent as being before or after the sponsor-defined reference time point defined by variable AGENTPT.PermAGENTPTEnd Reference Time PointChar TimingDescription or date/time in ISO 8601 character format of the reference point referred to by AGENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). AG – Assumptions
AG – ExamplesExample This example captures data about the allergen administered to the subject as part of a bronchial allergen challenge (BAC) test. Prior to the BAC, the subject had a skin-prick allergen test to help identify the allergen to be used for the BAC test. It identified grass as the allergen to be used in the BAC test. Data from the allergen skin test are not shown, but the CRF for the BAC includes collection of the allergen chosen for use in the BAC. A predetermined set of ascending doses of the chosen allergen was used in the screening BAC test. The results of the screening BAC are not shown, but would be represented in the RE domain. Row 1:The first dose given in the BAC was saline.Rows 2-4:Three successively higher doses of grass allergen were given. ag.xpt RowSTUDYIDDOMAINUSUBJIDAGSEQAGTRTAGPRESPAGOCCURAGDOSEAGDOSUAGROUTEVISITAGENDTC1XYZAGXYZ-001-0011SALINEYY0SQ-u/mLRESPIRATORY (INHALATION)SCREENING2010-11-07T10:56:002XYZAGXYZ-001-0012GRASSYY250SQ-u/mLRESPIRATORY (INHALATION)SCREENING2010-11-07T11:19:003XYZAGXYZ-001-0013GRASSYY1000SQ-u/mLRESPIRATORY (INHALATION)SCREENING2010-11-07T11:43:004XYZAGXYZ-001-0014GRASSYY2000SQ-u/mLRESPIRATORY (INHALATION)SCREENING2010-11-07T12:06:00 Example In this example, first there was a check that the subject had not taken a short-acting bronchodilator in the previous 4 hours (CM domain). Then the procedure agent (AG domain) was given as part of a reversibility assessment. Spirometry measurements (RE domain) were obtained before and after agent administration. An identifier was assigned to the reversibility test and this identifier was used to be link data across the multiple SDTM domains in which the data are represented. The question as to whether a short-acting bronchodilator was administered in the 4 hours prior to the reversibility assessment is represented in the Concomitant Medication (CM) domain, since this prior administration would have been for therapeutic effect, not as part of the procedure. The question asked was about the administration of any short-acting bronchodilator, rather than a specific medication, so both CMTRT and CMCAT are populated with the "SHORT-ACTING BRONCHODILATOR", which describes a group of medications. The CMSPID value RV1 was used to indicate that this question was associated with the reversibility test. cm.xpt RowSTUDYIDDOMAINUSUBJIDCMSEQCMSPIDCMTRTCMCATCMPRESPCMOCCURCMEVLINT1XYZCMXYZ-001-0011RV1SHORT-ACTING BRONCHODILATORSHORT-ACTING BRONCHODILATORYN-PT4H The administration of albuterol as part of the reversibility procedure is represented in the Procedure Agents (AG) domain. The AGSPID value RV1 was used to indicate that this administration was associated with the reversibility test. ag.xpt RowSTUDYIDDOMAINUSUBJIDAGSEQAGSPIDAGTRTAGPRESPAGOCCURAGDOSEAGDOSUAGDOSFRMAGDOSFRQAGROUTEVISITAGSTDTC1XYZAGXYZ-001-0011RV1ALBUTEROLYY2PUFFAEROSOLONCERESPIRATORY (INHALATION)VISIT 22013-06-18T10:05 The sponsor populated REGRPID with RV1 to indicate that these pulmonary function tests were associated with the reversibility test. The spirometer used in the testing is identified in SPDEVID. See the SDTM Implementation Guide for Medical Devices (SDTMIG-MD) for information about representing device-related information. Row 1:Shows the results for the pre-bronchodilator FEV1 test performed as part of a reversibility assessment. The timing reference variables RETPT, RETPTNUM, REELTM, RETPTREF, and RERFTDTC show that this test was performed 5 minutes before the bronchodilator challenge.Row 2:Shows the results for FEV1 test performed 20 minutes after the bronchodilator challenge.Row 3:Since the percentage reversibility was collected on the CRF, it is included in the SDTM dataset. re.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDRESEQREGRPIDRETESTCDRETESTREORRESREORRESURESTRESCRESTRESNRESTRESUVISITREDTCRETPTRETPTNUMREELTMRETPTREFRERFTDTC1XYZREXYZ-001-001ABC0011RV1FEV1Forced Expiratory Volume in 1 Second2.43L2.432.43LVISIT 22013-06-18T10:00PRE-BRONCHODILATOR ADMINISTRATION1-PT5MBRONCHODILATOR ADMINISTRATION2013-06-18T10:052XYZREXYZ-001-001ABC0012RV1FEV1Forced Expiratory Volume in 1 Second2.77L2.772.77LVISIT 22013-06-18T10:00POST-BRONCHODILATOR ADMINISTRATION2PT20MBRONCHODILATOR ADMINISTRATION2013-06-18T10:053XYZREXYZ-001-001ABC0013RV1PTCREVPercentage Reversibility13.99%13.9913.99%VISIT 22013-06-18T10:00 The identifier for the device used in the test was established in the Device Identifier (DI) domain. di.xpt RowSTUDYIDDOMAINSPDEVIDDISEQDIPARMCDDIPARMDIVAL1XYZDIABC0011TYPEDevice TypeSPIROMETER The relationship of the test agent to the spirometry measurements obtained before and after its administration and to the prior occurrence of short acting bronchodilator administration is recorded by means of a relationship in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1XYZAGXYZ-001-001AGSPID1 6.1.2 Concomitant and Prior MedicationsCM – Description/OverviewAn interventions domain that contains concomitant and prior medications used by the subject, such as those given on an as needed basis or condition-appropriate medications. CM – Specificationcm.xpt, Concomitant/Prior Medications — Interventions, Version 3.3. One record per recorded intervention occurrence or constant-dosing interval per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Describes the start of the medication relative to sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). If information such as "PRIOR" was collected, this information may be translated into CMSTRF. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCMENRFEnd Relative to Reference PeriodChar(STENRF)TimingDescribes the end of the medication relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). If information such as "PRIOR", "ONGOING, or "CONTINUING" was collected, this information may be translated into CMENRF. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCMSTRTPTStart Relative to Reference Time PointChar(STENRF)TimingIdentifies the start of the medication as being before or after the sponsor-defined reference time point defined by variable CMSTTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCMSTTPTStart Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by CMSTRTPT. Examples: "2003-12-15" or "VISIT 1".PermCMENRTPTEnd Relative to Reference Time PointChar(STENRF)Timing Identifies the end of the medication as being before or after the sponsor-defined reference time point defined by variable CMENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCMENTPTEnd Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by CMENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). CM – Assumptions
CM – ExamplesExample Sponsors collect the timing of concomitant medication use with varying specificity, depending on the pattern of use; the type, purpose, and importance of the medication; and the needs of the study. It is often unnecessary to record every unique instance of medication use, since the same information can be conveyed with start and end dates and frequency of use. If appropriate, medications taken as needed (intermittently or sporadically over a time period) may be reported with a start and end date and a frequency of "PRN". The example below shows three subjects who took the same medication on the same day. Rows 1-6:For this subject, each instance of aspirin use was recorded separately, and the frequency in each record is (CMDOSFRQ) is "ONCE".Rows 7-9:For a second subject, frequency was once a day ("QD") in their first and third records (where CMSEQ is "1" and "3"), but twice a day in their second record (CMSEQ = "2").Row 10:Records for the third subject are collapsed into a single entry that spans the relevant time period, with a frequency of "PRN". This is shown as an example only, not as a recommendation. This approach assumes that knowing exactly when aspirin was used is not important for evaluating safety and efficacy in this study. cm.xpt RowSTUDYIDDOMAINUSUBJIDCMSEQCMTRTCMDOSECMDOSUCMDOSFRQCMSTDTCCMENDTC1ABCCMABC-00011ASPIRIN100mgONCE2004-01-012004-01-012ABCCMABC-00012ASPIRIN100mgONCE2004-01-022004-01-023ABCCMABC-00013ASPIRIN100mgONCE2004-01-032004-01-034ABCCMABC-00014ASPIRIN100mgONCE2004-01-072004-01-075ABCCMABC-00015ASPIRIN100mgONCE2004-01-072004-01-076ABCCMABC-00016ASPIRIN100mgONCE2004-01-092004-01-097ABCCMABC-00021ASPIRIN100mgQD2004-01-012004-01-038ABCCMABC-00022ASPIRIN100mgBID2004-01-072004-01-079ABCCMABC-00023ASPIRIN100mgQD2004-01-092004-01-0910ABCCMABC-00031ASPIRIN100mgPRN2004-01-012004-01-09 Example The example below is for a study that had a particular interest in whether subjects use any anticonvulsant medications. The medication history, dosing, etc., was not of interest; the study only asked for the anticonvulsants to which subjects were exposed. cm.xpt RowSTUDYIDDOMAINUSUBJIDCMSEQCMTRTCMCAT1ABC123CM11LITHIUMANTI-CONVULSANT2ABC123CM21VPAANTI-CONVULSANT Example Sponsors often are interested in whether subjects are exposed to specific concomitant medications, and collect this information using a checklist. This example is for a study that had a particular interest in the antidepressant medications that subjects used. For the study's purposes, absence is just as important as presence of a medication. This can be clearly shown using CMOCCUR. In this example, CMPRESP shows that the subjects were specifically asked if they use any of three antidepressants (Zoloft, Prozac, and Paxil). The value of CMOCCUR indicates the response to the pre-specified medication question. CMSTAT indicates whether the response was missing for a pre-specified medication, and CMREASND shows the reason for missing response. The medication details (e.g., dose, frequency) were not of interest in this study. Row 1:Medication use was solicited and the medication was taken.Row 2:Medication use was solicited and the medication was not taken.Row 3:Medication use was solicited, but data was not collected. The reason for the lack of a response was collected and is represented in CMREASND. cm.xpt RowSTUDYIDDOMAINUSUBJIDCMSEQCMTRTCMPRESPCMOCCURCMSTATCMREASND1ABC123CM11ZOLOFTYY Example In this hepatitis C study, collection of data on prior treatments included reason for discontinuation. Since hepatitis C is usually treated with a combinations of medications, CMGRPID was used to group records into regimens. Rows 1-3:This subject's treatment consisted of the three medications grouped by means of CMGRPID = "1". The subject completed the scheduled treatment.Rows 4-6:Another subject received the same set of three medications. The medications for this subject are also grouped using CMGRPID = "1". Note, however, that the fact that the same CMGRPID value has been used for the same set of medications for subjects "ABC123-765" and "ABC123-899" is coincidence; CMGRPID groups records only within a subject. This subject stopped the regimen due to side effects. cm.xpt RowSTUDYIDDOMAINUSUBJIDCMSEQCMGRPIDCMTRTCMCATCMDOSFRMCMROUTECMRSDISC1ABC123CMABC123-76511PEGINTRONHCV TREATMENTINJECTIONSUBCUTANEOUSCOMPLETED SCHEDULED TREATMENT2ABC123CMABC123-76521RIBAVIRINHCV TREATMENTTABLETORALCOMPLETED SCHEDULED TREATMENT3ABC123CMABC123-76531BOCEPREVIRHCV TREATMENTTABLETORALCOMPLETED SCHEDULED TREATMENT4ABC123CMABC123-89911PEGINTRONHCV TREATMENTINJECTIONSUBCUTANEOUSTOXICITY/INTOLERANCE5ABC123CMABC123-89921RIBAVIRINHCV TREATMENTTABLETORALTOXICITY/INTOLERANCE6ABC123CMABC123-89931BOCEPREVIRHCV TREATMENTTABLETORALTOXICITY/INTOLERANCE 6.1.3 Exposure DomainsClinical trial study designs can range from open label (where subjects and investigators know which product each subject is receiving) to blinded (where the subject, investigator, or anyone assessing the outcome is unaware of the treatment assignment(s) to reduce potential for bias). To support standardization of various collection methods and details, as well as process differences between open-label and blinded studies, two SDTM domains based on the Interventions General Observation Class are available to represent details of subject exposure to protocol-specified study treatment(s). The two domains are introduced below. 6.1.3.1 ExposureEX – Description/OverviewAn interventions domain that contains the details of a subject's exposure to protocol-specified study treatment. Study treatment may be any intervention that is prospectively defined as a test material within a study, and is typically but not always supplied to the subject. EX – Specificationex.xpt, Exposure — Interventions, Version 3.3. One record per protocol-specified study treatment, constant-dosing interval, per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
TimingNumerical version of EXTPT to aid in sorting.PermEXELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time relative to the planned fixed reference (EXTPTREF). This variable is useful where there are repetitive measures. Not a clock time.PermEXTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by EXELTM, EXTPTNUM, and EXTPT. Examples: PREVIOUS DOSE, PREVIOUS MEAL.PermEXRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time for a fixed reference time point defined by EXTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). EX – Assumptions
6.1.3.2 Exposure as CollectedEC – Description/OverviewAn interventions domain that contains information about protocol-specified study treatment administrations, as collected. EC – Specificationec.xpt, Exposure as Collected — Interventions, Version 3.3. One record per protocol-specified study treatment, constant-dosing interval, per subject, per mood, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). EC – Assumptions
6.1.3.3 Exposure/Exposure as Collected ExamplesExample This is an example of a double-blind study comparing Drug X extended release (ER) (two 500-mg tablets once daily) vs. Drug Z (two 250-mg tablets once daily). Per example CRFs, Subject ABC1001 took 2 tablets from 2011-01-14 to 2011-01-28 and Subject ABC2001 took 2 tablets within the same timeframe but missed dosing on 2011-01-24. Exposure CRF: Subject: ABC1001 BottleNumber of Tablets Taken DailyReason for VariationStart DateEnd DateA2 Subject: ABC2001 BottleNumber of Tablets Taken DailyReason for VariationStart DateEnd DateeA2 Upon unmasking, it became known that Subject ABC1001 received Drug X and Subject ABC2001 received Drug Z. The EC dataset shows the administrations of study treatment as collected. Rows 1-2, 4:Show treatments administered.Row 3:Shows that the zero for Number of Tablets Taken Daily on the CRF was represented as ECOCCUR = "N". ec.xpt RowSTUDYIDDOMAINUSUBJIDECSEQECLNKIDECTRTECPRESPECOCCURECDOSEECDOSUECDOSFRQEPOCHECSTDTCECENDTCECSTDYECENDY1ABCECABC10011A2-20110114BOTTLE AYY2TABLETQDTREATMENT2011-01-142011-01-281152ABCECABC20011A2-20110114BOTTLE AYY2TABLETQDTREATMENT2011-01-142011-01-231103ABCECABC20012A0-20110124BOTTLE AYN The reason for the ECOCCUR value of "N" was represented using a supplemental qualiifier. suppec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCECABC2001ECSEQ2ECREASOCReason for Occur ValuePATIENT MISTAKECRF The EX dataset shows the unmasked administrations. Two tablets from Bottle A became 1000 mg of Drug X extended release for Subject ABC1001, but 500 mg of Drug Z for Subject ABC2001. Note that there is no record in the EX dataset for non-occurrence of study treatment. The non-occurrence of study drug for subject ABC2001 is reflected in the gap in time between the two EX records. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXLNKIDEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEPOCHEXSTDTCEXENDTCEXSTDYEXENDY1ABCEXABC10011A2-20110114DRUG X1000mgTABLET, EXTENDED RELEASEQDORALTREATMENT2011-01-142011-01-281152ABCEXABC20011A2-20110114DRUG Z500mgTABLETQDORALTREATMENT2011-01-142011-01-231103ABCEXABC20012A2-20110125DRUG Z500mgTABLETQDORALTREATMENT2011-01-252011-01-281215 The relrec.xpt example reflects a one-to-one dataset-level relationship between EC and EX using --LNKID. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCEC Example This example shows data from an open-label study. A subject received Drug X as a 20 mg/mL solution administered across 3 injection sites to deliver a total dose of 3 mg/kg. The subject's weight was 100 kg. Exposure CRF Visit3Date2009-05-10Injection 1 The collected administration amounts, in mL, and their locations are represented in the EC dataset. ec.xpt RowSTUDYIDDOMAINUSUBJIDECSEQECSPIDECLNKIDECTRTECPRESPECOCCURECDOSEECDOSUECDOSFRMECDOSFRQECROUTEECLOCECLATVISITNUMVISITEPOCHECSTDTCECENDTCECSTDYECENDY1ABCECABC30011INJ1V3DRUG XYY5mLINJECTIONONCESUBCUTANEOUSABDOMENLEFT3VISIT 3TREATMENT2009-05-102009-05-1021212ABCECABC30012INJ2V3DRUG XYY5mLINJECTIONONCESUBCUTANEOUSABDOMENCENTER3VISIT 3TREATMENT2009-05-102009-05-1021213ABCECABC30013INJ3V3DRUG XYY5mLINJECTIONONCESUBCUTANEOUSABDOMENRIGHT3VISIT 3TREATMENT2009-05-102009-05-102121 The sponsor considered the 3 injections to constitute a single administration, so the EX dataset shows the total dose given in the protocol-specified unit, mg/kg. EXLOC = "ABDOMEN" is included since this location was common to all injections, but EXLAT was not included. If the sponsor had chosen to represent laterality in the EX record, this would have been handled as described in Section 4.2.8.3, Multiple Values for a Non-Result Qualifier Variable ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXSPIDEXLNKIDEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEXLOCVISITNUMVISITEPOCHEXSTDTCEXENDTCEXSTDYEXENDY1ABCEXABC30011 The relrec.xpt example reflects a many-to-one dataset-level relationship between EC and EX using --LNKID. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCEC Example The study in this example was a double-blind study comparing 10, 20, and 30 mg of Drug X once daily vs Placebo. Study treatment was given as one tablet each from Bottles A, B, and C taken together once daily. The subject in this example took:
The EC dataset shows administrations as collected, in tablets. ec.xpt RowSTUDYIDDOMAINUSUBJIDECSEQECTRTECPRESPECOCCURECDOSEECDOSUECDOSFRQEPOCHECSTDTCECENDTCECSTDYECENDY1ABCECABC40011BOTTLE AYY1TABLETQDTREATMENT2011-01-142011-01-281152ABCECABC40012BOTTLE CYY1TABLETQDTREATMENT2011-01-142011-01-281153ABCECABC40013BOTTLE BYY1TABLETQDTREATMENT2011-01-142011-01-20174ABCECABC40014BOTTLE BYN Upon unmasking, it became known that the subject was randomized to Drug X 20 mg and that:
The EX dataset shows the doses administered in the protocol-specified unit (mg). The sponsor considered an administration to consist of the total amount for Bottles A, B, and C. The derivation of EX records from multiple EC records should be shown in the Define-XML document. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEPOCHEXSTDTCEXENDTCEXSTDYEXENDY1ABCEXABC40011DRUG X20mgTABLETQDORALTREATMENT2011-01-142011-01-20172ABCEXABC40012DRUG X10mgTABLETQDORALTREATMENT2011-01-212011-01-21883ABCEXABC40013DRUG X30mgTABLETQDORALTREATMENT2011-01-222011-01-22994ABCEXABC40014DRUG X20mgTABLETQDORALTREATMENT2011-01-232011-01-281015 Example The study in this example was an open-label study examining the tolerability of different doses of Drug A. Study drug was taken orally, daily for three months. Dose adjustments were allowed as needed in response to tolerability or efficacy issues. The EX dataset shows administrations collected in the protocol-specified unit, mg. No EC dataset was needed since the open-label administrations were collected in the protocol-specified unit; EC would be an exact duplicate of the entire EX domain. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEXADJEPOCHEXSTDTCEXENDTC137841EX378410011DRUG A20mgTABLETQDORAL Example This is an example of a double-blind study design comparing 10 and 20 mg of Drug X vs Placebo taken daily, morning and evening, for a week. Subject ABC5001 BottleTime PointNumber of Tablets TakenStart DateEnd DateAAM12012-01-012012-01-08BPM12012-01-012012-01-08 Subject ABC5002 BottleTime PointNumber of Tablets TakenStart DateEnd DateAAM12012-02-012012-02-08BPM12012-02-012012-02-08 Subject ABC5003 BottleTime PointNumber of Tablets TakenStart DateEnd DateAAM12012-03-012012-03-08BPM12012-03-012012-03-08 The EC dataset shows the administrations as collected. The time point variables ECTPT and ECTPTNUM were used to describe the time of day of administration. This use of time point variables is novel, since it represents data about multiple time points, one on each day of administration, rather than data for a single time point. ec.xpt RowSTUDYIDDOMAINUSUBJIDECSEQECLNKIDECTRTECPRESPECOCCURECDOSEECDOSUECDOSFRQEPOCHECSTDTCECENDTCECSTDYECENDYECTPTECTPTNUM1ABCECABC5001120120101-20120108-AMBOTTLE AYY1TABLETQDTREATMENT2012-01-012012-01-0818AM12ABCECABC5001220120101-20120108-PMBOTTLE BYY1TABLETQDTREATMENT2012-01-012012-01-0818PM23ABCECABC5002120120201-20120208-AMBOTTLE AYY1TABLETQDTREATMENT2012-02-012012-02-0818AM14ABCECABC5002220120201-20120208-PMBOTTLE BYY1TABLETQDTREATMENT2012-02-012012-02-0818PM25ABCECABC5003120120301-20120308-AMBOTTLE AYY1TABLETQDTREATMENT2012-03-012012-03-0818AM16ABCECABC5003220120301-20120308-PMBOTTLE BYY1TABLETQDTREATMENT2012-03-012012-03-0818PM2 The EX dataset shows the unmasked administrations in the protocol specified unit, mg. Amounts of placebo was represented as 0 mg. The sponsor chose to represent the administrations at the time point level. Rows 1-2:Show administrations for a subject who was randomized to the 20 mg Drug X arm.Rows 3-4:Show administrations for a subject who was randomized to the 10 mg Drug X arm.Rows 5-6:Show administrations for a subject who was randomized to the Placebo arm. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXLNKIDEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEPOCHEXSTDTCEXENDTCEXSTDYEXENDYEXTPTEXTPTNUM1ABCEXABC5001120120101-20120108-AMDRUG X10mgTABLETQDORALTREATMENT2012-01-012012-01-0818AM12ABCEXABC5001220120101-20120108-PMDRUG X10mgTABLETQDORALTREATMENT2012-01-012012-01-0818PM23ABCEXABC5002120120201-20120208-AMDRUG X10mgTABLETQDORALTREATMENT2012-02-012012-02-0818AM14ABCEXABC5002220120201-20120208-PMPLACEBO0mgTABLETQDORALTREATMENT2012-02-012012-02-0818PM25ABCEXABC5003120120301-20120308-AMPLACEBO0mgTABLETQDORALTREATMENT2012-03-012012-03-0818AM16ABCEXABC5003220120301-20120308-PMPLACEBO0mgTABLETQDORALTREATMENT2012-03-012012-03-0818PM2 The relrec.xpt example reflects a one-to-one dataset-level relationship between EC and EX using --LNKID. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCEC Example The study in this example was a single-crossover study comparing once daily oral administration of Drug A 20 mg capsules with Drug B 30 mg coated tablets. Study drug was taken for 3 consecutive mornings, 30 minutes prior to a standardized breakfast. There was a 6-day washout period between treatments. The following CRFs show data for two subjects. Subject 56789001 Period 1Period 2DayBottle 1 Subject 56789003 Period 1Period 2DayBottle 1 The EC dataset shows administrations as collected. ec.xpt RowSTUDYIDDOMAINUSUBJIDECSEQECTRTECPRESPECOCCURECDOSEECDOSUECDOSFRMECDOSFRQECROUTEEPOCHECSTDTCECENDTCECSTDYECENDYECTPTECELTMECTPTREF156789EC567890011BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 12002-07-01T07:302002-07-01T07:301130 MINUTES PRIOR-PT30MSTD BREAKFAST256789EC567890012BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 12002-07-01T07:302002-07-01T07:301130 MINUTES PRIOR-PT30MSTD BREAKFAST356789EC567890013BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 12002-07-02T07:302002-07-02T07:302230 MINUTES PRIOR-PT30MSTD BREAKFAST456789EC567890014BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 12002-07-02T07:302002-07-02T07:302230 MINUTES PRIOR-PT30MSTD BREAKFAST556789EC567890015BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 12002-07-03T07:322002-07-03T07:323330 MINUTES PRIOR-PT30MSTD BREAKFAST656789EC567890016BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 12002-07-03T07:322002-07-03T07:323330 MINUTES PRIOR-PT30MSTD BREAKFAST756789EC567890017BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 22002-07-09T07:302002-07-09T07:309930 MINUTES PRIOR-PT30MSTD BREAKFAST856789EC567890018BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 22002-07-09T07:302002-07-09T07:309930 MINUTES PRIOR-PT30MSTD BREAKFAST956789EC567890019BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 22002-07-10T07:302002-07-10T07:30101030 MINUTES PRIOR-PT30MSTD BREAKFAST1056789EC5678900110BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 22002-07-10T07:302002-07-10T07:30101030 MINUTES PRIOR-PT30MSTD BREAKFAST1156789EC5678900111BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 22002-07-11T07:342002-07-11T07:34111130 MINUTES PRIOR-PT30MSTD BREAKFAST1256789EC5678900112BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 22002-07-11T07:342002-07-11T07:34111130 MINUTES PRIOR-PT30MSTD BREAKFAST1356789EC567890031BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 12002-07-03T07:302002-07-03T07:301130 MINUTES PRIOR-PT30MSTD BREAKFAST1456789EC567890032BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 12002-07-03T07:302002-07-03T07:301130 MINUTES PRIOR-PT30MSTD BREAKFAST1556789EC567890033BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 12002-07-04T07:242002-07-04T07:242230 MINUTES PRIOR-PT30MSTD BREAKFAST1656789EC567890034BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 12002-07-04T07:242002-07-04T07:242230 MINUTES PRIOR-PT30MSTD BREAKFAST1756789EC567890035BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 12002-07-05T07:242002-07-05T07:243330 MINUTES PRIOR-PT30MSTD BREAKFAST1856789EC567890036BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 12002-07-05T07:242002-07-05T07:243330 MINUTES PRIOR-PT30MSTD BREAKFAST1956789EC567890037BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 22002-07-11T07:302002-07-11T07:309930 MINUTES PRIOR-PT30MSTD BREAKFAST2056789EC567890038BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 22002-07-11T07:302002-07-11T07:309930 MINUTES PRIOR-PT30MSTD BREAKFAST2156789EC567890039BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 22002-07-12T07:432002-07-12T07:43101030 MINUTES PRIOR-PT30MSTD BREAKFAST2256789EC5678900310BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 22002-07-12T07:432002-07-12T07:43101030 MINUTES PRIOR-PT30MSTD BREAKFAST2356789EC5678900311BOTTLE 1YY1CAPSULECAPSULEQDORALTREATMENT 22002-07-13T07:382002-07-13T07:38111130 MINUTES PRIOR-PT30MSTD BREAKFAST2456789EC5678900312BOTTLE 2YY1TABLET, COATEDTABLET, COATEDQDORALTREATMENT 22002-07-13T07:382002-07-13T07:38111130 MINUTES PRIOR-PT30MSTD BREAKFAST The EX dataset shows the unblinded administrations. Rows 1-12:Unblinding revealed that the first subject received placebo coated tablets during the first treatment epoch and placebo capsules during the second treatment epoch.Rows 13-24:Unblinding revealed that the second subject received placebo capsules during the first treatment epoch and placebo coated tablets during the second treatment epoch. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEPOCHEXSTDTCEXENDTCEXSTDYEXENDYEXTPTEXELTMEXTPTREF156789EX567890011DRUG A20mgCAPSULEQDORALTREATMENT 12002-07-01T07:30 Example The study in this example involved weekly infusions of Drug Z 10 mg/kg. If a subject experienced a dose-limiting toxicity (DLT), the intended dose could be reduced to 7.5 mg/kg. The example CRF below was for Subject ABC123-0201, who weighed 55 kg. The CRF shows that:
Visit123Intended Dose
If no, give reason:
If no, give reason:
If no, give reason:
End Time (24 hour clock)10:4511:20 Amount (mL)99 mL35 mL0 mLConcentration5.5 mg/mL4.12 mg/mL4.12 mg/mLIf dose was adjusted, what was the reason:
The EC dataset shows both intended and actual doses of Drug Z, as collected. Rows 1, 3, 5:Show the collected intended dose levels (mg/kg) and ECMOOD is "SCHEDULED". Scheduled dose is represented in mg/ML.Rows 2, 4, 6:Show the collected actual administration amounts (mL) and ECMOOD is "PERFORMED". Actual doses are represented using dose in mL and concentration (pharmaceutical strength) in mg/mL. ec.xpt RowSTUDYIDDOMAINUSUBJIDECSEQECLNKIDECLNKGRPECTRTECMOODECPRESPECOCCURECDOSEECDOSUECPSTRGECPSTRGUECADJVISITNUMVISITEPOCHECSTDTCECENDTCECSTDYECENDY1ABC123ECABC123-02011 The reason that ECOCCUR was "N" was represented in a supplemental qualifier. suppec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCECABC123-0201ECSEQ6ECREASOCReason for Occur ValuePERSONAL REASONCRF The EX dataset shows the administrations in protocol-specified unit (mg/kg). There is no record for the intended third dose that was not given. Intended doses in EC (records with EXMOOD = "SCHEDULED") can be compared with actual doses in EX. Row 1:Shows the subject's first dose.Row 2:Shows the subject's second dose. The collected explanation for the adjusted dose amount administered at Visit 2 is in EXADJ. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXLNKIDEXLNKGRPEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEXADJVISITNUMVISITEPOCHEXSTDTCEXENDTCEXSTDYEXENDY1ABC123EXABC123-0201120090213T1000V1DRUG Z9.9mg/kgSOLUTIONCONTINUOUSINTRAVENOUS The sponsor wished to represent the doses in mg, as well as in mg/kg. Since a dose includes both a numeric value and a unit, the data could not be represented in a supplemental qualifier, so was represented in an FA dataset. See Section 6.4.1, When to Use Findings About. fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFALNKIDFATESTCDFATESTFAOBJFAORRESFAORRESUFASTRESCFASTRESNFASTRESUVISITNUMVISITEPOCH1ABC123FAABC123-0201120090213T1000DOSEALTDose in Alternative UnitDRUG Z522.5mg522.5522.5mg1VISIT 1TREATMENT2ABC123FAABC123-0201220090220T1100DOSEALTDose in Alternative UnitDRUG Z144.2mg144.2144.2mg2VISIT 2TREATMENT The RELREC dataset represents relationships between EC, EX, and FA. Rows 1-2:Represent the one-to-one relationship between "PERFORMED" records in EC and records in EX, using --LNKID.Rows 3-4:Represent the many-to-one relationship between records (both "SCHEDULED" and "PERFORMED") in EC and records in EX, using --LNGRP.Rows 5-6:Represent the one-to-one relationship between records in EX and records in FA, using LNKID. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC123EC Example In this example, a 100 mg tablet is scheduled to be taken daily. Start and end of dosing were collected,along with deviations from the planned daily dosing. Note: This method of data collection design is not consistent with current CDASH standards. First Dose DateLast Dose Date2012-01-132012-01-20 DateNumber of Doses Daily The EC dataset shows administrations as collected. Row 1:Shows the overall dosing interval from first dose date to last dose date.Row 2:Shows the missed dose on 2012-01-15, which falls within the overall dosing interval.Row 3:Shows a doubled dose on 2012-01-16, which also falls within the overall dosing interval. ec.xpt RowSTUDYIDDOMAINUSUBJIDECSEQECTRTECCATECPRESPECOCCURECDOSEECDOSUECDOSFRQEPOCHECSTDTCECENDTCECSTDYECENDY1ABCECABC70011BOTTLE AFIRST TO LAST DOSE INTERVALYY1TABLETQDTREATMENT2012-01-132012-01-20182ABCECABC70012BOTTLE AEXCEPTION DOSEYN The EX dataset shows the unmasked treatment for this subject, "DRUG X", and represents dosing in non-overlapping intervals of time. There is no EX record for the missed dose, but the missed dose is reflected in a gap between dates in the EX records. Row 1:Shows the administration from first dose date to the day before the missed dose.Row 2:Shows the doubled dose.Row 3:Shows the remaining administrations to the last dose date. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXTRTEXDOSEEXDOSUEXDOSFRMEXDOSFRQEXROUTEEPOCHEXSTDTCEXENDTCEXSTDYEXENDY1ABCEXABC70011DRUG X100mgTABLETQDORALTREATMENT2012-01-132012-01-14122ABCEXABC70012DRUG X200mgTABLETQDORALTREATMENT2012-01-162012-01-16443ABCEXABC70013DRUG X100mgTABLETQDORALTREATMENT2012-01-172012-01-2058 6.1.4 Meal DataML – Description/OverviewInformation regarding the subject's meal consumption, such as fluid intake, amounts, form (solid or liquid state), frequency, etc., typically used for pharmacokinetic analysis. ML – Specificationml.xpt, Meal Data — Interventions, Version 1.0. One record per food product occurrence or constant intake interval per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). ML – Assumptions
ML – ExamplesExample This example shows meal data collected in an effort to understand the causes of two different kinds of event.
Meal Log CRF Record the last type of meal/food consumption prior to the hypoglycemic event: TypeIf Nutritional Drink, volume (ounces)Start DateStart TimeEvent IDX SnackNutritional drinkMeal DILI Meal CRF If suspected DILI, did you consume any of the following in the past week? TypeOccurrenceIf yes, DateWild mushroomsX YesNo2015 DEC 24Ackee fruitYesX No Note that in this example MLENDTC is null. Since no end date was collected, the meal was represented as a point-in-time event, as described in Assumption 2b. Rows 1-3:Show the last meal data for three hypoglycemic events.Rows 4-6:Show the meal data collected relative to the suspected DILI. ml.xpt RowSTUDYIDDOMAINUSUBJIDMLSEQMLTRTMLCATMLPRESPMLOCCURMLDOSEMLDOSUMLDTCMLSTDTCMLENDTCMLEVLINTRELMIDSMIDSMIDSDTC1XYZMLXYZ-001-0011SNACKHYPOGLYCEMIA EVALUATIONYY Example This example describes a study that examines the impact of physical modifications in a cafeteria on selection/consumption among school students. GroupArmsDetails1ControlStudents received standard meals in a standard cafeteria environment.2Experimental: choice architectureStudents were exposed to modifications to the physical environment in the cafeteria to "nudge" students towards healthier choices. Physical modifications included:
Food-card data was collected over a 7-month period by students receiving a school meal one day week. Students who brought a lunch from home or those not eating lunch in the cafeteria on a study day were excluded. The dataset below shows the food-card data collected for the first 3 weeks for a subject. ml.xpt RowSTUDYIDDOMAINUSUBJIDMLSEQMLTRTVISITNUMVISITMLSTDTC1ABC123MLABC123-0011FRUIT ROLLUP1WEEK 12015-09-092ABC123MLABC123-0012WHTE MILK1WEEK 12015-09-093ABC123MLABC123-0013PEANUT BUTTER SANDWICH1WEEK 12015-09-094ABC123MLABC123-0014BANANA2WEEK 22015-09-175ABC123MLABC123-0015CHOCOLATE MILK2WEEK 22015-09-176ABC123MLABC123-0016PIZZA2WEEK 22015-09-177ABC123MLABC123-0017APPLE3WEEK 32015-09-228ABC123MLABC123-0018WHITE MILK3WEEK 32015-09-229ABC123MLABC123-0019SALAD3WEEK 32015-09-22 6.1.5 ProceduresPR – Description/OverviewAn interventions domain that contains interventional activity intended to have diagnostic, preventive, therapeutic, or palliative effects. PR – Specificationpr.xpt, Procedures — Interventions, Version 3.3. One record per recorded procedure per occurrence per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Identifies the start of the observation as being before or after the sponsor-defined reference time point defined by variable PRSTTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermPRSTTPTStart Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by PRSTRTPT. Examples: "2003-12-15" or "VISIT 1".PermPRENRTPTEnd Relative to Reference Time PointChar(STENRF)Timing Identifies the end of the observation as being before or after the sponsor-defined reference time point defined by variable PRENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermPRENTPTEnd Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by PRENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). PR – Assumptions
PR – ExamplesExample A procedures log CRF may collect verbatim values (procedure names) and dates performed. This example shows a subject who had five procedures collected and represented in the PR domain. pr.xpt RowSTUDYIDDOMAINUSUBJIDPRSEQPRTRTPRSTDTCPRENDTC1XYZPRXYZ789-0021Wisdom Teeth Extraction2010-06-082010-06-082XYZPRXYZ789-0022Reset Broken Arm2010-08-062010-08-063XYZPRXYZ789-0023Prostate Examination2010-12-122010-12-124XYZPRXYZ789-0024Endoscopy2010-12-122010-12-125XYZPRXYZ789-0025Heart Transplant2011-08-292011-08-29 Example This example shows data from a 24-hour Holter monitor, an ambulatory electrocardiography device that records a continuous electrocardiographic rhythm pattern. The start and end of the Holter monitoring procedure are represented in the PR domain. pr.xpt RowSTUDYIDDOMAINUSUBJIDPRSEQPRLNKIDPRTRTPRPRESPPROCCURPRSTDTCPRENDTC1ABC123PRABC123-001120110101_2011010224-HOUR HOLTER MONITORYY2011-01-01T08:002011-01-02T09:45 The heart rate findings from the procedure are represented in the EG domain. eg.xpt RowSTUDYIDDOMAINUSUBJIDEGSEQEGLNKIDEGTESTCDEGTESTEGORRESEGORRESUEGMETHODEGDTCEGENDTC1ABC123EGABC123-001120110101_20110102EGHRMINECG Minimum Heart Rate70beats/minHOLTER CONTINUOUS ECG RECORDING2011-01-01T08:002011-01-02T09:452ABC123EGABC123-001220110101_20110102EGHRMAXECG Maximum Heart Rate100beats/minHOLTER CONTINUOUS ECG RECORDING2011-01-01T08:002011-01-02T09:453ABC123EGABC123-001320110101_20110102EGHRMEANECG Mean Heart Rate75beats/minHOLTER CONTINUOUS ECG RECORDING2011-01-01T08:002011-01-02T09:45 The relrec.xpt reflects a one-to-many dataset-level relationship between PR and EG using --LNKID. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC123PR Example Data for three subjects who had on-study radiotherapy are below. Dose, dose unit, location, and timing are represented. pr.xpt RowSTUDYIDDOMAINUSUBJIDPRSEQPRTRTPRDOSEPRDOSUPRLOCPRLATPRSTDTCPRENDTC1ABC123PRABC123-10011External beam radiation therapy70GyBREASTRIGHT2011-06-012011-06-252ABC123PRABC123-20021Brachytherapy25GyPROSTATE 6.1.6 Substance UseSU – Description/OverviewAn interventions domain that contains substance use information that may be used to assess the efficacy and/or safety of therapies that look to mitigate the effects of chronic substance use. SU – Specificationsu.xpt, Substance Use — Interventions, Version 3.3. One record per substance type per reported occurrence per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Describes the start of the substance use relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). If information such as "PRIOR" was collected, this information may be translated into SUSTRF. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermSUENRFEnd Relative to Reference PeriodChar(STENRF)TimingDescribes the end of the substance use with relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). If information such as "PRIOR", "ONGOING", or "CONTINUING" was collected, this information may be translated into SUENRF. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermSUSTRTPTStart Relative to Reference Time PointChar(STENRF)TimingIdentifies the start of the substance as being before or after the reference time point defined by variable SUSTTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7 , Use of Relative Timing Variables. PermSUSTTPTStart Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the reference point referred to by SUSTRTPT. Examples: "2003-12-15" or "VISIT 1".PermSUENRTPTEnd Relative to Reference Time PointChar(STENRF)Timing Identifies the end of the substance as being before or after the reference time point defined by variable SUENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7 , Use of Relative Timing Variables. PermSUENTPTEnd Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the reference point referred to by SUENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). SU – Assumptions
SU – ExamplesExample The example below illustrates how typical substance use data could be populated. Here, the CRF collected:
SUCAT allows the records to be grouped into smoking-related data and caffeine-related data. In this example, the treatments are pre-specified on the CRF page, so SUTRT does not require a standardized SUDECOD equivalent. Not shown: A subject who never smoked does not have a tobacco record. Alternatively, a row for the subject could have been included with SUOCCUR = "N" and null dosing and timing fields; the interpretation would be the same. A subject who did not drink any caffeinated drinks on the day of the assessment does not have any caffeine records. A subject who never smoked and did not drink caffeinated drinks on the day of the assessment does not appear in the dataset. Row 1:This subject is a 2-pack/day current smoker. "Current" implies that smoking started sometime before the time the question was asked (SUSTTPT = "2006-01-01", SUSTRTPT = "BEFORE") and had not ended as of that date (SUENTTP = "2006-01-01", SUENRTPT = "ONGOING"). See Section 4.4.7, Use of Relative Timing Variables for the use of these variables. Both the beginning and ending reference time points for this question are the date of the assessment.Row 2:The same subject drank three cups of coffee on the day of the assessment.Row 3:A second subject is a former smoker. The date the subject began smoking is unknown, but we know that it was sometime before the assessment date. This is shown by the values of SUSTTPT and SUSTRTPT. The end date of smoking was collected, so SUENTPT and SUENRTPT are not populated. Instead, the end date is in SUENDTC.Row 4:This second subject drank tea on the day of the assessment.Row 5:This second subject drank coffee on the day of the assessment.Row 6:A third subject had missing data for the smoking questions. This is indicated by SUSTAT = "NOT DONE". The reason is in SUREASND.Row 7:This third subject also had missing data for all of the caffeine questions. su.xpt RowSTUDYIDDOMAINUSUBJIDSUSEQSUTRTSUCATSUSTATSUREASNDSUDOSESUDOSUSUDOSFRQSUSTDTCSUENDTCSUSTTPTSUSTRTPTSUENTPTSUENRTPT11234SU12340051CIGARETTESTOBACCO 6.2 Models for Events DomainsMost subject-level observations collected during the study should be represented according to one of the three SDTM general observation classes. This is the list of domains corresponding to the Events class. Domain CodeDomain DescriptionAE Adverse Events An events domain that contains data describing untoward medical occurrences in a patient or subjects that are administered a pharmaceutical product and which may not necessarily have a causal relationship with the treatment. CEClinical Events An events domain that contains clinical events of interest that would not be classified as adverse events. DSDisposition An events domain that contains information encompassing and representing data related to subject disposition. DVProtocol Deviations An events domain that contains protocol violations and deviations during the course of the study. HOHealthcare Encounters A events domain that contains data for inpatient and outpatient healthcare events (e.g., hospitalization, nursing home stay, rehabilitation facility stay, ambulatory surgery). MHMedical History The medical history dataset includes the subject's prior history at the start of the trial. Examples of subject medical history information could include general medical history, gynecological history, and primary diagnosis. 6.2.1 Adverse EventsAE – Description/OverviewAn events domain that contains data describing untoward medical occurrences in a patient or subjects that are administered a pharmaceutical product and which may not necessarily have a causal relationship with the treatment. AE – Specificationae.xpt, Adverse Events — Events, Version 3.3. One record per adverse event per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Describes the end of the event relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point (RFSTDTC) and a discrete ending point (RFENDTC) of the trial. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermAEENRTPTEnd Relative to Reference Time PointChar(STENRF)TimingIdentifies the end of the event as being before or after the reference time point defined by variable AEENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermAEENTPTEnd Reference Time PointCharTimingDescription of date/time in ISO 8601 character format of the reference point referred to by AEENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). AE – Assumptions
AE – ExamplesExample This example illustrates data from an AE CRF that collected AE terms as free text. AEs were coded using MedDRA, and the sponsor's procedures include the possibility of modifying the reported term to aid in coding. The CRF was structured so that seriousness category variables (e.g., AESDTH, AESHOSP) were checked only when AESER is answered "Y." In this study, the study reference period started at the start of study treatment. Three AEs were reported for this subject. Rows 1-2:Show examples of modifying the reported term for coding purposes, with the modified term in AEMODIFY. These adverse events were not serious, so the seriousness criteria variables are null. Note that for the event in row 2, AESTDY = "1". Since Day 1 was the day treatment started, the AE start and end times, as well as dates, were collected to allow comparison of the AE timing to the start of treatment.Row 3:Shows an example of the overall seriousness question AESER answered with "Y" and the relevant corresponding seriousness category variables (AESHOSP and AESLIFE) answered "Y". The other seriousness category variables are left blank. This row also shows AEENRF being populated because the AE was marked as "Continuing" as of the end of the study reference period for the subject (see Section 4.4.7, Use of Relative Timing Variables). ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAETERMAEMODIFYAEDECODAEBODSYSAESEVAESERAEACNAERELAEOUTAESCONGAESDISABAESDTHAESHOSPAESLIFEAESMIEEPOCHAESTDTCAEENDTCAESTDYAEENDYAEENRF1ABC123AE1231011POUNDING HEADACHEHEADACHEHeadacheNervous system disordersSEVERENNOT APPLICABLEDEFINITELY NOT RELATEDRECOVERED/RESOLVED Example In this example, a CRF module included at several visits asked whether nausea, vomiting, or diarrhea occurred. The responses to the probing questions ("Yes", "No", or "Not Done") were represented in the Findings About (FA) domain (see Section 6.4, Findings About Events or Interventions). If "Yes", the investigator was instructed to complete the Adverse Event CRF. In the Adverse Events dataset, data on AEs solicited by means of pre-specified on the CRF have an AEPRESP value of "Y". For AEs solicited by a general question, AEPRESP is null. RELREC may be used to relate AE records and FA records. Rows 1-2:Show that nausea and vomiting were pre-specified on a CRF, as indicated by AEPRESP = "Y". The subject did not experience diarrhea, so no record for that term exists in the AE dataset.Row 3:Shows an example of an AE (headache) that was not pre-specified on a CRF as indicated by a null value for AEPRESP. ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAETERMAEDECODAEPRESPAEBODSYSAESEVAESERAEACNAERELAEOUTEPOCHAESTDTCAEENDTCAESTDYAEENDY1ABC123AE1231011NAUSEANauseaYGastrointestinal disordersSEVERENDOSE REDUCEDRELATEDRECOVERED/RESOLVEDTREATMENT2005-10-122005-10-13232ABC123AE1231012VOMITINGVomitingYGastrointestinal disordersMODERATENDOSE REDUCEDRELATEDRECOVERED/RESOLVEDTREATMENT2005-10-13T13:002005-10-13T19:00333ABC123AE1231013HEADACHEHeadache Example In this example, a CRF module that asked whether or not nausea, vomiting, or diarrhea occurred was included in the study only once. In the context of this study, the conditions that occurred were reportable as Adverse Events. No additional data about these events was collected. No other adverse event information was collected via general questions. The responses to the probing questions ("Yes", "No", or "Not Done") were represented in the Findings About (FA) domain (see Section 6.4, Findings About Events or Interventions). This is an example of unusually sparse AE data collection; the AE dataset is populated with the term and the flag indicating that it was pre-specified, but timing information is limited to the date of collection, and other expected qualifiers are not available. RELREC may be used to relate AE records and FA records. The subject shown in this example experienced nausea and vomiting. The subject did not experience diarrhea, so no record for that term exists in the AE dataset. ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAETERMAEDECODAEPRESPAEBODSYSAESERAEACNAERELAEDTCAESTDTCAEENDTCAEDY1ABC123AE1231011NAUSEANauseaYGastrointestinal disorders Example In this example, the investigator was instructed to create a new adverse-event record each time the severity of an adverse event changed. The sponsor used AEGRPID to identify the group of records related to a single event for a subject. Row 1:Shows an adverse event of nausea, whose severity was moderate.Rows 2-4:Show AEGRPID used to group records related to a single event of "VOMITING".Rows 5-6:Show AEGRPID used to group records related to a single event of "DIARRHEA". ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAEGRPIDAETERMAEBODSYSAESEVAESERAEACNAERELAESTDTCAEENDTC1ABC123AE1231011 6.2.2 Clinical EventsCE – Description/OverviewAn events domain that contains clinical events of interest that would not be classified as adverse events. CE – Specificationce.xpt, Clinical Events — Events, Version 3.3. One record per event per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
TimingActual study day of start of the clinical event expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.PermCEENDYStudy Day of End of EventNum TimingActual study day of end of the clinical event expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.PermCESTRFStart Relative to Reference PeriodChar(STENRF)Timing Describes the start of the clinical event relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCEENRFEnd Relative to Reference PeriodChar(STENRF)TimingDescribes the end of the event relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCESTRTPTStart Relative to Reference Time PointChar(STENRF)TimingIdentifies the start of the observation as being before or after the reference time point defined by variable CESTTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCESTTPTStart Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by --STRTPT. Examples: "2003-12-15" or "VISIT 1".PermCEENRTPTEnd Relative to Reference Time PointChar(STENRF)Timing Identifies the end of the observation as being before or after the sponsor-defined reference time point defined by variable CEENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermCEENTPTEnd Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the reference point referred to by CEENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). CE – Assumptions
CE – ExamplesExample In this example:
CRF: Record start dates of any of the following signs that occur.Clinical SignStart DateRash This example shows records for clinical events for which start dates were recorded. Since conjunctivitis was not observed, no start date was recorded and there is no CE record. ce.xpt RowSTUDYIDDOMAINUSUBJIDCESEQCETERMCEPRESPCEOCCURCESTDTC1ABC123CE1231RashYY2006-05-032ABC123CE1232WheezingYY2006-05-033ABC123CE1233EdemaYY2006-05-03 Example In this example:
CRF: EventDate StartedDate EndedSeverityNausea
(dd/mmm/yyyy)_ _ / _ _ _ / _ _ _ _ (dd/mmm/yyyy)
(dd/mmm/yyyy)_ _ / _ _ _ / _ _ _ _ (dd/mmm/yyyy)
(dd/mmm/yyyy)_ _ / _ _ _ / _ _ _ _ (dd/mmm/yyyy)
(dd/mmm/yyyy)_ _ / _ _ _ / _ _ _ _ (dd/mmm/yyyy)
Row 1:Shows a record for the pre-specified clinical event "Nausea". The CEPRESP value of "Y" indicates that there was a probing question; the response to the probe (CEOCCUR) was "Yes". The record includes additional data about the event.Row 2:Shows a record for the pre-specified clinical event "Vomit". The CEPRESP value of "Y" indicates that there was a probing question; the response to the question (CEOCCUR) was "No".Row 3:Shows a record for the pre-specified clinical event "Diarrhea." The value "Y" for CEPRESP indicates it was pre-specified. The CESTAT value of NOT DONE indicates that the probing question was not asked or that there was no answer.Row 4:Shows a record for a write-in Clinical Event recorded in the "Other, Specify" space. Because this event was not pre-specified, CEPRESP and CEOCCUR are null. See Section 4.2.7, Submitting Free Text from the CRF for further information on populating the Topic variable when "Other, Specify" is used on the CRF). ce.xpt RowSTUDYIDDOMAINUSUBJIDCESEQCETERMCEPRESPCEOCCURCESTATCESEVCESTDTCCEENDTC1ABC123CE1231NAUSEAYY 6.2.3 DispositionDS – Description/OverviewAn events domain that contains information encompassing and representing data related to subject disposition. DS – Specificationds.xpt, Disposition — Events, Version 3.3. One record per disposition status or protocol milestone per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). DS – Assumptions
DS – ExamplesExample In this example, disposition of study participation was collected for each EPOCH of a trial. Disposition of study participation is indicated by DSCAT = "DISPOSITION EVENT". EPOCH was taken from the case report form, which asked about completion of each epoch of the study. Data about disposition of study treatment was not collected, but the sponsor populated DSSCAT with "STUDY PARTICIPATION" to emphasize that these represent disposition of study participation. Data were also collected about several protocol milestones represented with DSCAT = "PROTOCOL MILESTONE". Rows 1, 2, 6, 8, 9, 12, 13, 17, 18:Show records for protocol milestones. DSTERM and DSDECOD are populated with the same value, the name of the milestone. Note that for randomization events, EPOCH = "SCREENING", since randomization occurred before the start of treatment, during the screening epoch.Rows 3-5:Show three records for a subject who completed three stages of the study, "SCREENING", "TREATMENT", and "FOLLOW-UP".Row 7:Shows disposition of a subject who was a screen failure. The verbatim reason the subject was a screen failure is represented in DSTERM. Since the subject did not complete the screening epoch, DSDECOD is not "COMPLETED" but another appropriate controlled term, "PROTOCOL VIOLATION". The date of discontinuation is in DSSTDTC. The protocol deviation event itself would be represented in the DV dataset.Rows 10-11:Show disposition of a subject who completed the screening stage but did not complete the treatment stage. For completed epochs, both DSTERM and DSDECOD are "COMPLETED". For epochs that were not completed, the verbatim reason for non-completion of the treatment epoch is in DSTERM, while the value from controlled terminology is in DSDECOD.Rows 14-16:Show disposition of a subject who completed treatment, but did not complete follow-up. Note that for final disposition event, the date of collection of the event information, DSDTC, was different from the date of the disposition event (the subject's death), DSSTDTC.Rows 19-21:Show disposition of a subject who discontinued the treatment epoch due to an AE, but who went on to complete the follow-up phase of the trial. ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSSCATEPOCHDSDTCDSSTDTC1ABC123DS1231011INFORMED CONSENT OBTAINEDINFORMED CONSENT OBTAINEDPROTOCOL MILESTONE Example In this example, the sponsor has chosen to simply submit whether or not the subject completed the study, so there is only one record per subject. The sponsor did not collect disposition of treatment and did not include DSSCAT. EPOCH was populated as a timing variable, and represents the epoch during which the subject discontinued. Row 1:Subject who completed the study. EPOCH = "FOLLOW-UP" since that was the last epoch in the design of this study.Rows 2-3:Subjects who discontinued. Both discontinued participation during the treatment epoch. ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATEPOCHDSSTDTC1ABC456DS4561011COMPLETEDCOMPLETEDDISPOSITION EVENTFOLLOW-UP2003-09-212ABC456DS4561021SUBJECT TAKING STUDY MED ERRATICALLYPROTOCOL VIOLATIONDISPOSITION EVENTTREATMENT2003-09-293ABC456DS4561031LOST TO FOLLOW-UPLOST TO FOLLOW-UPDISPOSITION EVENTTREATMENT2003-10-15 Example In this study, disposition of study participation was collected for the treatment and follow-up epochs. For these records, the value in EPOCH was taken from the CRF. Data on screen failures were not submitted for this study, so all submitted subjects completed screening; the sponsor chose not to data on disposition of the screening epoch. Data on protocol milestones were not collected, but data were collected if a subject's treatment was unblinded. For these records, EPOCH represents the epoch during which the blind was broken. Rows 1, 2:Subject completed the treatment and follow-up phase.Rows 3, 5:Subject did not complete the treatment phase but did complete the follow-up phase.Row 4:Subject's treatment is unblinded. The date of the unblinding is represented in DSSTDTC. Maintaining the blind as per protocol is not considered to be an event since there is no change in the subject's state. ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATEPOCHDSSTDTC1ABC789DS7891011COMPLETEDCOMPLETEDDISPOSITION EVENTTREATMENT2004-09-122ABC789DS7891012COMPLETEDCOMPLETEDDISPOSITION EVENTFOLLOW-UP2004-12-203ABC789DS7891021SKIN RASHADVERSE EVENTDISPOSITION EVENTTREATMENT2004-09-304ABC789DS7891022SUBJECT HAD SEVERE RASHTREATMENT UNBLINDEDOTHER EVENTTREATMENT2004-10-015ABC789DS7891023COMPLETEDCOMPLETEDDISPOSITION EVENTFOLLOW-UP2004-12-28 Example In this example, the CRF included collection of an AE number when study participation was incomplete due to an adverse event. The relationship between the DS record and in the AE record was represented in a RELREC dataset. The DS domains represents the end of the subject's participation in the study, due to their death from heart failure. In this case, the disposition was collected (DSDTC) on the same day that death occurred and the subject's study participation ended. (DSDTDTC). ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATEPOCHDSDTCDSSTDTC1ABC123DS1231021Heart FailureDEATHDISPOSITION EVENTTREATMENT2003-09-292003-09-29 The heart failure is represented as an adverse event. In order to save space, only two of the MedDRA coding variables for the adverse event have been included. ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAETERMAESTDTCAEENDTCAEDECODAESOCAESEVAESERAEACNAERELAEOUTAESCANAESCONGAESDISABAESDTHAESHOSPAESLIFEAESODAESMIE1ABC123AE1231021Heart Failure2003-09-292003-09-29HEART FAILURECARDIOVASCULAR SYSTEMSEVEREYNOT APPLICABLEDEFINITELY NOT RELATEDFATALNNNYNNNN The relationship between the DS and AE records is represented in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC123DS123102DSSEQ1 The subject's DM record is not shown, but included DTHFL = "Y" and the date of death. Example The example below represents a multi-drug (isoniazid and levofloxacin) investigational treatment trial for multidrug-resistant tuberculosis (MDR-TB). The protocol allows for a subject to discontinue levofloxacin and continue single treatment of isoniazid throughout the remainder of the study. Disposition of study participation and disposition of each drug was collected. Whether a record with DSCAT = "DISPOSITION EVENT" represents disposition of the subject's participation in the study or disposition of a study treatment is represented in DSSCAT. In this example, disposition of the study and of each drug a subject received for each of the study's two treatment epochs. Row 1:Indicates that the physician, per protocol, removed levofloxacin treatment due to high-level positive cultures. This record represents the treatment discontinuation for levofloxacin, for the first treatment epocch. Note that since this subject did not receive levofloxacin during the second treatment epoch, there is no record for DSSCAT = "LEVOFLOXACIN" with EPOCH = "TREATMENT 2".Rows 2, 4:Represent the treatment continuation and completion for isoniazid each treatment epoch, as indicated by DSSCAT = "ISONIAZID".Rows 3, 5:Represent the study disposition for each treatment epoch, as indicated by DSSCAT = "STUDY PARTICIPATION". ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSSCATDSSTDTCEPOCH1XXXDSXXX-767-0011PERSISTENT HIGH-LEVEL POSITIVE CULTURES, PER PROTOCOL, LEVOFLOXACIN REMOVAL RECOMMENDEDPHYSICIAN DECISIONDISPOSITION EVENTLEVOFLOXACIN2016-02-15TREATMENT 12XXXDSXXX-767-0012COMPLETEDCOMPLETEDDISPOSITION EVENTISONIAZID2016-02-15TREATMENT 13XXXDSXXX-767-0013COMPLETEDCOMPLETEDDISPOSITION EVENTSTUDY PARTICIPATION2016-02-25TREATMENT 14XXXDSXXX-767-0014COMPLETEDCOMPLETEDDISPOSITION EVENTISONIAZID2016-03-14TREATMENT 25XXXDSXXX-767-0015COMPLETEDCOMPLETEDDISPOSITION EVENTSTUDY PARTICIPATION2016-03-24TREATMENT 2 Example This example is for a study of a multi-drug (isoniazid and levofloxacin) investigational treatment for multidrug-resistant tuberculosis (MDR-TB). The protocol allowed a subject to discontinue levofloxacin and continue single treatment of isoniazid throughout the remainder of the study. Disposition of study participation and of each study treatment was collected. For records of disposition of the subject's participation in the study DSSCAT = "STUDY PARTICIPATION", while for records of disposition of a study treatment DSSCAT is the name of the treatment. Row 1:Represents the final treatment disposition for levofloxacin, as indicated by DSSCAT = "LEVOFLOXACIN". The physician removed levofloxacin treatment due to high-level positive cultures, as allowed by the protocol.Row 2:Represents the final treatment completion of isoniazid within the trial, which is indicated by DSSCAT = "ISONIAZID".Row 3:Represents the final study completion within the trial, which is indicated by DSSCAT = "STUDY PARTICIPATION". ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSSCATDSSTDTCEPOCH1XXXDSXXX-767-0011PERSISTENT HIGH-LEVEL POSITIVE CULTURES, PER PROTOCOL, LEVOFLOXACIN REMOVAL RECOMMENDEDPHYSICIAN DECISIONDISPOSITION EVENTLEVOFLOXACIN2016-02-15TREATMENT 12XXXDSXXX-767-0012COMPLETEDCOMPLETEDDISPOSITION EVENTISONIAZID2016-03-14TREATMENT 23XXXDSXXX-767-0013COMPLETEDCOMPLETEDDISPOSITION EVENTSTUDY PARTICIPATION2016-03-24TREATMENT 2 Example The example below is for a trial with a single investigative treatment. The sponsor used the generic DSSCAT value "STUDY TREATMENT" rather than the name of the treatment. This subject discontinued both treatment and study participation due to an adverse event. Rows 1, 3:Represent the disposition of treatment for each treatment epoch, as indicated by DSSCAT = "STUDY TREATMENT".Rows 2, 4:Represent the disposition of study participation continuation for each treatment epoch, as indicated by DSSCAT = "STUDY PARTICIPATION". ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSSCATDSSTDTCEPOCH1XXXDSXXX-767-0011COMPLETEDCOMPLETEDDISPOSITION EVENTSTUDY TREATMENT2016-02-15TREATMENT 12XXXDSXXX-767-0012COMPLETEDCOMPLETEDDISPOSITION EVENTSTUDY PARTICIPATION2016-02-15TREATMENT 13XXXDSXXX-767-0013SKIN RASHADVERSE EVENTDISPOSITION EVENTSTUDY TREATMENT2016-03-14TREATMENT 24XXXDSXXX-767-0014SKIN RASHADVERSE EVENTDISPOSITION EVENTSTUDY PARTICIPATION2016-03-14TREATMENT 2 Example The example below represents data for an ongoing blinded study in which each subject received two treatments, identified by the sponsor as "BLINDED DRUG A" and "BLINDED DRUG B". Disposition of study participation and of each of the two blinded treatments was collected for each of the two treatment epochs in the study. The subject in this example completed study participation and both treatments for both treatment epochs. Rows 1, 2, 4, 5:Represent the disposition of the blinded treatments for each of the two treatment epochs for each of the two treatments, indicated by DSSCAT = "BLINDED DRUG A" and DSSCAT = "BLINDED DRUG B".Rows 3, 6:Represent the disposition of study participation for each of the two treatment epochs, as indicated by DSSCAT = "STUDY PARTICIPATION". ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSSCATDSSTDTCEPOCH1XXXDSXXX-767-0011COMPLETEDCOMPLETEDDISPOSITION EVENTBLINDED DRUG A2016-02-15TREATMENT 12XXXDSXXX-767-0012COMPLETEDCOMPLETEDDISPOSITION EVENTBLINDED DRUG B2016-02-15TREATMENT 13XXXDSXXX-767-0013COMPLETEDCOMPLETEDDISPOSITION EVENTSTUDY PARTICIPATION2016-02-25TREATMENT 14XXXDSXXX-767-0014COMPLETEDCOMPLETEDDISPOSITION EVENTBLINDED DRUG A2016-03-14TREATMENT 25XXXDSXXX-767-0015COMPLETEDCOMPLETEDDISPOSITION EVENTBLINDED DRUG B2016-03-14TREATMENT 26XXXDSXXX-767-0016COMPLETEDCOMPLETEDDISPOSITION EVENTSTUDY PARTICIPATION2016-03-24TREATMENT 2 Example This example is for a study in which multiple informed consents were collected. DSTERM is populated with a full description of the informed consent; DSDECOD is populated with the standardized value "INFORMED CONSENT OBTAINED" from the codelist "Completion/Reason for Non-Completion" (NCOMPLT). For all informed consent records, DSCAT = "PROTOCOL MILESTONE". The sponsor chose to include the EPOCH timing variable, to indicate the epoch during which each protocol milestone occurred. Row 1:Shows the obtaining of the initial study informed consent.Row 2:Shows randomization, another event with DSCAT = "PROTOCOL MILESTONE".Rows 3-5:Show three additional informed consents obtained during the trial. ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATEPOCHDSSTDTC1XXXDSXXX-767-0011INFORMED CONSENT FOR STUDY ENROLLMENT OBTAINEDINFORMED CONSENT OBTAINEDPROTOCOL MILESTONESCREENING2016-02-222XXXDSXXX-767-0012RANDOMIZEDRANDOMIZEDPROTOCOL MILESTONESCREENING2016-02-263XXXDSXXX-767-0013INFORMED CONSENT FOR AMENDMENT ONE OBTAINEDINFORMED CONSENT OBTAINEDPROTOCOL MILESTONETREATMENT 12016-04-124XXXDSXXX-767-0014INFORMED CONSENT FOR PHARMACOGENETIC RESEARCH OBTAINEDINFORMED CONSENT OBTAINEDPROTOCOL MILESTONETREATMENT 22016-06-085XXXDSXXX-767-0015INFORMED CONSENT FOR PK SUB-STUDY OBTAINEDINFORMED CONSENT OBTAINEDPROTOCOL MILESTONETREATMENT 22016-06-23 Example The example represents data for two subjects who participated in a study with multiple treatment periods. During the first treatment period, subjects were randomized to "DRUG1" or "DRUG2". The second treatment phase added the investigational drug to "DRUG1" and "DRUG2". Disposition of study drugs and study participation was collected at the end of each epoch. DSSCAT was used to distinguish between disposition of study drugs vs. study participation. The supporting Demographics (DM), Exposure (EX), Trial Elements (TE), Trial Arms (TA) and Subject Elements (SE) have been provided for additional context. Not all records are shown in the supporting example datasets. The elements used in the TA dataset are defined in the TE dataset. Row 1:Shows the screening element.Rows 2, 3:Show the elements for treatment with either "DRUG1" or "DRUG2". These appear in the first treatment epoch in the TA dataset.Rows 4, 5:Show the elements for treatment with either "DG1INDG" or "DG2INDG". These appear in the second treatment epoch in the TA dataset.Row 6:Shows the follow-up element. te.xpt RowSTUDYIDDOMAINETCDELEMENTTESTRLTEENRLTEDUR1XYZTESCRNScreenInformed Consent1 week after start of ElementP7D2XYZTEDRUG1Drug 1First dose of Drug 14 weeks after start of ElementP28D3XYZTEDRUG2Drug 2First dose of Drug 24 weeks after start of ElementP28D4XYZTEDG1INDGDrug 1 + Investigation DrugFirst dose of Investigational Drug, where Investigational Drug is given with Drug 1.1 week after start of ElementP7D5XYZTEDG2INDGDrug 2 + Investigation DrugFirst dose of Investigational Drug, where Investigational Drug is given with Drug 2.1 week after start of ElementP7D6XYZTEFUFollow-upOne day after last administration of study drug. The TA dataset describes the design of the study. Rows 1, 5:Screening portion of the trial arm.Rows 2, 6:Represents the planned initial treatment arm of either "DRUG1" or "DRUG2".Rows 3, 7:Represents the planned second treatment arm of either "DG1INDG" or " DG2INDG" .Rows 4, 8:Follow-up portion of the trial arm. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1XYZTADG1INDGDrug-1+Investigation-Drug1SCRNScreenRandomized to DG1INDG The Demographics (DM) dataset includes the arm to which the subjects were randomized, and the dates of informed consent, start of study treatment, end of study treatment, and end of study participation. dm.xpt RowSTUDYIDDOMAINUSUBJIDSUBJIDRFXSTDTCRFXENDTCRFICDTCRFPENDTCSITEIDINVNAMARMCDARMACTARMCDACTARMARMNRSACTARMUD1XYZDMXYZ-767-0010012016-02-142016-04-192016-02-022016-04-2401ADAMS, MDG1INDGDrug-1+Investigation-DrugDG1INDGDrug-1+Investigation-Drug The Exposure (EX) dataset shows the administration of study treatments. ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSEQEXTRTEXDOSEEXDOSUEPOCHEXSTDTCEXENDTC1XYZEXXYZ-767-0011Drug 1500mgTREATMENT 12016-02-142016-03-132XYZEXXYZ-767-0012Drug 1500mgTREATMENT 22016-03-142016-04-193XYZEXXYZ-767-0013Investigational Drug1000mgTREATMENT 22016-03-142016-04-194XYZEXXYZ-767-0021Drug 2500mgTREATMENT 12016-02-212016-03-235XYZEXXYZ-767-0022Drug 2500mgTREATMENT 22016-03-242016-04-246XYZEXXYZ-767-0023Investigational Drug1000mgTREATMENT 22016-03-242016-04-24 The Subject Elements (SE) dataset shows the dates for the elements for each subject. Rows 1, 5:Represent the subjects' actual screening elements.Rows 2, 6:Represent the subjects' actual first treatment epochs. The two subjects were in different elements in the first treatment epoch.Rows 3, 7:Represent the subjects' actual second treatment epochs.Rows 4, 8:Represent the subjects' actual follow-up elements. se.xpt RowSTUDYIDDOMAINUSUBJIDSDSEQETCDELEMENTSESTDTCSEENDTCTAETORDEPOCH1XYZSEXYZ-767-0011SCREENScreen2016-02-022016-02-141SCREENING2XYZSEXYZ-767-0012DRUG1Drug-12016-02-142016-03-142TREATMENT 13XYZSEXYZ-767-0013DG1INDGDrug 1 + Investigational Drug2016-03-142016-04-243TREATMENT 24XYZSEXYZ-767-0014FUFollow-up2016-04-242016-04-244FOLLOW-UP5XYZSEXYZ-767-0021SCREENScreen2016-02-042016-02-211SCREENING6XYZSEXYZ-767-0022DRUG2Drug-22016-02-212016-03-242TREATMENT 17XYZSEXYZ-767-0023DG2INDGDrug 2 + Investigational Drug2016-03-242016-04-293TREATMENT 28XYZSEXYZ-767-0024FUFollow-up2016-04-292016-04-294FOLLOW-UP The Dispostion (DS) dataset shows the disposition events and protocol milestones for each subject. Rows 1, 8:Show randomization to either "DRUG1" or "DRUG2" in the study.Rows 2, 9:Represent the completion of the screening phase of the study. Note that although a form describing disposition of the screening epoch may be filled out before treatment starts, the screening epoch does not end until treatment begins.Rows 3, 5, 10, 12:Represent the completion of drug for each EPOCH, where DSSCAT has the name of the drug(s). The DSSTDTC is the end date of study treatment for the EPOCH.Rows 4, 6, 11, 13:Represent the completion of study participation for each EPOCH, where DSSCAT has the name of "STUDY PARTICIPATION". The DSSTDTC is the end date of study particaption for the EPOCH. There's a one day evaluation post treatment.Rows 7, 14:Represent the completion of study participation follow-up EPOCH, where DSSCAT has the name of "STUDY PARTICIPATION". The DSSTDTC is the end date of study particaption for the EPOCH. ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSSCATDSSTDTCEPOCH1XYZDSXYZ-767-0011RANDOMIZEDRANDOMIZEDPROTOCOL MILESTONE Example The study in this example had four cycles of treatment within the treatment epoch, and each cycle was represented as an element. While not a general requirement that each cycle is represented as a distinct element, doing so was important for this study. The study compared a current standard treatment with Drugs A and B to treatment with Drugs A, B, and C. The protocol allowed for drug doses to be reduced under specified criteria. For Drug C, these dose modifications could include dropping the drug. When Drug C is dropped, the subject may transition to treatment with Drugs A and B or to Follow-up. The TE dataset shows the elements of the trial. te.xpt RowSTUDYIDDOMAINETCDELEMENTTESTRLTEENRLTEDUR1DS10TESCRNScreenInformed ConsentScreening assessments are complete, up to 2 weeks after start of Element The TA dataset shows the trial design. The sponsor has chosen to number elements starting with zero for the screening element. For the AB Arm, the TAETORD values match the cycle numbers. For the ABC Arm, if Drug C is dropped, the subject may transition to an AB element or Follow-up. TAETORD values are not chronological for this Arm such that elements with TAETORD values of "2" or "5" would be during "Cycle 2", elements with TAETORD values of "3" or "6" would be during "Cycle 3", and elements with TAETORD values of "4" or "7" would be during "Cycle 4". ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1DS10TAABAB0SCRNScreenRandomized to AB This example shows data for a subject who was randomized to Treatment ABC. Drug C was dropped after Cycle 2 due to toxicity associated with Drug C. Treatment with Drugs A and B was stopped after Cycle 3 due to disease progression. The subject died during follow-up. The SE dataset records the elements this subject experienced. Rows 1-4:The subject participated in the screening epoch and three elements of the treatment epoch.Row 5:The subject's fifth element was not "ABC" or "AB", as would have been expected if they recieved all four cycles of therapy, but "FU". se.xpt RowSTUDYIDDOMAINUSUBJIDSESEQETCDSESTDTCSEENDTCSEUPDESTAETORDEPOCH1DS10SE1011SCRN2015-01-212015-02-01 In this study, disposition of each treatment was collected, and disposition of study participation was collected for each epoch of the trial. The date of disposition for study treatment was defined as the date of the last dose of that treatment. Rows 1-2:Show that informed consent was obtained and randomization occurred during the screening epoch.Row 3:Shows disposition of study participation for the screening epoch. The subject completed this epoch.Row 4:Shows that Drug C was ended during the second cycle (TAETORD = "2") of the treatment epoch.Rows 5-6:Show that Drugs A and B were ended on the same day during the third cycle (TAETORD = "6") of the treatment epoch.Row 7:Shows disposition of study participation in the treatment epoch. The subject did not complete treatment, due to disease progression. The date of disposition of the treatment epoch, DSSTDTC, is the date the subject started the follow-up epoch. For this study, that was defined as four weeks after the start of the last treatment element. This means that although the subject's last dose of treatment was "2015-04-14", the end of the treatment period was later, on "2015-04-26", when the subject started the follow-up treatment.Row 8:Shows disposition of study participation in the follow-up epoch. The subject died. ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSSCATDSSTDTCTAETORDEPOCH1DS10DS1011INFORMED CONSENT OBTAINEDINFORMED CONSENT OBTAINEDPROTOCOL MILESTONE 6.2.4 Protocol DeviationsDV – Description/OverviewAn events domain that contains protocol violations and deviations during the course of the study. DV – Specificationdv.xpt, Protocol Deviations — Events, Version 3.3. One record per protocol deviation per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). DV – Assumptions
DV – ExamplesExample This is an example of data that was collected on a protocol-deviations CRF. The DVDECOD column is for controlled terminology, whereas the DVTERM is free text. Rows 1, 3:Show examples of a TREATMENT DEVIATION type of protocol deviation.Row 2:Shows an example of a deviation due to the subject taking a prohibited concomitant medication.Row 4:Shows an example of a medication that should not be taken during the study. dv.xpt RowSTUDYIDDOMAINUSUBJIDDVSEQDVTERMDVDECODEPOCHDVSTDTC1ABC123DV1231011IVRS PROCESS DEVIATION - NO DOSE CALL PERFORMED.TREATMENT DEVIATIONTREATMENT2003-09-212ABC123DV1231031DRUG XXX ADMINISTERED DURING STUDY TREATMENT PERIODEXCLUDED CONCOMITANT MEDICATIONTREATMENT2003-10-303ABC123DV1231032VISIT 3 DOSE <15 MGTREATMENT DEVIATIONTREATMENT2003-10-304ABC123DV1231041TOOK ASPIRINPROHIBITED MEDSTREATMENT2003-11-30 6.2.5 Healthcare EncountersHO – Description/OverviewA events domain that contains data for inpatient and outpatient healthcare events (e.g., hospitalization, nursing home stay, rehabilitation facility stay, ambulatory surgery). HO – Specificationho.xpt, Healthcare Encounters — Events, Version 3.3. One record per healthcare encounter per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Identifies the start of the observation as being before or after the sponsor-defined reference time point defined by variable --STTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermHOSTTPTStart Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by STRTPT. Examples: "2003-12-15" or "VISIT 1".PermHOENRTPTEnd Relative to Reference Time PointChar(STENRF)Timing Identifies the end of the event as being before or after the reference time point defined by variable HOENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermHOENTPTEnd Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the reference point referred to by HOENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). HO – Assumptions
HO – ExamplesExample In this example, a healthcare encounter CRF collects verbatim descriptions of the encounter. Rows 1-2:Subject ABC123101 was hospitalized and then moved to a nursing home.Rows 3-5:Subject ABC123102 was in a hospital in the general ward and then in the intensive care unit. This same subject was transferred to a rehabilitation facility.Rows 6-7:Subject ABC123103 has two hospitalization records.Row 8:Subject ABC123104 was seen in the cardiac catheterization laboratory.Rows 9-12:Subject ABC123105 and subject ABC123106 were each seen in the cardiac catheterization laboratory and then transferred to another hospital. ho.xpt RowSTUDYIDDOMAINUSUBJIDHOSEQHOTERMEPOCHHOSTDTCHOENDTCHODUR1ABCHOABC1231011HOSPITALTREATMENT2011-06-082011-06-13 Row 1:For the first encounter recorded for subject ABC123101, the indication/medical condition for hospitalization was recorded.Row 2:For the second encounter recorded for subject ABC123101, the reason for admission to a nursing home was for rehabilitation.Rows 3-4:For the two encounters recorded for subject ABC123103, the name of the facilities were recorded.Row 5:For the first encounter for subject ABC123105, the indication/medical condition for the hospitalization was recorded.Row 6:For the second encounter for subject ABC123105, the name of the hospital was recorded. suppho.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCHOABC123101HOSEQ1HOINDCIndicationCONGESTIVE HEART FAILURECRF Example In this example, the dates of an initial hospitalization are collected as well as the date/time of ICU stay. Subsequent to discharge from the initial hospitalization, follow-up healthcare encounters, including admission to a rehabilitation facility, visits with healthcare providers, and home nursing visits were collected. Repeat hospitalizations are categorized separately. ho.xpt RowSTUDYIDDOMAINUSUBJIDHOSEQHOTERMHOCATHOSTDTCHOENDTCHOENRTPTHOENTPT1ABCHOABC1231011HOSPITALINITIAL HOSPITALIZATION2011-06-082011-06-12 The indication/medical condition for subject ABC123101's repeat hospitalization was represented as a supplemental qualifier. suppho.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCHOABC123101HOSEQ7HOINDCIndicationSTROKECRF 6.2.6 Medical HistoryMH – Description/OverviewThe medical history dataset includes the subject's prior history at the start of the trial. Examples of subject medical history information could include general medical history, gynecological history, and primary diagnosis. MH – Specificationmh.xpt, Medical History — Events, Version 3.3. One record per medical history event per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Describes the end of the event relative to the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point (represented by RFSTDTC and RFENDTC in Demographics). Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermMHENRTPTEnd Relative to Reference Time PointChar(STENRF)TimingIdentifies the end of the event as being before or after the reference time point defined by variable MHENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermMHENTPTEnd Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the reference point referred to by MHENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). MH – Assumptions
MH – ExamplesExample In this example, a General Medical History CRF collected verbatim descriptions of conditions and events by body system (e.g., Endocrine, Metabolic), did not collect start date, but asked whether or not the condition was ongoing at the time of the visit. Another CRF page was used for cardiac history events. This page asked for date of onset of symptoms and date of diagnosis, but did not include the ongoing question. Rows 1-3:MHCAT indicates that these data were collected on the General Medical History CRF, and MHSCAT indicates the body system for which the event was collected. The reported events were coded using a standard dictionary. MHDECOD and MHBODSYS display the preferred term and body system assigned through the coding process. MHENRTPT was populated based on the response to the "Ongoing" question on the General Medical History CRF. MHENTPT displays the reference date for MHENRTPT, that is, the date the information was collected. If "Yes" was specified for Ongoing, MHENRTPT = "ONGOING"; if "No" was checked, MHENRTPT = "BEFORE". See Section 4.4.7, Use of Relative Timing Variables, for further guidance.Rows 4-5:MHCAT indicates that these data were collected on the Cardiac Medical History CRF. Since two kinds of start date were collected for congestive heart failure, there are two records for this event, one with the start date for which MHEVDTYP = "SYMPTOM ONSET" and one with the start date for which MHEVDTYP = "DIAGNOSIS". The sponsor grouped these two records using the MHGRPID value "CHF". mh.xpt RowSTUDYIDDOMAINUSUBJIDMHSEQMHGRPIDMHTERMMHDECODMHEVDTYPMHCATMHSCATMHBODSYSMHSTDTCMHENRTPTMHENTPT1ABC123MH1231011 Example In this example, data from three CRF modules related to medical history were collected:
In all of the records shown below, MHCAT is populated with the CRF module (general medical history, stroke history, or risk factors) through which the data were collected. MHPRESP and MHOCCUR were populated only when the term was pre-specified, in keeping with MH Assumption 4. Rows 1-3:Show records from the general medical history CRF. MHSCAT displays the body systems specified on the CRF. The coded terms are represented in MHDECOD. MHENRF has been populated based on the response to the "Ongoing at Study Start" question on the CRF. If "Yes" was specified, MHENRF = "DURING/AFTER"; if "No" was checked, MHENRF = "BEFORE". See Section 4.4.7, Use of Relative Timing Variables, for further guidance on using --STRF and --ENRF.Row 4:Shows the record from the stroke history CRF. MHSTDTC was populated with the date and time at which the event occurred.Rows 5-8:Show records from the risk factors CRF. MHPRESP values of "Y" indicate that each risk factor was pre-specified on the CRF. MHOCCUR is populated with "Y" or "N", corresponding to the CRF response to the questions for the four pre-specified risk factors. The terms used to describe these risk factors were chosen to have associated codes in the standard dictionary. mh.xpt RowSTUDYIDDOMAINUSUBJIDMHSEQMHTERMMHDECODMHCATMHSCATMHPRESPMHOCCURMHBODSYSMHSTDTCMHENRF1ABC123MH1231011ASTHMAAsthmaGENERAL MEDICAL HISTORYRESPIRATORY Example This is an example of a medical history CRF where the history of specific (pre-specified) conditions is solicited. The conditions were not coded using a standard dictionary. The data were collected as part of the Screening visit. Rows 1-9:MHPRESP = "Y" indicates that these conditions were specifically queried. Presence or absence of the condition is represented in MHOCCUR.Row 10:There was also a specific question about ASTHMA, as indicated by MHPRESP = "Y", but this question was not asked. Since the question was not asked, MHOCCUR is null and MHSTAT = "NOT DONE". In this case, a reason for the absence of a response was collected, and this is represented in MHREASND. mh.xpt RowSTUDYIDDOMAINUSUBJIDMHSEQMHTERMMHDECODMHPRESPMHOCCURMHSTATMHREASNDVISITNUMVISITMHDTCMHDY1ABC123MH1010021HISTORY OF EARLY CORONARY ARTERY DISEASE (<55 YEARS OF AGE)Coronary Artery DiseaseYN Example This diabetes study included subjects with both Type 1 diabetes and Type 2 diabetes. Data collection included which kind of diabetes the subject had and the date of diagnosis of the condition. Rows 1-2:Show that subject XYZ-001-001 had Type 1 diabetes, and did not have Type 2 diabetes. The fact that the start date in Row 1 is the date of diagnosis is indicated by MHEVDTYP = "DIAGNOSIS". Since this subject did not have Type 2 diabetes, no start date for Type 2 diabetes was collected, so MHEVDTYP in Row 2 is blank.Rows 3-4:Show that subject XYZ-001-002 had Type 2 diabetes, and did not have Type 1 diabetes. The fact that the start date in Row 4 is the date of diagnosis is indicated by MHEVDTYP = "DIAGNOSIS". mh.xpt RowSTUDYIDDOMAINUSUBJIDMHSEQMHTERMMHDECODMHEVDTYPMHCATMHPRESPMHOCCURMHDTCMHSTDTC1XYZMHXYZ-001-0011TYPE 1 DIABETES MELLITUSType 1 diabetes mellitusDIAGNOSISDIABETESYY2010-09-262010-03-252XYZMHXYZ-001-0012TYPE 2 DIABETES MELLITUSType 2 diabetes mellitus 6.3 Models for Findings DomainsMost subject-level observations collected during the study should be represented according to one of the three SDTM general observation classes. This is the list of domains corresponding to the Findings class. Domain CodeDomain DescriptionDA Drug Accountability A findings domain that contains the accountability of study drug, such as information on the receipt, dispensing, return, and packaging. DDDeath Details A findings domain that contains the diagnosis of the cause of death for a subject. EGECG Test Results A findings domain that contains ECG data, including position of the subject, method of evaluation, all cycle measurements and all findings from the ECG including an overall interpretation if collected or derived. IEInclusion/Exclusion Criteria Not Met A findings domain that contains those criteria that cause the subject to be in violation of the inclusion/exclusion criteria. ISImmunogenicity Specimen Assessments A findings domain for assessments that determine whether a therapy induced an immune response. LBLaboratory Test Results A findings domain that contains laboratory test data such as hematology, clinical chemistry and urinalysis. This domain does not include microbiology or pharmacokinetic data, which are stored in separate domains. MB and MSMicrobiology Domains Microbiology Specimen (MB) A findings domain that represents non-host organisms identified including bacteria, viruses, parasites, protozoa and fungi. Microbiology Susceptibility (MS) A findings domain that represents drug susceptibility testing results only. This includes phenotypic testing (where drug is added directly to a culture of organisms) and genotypic tests that provide results in terms of susceptible or resistant. Drug susceptibility testing may occur on a wide variety of non-host organisms, including bacteria, viruses, fungi, protozoa and parasites. MIMicroscopic Findings A findings domain that contains histopathology findings and microscopic evaluations. MOMorphology A domain relevant to the science of the form and structure of an organism or of its parts. The MO domain was originally created to hold all macroscopic results, but is expected to be deprecated in a later version of the SDTMIG. Submissions using that later SDTMIG version would represent morphology results in the appropriate body system-based physiology/morphology domain. For data prepared using a version of the SDTMIG that includes both the MO domain and body system-based physiology/morphology domains, morphology findings may be represented in either the MO domain or in a body-system based physiology/morphology domain. Custom body system-based domains may be used if the appropriate body system-based domain is not included in the SDTMIG version being used. CV, MK, NV, OE, RP, RE and URMorphology/Physiology Domains Cardiovascular System Findings (CV) A findings domain that contains physiological and morphological findings related to the cardiovascular system, including the heart, blood vessels and lymphatic vessels. Musculoskeletal System Findings (MK) A findings domain that contains physiological and morphological findings related to the system of muscles, tendons, ligaments, bones, joints, and associated tissues. Nervous System Findings (NV) A findings domain that contains physiological and morphological findings related to the nervous system, including the brain, spinal cord, the cranial and spinal nerves, autonomic ganglia and plexuses. Ophthalmic Examinations (OE) A findings domain that contains tests that measure a person's ocular health and visual status, to detect abnormalities in the components of the visual system, and to determine how well the person can see. Reproductive System Findings (RP) A findings domain that contains physiological and morphological findings related to the male and female reproductive systems. Respiratory System Findings (RE) A findings domain that contains physiological and morphological findings related to the respiratory system, including the organs that are involved in breathing such as the nose, throat, larynx, trachea, bronchi and lungs. Urinary System Findings (UR) A findings domain that contains physiological and morphological findings related to the urinary tract, including the organs involved in the creation and excretion of urine such as the kidneys, ureters, bladder and urethra. PC and PPPharmacokinetics Pharmacokinetics Concentrations (PC) A findings domain that contains concentrations of drugs or metabolites in fluids or tissues as a function of time. Pharmacokinetics Parameters (PP) A findings domain that contains pharmacokinetic parameters derived from pharmacokinetic concentration-time (PC) data. PEPhysical Examination A findings domain that contains findings observed during a physical examination where the body is evaluated by inspection, palpation, percussion, and auscultation. FT, QS, and RSQuestionnaires, Ratings and Scales Functional Tests (FT) A findings domain that contains data for named, stand-alone, task-based evaluations designed to provide an assessment of mobility, dexterity, or cognitive ability. Questionnaires (QS) A findings domain that contains data for named, stand-alone instruments designed to provide an assessment of a concept. Questionnaires have a defined standard structure, format, and content; consist of conceptually related items that are typically scored; and have documented methods for administration and analysis. Disease Response and Clin Classification (RS) A findings domain for the assessment of disease response to therapy, or clinical classification based on published criteria. SCSubject Characteristics A findings domain that contains subject-related data not collected in other domains. SSSubject Status A findings domain that contains general subject characteristics that are evaluated periodically to determine if they have changed. TU and TRTumor/Lesion Domains Tumor/Lesion Identification (TU) A findings domain that represents data that uniquely identifies tumors or lesions under study. Tumor/Lesion Results (TR) A findings domain that represents quantitative measurements and/or qualitative assessments of the tumors or lesions identified in the tumor/lesion identification (TU) domain. VSVital Signs A findings domain that contains measurements including but not limited to blood pressure, temperature, respiration, body surface area, body mass index, height and weight. 6.3.1 Drug AccountabilityDA – Description/OverviewA findings domain that contains the accountability of study drug, such as information on the receipt, dispensing, return, and packaging. DA – Specificationda.xpt, Drug Accountability — Findings, Version 3.3. One record per drug accountability finding per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit, based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the start date/time of the observation, or the date/time of collection if start date/time is not collected.PermDADTCDate/Time of CollectionCharISO 8601TimingDate and time of the drug accountability assessment represented in ISO 8601 character format.ExpDADYStudy Day of Visit/Collection/ExamNum Timing
¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). DA – Assumptions
DA – ExamplesExample This example shows drug accounting for a study with two study meds and one rescue med, all of which were measured in tablets. The sponsor chose to add EPOCH from the list of timing variables and to use DASPID and DAREFID for code numbers that appeared on the label. da.xpt RowSTUDYIDDOMAINUSUBJIDDASEQDAREFIDDASPIDDATESTCDDATESTDACATDASCATDAORRESDAORRESUDASTRESCDASTRESNDASTRESUVISITNUMEPOCHDADTC1ABCDAABC-010011XBYCC-E990AA375827DISAMTDispensed AmountStudy MedicationBottle A30TABLET3030TABLET1Study Med Period 12004-06-152ABCDAABC-010012XBYCC-E990AA375827RETAMTReturned AmountStudy MedicationBottle A5TABLET55TABLET2Study Med Period 12004-07-153ABCDAABC-010013XBYCC-E990BA227588DISAMTDispensed AmountStudy MedicationBottle B15TABLET1515TABLET1Study Med Period 12004-06-154ABCDAABC-010014XBYCC-E990BA227588RETAMTReturned AmountStudy MedicationBottle B0TABLET00TABLET2Study Med Period 12004-07-155ABCDAABC-010011 Example In this study, drug containers, rather than their contents, were being accounted for and the sponsor did not track returns. In this case, the purpose of the accountability tracking is to verify that the containers dispensed were consistent with the randomization. The sponsor chose to use DASPID to record the identifying number of the container dispensed. da.xpt RowSTUDYIDDOMAINUSUBJIDDASEQDASPIDDATESTCDDATESTDACATDASCATDAORRESDAORRESUDASTRESCDASTRESNDASTRESUVISITNUMDADTC1ABCDAABC/010011AB001DISPAMTDispensed AmountStudy MedicationDrug A1CONTAINER11CONTAINER12004-06-152ABCDAABC/010011AB002DISPAMTDispensed AmountStudy MedicationDrug B1CONTAINER11CONTAINER12004-06-15 6.3.2 Death DetailsDD – Description/OverviewA findings domain that contains the diagnosis of the cause of death for a subject. The domain is designed to hold supplemental data that are typically collected when a death occurs, such as the official cause of death. It does not replace existing data such as the SAE details in AE. Furthermore, it does not introduce a new requirement to collect information that is not already indicated as Good Clinical Practice or defined in regulatory guidelines. Instead, it provides a consistent place within SDTM to hold information that previously did not have a clearly defined home. DD – Specificationdd.xpt, Death Details — Findings, Version 3.3. One record per finding per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). DD – Assumptions
DD – ExamplesExample This example shows the primary cause of death for three subjects. The CRF also collected the location of the subject's death and a secondary cause of death. Rows 1-2:Show the primary cause of death and location of death for a subject. DDDTC is the date of assessment.Rows 3-4:Show records for primary cause of death and location of death for another subject for whom the information was not known.Rows 4-6:Show primary and secondary cause of death and location of death for a third subject. dd.xpt RowSTUDYIDDOMAINUSUBJIDDDSEQDDTESTCDDDTESTDDORRESDDSTRESCDDDTC1ABC123DDABC123010011PRCDTHPrimary Cause of DeathSUDDEN CARDIAC DEATHSUDDEN CARDIAC DEATH2011-01-122ABC123DDABC123010012LOCDTHLocation of DeathHOMEHOME2011-01-123ABC123DDABC123010021PRCDTHPrimary Cause of DeathUNKNOWNUNKNOWN2011-03-154ABC123DDABC123010022LOCDTHLocation of DeathUNKNOWNUNKNOWN2011-03-155ABC123DDABC123010231PRCDTHPrimary Cause of DeathCARDIAC ARRHYTHMIACARDIAC ARRHYTHMIA2011-09-096ABC123DDABC123010232SECDTHSecondary Cause of DeathCHFCONGESTIVE HEART FAILURE2011-09-097ABC123DDABC123010233LOCDTHLocation of DeathMEMORIAL HOSPITALHOSPITAL2011-09-09 Example This example illustrates how the DD, DS, and AE data for a subject are linked using RELREC. Note that each of these domains serves a different purpose, even though the information is related. This subject had a fatal adverse event, represented in the AE domain. ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAETERMAESTDTCAEENDTCAEDECODAEBODSYSAEOUTAESERAESDTH1ABC123AEABC123010016SUDDEN CARDIAC DEATH2011-01-102011-01-10SUDDEN CARDIAC DEATHCARDIOVASCULAR SYSTEMFATALYY The primary cause of death was collected and is represented in DD. In this case, the result for primary cause of death is the same as the term in the AE record. dd.xpt RowSTUDYIDDOMAINUSUBJIDDDSEQDDTESTCDDDTESTDDORRESDDSTRESCDDDTC1ABC123DDABC123010011PRCDTHPrimary Cause of DeathSUDDEN CARDIAC DEATHSUDDEN CARDIAC DEATH2011-01-12 The subject's death was also represented in the DS domain as the reason for their withdrawal from the study. Rows 1-3:Show typical protocol milestones and disposition events.Row 4:Shows the date the death event occurred (DSSTDTC) and was recorded (DSDTC). ds.xpt RowSTUDYIDDOMAINUSUBJIDDSSEQDSTERMDSDECODDSCATDSDTCDSSTDTC1ABC123DSABC123010011INFORMED CONSENT OBTAINEDINFORMED CONSENT OBTAINEDPROTOCOL MILESTONE2011-01-022011-01-022ABC123DSABC123010012COMPLETEDCOMPLETEDDISPOSITION EVENT2011-01-032011-01-033ABC123DSABC123010013RANDOMIZEDRANDOMIZEDPROTOCOL MILESTONE2011-01-032011-01-034ABC123DSABC123010014SUDDEN CARDIAC DEATHDEATHDISPOSITION EVENT2011-01-102011-01-10 The relationship between the DS, AE, and DD records that reflect the subject's death is represented in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC123DSABC12301001DSSEQ4 6.3.3 ECG Test ResultsEG – Description/OverviewA findings domain that contains ECG data, including position of the subject, method of evaluation, all cycle measurements and all findings from the ECG including an overall interpretation if collected or derived. EG – Specificationeg.xpt, ECG Test Results — Findings, Version 3.3. One record per ECG observation per replicate per time point or one record per ECG observation per beat per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Short name of the measurement, test, or examination described in EGTEST. It can be used as a column name when converting a dataset from a vertical to a horizontal format. The value in EGTESTCD cannot be longer than 8 characters, nor can it start with a number (e.g., "1TEST" is not valid). EGTESTCD cannot contain characters other than letters, numbers, or underscores. Examples : "PRAG", "QRSAG". Test codes are in two separate codelists, one for tests based on regular 10-second ECGs (EGTESTCD) and one for tests based on Holter monitoring (HETESTCD). ReqEGTESTECG Test or Examination NameChar(EGTEST)(HETEST)Synonym QualifierVerbatim name of the test or examination used to obtain the measurement or finding. The value in EGTEST cannot be longer than 40 characters. Examples: "PR Interval, Aggregate", "QRS Duration, Aggregate". Test names are in two separate codelists, one for tests based on regular 10-second ECGs (EGTEST) and one for tests based on Holter monitoring (HETEST). ReqEGCATCategory for ECGChar*Grouping QualifierUsed to categorize ECG observations across subjects. Examples: "MEASUREMENT", "FINDING", "INTERVAL".PermEGSCATSubcategory for ECGChar*Grouping QualifierA further categorization of the ECG.PermEGPOSECG Position of SubjectChar(POSITION)Record QualifierPosition of the subject during a measurement or examination. Examples: "SUPINE", "STANDING", "SITTING".PermEGBEATNOECG Beat NumberNumVariable QualifierA sequence number that identifies the beat within an ECG.PermEGORRESResult or Finding in Original UnitsChar Result QualifierResult of the ECG measurement or finding as originally received or collected. Examples of expected values are "62" or "0.151" when the result is an interval or measurement, or "ATRIAL FIBRILLATION" or "QT PROLONGATION" when the result is a finding.ExpEGORRESUOriginal UnitsChar(UNIT)Variable QualifierOriginal units in which the data were collected. The unit for EGORRES. Examples: "sec" or "msec".PermEGSTRESCCharacter Result/Finding in Std FormatChar(EGSTRESC)(HESTRESC)Result Qualifier Contains the result value for all findings, copied or derived from EGORRES in a standard format or standard units. EGSTRESC should store all results or findings in character format; if results are numeric, they should also be stored in numeric format in EGSTRESN. For example, if a test has results of "NONE", "NEG", and "NEGATIVE" in EGORRES and these results effectively have the same meaning, they could be represented in standard format in EGSTRESC as "NEGATIVE". For other examples, see general assumptions. Additional examples of result data: "SINUS BRADYCARDIA", "ATRIAL FLUTTER", "ATRIAL FIBRILLATION". Test results are in two separate codelists, one for tests based on regular 10-second ECGs (EGSTRESC) and one for tests based on Holter monitoring (HESTRESC). ExpEGSTRESNNumeric Result/Finding in Standard UnitsNumResult QualifierUsed for continuous or numeric results or findings in standard format; copied in numeric format from EGSTRESC. EGSTRESN should store all numeric test results or findings.PermEGSTRESUStandard UnitsChar(UNIT)Variable QualifierStandardized units used for EGSTRESC and EGSTRESN.PermEGSTATCompletion StatusChar(ND)Record QualifierUsed to indicate an ECG was not done, or an ECG measurement was not taken. Should be null if a result exists in EGORRES.PermEGREASNDReason ECG Not DoneChar Record QualifierDescribes why a measurement or test was not performed. Examples: "BROKEN EQUIPMENT" or "SUBJECT REFUSED". Used in conjunction with EGSTAT when value is "NOT DONE".PermEGXFNECG External File PathChar Record QualifierFile name and path for the external ECG waveform file.PermEGNAMVendor NameChar Record QualifierName or identifier of the laboratory or vendor who provided the test results.PermEGMETHODMethod of Test or ExaminationChar(EGMETHOD)Record QualifierMethod of the ECG test. Example: "12 LEAD STANDARD".PermEGLEADLead Location Used for MeasurementChar(EGLEAD)Record QualifierThe lead used for the measurement. Examples: "LEAD 1", "LEAD 2", "LEAD 3", "LEAD rV2", "LEAD V1".PermEGLOBXFLLast Observation Before Exposure FlagChar(NY)Record QualifierOperationally-derived indicator used to identify the last non-missing value prior to RFXSTDTC. The value should be "Y" or null.ExpEGBLFLBaseline FlagChar(NY)Record QualifierIndicator used to identify a baseline value. Should be "Y" or null. Note that EGBLFL is retained for backward compatibility. The authoritative baseline for statistical analysis is in an ADaM dataset.PermEGDRVFLDerived FlagChar(NY)Record QualifierUsed to indicate a derived record. The value should be "Y" or null. Records that represent the average of other records, or that do not come from the CRF, or are not as originally collected or received are examples of records that would be derived for the submission datasets. If EGDRVFL = "Y", then EGORRES could be null, with EGSTRESC and EGSTRESN (if the result is numeric) having the derived value.PermEGEVALEvaluatorChar(EVAL)Record QualifierRole of the person who provided the evaluation. Used only for results that are subjective (e.g., assigned by a person or a group). Should be null for records that contain collected or derived data. Examples: "INVESTIGATOR", "ADJUDICATION COMMITTEE", "VENDOR".PermEGEVALIDEvaluator IdentifierChar(MEDEVAL)Variable QualifierUsed to distinguish multiple evaluators with the same role recorded in EGEVAL. Examples: "RADIOLOGIST 1" or "RADIOLOGIST 2".PermEGREPNUMRepetition NumberNum Record QualifierThe incidence number of a test that is repeated within a given timeframe for the same test. The level of granularity can vary, e.g., within a time point or within a visit. For example, multiple measurements of blood pressure or multiple analyses of a sample.PermVISITNUMVisit NumberNum Timing
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time at which the assessment was made.PermEGDTCDate/Time of ECGCharISO 8601TimingDate/Time of ECG.ExpEGDYStudy Day of ECGNum Timing
Timing
TimingNumerical version of EGTPT to aid in sorting.PermEGELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a fixed time point reference (EGTPTREF). Not a clock time or a date time variable. Represented as an ISO 8601 duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by EGTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by EGTPTREF.PermEGTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by EGELTM, EGTPTNUM, and EGTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermEGRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time for a fixed reference time point defined by EGTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). EG – Assumptions
EG - ExamplesExample This example shows ECG measurements and other findings from one ECG for one subject. EGCAT has been used to group tests. Row 1:Shows a measurement of ventricular rate.Rows 2-4:These interval measurements were collected in seconds. However, in this submission, the standard unit for these tests was milliseconds, so the results have been converted in EGSTRESC and EGSTRESN.Rows 5-6:Show "QTcB Interval, Aggregate" and "QTcF Interval, Aggregate". These results were derived by the sponsor, as indicated by the "Y" in the EGDRVFL column. Note that EGORRES is null for these derived records.Rows 7-10:Show results from tests looking for certain kinds of abnormalities, which have been grouped using EGCAT = "FINDINGS".Row 11:Shows a technical problem represented as the result of the test "Technical Quality". Results of this test can be important to the overall understanding of an ECG, but are not truly findings or interpretations about the subject's heart function.Row 12:Shows the result of the TEST "Interpretation" (i.e., the interpretation of the ECG strip as a whole), which for this ECG was "ABNORMAL". eg.xpt RowSTUDYIDDOMAINUSUBJIDEGSEQEGREFIDEGTESTCDEGTESTEGCATEGPOSEGORRESEGORRESUEGSTRESCEGSTRESNEGSTRESUEGXFNEGNAMEGDRVFLEGEVALVISITNUMVISITEGDTCEGDY1XYZEGXYZ-US-701-0021334PT89EGHRMNECG Mean Heart RateMEASUREMENTSUPINE62beats/min6262beats/minPQW436789-07.xmlTest Lab For some tests, clinical significance was collected. These assessments of clinical significance were represented in supplemental qualifier records. Row 1:Shows that the record in the EG dataset with EGSEQ = "1" (the record showing a ventricular rate of 62 bpm), was assessed as having a value of "N" for the variable EGCLSIG. In other words, the result was not clinically significant.Row 2:Shows that the record in the EG dataset with EGSEQ = "2" (the record showing a PR interval of 0.15 sec), was assessed as being clinically significant. suppeg.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1XYZEGXYZ-US-701-002EGSEQ1EGCLSIGClinically SignificantNCRF Example This example shows ECG results where only the overall assessment was collected. Results are for one subject across multiple visits. In addition, the ECG interpretation was provided by the investigator and, when necessary, by a cardiologist. EGGRPID is used to group the overall assessments collected on each ECG. Rows 1-3:Show interpretations performed by the principal investigation on three different occasions. The ECG at Visit "SCREEN 2" has been flagged as the last observation before start of study treatment.Rows 4-5:Show interpretations of the same ECG by both the investigator and a cardiologist. EGGRPID has been used to group these two records to emphasize their relationship. eg.xpt RowSTUDYIDDOMAINUSUBJIDEGSEQEGGRPIDEGTESTCDEGTESTEGPOSEGORRESEGSTRESCEGSTRESNEGLOBXFLEGEVALVISITNUMVISITVISITDYEGDTCEGDY1ABCEGABC-99-CA-4561 Example This example shows 10-second ECG replicates extracted from a continuous recording. The example shows one subject's (USUBJID = "2324-P0001") extracted 10-second ECG replicate results. Three replicates were extracted for planned time points "1 HR" and "2 HR"; EGREPNUM is used to identify the replicates. Summary mean measurements are reported for the 10 seconds of extracted data for each replicate. EGDTC is the date/time of the first individual beat in the extracted 10-second ECG. In order to save space, some permissible variables (EGREFID, VISITDY, EGTPTNUM, EGTPTREF, EGRFTDTC) have been omitted, as marked by ellipses. eg.xpt RowSTUDYIDDOMAINUSUBJIDEGSEQ…EGTESTCDEGTESTEGCATEGPOSEGORRESEGORRESUEGSTRESCEGSTRESNEGSTRESUEGLEADEGMETHODVISITNUMVISITEGDTCEGTPT…EGREPNUM1STUDY01EG2324-P00011…PRAGPR Interval, AggregateINTERVALSUPINE176msec176176msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:00:211 HR…12STUDY01EG2324-P00012…RRAGRR Interval, AggregateINTERVALSUPINE658msec658658msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:00:211 HR…13STUDY01EG2324-P00013…QRSAGQRS Duration, AggregateINTERVALSUPINE97msec9797msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:00:211 HR…14STUDY01EG2324-P00014…QTAGQT Interval, AggregateINTERVALSUPINE440msec440440msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:00:211 HR…15STUDY01EG2324-P00015…PRAGPR Interval, AggregateINTERVALSUPINE176msec176176msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:01:351 HR…26STUDY01EG2324-P00016…RRAGRR Interval, AggregateINTERVALSUPINE679msec679679msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:01:351 HR…27STUDY01EG2324-P00017…QRSAGQRS Duration, AggregateINTERVALSUPINE95msec9595msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:01:351 HR…28STUDY01EG2324-P00018…QTAGQT Interval, AggregateINTERVALSUPINE389msec389389msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:01:351 HR…29STUDY01EG2324-P00019…PRAGPR Interval, AggregateINTERVALSUPINE169msec169169msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:02:141 HR…310STUDY01EG2324-P000110…RRAGRR Interval, AggregateINTERVALSUPINE661msec661661msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:02:141 HR…311STUDY01EG2324-P000111…QRSAGQRS Duration, AggregateINTERVALSUPINE90msec9090msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:02:141 HR…312STUDY01EG2324-P000112…QTAGQT Interval, AggregateINTERVALSUPINE377msec377377msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T10:02:141 HR…313STUDY01EG2324-P000113…PRAGPR Interval, AggregateINTERVALSUPINE176msec176176msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:00:212 HR…114STUDY01EG2324-P000114…RRAGRR Interval, AggregateINTERVALSUPINE771msec771771msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:00:212 HR…115STUDY01EG2324-P000115…QRSAGQRS Duration, AggregateINTERVALSUPINE100msec100100msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:00:212 HR…116STUDY01EG2324-P000116…QTAGQT Interval, AggregateINTERVALSUPINE379msec379379msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:00:212 HR…117STUDY01EG2324-P000117…PRAGPR Interval, AggregateINTERVALSUPINE179msec179179msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:01:312 HR…218STUDY01EG2324-P000118…RRAGRR Interval, AggregateINTERVALSUPINE749msec749749msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:01:312 HR…219STUDY01EG2324-P000119…QRSAGQRS Duration, AggregateINTERVALSUPINE103msec103103msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:01:312 HR…220STUDY01EG2324-P000120…QTAGQT Interval, AggregateINTERVALSUPINE402msec402402msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:01:312 HR…221STUDY01EG2324-P000121…PRAGPR Interval, AggregateINTERVALSUPINE175msec175175msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:02:402 HR…322STUDY01EG2324-P000122…RRAGRR Interval, AggregateINTERVALSUPINE771msec771771msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:02:402 HR…323STUDY01EG2324-P000123…QRSAGQRS Duration, AggregateINTERVALSUPINE98msec9898msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:02:402 HR…324STUDY01EG2324-P000124…QTAGQT Interval, AggregateINTERVALSUPINE356msec356356msecLEAD II12 LEAD STANDARD2VISIT 22014-03-22T11:02:402 HR…3 Example The example shows one subject's continuous beat-to-beat EG results. Only 3 beats are shown, but there could be measurements for, as an example, 101,000 complexes in 24 hours. The actual number of complexes in 24 hours can be variable and depends on average heart rate. The results are mapped to the EG (ECG Test Results) domain using EGBEATNO. If there is no result to be reported, then the row would not be included. Rows 1-2:Show the first beat recorded. The first beat was considered to be the beat for which the recording contained a complete P-wave. It was assigned EGBEATNO = "1". There is no RR measurement for this beat because RR is measured as the duration (time) between the peak of the R-wave in the reported single beat and peak of the R-wave in the preceding single beat, and the partial recording that preceded EGBEATNO = "1" did not contain an R-wave. EGDTC was the date/time of the individual beat.Rows 3-5:EGBEATNO = "2" had an RR measurement, since the R-wave of the preceding beat (EGBEATNO = "1") was recorded.Rows 6-8:There is a 1-hour gap between beats 2 and 3 due to electrical interference or other artifacts that prevented measurements from being recorded. Note that EGBEATNO = "3" does have an RR measurement because the partial beat preceding EGBEATNO = "3" contained an R-wave. eg.xpt RowSTUDYIDDOMAINUSUBJIDEGSEQEGTESTCDEGTESTEGCATEGPOSEGBEATNOEGORRESEGORRESUEGSTRESCEGSTRESNEGSTRESUEGLEADEGMETHODVISITNUMVISITVISITDYEGDTC1STUDY01EG2324-P00011PRSBPR Interval, Single BeatINTERVALSUPINE1176msec176176msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T14:32:12.32STUDY01EG2324-P00012QRSSBQRS Duration, Single BeatINTERVALSUPINE197msec9797msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T14:32:12.33STUDY01EG2324-P00013PRSBPR Interval, Single BeatINTERVALSUPINE2176msec176176msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T14:32:13.34STUDY01EG2324-P00014RRSMRR Interval, Single MeasurementINTERVALSUPINE2679msec679679msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T14:32:13.35STUDY01EG2324-P00015QRSSBQRS Duration, Single BeatINTERVALSUPINE295msec9595msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T14:32:13.36STUDY01EG2324-P00016PRSBPR Interval, Single BeatINTERVALSUPINE3169msec169169msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T15:32:14.27STUDY01EG2324-P00017RRSMRR Interval, Single MeasurementINTERVALSUPINE3661msec661661msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T15:32:14.28STUDY01EG2324-P00018QRSSBQRS Duration, Single BeatINTERVALSUPINE390msec9090msecLEAD II12 LEAD STANDARD1SCREENING-72014-02-11T15:32:14.2 6.3.4 Inclusion/Exclusion Criteria Not MetIE – Description/OverviewA findings domain that contains those criteria that cause the subject to be in violation of the inclusion/exclusion criteria. IE – Specificationie.xpt, Inclusion/Exclusion Criteria Not Met — Findings, Version 3.2. One record per inclusion/exclusion criterion not met per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the observation date/time of the inclusion/exclusion finding.PermIEDTCDate/Time of CollectionCharISO 8601TimingCollection date and time of the inclusion/exclusion criterion represented in ISO 8601 character format.PermIEDYStudy Day of CollectionNum Timing
¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). IE – Assumptions
IE – ExamplesExample This example shows records for three subjects who failed to meet all inclusion/exclusion criteria but who were included in the study. Rows 1-2:Show data for a subject with two inclusion/exclusion exceptions.Rows 3-4:Show data for two other subjects, both of whom failed the same inclusion criterion. ie.xpt RowSTUDYIDDOMAINUSUBJIDIESEQIESPIDIETESTCDIETESTIECATIEORRESIESTRESCVISITNUMVISITVISITDYIEDTCIEDY1XYZIEXYZ-0007117EXCL17Ventricular RateEXCLUSIONYY1WEEK -8-561999-01-10-582XYZIEXYZ-000723INCL03Acceptable mammogram from local radiologist?INCLUSIONNN1WEEK -8-561999-01-10-583XYZIEXYZ-004713INCL03Acceptable mammogram from local radiologist?INCLUSIONNN1WEEK -8-561999-01-12-564XYZIEXYZ-009613INCL03Acceptable mammogram from local radiologist?INCLUSIONNN1WEEK -8-561999-01-13-55 6.3.5 Immunogenicity Specimen AssessmentsIS – Description/OverviewA findings domain for assessments that determine whether a therapy induced an immune response. IS – Specificationis.xpt, Immunogenicity Specimen Assessments — Findings, Version 3.3. One record per test per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the start date/time of the observation, or the date/time of collection if start date/time is not collected.PermISDTCDate/Time of CollectionCharISO 8601TimingCollection date and time of an observation represented in ISO 8601.ExpISDYStudy Day of Visit/Collection/ExamNum TimingActual study day of visit/collection/exam expressed in integer days relative to sponsor-defined RFSTDTC in Demographics.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). IS – Assumptions
IS – ExamplesExample In this example, subjects were dosed with a Hepatitis C vaccine. Note that information about administration of the vaccine is found in the Exposure domain, not the Immunogenicity domain, so it is not included here. is.xpt RowSTUDYIDDOMAINUSUBJIDISSEQISTESTCDISTESTISCATISORRESISORRESUISSTRESCISSTRESNISSTRESUISSPECISMETHODISLOBXFLISLLOQVISITNUMVISITISDTCISDY1ABC-123IS1234571HCABHepatitis C Virus AntibodySEROLOGY3115.016gpELISA unit/mL3115.0163115.016gpELISA unit/mLSERUMENZYME IMMUNOASSAYY1001VISIT 12008-10-1012ABC-123IS1234572HCABHepatitis C Virus AntibodySEROLOGY1772.78gpELISA unit/mL1772.781772.78gpELISA unit/mLSERUMENZYME IMMUNOASSAY Example In this example, subject was dosed with the study product consisting of 0.5 mL of varicella vaccine. The immunogenic response of the study product, as well as the pneumococcal vaccine that was given concomitantly, was measured to ensure that immunogenicity of both vaccines was sufficient to provide protection. The measurements of antibody to the vaccines were represented in the IS domain. is.xpt RowSTUDYIDDOMAINUSUBJIDISSEQISTESTCDISTESTISCATISORRESISORRESUISSTRESCISSTRESNISSTRESUISSPECISMETHODISLOBXFLISLLOQVISITNUMVISITISDYISDTC1GHJ-456IS60171PNPSAB14Pneumococcal Polysacch AB Serotype 14SEROLOGY9.715ug/mL9.7159.715ug/mLSERUMENZYME IMMUNOASSAYY2.51VISIT 112010-02-062GHJ-456IS60172VZVABVaricella-Zoster Virus AntibodySEROLOGY141.616gpELISA unit/mL141.616141.616gpELISA unit/mLSERUMENZYME IMMUNOASSAYY2.51VISIT 112010-02-063GHJ-456IS60173PNPSAB14Pneumococcal Polysacch AB Serotype 14SEROLOGY13.244ug/mL13.24413.244ug/mLSERUMENZYME IMMUNOASSAY 6.3.6 Laboratory Test ResultsLB – Description/OverviewA findings domain that contains laboratory test data such as hematology, clinical chemistry and urinalysis. This domain does not include microbiology or pharmacokinetic data, which are stored in separate domains. LB – Specificationlb.xpt, Laboratory Test Results — Findings, Version 3.3. One record per lab test per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Record QualifierDescribes why a measurement or test was not performed, e.g., "BROKEN EQUIPMENT", "SUBJECT REFUSED", or "SPECIMEN LOST". Used in conjunction with LBSTAT when value is "NOT DONE".PermLBNAMVendor NameChar Record QualifierThe name or identifier of the laboratory that performed the test.PermLBLOINCLOINC CodeChar*Synonym Qualifier
Timing
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the start date/time of the observation, or the date/time of collection if start date/time is not collected.PermLBDTCDate/Time of Specimen CollectionCharISO 8601TimingDate/time of specimen collection represented in ISO 8601 character format.ExpLBENDTCEnd Date/Time of Specimen CollectionCharISO 8601TimingEnd date/time of specimen collection represented in ISO 8601 character format.PermLBDYStudy Day of Specimen CollectionNum Timing
TimingActual study day of end of observation expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.PermLBTPTPlanned Time Point NameChar Timing
TimingNumerical version of LBTPT to aid in sorting.PermLBELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a planned fixed reference (LBTPTREF). This variable is useful where there are repetitive measures. Not a clock time or a date/time variable. Represented as ISO 8601 duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by LBTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by LBTPTREF.PermLBTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by LBELTM, LBTPTNUM, and LBTPT. Examples: PREVIOUS DOSE, PREVIOUS MEAL.PermLBRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time of the reference time point, LBTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). LB – Assumptions
LB – ExamplesExample Row 1:Shows a value collected in one unit, but converted to selected standard unit. See Section 4.5.1, Original and Standardized Results of Findings and Tests Not Done for additional examples for the population of Result Qualifiers.Rows 1, 3, 5-8:LBLOBXFL = "Y" indicates that these were last observations before exposure to study treatment.Rows 2-3:Show two records (Rows 2 and 3) for Alkaline Phosphatase done at the same visit, one day apart.Row 4:Shows a derived record (average of the records 2 and 3) and flagged derived (LBDRVFL = "Y").Rows 6-7:Show a suggested use of the LBSCAT variable. It could be used to further classify types of tests within a laboratory panel (i.e., "DIFFERENTIAL").Row 9:Shows the proper use of the LBSTAT variable to indicate "NOT DONE", where a reason was collected when a test was not done.Row 10:The subject had cholesterol measured. The normal range for this test is <200 mg/dL. Note that the sponsor has decided to make LBSTNRHI = "199", however another sponsor may have chosen a different value.Row 12:Shows use of LBSTNRC for Urine Protein that is not reported as a continuous numeric result. lb.xpt RowSTUDYIDDOMAINUSUBJIDLBSEQLBTESTCDLBTESTLBCATLBSCATLBORRESLBORRESULBORNRLOLBORNRHILBSTRESCLBSTRESNLBSTRESULBSTNRLOLBSTNRHILBSTNRCLBNRINDLBSTATLBREASNDLBLOBXFLLBFASTLBDRVFLVISITNUMVISITLBDTC1ABCLBABC-001-0011ALBAlbuminCHEMISTRY The SUPPLB dataset example shows clinical significance assigned by the investigator for test results where LBNRIND (reference range indicator) is populated. supplb.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCLBABC-001-001LBSEQ1LBCLSIGClinical SignificanceNCRFINVESTIGATOR2ABCLBABC-001-001LBSEQ6LBCLSIGClinical SignificanceNCRFINVESTIGATOR Example Row 1:Shows an example of a pre-dose urine collection interval (from 4 hours prior to dosing until 15 minutes prior to dosing) with a negative value for LBELTM that reflects the end of the interval in reference to the fixed reference LBTPTREF, the date of which is recorded in LBRFTDTC.Rows 2-3:Show an example of postdose urine collection intervals with values for LBELTM that reflect the end of the intervals in reference to the fixed reference LBTPTREF, the date of which is recorded in LBRFTDTC. lb.xpt RowSTUDYIDDOMAINUSUBJIDLBSEQLBTESTCDLBTESTLBCATLBORRESLBORRESULBORNRLOLBORNRHILBSTRESCLBSTRESNLBSTRESULBSTNRLOLBSTNRHILBNRINDVISITNUMVISITLBDTCLBENDTCLBTPTLBTPTNUMLBELTMLBTPTREFLBRFTDTC1ABCLBABC-001-0011GLUCGlucoseURINALYSIS7mg/dL1150.390.39mmol/L0.10.8NORMAL2INITIAL DOSING1999-06-19T04:001999-06-19T07:45Pre-dose1-PT15MDosing1999-06-19T08:002ABCLBABC-001-0012GLUCGlucoseURINALYSIS11mg/dL1150.610.61mmol/L0.10.8NORMAL2INITIAL DOSING1999-06-19T08:001999-06-19T16:000-8 hours after dosing2PT8HDosing1999-06-19T08:003ABCLBABC-001-0013GLUCGlucoseURINALYSIS9mg/dL1150.50.5mmol/L0.10.8NORMAL2INITIAL DOSING1999-06-19T16:001999-06-20T00:008-16 hours after dosing3PT16HDosing1999-06-19T08:00 Example This is an example of pregnancy test records, one with a result and one with no result because the test was not performed because the subject was male. Row 1:Shows a pregnancy test with an original result of "-" (negative sign) standardized to the text value "NEGATIVE" in LBSTRESC.Row 2:Shows a pregnancy test that was not performed because the subject was male. The sponsor felt it was necessary to include a record documenting the reason why the test was not performed, rather than simply not including a record. lb.xpt RowSTUDYIDDOMAINUSUBJIDLBSEQLBTESTCDLBTESTLBCATLBORRESLBORRESULBORNRLOLBORNRHILBSTRESCLBSTRESNLBSTRESULBSTNRLOLBSTRNHILBNRINDLBSTATLBREASNDVISITNUMVISITLBDTC1ABCLBABC-001-0011HCGChoriogonadotropin BetaCHEMISTRY- 6.3.7 Microbiology DomainsThe microbiology domains consist of Microbiology Specimen (MB) and Microbiology Susceptibility (MS). The MB domain is used for the detection, identification, quantification, and other characterizations of microorganisms in subject samples, except for drug susceptibility testing. MS is used for representing data from drug susceptibility testing on the organisms identified in MB. All non-host infectious organisms, including bacteria, viruses, fungi, and parasites are appropriate for the microbiology domains. 6.3.7.1 Microbiology SpecimenMB – Description/OverviewA findings domain that represents non-host organisms identified including bacteria, viruses, parasites, protozoa and fungi. MB – Specificationmb.xpt, Microbiology Specimen — Findings, Version 3.3. One record per microbiology specimen finding per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingNumeric version of MBTPT used in sorting.PermMBELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a planned fixed reference (MBTPTREF). This variable is useful where there are repetitive measures. Not a clock time or a date time variable. Represented as an ISO 8601 duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by MBTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by MBTPTREF.PermMBTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by MBELTM, MBTPTNUM, and MBTPT. Example: "PREVIOUS DOSE".PermMBRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time for a fixed reference time point, MBTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parentheses). MB – Assumptions
6.3.7.2 Microbiology SusceptibilityMS – Description/OverviewA findings domain that represents drug susceptibility testing results only. This includes phenotypic testing (where drug is added directly to a culture of organisms) and genotypic tests that provide results in terms of susceptible or resistant. Drug susceptibility testing may occur on a wide variety of non-host organisms, including bacteria, viruses, fungi, protozoa and parasites. MS – Specificationms.xpt, Microbiology Susceptibility — Findings, Version 3.3. One record per microbiology susceptibility test (or other organism-related finding) per organism found in MB, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parentheses). MS – Assumptions
6.3.7.3 Microbiology Specimen/Microbiology Susceptibility ExamplesExample The example below shows that a central and local lab (MBNAM) both independently identified "ENTEROCOCCUS FAECALIS " (MBORRES) in a fluid specimen (MBSPEC) taken from the skin (MBLOC) of one subject at Visit 1. The method used by both labs was a solid microbial culture (MBMETHOD). The culture was not targeted to encourage the growth of a specific organism, thus the MBTESTCD/MBTEST is "MCORGIDN"/"Microbial Organism Identification" and MBORRES represents the name of the organism identified. mb.xpt RowSTUDYIDDOMAINUSUBJIDMBSEQMBREFIDMBLNKIDMBTESTCDMBTESTMBORRESMBSTRESCMBNAMMBSPECMBLOCMBMETHODVISITNUMVISITMBDTC1ABCMBABC-001-0021SPEC011MCORGIDNMicrobial Organism IdentificationENTEROCOCCUS FAECALISENTEROCOCCUS FAECALISCENTRAL LAB ABCFLUIDSKINMICROBIAL CULTURE, SOLID1VISIT 12005-07-21T08:002ABCMBABC-001-0022SPEC012MCORGIDNMicrobial Organism IdentificationENTEROCOCCUS FAECALISENTEROCOCCUS FAECALISLOCAL LAB XYZFLUIDSKINMICROBIAL CULTURE, SOLID1VISIT 12005-07-21T08:00 After Enterococcus faecalis was identified in the subject sample, drug susceptibility testing was performed at each of the labs using both the sponsor's investigational drug as well as amoxicillin. Since an identified organism is the subject of the test, the NHOID variable is populated with "ENTEROCOCCUS FAECALIS". Between the two labs (MSNAM), a total of three susceptibility testing methods were used: epsilometer, disk diffusion, and macro broth dilution (MSMETHOD). Both epsilometer and disk diffusion both use agar diffusion methods. In this type of testing, an agar plate is inoculated with the microorganism of interest and either a strip (epsilometer) or discs (disk diffusion) containing various concentrations of the drug are placed onto the agar plate. The epsilometer test method provides both a minimum inhibitory concentration (MSTESTCD="MIC"), the lowest concentration of a drug that inhibits the growth of a microorganism, and a qualitative interpretation (MSTESTCD="MICROSUS") such as susceptible, intermediate, or resistant. The disk diffusion test method gives the diameter of the zone of inhibition (MSTESTCD="DIAZOINH") and a qualitative interpretation such as susceptible, intermediate, or resistant (MSTESTCD=" MICROSUS" ). The quantitative and qualitative results are grouped together using MSGRPID. The third method, macro broth dilution, was used to test the specimen at a pre-defined drug concentration of each of the drugs. When the drug and amount are a pre-defined part of the test, the variable MSAGENT is populated with the name of the drug being used in the susceptibility test. The variables MSCONC and MSCONCU represent the concentration and units of the drug being used. Rows 1-4:Show the minimum inhibitory concentration and the interpretation result reported from Central Lab ABC from a sample that was tested for susceptibility to the sponsor drug and amoxicillin using an epsilometer test method.Rows 5-6:Show that Local Lab XYZ found that the sample was susceptible to the sponsor drug at a concentration of 0.5 ug/dL and resistant to amoxicillin at a concentration of 0.5 ug/dL.Rows 7-10:Show the diameter of the zone of inhibition and the interpretation result reported from Local Lab XYZ from a sample that was tested for susceptibility to the sponsor drug and amoxicillin using a disk diffusion test method. ms.xpt RowSTUDYIDDOMAINUSUBJIDNHOIDMSGRPIDMSSEQMSREFIDMSLNKIDMSTESTCDMSTESTMSAGENTMSCONCMSCONCUMSORRESMSORRESUMSSTRESCMSTRESNMSSTRESUMSNAMMSMETHODMSDTC1ABCMSABC-001-002ENTEROCOCCUS FAECALIS11SPEC011MICMinimum Inhibitory ConcentrationSponsor Drug While not expected, the sponsor decided to connect the identification records in MB to the records in MS using the variables MBLNKID and MSLNKID. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCMB Example The following example illustrates a scenario where a subject sputum sample was collected and tested for the presence of infectious organisms over the course of three visits. The two organisms identified were also tested for susceptibility to both penicillin and a sponsor study drug (MSAGENT). The example shows that the two infecting organisms were cleared over the course of the three visits. Specimen collection was represented in the Biospecimen Events (BE) domain. The example below shows that three samples were collected from the same subject over a period of 15 days. Information about the BE domain including the specification table, assumptions, and examples can be found in the SDTMIG-PGx document. be.xpt RowSTUDYIDDOMAINUSUBJIDBESEQBEREFIDBETERMBEDTC1ABCBEABC-001-0011SP01Collecting2005-06-19T08:002ABCBEABC-001-0012SP02Collecting2005-06-26T08:003ABCBEABC-001-0013SP03Collecting2005-07-06T08:00 The SUPPBE dataset below is used to represent two non-standard variables of BE. Rows 1-3:Show that all three samples (IDVARVAL where IDVAR="BEREFID") were sputum as indicated by QVAL where QNAM = "BESPEC" and QLABEL = "Specimen Type".Rows 4-6:Show that all three sputum samples were collected via expectoration as indicated by QVAL where QNAM = "Specimen Collection Method". QVAL is populated using the CDISC controlled terminology codelist, "Specimen Collection Method". suppbe.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG1ABCBEABC-01-101BEREFIDSP01BESPECSpecimen TypeSPUTUMCRF2ABCBEABC-01-101BEREFIDSP02BESPECSpecimen TypeSPUTUMCRF3ABCBEABC-01-101BEREFIDSP03BESPECSpecimen TypeSPUTUMCRF4ABCBEABC-01-101BEREFIDSP01BECLMETHSpecimen Collection MethodEXPECTORATIONCRF5ABCBEABC-01-101BEREFIDSP02BECLMETHSpecimen Collection MethodEXPECTORATIONCRF6ABCBEABC-01-101BEREFIDSP03BECLMETHSpecimen Collection MethodEXPECTORATIONCRF Rows 1-2:Show that a gram stain was used on a subject sputum sample to identify the presence of gram negative cocci (row 1) and to quantify the bacteria (row 2). MBORRES in row 2 represents an ordinal result, such as from a published quantification scale. This value decodes to "FEW" as shown in MBSTRESC. The quantification scale used and the result scale type are represented as Supplemental Qualifiers of MB.Rows 3-4:Show that the same gram-stained sample was used to identify and quantify the presence of gram negative rods.Rows 5-6:Show that microbial culture of the same sample was used at the same visit to identify the presence of two organisms, "STREPTOCOCCUS PNEUMONIAE" and "KLEBSIELLA PNEUMONIAE" (MBORRES).Row 7:Shows that microbial culture of a subsequent sample at a later visit indicated only the presence of "KLEBSIELLA PNEUMONIAE" (MBORRES).Row 8:Shows that microbial culture of a third subject sample at the third visit indicated "NO GROWTH" (MBORRES) of any organisms. mb.xpt RowSTUDYIDDOMAINUSUBJIDMBSEQMBREFIDMBTESTCDMBTESTMBTSTDTLMBORRESMBSTRESCMBLOCMBMETHODVISITNUMVISITMBDTC1ABCMBABC-001-0011SP01GMNCOCGram Negative CocciDETECTIONPRESENTPRESENTLUNGGRAM STAIN1VISIT 12005-06-19T08:002ABCMBABC-001-0012SP01GMNCOCGram Negative CocciCELL COUNT2+FEWLUNGGRAM STAIN1VISIT 12005-06-19T08:003ABCMBABC-001-0013SP01GMNRODGram Negative RodsDETECTIONPRESENTPRESENTLUNGGRAM STAIN1VISIT 12005-06-19T08:004ABCMBABC-001-0014SP01GMNRODGram Negative RodsCELL COUNT2+FEWLUNGGRAM STAIN1VISIT 12005-06-19T08:005ABCMBABC-001-0015SP01MCORGIDNMicrobial Organism Identification suppmb.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG1ABCMBABC-01-101MBTSTDTLCELL COUNTMBQSCALQuantification ScaleCDC semi-quantitative score for gram stainingCRF2ABCMBABC-01-101MBTSTDTLCELL COUNTMBRRSTYPReported Result Scale TypeORDINALCRF Rows 1-2:Show that the sponsor drug (MSAGENT) was tested against "STREPTOCOCCUS PNEUMONIAE" (NHOID) from subject sample SP01 and that the drug has a minimum inhibitory concentration (MSTESTCD/MSTEST) of 0.004 mg/L (row 1). This led to the conclusion that this organism is susceptible to that drug (row 2).Rows 3-4:Show that penicillin was tested against the same organism from the same sample and was found to have a minimum inhibitory concentration of 0.023 mg/L (row 3). This led to the conclusion that "STREPTOCOCCUS PNEUMONIAE" is resistant to penicillin (row 4).Rows 5-8:Similar to rows 1-4, the sponsor drug (rows 5-6) and penicillin (rows 7-8) were tested against " KLEBSIELLA PNEUMONIAE" from an additional sample from the same subject at a later timepoint. Results from these tests indicated that the organism was susceptible to sponsor drug, yet had intermediate resistance to penicillin.Rows 9-10:A test against "KLEBSIELLA PNEUMONIAE" from an additional sample at a later timepoint showed little change in the minimum inhibitory concentration of penicillin, and that the organism was still classified as having intermediate resistance to this drug. ms.xpt RowSTUDYIDDOMAINUSUBJIDNHOIDMSSEQMSREFIDMSGRPIDMSTESTCDMSTESTMSAGENTMSORRESMSORRESUMSSTRESCMSTRESNMSSTRESUMSMETHODMSDTC1ABCMSABC-001-001STREPTOCOCCUS PNEUMONIAE1SP011MICMinimum Inhibitory ConcentrationSponsor Drug0.004mg/L0.0040.004mg/LEPSILOMETER2005-06-19T08:002ABCMSABC-001-001STREPTOCOCCUS PNEUMONIAE2SP011MICROSUSMicrobial SusceptibilitySponsor DrugSUSCEPTIBLE Example This example shows the microorganisms detected from a gastric aspirate specimen from a child with suspected TB. In this example, gastric lavage is only performed once. Three records in the Microbiology Specimen (MB) domain store detection records for two levels of detection: acid-fast bacilli, and Mycobacterium tuberculosis (Mtb). Characteristics from a culture on solid media that support the presumptive detection of Mtb are also represented in MB. The susceptibility results from both the NAAT and the solid culture are represented in the Microbiology Susceptibility (MS) domain. The example table below shows specimen processing events including sample collection, preparation and culturing events. Sample processing events are represented in the Biospecimen Events (BE) domain. For TB studies, each sample needs a separate identifier to link it to further actions or characteristics of the sample. Therefore, each aliquot is assigned a unique BEREFID value that can be traced to the BEREFID value assigned to the collected "parent" sample. BEREFID is also used to connect the BE and Biospecimen Findings (BS) domains (via BSREFID), as well as any results obtained from the sample that are in the MB or MS domains (via MBREFID and MSREFID). If the same sample is used in many tests, the use of --REFID may result in a potentially undesirable MANY to MANY merge. Users may need to make use of additional linking variables, such as --LNKID. Information about the BE and BS domains including the specification tables, assumptions, and examples can be found in the SDTMIG-PGx document. In the BE, BS, MB, and MS domains, --DTC represents the date of sample collection. --LNKID is used to link culture start and stop dates (BE) with culture results (MB and MS). Row 1:Shows the event of specimen collection. This is the genesis of the sample identified by BEREFID = "100", therefore BEDTC and BESTDTC are the same. The specimen collection setting, collection method, and specimen type are represented using Supplemental Qualifiers. Even though the variable Specimen Type is available for use in Findings domains, it is not available for use in Events domains and thus it is represented as Supplemental Qualifier.Rows 2-6:Show that the sample was aliquoted (smaller subsamples were portioned out from the parent sample). Each separate aliquot is assigned a unique BEREFID. In these cases, BEREFID is an incremented decimal value with the original sample's BEREFID (when BECAT = "COLLECTION") as the base number. This is not an explicit requirement, but makes tracking the samples easier. The definitive link between parent-child samples is defined by the PARENT variable shown in the RELSPEC dataset below.Rows 7-9:Show that three of the aliquots (100.3, 100.4, and 100.5) were cultured for detection (row 7) and drug susceptibility testing (rows 8 and 9). The inoculation and read dates of a culture should be represented in BESTDTC and BEENDTC, respectively. These dates can be linked to the culture results in MB and MS using BELNKID, MBLNKID, and MSLNKID.Row 10:Shows that sample 100.1 was concentrated. be.xpt RowSTUDYIDDOMAINUSUBJIDBESEQBEREFIDBELNKIDBETERMBECATBEDTCBESTDTCBEENDTC1ABCBEABC-01-1011100 suppbe.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG1ABCBEABC-01-101BESEQ1BECLSETSpecimen Collection SettingHOSPITALCRF2ABCBEABC-01-101BESEQ1BECLMETHSpecimen Collection MethodGASTRIC LAVAGECRF3ABCBEABC-01-101BESEQ1BESPECSpecimen TypeLAVAGE FLUIDCRF4ABCBEABC-01-101BESEQ2BESPECSpecimen TypeLAVAGE FLUIDCRF5ABCBEABC-01-101BESEQ3BESPECSpecimen TypeLAVAGE FLUIDCRF6ABCBEABC-01-101BESEQ4BESPECSpecimen TypeLAVAGE FLUIDCRF7ABCBEABC-01-101BESEQ5BESPECSpecimen TypeLAVAGE FLUIDCRF8ABCBEABC-01-101BESEQ6BESPECSpecimen TypeLAVAGE FLUIDCRF9ABCBEABC-01-101BESEQ7BESPECSpecimen TypeLAVAGE FLUIDCRF10ABCBEABC-01-101BESEQ8BESPECSpecimen TypeLAVAGE FLUIDCRF11ABCBEABC-01-101BESEQ9BESPECSpecimen TypeLAVAGE FLUIDCRF12ABCBEABC-01-101BESEQ10BESPECSpecimen TypeLAVAGE FLUIDCRF Findings data captured about the specimen during collection, preparation, and handling are represented in the Biospecimen (BS) domain. Row 1:Shows the total volume of lavage fluid collected during the gastric lavage by using the same values for BSREFID and BEREFID. This is the parent (collected) sample from which further aliquots were generated.Rows 2-6:Show the volume of each aliquot created. bs.xpt RowSTUDYIDDOMAINUSUBJIDBSSEQBSREFIDBSTESTCDBSTESTBSORRESBSORRESUBSSTRESCBSSTRESNBSSTRESUBSSPECBSLOCBSDTC1ABCBSABC-01-1011100VOLUMEVolume20mL2020mLLAVAGE FLUIDSTOMACH2011-01-17T06:002ABCBSABC-01-1012100.1VOLUMEVolume4mL44mLLAVAGE FLUIDSTOMACH2011-01-17T06:003ABCBSABC-01-1013100.2VOLUMEVolume4mL44mLLAVAGE FLUIDSTOMACH2011-01-17T06:004ABCBSABC-01-1014100.3VOLUMEVolume4mL44mLLAVAGE FLUIDSTOMACH2011-01-17T06:005ABCBSABC-01-1015100.4VOLUMEVolume4mL44mLLAVAGE FLUIDSTOMACH2011-01-17T06:006ABCBSABC-01-1016100.5VOLUMEVolume4mL44mLLAVAGE FLUIDSTOMACH2011-01-17T06:00 The RELSPEC table shows the relationship of the "Parent" sample to its aliquots. The LEVEL variable indicates that the sample has been sub-sampled. The original "Parent" sample is always LEVEL = "1". An aliquot of the sample would be LEVEL = "2". If the aliquot was further split, that sub-sample would be LEVEL = "3". Row 1:Shows the original collected (parent) sample. The PARENT variable is left blank to indicate this is the highest level sample.Rows 2-6:Show the relationship of each aliquot in the BE domain to the parent sample. PARENT is populated with the REFID value of the parent sample, indicating that the sample with REFID = "100" is the parent of these samples. LEVEL = "2" serves to indicate that these aliquots are sub-samples of the original (LEVEL = "1") sample. relspec.xpt RowSTUDYIDUSUBJIDREFIDSPECPARENTLEVEL1ABCABC-01-101100LAVAGE FLUID Results from detection tests performed on samples are represented in the MB domain. The sputum sample was aliquoted five times. Three of these aliquots underwent detection testing using three separate tests: one for AFB, one for M. tuberculosis complex, and one for M. tuberculosis. MBTESTCD/MBTEST represents the organism being investigated. MBMETHOD represents the testing method. MBREFID represents which aliquot was tested. The variable MBTSTDTL is used to represent further description of the test performed in producing the MB result. In addition to detection, MBTSTDTL can also be used to represent specific attributes, such as quantifiable and semi-quantifiable results of the culture, as well as qualitative details about the culture such as colony color, morphology, etc. Row 1:Shows a test targeting the presence or absence of AFB using a stain. The MBSPCCND shows that the sample used in the test was concentrated. MBGRPID can be used to connect the detection record with the corresponding AFB quantification results shown in row 2.Row 2:Shows a categorical result for an AFB test using a stain. MBORRES contains a result based on a CDC AFB quantification scale. The name of the scale used is represented as a Supplemental Qualifier. MBREFID indicates which aliquot the procedure was performed upon and MBGRPID is used to connect the AFB quantification record to the detection record in row 1.Row 3:Shows a test targeting the presence or absence of M. tuberculosis complex using a genotyping method. Details about the assay can be found in the DI domain. The value in SPDEVID links the genotype result to the assay information in the DI domain. The microbial detection certainty is represented as a Supplemental Qualifier. Because genotyping was used, the detection is considered to be definitive.Row 4:Shows a test targeting the presence or absence of M. tuberculosis performed on a solid culture. The medium type and microbial detection certainty are represented as Supplemental Qualifiers. Because genotyping was not used, the detection is considered to be presumptive. The culture start and stop dates are represented in BE and are connected to the culture results via BELNKID and MBLNKID. MBGRPID is used to connect the detection record in MB with the corresponding culture characteristics shown in rows 5-7.Row 5:Shows a colony forming unit (CFU) count from a solid culture. The MBORRES value represents the actual colony count from this plate. However, the sample that was spread on this plate represented a 100-fold dilution from the original subject sample. This information is represented in the Supplemental Qualifier "DILUFACT" (Dilution Factor), whose value= 10^-2 (1/100th). In order to enable more straightforward pooling of CFU data, a simple integer result (14700) is used in MBSTRESC/N and MBSTRESU = "CFU/mL". The medium type for the solid culture is also represented as a Supplemental Qualifier.Row 6:Shows the standardized colony count category based on a CDC M. tuberculosis colony quantification scale. The quantification scale used and the medium type for the solid culture are represented as Supplemental Qualifiers. mb.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDMBSEQMBGRPIDMBLNKIDMBREFIDMBTESTCDMBTESTMBTSTDTLMBORRESMBORRESUMBSTRESCMBSTRESNMBSTRESUMBLOCMBSPCCNDMBMETHODVISITNUMVISITMBDTC1ABCMBABC-01-101 suppmb.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG1ABCMBABC-01-101MBSEQ2MBQSCALQuantification ScaleSmear Quantification: Centers for Disease Control Method for Carbol Fuchsin Staining (1000X)eDT2ABCMBABC-01-101MBSEQ3MBMICERTMicrobial Identification CertaintyDEFINITIVEeDT3ABCMBABC-01-101MBSEQ4MBMICERTMicrobial Identification CertaintyPRESUMPTIVEeDT4ABCMBABC-01-101MBREFID100.3MBMEDTYPMedium TypeMIDDLEBROOK 7H10 AGAReDT7ABCMBABC-01-101MBSEQ6MBQSCALQuantification ScaleSolid Media Result: Centers for Disease Control (CDC) Quantification ScaleeDT8ABCMBABC-01-101MBSEQ5MBDILFCTDilution Factor10^-2eDT9ABCMBABC-01-101MBSEQ2MBRRSTYPReported Result Scale TypeORDINALeDT10ABCMBABC-01-101MBSEQ5MBRRSTYPReported Result Scale TypeQUANTITATIVEeDT11ABCMBABC-01-101MBSEQ6MBRRSTYPReported Result Scale TypeORDINALeDT Results from drug susceptibility tests performed on samples are represented in the MS domain. This includes all phenotypic tests (where drug is added directly to the culture medium) and genotypic tests when the result is given as susceptible or resistant. Genotypic tests that give results of specific genetic polymorphisms should be represented in the Pharmacogenomics/Genetics Findings (PF) domain, even though such results may be categorized as susceptible or resistant. In the example below, the variable NHOID (Non-Host Organism ID) is populated with the name of the organism that is the subject of the test. Rows 1-2:Show phenotypic testing results on two separate culture plates: one with medium containing rifampicin (row 1) and one with medium containing isoniazid (row 2). MSAGENT is populated with the name of the drug being used in the susceptibility test. The variables MSCONC and MSCONCU represent the concentration and units of the drug being used. The culture start and stop dates are represented in BE and can be linked to MS by BELNKID and MSLNKID.Rows 3-4:Show genotypic susceptibility testing results on the same aliquot from a nucleic acid amplification test (NAAT) that looks for mutations that confer resistance to two drugs. MSAGENT should be populated with the name of the drug whose action is affected by the mutation being tested for. However, since the drug is not used in the test, MSCONC and MSCONU should be null. These results are represented in MS because the only result given is in terms of resistant/susceptible; no genetic results are reported. ms.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDNHOIDMSSEQMSREFIDMSLNKIDMSTESTCDMSTESTMSAGENTMSCONCMSCONCUMSORRESMSSTRESCMSSPECMSLOCMSMETHODMSDTC1ABCMSABC-01-101 suppms.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG1ABCMBABC-01-101MBREFID100.4MEDTYPEMedium TypeLOWENSTEIN-JENSENeDT2ABCMBABC-01-101MBREFID100.5MEDTYPEMedium TypeLOWENSTEIN-JENSEeDT Data about the device used (row 3 of the MB dataset example and rows 3-4 of the MS dataset example above) can be represented in the Device Identifiers (DI) domain if needed. di.xpt RowSTUDYIDDOMAINSPDEVIDDISEQDIPARMCDDIPARMDIVAL1ABCDIABC7651DEVTYPEDevice TypeNUCLEIC ACID AMPLIFICATION TEST2ABCDIABC7652TRADENAMTrade NameHAIN GENOTYPE MTBDRplus The RELREC table below shows how to link culture start and end dates from BE to the culture results in MB and MS using --LNKID. It also shows how to link the detection record (MB) to the susceptibility results (MS) from a NAAT using --REFID. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCBE Example When a culture has become contaminated, the sponsor may choose to report results despite the contamination. The example below showshow to flag results using a supplemental qualifier to indicate that the results are comingfrom a contaminated culture.This example also illustrates how to use Timing variables to represent an 8-hour pooled overnight sputum sample collection when the start and end times are collected.MBDTC is used to represent the start date/time of the overnight sputum collection and MBENDTC is used to represent the end date/time. Row 1:Shows a test targeting the presence or absence ofMycobacterium tuberculosisfrom a solid culture that has been contaminated (see SUPPMB).Row 2:Shows the number of colony forming units from the solid culture that has been contaminated (see SUPPMB). mb.xpt RowSTUDYIDDOMAINUSUBJIDMBSEQMBREFIDMBGPRIDMBTESTCDMBTESTMBTSTDTLMBORRESMBORRESUMBSTRESCMBTRESNMBSTRESUMBSPECMBLOCMBMETHODVISITNUMVISITMBDTCMBENDTC1ABCMBABC-01-60116001MTBMycobacterium tuberculosisDETECTIONPRESENT Below, a culture contamination indicator flag is shown in SUPPMB. An additional Supplemental Qualifier indicating that the reported result scale type is "QUANTITATIVE" is also shown. suppmb.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG1ABCMBABC-01-601MBSEQ1MBCNMINDCulture Contamination IndicatorYeDT2ABCMBABC-01-601MBSEQ2MBCNMINDCulture Contamination IndicatorYeDT3ABCMBABC-01-601MBSEQ2MBRRSTYPReported Result Scale TypeQUANTITATIVEeDT 6.3.8 Microscopic FindingsMI – Description/OverviewA findings domain that contains histopathology findings and microscopic evaluations. The histopathology findings and microscopic evaluations recorded. The Microscopic Findings dataset provides a record for each microscopic finding observed. There may be multiple microscopic tests on a subject or specimen. MI – Specificationmi.xpt, Microscopic Findings — Findings, Version 3.3. One record per finding per specimen per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time at which the specimen was collected.PermMIDTCDate/Time of Specimen CollectionCharISO 8601TimingDate/time of specimen collection, in ISO 8601 format.ExpMIDYStudy Day of Specimen CollectionNum TimingStudy day of specimen collection, in integer days. The algorithm for calculations must be relative to the sponsor-defined RFSTDTC variable in the Demographics (DM) domain.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). MI – Assumptions
MI – ExamplesExample Immunohistochemistry (IHC) is a method that involves treating tissue with a stain that adheres to very specific substances. IHC is the method most commonly used to assess the amount of HER2 receptor protein on the surface of the cancer cells. A cell with too many receptors receives too many growth signals. In this study, IHC assessment of HER2 in samples of breast cancer tissue yielded reaction scores on a scale of "0" to "3+". Reaction scores of "0" to "1+" were categorized as "NEGATIVE", while scores of "2+" and "3+" were categorized as "POSITIVE". Row 1:Shows a subject with a receptor protein stain reaction score of "0", categorized in MIRESCAT as "NEGATIVE".Row 2:Shows a subject with a receptor protein stain reaction score of "2+", categorized in MIRESCAT as "POSITIVE". mi.xpt RowSTUDYIDDOMAINUSUBJIDMISEQMITESTCDMITESTMITSTDTLMIORRESMISTRESCMIRESCATMISPECMILOCMIMETHODVISITMIDTC1ABCMIABC-10011HER2Human Epidermal Growth Factor Receptor 2Reaction Score00NEGATIVETISSUEBREASTIHCSCREENING2001-06-152ABCMIABC-20021HER2Human Epidermal Growth Factor Receptor 2Reaction Score2+2+POSITIVETISSUEBREASTIHCSCREENING2001-06-15 Example In this study, immunohistochemistry (IHC) for BRCA1 protein expression in a tissue was reported using a reaction score, a stain intensity score, and a composite score.
Row 1:Shows the reaction score.Row 2:Shows the stain intensity, which was assessed as "STRONG". The score associated with an intensity of "STRONG" is in MISTRESC and MISTRESN.Row 3:The composite score is a represented in a derived record, so MIORRES is null. mi.xpt RowSTUDYIDDOMAINUSUBJIDMISEQMIGRPIDMITESTCDMITESTMITSTDTLMIORRESMISTRESCMISTRESNMISPECMILOCMIMETHODMIDRVFLVISITMIDTC1ABCMIABC-100111BRCA1Breast Cancer Susceptibility Gene 1Reaction Score222TISSUEBREASTIHC The IHC staining results above were all for the cell nucleus. This was represented using a supplemental qualifier for subcellular location. suppmi.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCMIABC-1001MIGRPID1MISCELOCSubcellular LocationNUCLEUSCRF Example In this study, IHC staining for Thyroid Transcription Factor 1 was reported at a detailed level.
Rows 1-4:Show percentage of cells at each of the staining intensities.Row 5:Shows the H-Score derived from the percentages. This is a derived record, so MIORRES is blank. mi.xpt RowSTUDYIDDOMAINUSUBJIDMISEQMIGRPIDMITESTCDMITESTMITSTDTLMIORRESMIORRESUMISTRESCMISTRESNMISTRESUMISPECMILOCMIMETHODMIDRVLVISITMIDTC1ABCMIABC-100111TTF1Thyroid Transcription Factor 1The percentage of cells with 0 intensity of staining25%2525%TISSUELUNGIHC The IHC staining results above were all for the cell cytoplasm. This was represented using a supplemental qualifier for subcellular location. suppmi.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCMIABC-1001MIGRPID1MISCELOCSubcellular LocationCYTOPLASMCRF 6.3.9 MorphologyMO – DecomissioningWhen the Morphology domain was introduced in SDTMIG v3.2, the SDS team planned to represent morphology and physiology findings in separate domains: morphology findings in the MO domain and physiology findings in separate domains by body systems. Since then, the team found that separating morphology and physiology findings was more difficult than anticipated and provided little added value. This led to the decision to expand the body system-based domains to cover both morphology and physiology findings and to deprecate MO in a future version of the SDTMIG. Submissions using that later SDTMIG version would represent morphology results in the appropriate body system-based physiology/morphology domain. For data prepared using a version of the SDTMIG that includes both the MO domain and body system-based physiology/morphology domains, morphology findings may be represented in either the MO domain or in a body-system based physiology/morphology domain. Custom body system-based domains may be used if the appropriate body system-based domain is not included in the SDTMIG version being used. MO – Description/OverviewA domain relevant to the science of the form and structure of an organism or of its parts. Macroscopic results (e.g., size, shape, color, and abnormalities of body parts or specimens) that are seen by the naked eye or observed via procedures such as imaging modalities, endoscopy, or other technologies. Many morphology results are obtained from a procedure, although information about the procedure may or may not be collected. MO – Specificationmo.xpt, Morphology — Findings, Version 3.3. One record per Morphology finding per location per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time at which the observation was made.PermMODTCDate/Time of TestCharISO 8601TimingDate of test.ExpMODYStudy Day of TestNum Timing
Timing
TimingNumerical version of MOTPT to aid in sorting.PermMOELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a fixed time point reference (MOTPTREF). Not a clock time or a date time variable. Represented as an ISO 8601 duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by MOTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by MOTPTREF.PermMOTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by MOELTM, MOTPTNUM, and MOTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermMORFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time of the reference time point, MOTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). MO – Assumptions
MO – ExamplesExample This example shows Morphology tests related to cardiovascular assessments for one subject. Other tests of cardiovascular function would be submitted in another appropriate and associated physiology domain. mo.xpt RowSTUDYIDDOMAINUSUBJIDMOSEQMOTESTCDMOTESTMOORRESMOORRESUMOSTRESCMOSTRESNMOSTRESUMOLOCMOLATVISITNUMMODTC1XYZMOXYZ-AB-333-0091AREAArea20cm22020cm2HEART, ATRIUMLEFT12015-06-152XYZMOXYZ-AB-333-0092VOLUMEVolume22ml2222mlHEART, ATRIUMLEFT12015-06-153XYZMOXYZ-AB-333-0093NUMDVSLNumber of Diseased Vessels2 Example This example shows imaging data results from an Alzheimer's disease study. It represents seven MRI imaging tests done on the brain at the "SCREENING" visit and at first treatment, "VISIT 1", for one subject. It also shows the controlled terminology for MOTESTCD and MOTEST. MOREFID is used in RELREC to link the data with the MRI information reported in the Device-in-Use domain. MOREFID contains the identifier of the image used to determine the MO results. The Device Identifier and Device-in-Use domains are used in this example from the SDTM Implementation Guide for Medical Devices (SDTMIG-MD). mo.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDMOSEQMOREFIDMOTESTCDMOTESTMOORRESMOORRESUMOSTRESCMOSTRESNMOSTRESUMOLOCMOLATMOMETHODMOEVALVISITNUMVISITVISITDYMODTCMODY1STUDYXMOP0001ABC17411234-5678INTPInterpretationNORMAL The example below shows a Device Identifiers (DI) domain record based on the MRI device used for the brain measurement. A prerequisite for any Device domain is that there will be at least one record in DI. The standard controlled terminology for DIPARMCD, DIPARM, and DIVAL is represented in the table. di.xpt RowSTUDYIDDOMAINSPDEVIDDISEQDIPARMCDDIPARMDIVAL1STUDYXDIABC1741TYPEDevice TypeMRI Device-in-Use (DU) data example related to MO results for the MRI device This example represents data from one subject collected at two visits regarding parameters from an MRI imaging protocol. DUGRPID is used to facilitate the creation of a RELREC relationship to the morphological result(s). (See table below.) du.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDDUSEQDUGRPIDDUREFIDDUTESTCDDUTESTDUORRESDUORRESUDUSTRESCDUSTRESNDUSTRESUVISITNUMVISITVISITDYDUDTCDUDY1STUDYXDU2324-P0001ABC1741DUMO1222333-444555COILSTRCoil Strength1.5Tesla1.51.5Tesla1SCREENING-72011-04-19-72STUDYXDU2324-P0001ABC1742DUMO1222333-444555ANTPLANEAnatomical PlaneCORONAL Example on the use of RELREC to relate MO and DU The example represents the relationship between the MO and DU records for the "SCREENING" and "VISIT 1" visits. MOREFID was used to link the records in DU by DUGRPID. DUGRPID was assigned to all of the records for the visit for the device. relrec.xpt RowSTUDYIDUSUBJIDRDOMAINIDVARIDVARVALRELTYPERELID1STUDYX2324-P0001MOMOREFID1234-5678 Example This example is from a Polycystic Kidney Disease study where kidney, liver, and heart (left ventricle) measurements were recorded. The example represents one subject who had MO results based on a CT-SCAN image at the "BASELINE" visit. mo.xpt RowSTUDYIDDOMAINUSUBJIDMOSEQMOTESTCDMOTESTMOORRESMOORRESUMOSTRESCMOSTRESNMOSTRESUMOLOCMOLATMOMETHODMOANMETHVISITNUMVISITMODTC1STUDY01MO2324-P00011WIDTHWidth5mm55mmKIDNEYLEFTCT SCAN 6.3.10 Morphology/Physiology DomainsIndividual domains for morphology and physiology findings about specific body systems are grouped together in this section. This grouping is not meant to imply that there is a single morphology/physiology domain. Additional domains for other body systems are expected to be added in future versions. See Section 6.3.9, Morphology, for an explanation of the relationship between the morphology/physiology domains and the Morphology domain. 6.3.10.1 Generic Morphology/Physiology SpecificationThis section describes properties common to all the body system-based morphology/physiology domains.
--.xpt, Body System-Based Morphology/Physiology — Findings, Version 3.3. One record per finding per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Short character value for --TEST used as a column name when converting a dataset from a vertical format to a horizontal format. The short value can be up to 8 characters. Subject to Domain-specific test code controlled terminology. Req--TESTName of Measurement, Test or ExaminationChar*Synonym QualifierLong name for --TESTCD. Subject to Domain-specific test code controlled terminology. Req--ORRESResult or Finding in Original UnitsCharResult QualifierResult of the measurement or finding as originally received or collected.Exp--STRESCResult or Finding in Standard FormatChar Result QualifierContains the result value for all findings, copied or derived from --ORRES in a standard format or in standard units. --STRESC should store all results or findings in character format; if results are numeric, they should also be stored in numeric format in --STRESN. For example, if various tests have results "NONE", "NEG", and "NEGATIVE" in --ORRES, and these results effectively have the same meaning, they could be represented in standard format in --STRESC as "NEGATIVE".Exp--LOBXFLLast Observation Before Exposure FlagChar Record QualifierOperationally-derived indicator used to identify the last non-missing value prior to RFXSTDTC. The value should be "Y" or null.ExpVISITNUMVisit NumberNum Timing
TimingStudy day of the collection, in integer days. The algorithm for calculations must be relative to the sponsor-defined RFSTDTC variable in the Demographics (DM) domain.Exp ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). 6.3.10.2 Cardiovascular System FindingsCV – Description/OverviewA findings domain that contains physiological and morphological findings related to the cardiovascular system, including the heart, blood vessels and lymphatic vessels. CV – Specificationcv.xpt, Cardiovascular System Findings — Findings, Version 1.0. One record per finding or result per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
TimingProtocol-defined description of clinical encounter. May be used in addition to VISITNUM and/or VISITDY.PermVISITDYPlanned Study Day of VisitNum TimingPlanned study day of VISIT. Should be an integer.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time at which the assessment was made.PermCVDTCDate/Time of TestCharISO 8601TimingCollection date and time of an observation.ExpCVDYStudy Day of Visit/Collection/ExamNum TimingActual study day of visit/collection/exam expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.PermCVTPTPlanned Time Point NameChar TimingText description of time when a measurement or observation should be taken, as defined in the protocol. This may be represented as an elapsed time relative to a fixed reference point, such as time of last dose. See CVTPTNUM and CVTPTREF.PermCVTPTNUMPlanned Time Point NumberNum TimingNumeric version of planned time point used in sorting.PermCVELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned Elapsed time relative to a planned fixed reference (CVTPTREF) such as "PREVIOUS DOSE" or "PREVIOUS MEAL". This variable is useful where there are repetitive measures. Not a clock time or a date/time variable, but an interval, represented as ISO duration.PermCVTPTREFTime Point ReferenceChar TimingDescription of the fixed reference point referred to by CVELTM, CVTPTNUM, and CVTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermCVRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time for a fixed reference time point defined by CVTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). CV – Assumptions
CV – ExamplesExample The example below shows various findings related to the aortic artery. This example also shows the evaluation for the presence or absence of abdominal aortic aneurysms. The suprarenal, infrarenal, and thoracic sections of the aorta were examined for aneurysms. This level of anatomical location detail can be found in CVLOC. The records in Rows 1 to 3 are related assessments regarding an aneurysm in the thoracic aorta and are grouped together using the CVGRPID variable. cv.xpt RowSTUDYIDDOMAINUSUBJIDCVSEQCVGRPIDCVTESTCDCVTESTCVORRESCVSTRESCCVLOCCVMETHODVISITNUMVISITCVDTC1ABC123CV002-200412ANEURINDAneurysm IndicatorYYTHORACIC AORTATRANSTHORACIC ECHOCARDIOGRAPHY2BASELINE2015-06-09T14:202ABC123CV002-200422DISECINDDissection IndicatorYYTHORACIC AORTATRANSTHORACIC ECHOCARDIOGRAPHY2BASELINE2015-06-09T14:203ABC123CV002-200432STANFADCStanford AoD ClassificationCLASS ACLASS ATHORACIC AORTATRANSTHORACIC ECHOCARDIOGRAPHY2BASELINE2015-06-09T14:204ABC123CV002-20044 Example In this example the CVTESTs represent the structure of the aortic valve evaluated during a transthoracic echocardiography procedure. cv.xpt RowSTUDYIDDOMAINUSUBJIDCVSEQCVTESTCDCVTESTCVCATCVORRESCVORRESUCVSTRESCCVSTRESNCVSTRESUCVLOCCVMETHODVISITNUMVISITCVDTC1ABC123CV10011NCVALTYPNative Cardiac Valve Intervention TypeVALVULAR STRUCTURE, COMMONNATIVE, WITHOUT INTERVENTION 6.3.10.3 Musculoskeletal System FindingsMK – Description/OverviewA findings domain that contains physiological and morphological findings related to the system of muscles, tendons, ligaments, bones, joints, and associated tissues. MK – Specificationmk.xpt, Musculoskeletal System Findings — Findings, Version 1.0. One record per assessment per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). MK – Assumptions
MK – ExamplesExample This example illustrates the data collected for the Swollen/Tender Joint Count assessment, specifically the 68-joint count. After determining whether each joint is swollen or tender, the assessment includes adding up the number of "Yes" responses for swollen joints and tender joints to obtain a total counts for swollen joints and tender joints. Total counts were not collected on the CRF since they were to be derived in ADaM datasets. Data collection included a field for marking a joint "Not Evaluable" when that joint met a condition (e.g., infection of the overlying tissue or skin, grossly edematous, fused) which precluded joint assessment. as specified by the protocol and the protocol-related joint assessor training. A field for the reason that a joint was not evaluable was not needed. Note that there was a field for marking a joint assessment as "Not Done"; this was to be used if the joint assessor overlooked or missed that joint while performing the joint assessment. The data collected are represented in the Musculoskeletal System Findings (MK) domain. Each joint location is specified in MKLOC with laterality ("RIGHT" or "LEFT") in MKLAT. Since the evaluation includes a large number of joints that would result in many records, only a subset of the data collected is shown below. Rows 1-8, 11-12, 15-16:Show the occurrence of tenderness or swelling (MKORRES/MKSTRESC = "Y" or "N") at specific joint locations, represented in MKLOC, on the right and left sides (MKLAT) of the body.Rows 9-10:Show that the assessments for tenderness and swelling of the MKLOC = "ACROMIOCLAVICULAR JOINT" on the right side of the body was not performed (MKSTAT = "NOT DONE") but a specific reason was not collected on the CRF.Rows 13-14:Show that the assessments for tenderness and swelling of the MKLOC = "SHOULDER JOINT" on the right side of the body was not performed (MKSTAT = "NOT DONE") because it wasn't evaluable (MKREASND = "JOINT NOT EVALUABLE"). mk.xpt RowSTUDYIDDOMAINUSUBJIDMKSEQMKTESTCDMKTESTMKORRESMKSTRESCMKSTRESNMKSTATMKREASNDMKLOCMKLATMKMETHODVISITNUMVISITMKDTC1DEFMKDEF-1381TNDRINDTenderness IndicatorYY Example This example illustrates the collection of scores for the joint space narrowing assessment. There are two scoring methods that may be used to evaluate the joints via a radiographic image: Sharp/Genant and Sharp/van der Heijde. In this evaluation of radiographs for joint narrowing, each joint was graded. If the joint was not assessed, a reason why it was not assessed was provided. The data collected are represented in the Musculoskeletal System Findings (MK) domain. In this example, the evaluation was done by a trained evaluator (MKEVAL = "INDEPENDENT ASSESSOR") from an X-ray using the Sharp/Genant scoring method. Each image was assessed by two readers of the same role; in this example, MKEVALID is populated with "READER 1" because these assessments were performed by the first reader. The method used to obtain the image is represented in MKMETHOD = "X-RAY". The scoring method used for the assessment is pre-coordinated into MKTESTCD and MKTEST. Each joint location is specified in MKLOC with laterality ("RIGHT" or "LEFT") in MKLAT. Since the evaluation includes a large number of joints that would result in many records, only a subset of the data collected is shown below. The total score for the assessment was not collected, so is not represented in this dataset; it was to be derived in an ADaM dataset. Rows 1-2, 4-5, 7-8, 10-11, 13-16:Show the text description of each joint space narrowing score in MKORRES and the corresponding numeric score in MKSTRESC/MKSTRESN.Rows 3, 6, 9, 12:Show data collected for joints that were not assessed, MKSTAT = "NOT DONE", with the reason collected on the CRF represented in MKREASND. mk.xpt RowSTUDYIDDOMAINUSUBJIDMKSEQMKTESTCDMKTESTMKORRESMKSTRESCMKSTRESNMKSTATMKREASNDMKLOCMKLATMKMETHODMKEVALMKEVALIDVISITNUMVISITMKDTC1XYZMKXYZ-0021SGJSNSCRSharp/Genant JSN ScoreMODERATE; 51-75% LOSS OF JOINT SPACE22 6.3.10.4 Nervous System FindingsNV – Description/OverviewA findings domain that contains physiological and morphological findings related to the nervous system, including the brain, spinal cord, the cranial and spinal nerves, autonomic ganglia and plexuses. NV – Specificationnv.xpt, Nervous System Findings — Findings, Version 1.0. One record per finding per location per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time at which the assessment was made.PermNVDTCDate/Time of CollectionCharISO 8601TimingDate of procedure or test.ExpNVDYStudy Day of Visit/Collection/ExamNum Timing
Timing
TimingNumerical version of NVTPT to aid in sorting.PermNVELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a fixed time point reference (NVTPTREF). Not a clock time or a date time variable. Represented as an ISO 8601 duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by NVTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by NVTPTREF.PermNVTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by NVELTM, NVTPTNUM, and NVTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermNVRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time for a fixed reference time point defined by --TPTREF in ISO 8601 character format.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). NV – Assumptions
NV – ExamplesExample The following example demonstrates the SDTM-based modeling of the nervous system information collected and generated (as described above) from separate PET or PET/CT procedures. This example shows measures for standard uptake value ratios taken from three PET scans. SPDEVID shows the scanner used. NVLNKID can be used to link back to the imaging procedure record in the PR domain (PRLNKID), as well as to the tracer administration record in the AG domain (AGLNKID). AGLNKID would be used to determine which tracer uptake is being measured (SUVR), and therefore to which biomarker the findings pertain. NVDTC corresponds to the date of the PET or PET/CT procedure from which these results were obtained. Rows 1-2:Show the Standard Uptake Value Ratio (SUVR) findings based on a PET/CT scan for a subject.Rows 3-4:Show the SUVR findings based on a PET/CT scan for a subject.Rows 5-6:Show the SUVR findings based on an FDG-PET scan for a subject. nv.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDNVSEQNVREFIDNVLNKIDNVTESTCDNVTESTNVORRESNVORRESUNVSTRESCNVSTRESNNVSTRESUNVLOCNVDIRNVMETHODVISITNUMNVDTC1ABC123NVAD01-101221123603SUVRStandard Uptake Value Ratio.95RATIO.95.95RATIOPRECUNEUS The reference region used for the SUVR tests shown is represented in a Supplemental Qualifiers dataset. suppnv.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVAL1ABC123NVAD01-101NVSEQ1REFREGReference RegionCEREBELLUM2ABC123NVAD01-101NVSEQ2REFREGReference RegionCEREBELLUM3ABC123NVAD01-102NVSEQ1REFREGReference RegionCEREBELLUM4ABC123NVAD01-102NVSEQ2REFREGReference RegionCEREBELLUM5ABC123NVAD01-103NVSEQ1REFREGReference RegionPONS6ABC123NVAD01-103NVSEQ2REFREGReference RegionPONS The RELREC table below displays the dataset relationship on how a procedure is linked to multiple Nervous System (NV) domain records and how an individual Procedure Agents (AG) administration record related to a scan is linked to multiple Nervous System (NV) domain records. The RELREC table below uses --LNKID to relate the PR and AG domains to each other and to NV, and --REFID to relate NV and DU. In this example, the sponsor has maintained two sets of reference identifiers (REFID values) for the specific purpose of being able to relate records across multiple domains. Because the SDTMIG-MD advocates the use of --REFID to link a group of settings to the results obtained from the reading or interpretation of the test (see SDTMIG-MD v1.0, Section 4.2.1, Assumption 8), --LNKID has been used to establish the relationships between the procedure, the substance administered during the procedure, and the results obtained from the procedure. --LNKID is unique for each procedure for each subject, so datasets may be related to each other as a whole. Rows 1-2:Show the relationship between the scan, represented in PR, and the radiolabel tracer used, represented in AG. There is only one tracer administration for each scan, and only one scan for each tracer administration, so the relationship is ONE to ONE.Rows 3-4:Show the relationship between the scan, represented in PR, and the SUVR results obtained from the scan, represented in NV. Each scan yields two results, so the relationship is ONE to MANY.Rows 5-6:Show the relationship between the radiolabel tracer used and the SUVR results for each scan. This relationship may seem indirect, but it is not: The choice of radiolabel has the potential to affect the results obtained. Because the relationship between PR and AG is ONE to ONE and the relationship between PR and NV is ONE to MANY, the relationship between AG and NV must be ONE to MANY.Rows 7-8:Show the relationship between the SUVR results and the specific settings for the device used for each scan. There is more than one result from each scan, and more than one setting for each scan, so the relationship is MANY to MANY. This relationship is unusual and challenging to manage in a join/merge, and only represents the concept of this relationship. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC123PR Example The following examples show how to represent components of a pattern-reversal visual evoked potential (VEP) test elicited by checkerboard stimuli for a subject with optic neuritis. VEPs are detected via an electroencephalogram (EEG) using leads that are placed on the back of the subject's head. It is important to note that the nature of VEP testing is such that NVMETHOD should be equal to "EEG", and that NVCAT should be equal to "VISUAL EVOKED POTENTIAL". Several latencies from each eye including N75, P100, and N145, as well as the P100 peak-to-peak amplitude (75-100), are collected and should be represented in NVTESTCD/NVTEST. Details about the VEP equipment including the checkerboard size should be represented in the appropriate device domains. To interpret, each VEP component is compared against normative values established by the laboratory using healthy controls. In this example, a VEP component is considered abnormal if it falls outside of three standard deviations from the normative lab mean. These low and high values are stored in NVORNRLO and NVORNRHI respectively and the interpretation of each VEP component is represented in NVNRIND. In addition to interpreting each VEP component as normal or abnormal, the overall test for each eye may have an interpretation. In this scenario, NVTESTCD/NVTEST should be equal to INTP/Interpretation and NVORRES should represent whether the overall test in each eye is normal or abnormal. NVGRPID links the each VEP component to the overall interpretation. The NV domain should be used to represent the VEP latencies, P100 peak-to-peak amplitude, and their interpretations. SPDEVID allows the results to be related to both the VEP testing device as well as the checkerboard size. Rows 1-4:Show the VEP measurements from the right eye.Row 5:Shows that when all the components of right eye VEP are considered together (NVGRPID = 1), the overall test is interpreted as abnormal.Rows 6-9:Show the VEP measurements from the left eye.Row 10:Shows that when all the components of left eye VEP are considered together (NVGRPID = 2), the overall test is interpreted as abnormal. nv.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDFOCIDNVSEQNVGRPIDNVTESTCDNVTESTNVCATNVORRESNVORRESUNVSTRESCNVSTRESNNVSTRESUNVORNRLONVORNRHINVNRINDNVLOCNVLATNVMETHODVISITNUMNVDTC1MS123NVMS01-01123OD11N75LATN75 LatencyVISUAL EVOKED POTENTIAL79.8msec79.879.8msec54.6894NORMALEYERIGHTEEG12013-02-082MS123NVMS01-01123OD21P100LATP100 LatencyVISUAL EVOKED POTENTIAL129msec129129msec76.75113.71ABNORMALEYERIGHTEEG12013-02-083MS123NVMS01-01123OD31N145LATN145 LatencyVISUAL EVOKED POTENTIAL181msec181181msec114.27156.03ABNORMALEYERIGHTEEG12013-02-084MS123NVMS01-01123OD41P100AMPP100 AmplitudeVISUAL EVOKED POTENTIAL5.02uV5.025.02uV5.2612.64ABNORMALEYERIGHTEEG12013-02-085MS123NVMS01-01123OD51INTPInterpretationVISUAL EVOKED POTENTIALABNORMAL Information about the VEP device is not shown. Identifying information would be represented using the Device Identifiers (DI) domain, and any properties of the device that may change between assessments would be represented and Device In-Use (DU) domains. See the SDTMIG-MD for examples of these domains. 6.3.10.5 Ophthalmic ExaminationsOE – Description/OverviewA findings domain that contains tests that measure a person's ocular health and visual status, to detect abnormalities in the components of the visual system, and to determine how well the person can see. OE – Specificationoe.xpt, Ophthalmic Examinations — Findings, Version 1.0. One record per ophthalmic finding per method per location, per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). OE – Assumptions
OE – ExamplesExample This example shows a general anterior segment examination performed on each eye at one visit, with the purpose of evaluating general abnormalities. Rows 1-2:Represent an overall interpretation (i.e., normal/abnormal) finding from the anterior segment examination, using the OETESTCD = 'INTP'. OELOC indicates that the assessor examined the lens and OELAT indicates which lens was examined.Row 3:Represents an abnormality observed during the anterior segment examination of the right eye. OEDIR = 'MULTIPLE' and indicates multiple directionality values are applicable. OELOC, OELAT, and the multiple OEDIR values specify the location of the abnormality represented in OEORRES and OESTRESC. oe.xpt RowSTUDYIDDOMAINUSUBJIDFOCIDOESEQOETESTCDOETESTOEORRESOESTRESCOELOCOELATOEDIROEMETHODOEEVALVISITNUMVISITOEDTC1XXXOEXXX-450-110OS1INTPInterpretationNORMALNORMALLENSLEFT Row 1:Indicates that the observed abnormality (i.e., red spot visible) was clinically significant.Rows 2-3:Represent the multiple directionality values further describing the anatomical location where the abnormality was observed. suppoe.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVAL1XXXOEXXX- 450-110OESEQ3OECLSIGClinically SignificantY2XXXOEXXX- 450-110OESEQ3OEDIR1Directionality 1SUPERIOR3XXXOEXXX- 450-110OESEQ3OEDIR2Directionality 2TEMPORAL Example This example shows:
The test for Iris Color is in the OE domain because in this use case, the medication is likely to change the result over the course of the study. Otherwise, Iris Color should be represented in the Subject Characteristics (SC) domain (Section 6.3.14, Subject Characteristics). Rows 1-2:Show assessments of the color of the iris (OELOC = "IRIS") for the right and left eyes, respectively.Rows 3-4:Show assessments of the status of the lens (OELOC = "LENS") for the right and left eyes, respectively. This status assessment is to determine whether the lens of the eye is the natural lens (OEORRES = "PHAKIC") or a replacement (OEORRES = "PSEUDOPHAKIC").Rows 5-6:Show assessments looking for the presence of Hyperemia (increased blood flow). The fact that OELOC = "CONJUNCTIVA" even for the left eye, where Hyperemia was absent suggests that this examination was specifically an examination of the conjunctiva.Rows 7-8:Show measurements of the cup-to-disc ratio for the right and left eyes, respectively. oe.xpt RowSTUDYIDDOMAINUSUBJIDFOCIDOESEQOETESTCDOETESTOEORRESOEORRESUOESTRESCOESTRESNOESTRESUOELOCOELATOEMETHODOEEVALVISITNUMVISITOEDTC1XXXOEXXX- 450-120OD1COLORColorBLUE Row 1:Indicates that the observed abnormality (i.e., Hyperemia) was clinically significant.Rows 2-3:Represent the testing condition (i.e., dilated eyes) qualifying the Cup-to-Disc Ratio tests. suppoe.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVAL1XXXOEXXX- 450-120OESEQ5OECLSIGClinically SignificantY2XXXOEXXX- 450-120OESEQ7TSTCNDTesting ConditionDILATED3XXXOEXXX- 450-120OESEQ8TSTCNDTesting ConditionDILATED Example This example shows:
Rows 1-2:Represent the assessments performed by the investigator. OEDTC represents the ophthalmoscopy exam date.Rows 3-6:Represent the assessments performed by an independent assessor. OEDTC represents the optical coherence tomography image date. oe.xpt RowSTUDYIDDOMAINUSUBJIDFOCIDOESEQOELNKIDOETESTCDOETESTOEORRESOEORRESUOESTRESCOESTRESNOESTRESUOELOCOELATOEMETHODOEEVALVISITNUMVISITOEDTC1XYZOEXYZ-100-001OS1 Rows 1-2:Indicate whether the observed abnormality was clinically significant.Rows 3-4:Represent the date when the independent assessor performed the evaluation of the optical coherence tomography image. suppoe.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVAL1XYZOEXYZ- 100-001OESEQ1OECLSIGClinically SignificantY2XYZOEXYZ- 100-001OESEQ2OECLSIGClinically SignificantN3XYZOEXYZ- 100-001OELNKID1EVLDTCEvaluation Date2012-04-304XYZOEXYZ- 100-001OELNKID2EVLDTCEvaluation Date2012-04-30 Rows 1-4:Represent optical coherence tomography procedures performed at Screening and Visit 1 on the right and left eyes. SPDEVID identifies the device used in performing these tests.Row 5:Represents an optical coherence tomography procedure that was not performed at Visit 2. pr.xpt RowSTUDYIDDOMAINUSUBJIDFOCIDSPDEVIDPRSEQPRLNKIDPRTRTPRPRESPPROCCURPRLOCPRLATPRSTDTCVISITNUMVISIT1XYZPRXYZ- 100-001OS10011OCTYYEYELEFT2012-04-25T09:30:001SCREENING2XYZPRXYZ- 100-001OD10022OCTYYEYERIGHT2012-04-25T10:10:001SCREENING3XYZPRXYZ- 100-001OS10033OCTYYEYELEFT2012-05-25T08:00:002VISIT 14XYZPRXYZ- 100-001OD10044OCTYYEYERIGHT2012-05-25T08:30:002VISIT 15XYZPRXYZ- 100-001OU The reason why the optical coherence tomography at Visit 2 was not performed is represented in a supplemental qualifier. supppr.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVAL1XYZPRXYZ- 100-001PRSEQ5OEREASOCReason for Occur ValuePatient was sick for # weeks Identifying information for the device with SPDEVID = "100" included in the PR domain above is represented in the Device Identifiers (DI) domain. di.xpt RowSTUDYIDDOMAINSPDEVIDDISEQDIPARMCDDIPARMDIVAL1XYZDI1001TYPEDevice TypeOCT2XYZDI1002MANUFManufacturerZEISS3XYZDI1003MODELModelCIRRUS4XYZDI1004SERIALSerial Numberyyyyyy The many-to one relationship between records in the Procedures (PR) and Ophthalmic Findings (OE) domains is described in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARALRELTYPERELID1XYZPR Example This example shows:
Row 1:Represents the subject's assessment of ocular comfort in the right eye, upon instillation of a lubricant eye drop for dry eye.Row 2:Represents the subject's assessment of ocular comfort in the right eye, 1 minute post-instillation of a lubricant eye drop for dry eye.Row 3:Represents the subject's assessment of ocular comfort in the left eye, upon instillation of a lubricant eye drop for dry eye.Row 4:Represents the subject's assessment of ocular comfort in the left eye, 1 minute post-instillation of a lubricant eye drop for dry eye. oe.xpt RowSTUDYIDDOMAINUSUBJIDFOCIDOESEQOETESTCDOETESTOECATOEORRESOESTRESCOESTRESNOELOCOELATOEMETHODOEEVALVISITNUMVISITOEDTCOETPTOETPTNUM1XYZOEXYZ-100-0001OD1DRPCMTGRDrop Comfort GradeOCCULAR COMFORT111EYERIGHTVISUAL ANALOG SCALESTUDY SUBJECT1VISIT 12011-02-11T09:00UPON INSTILLATION12XYZOEXYZ-100-0001OD2DRPCMTGRDrop Comfort GradeOCCULAR COMFORT101010EYERIGHTVISUAL ANALOG SCALESTUDY SUBJECT1VISIT 12011-02-11T09:011 MINUTE POST-INSTILLATION23XYZOEXYZ-100-0001OS1DRPCMTGRDrop Comfort GradeOCCULAR COMFORT111EYELEFTVISUAL ANALOG SCALESTUDY SUBJECT1VISIT 12011-05-01T09:00UPON INSTILLATION14XYZOEXYZ-100-0001OS2DRPCMTGRDrop Comfort GradeOCCULAR COMFORT101010EYELEFTVISUAL ANALOG SCALESTUDY SUBJECT1VISIT 12011-05-01T09:011 MINUTE POST-INSTILLATION2 The numeric scale used in grading ocular comfort was described in a supplemental qualifier. suppoe.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVAL1XYZOEXYZ-100-0001OECATOCULAR COMFORTRESCRTResult Criteria10-point VAS (1=Best, 10=Worst) Adverse events affecting the eyes are represented in the AE domain. For events that affected only one eye, the sponsor populated FOCID, an identifier variable that can be included in any domain. ae.xpt RowSTUDYIDDOMAINUSUBJIDFOCIDAESEQAESPIDAETERMAEDECODAEBODSYSAELOCAELATAESEVAESERAEACNAERELAEOUTAESTDTCAEENDTC1XYZAEXYZ-100-0001 6.3.10.6 Reproductive System FindingsRP – Description/OverviewA findings domain that contains physiological and morphological findings related to the male and female reproductive systems. RP – Specificationrp.xpt, Reproductive System Findings — Findings, Version 3.3. One record per finding or result per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). RP – Assumptions
RP – ExamplesExample This example represents Reproductive System Findings at the Screening Visit, Visit 1 and Visit 2 for two subjects. rp.xpt RowSTUDYIDDOMAINUSUBJIDRPSEQRPTESTCDRPTESTRPORRESRPORRESURPSTRESCRPSTRESNRPSTRESURPDURRPBLFLVISITNUMVISITVISITDYRPDTCRPDY1STUDYXRP2324-P00011SPABORTNNumber of Spontaneous Abortions1 6.3.10.7 Respiratory System FindingsRE – Description/OverviewA findings domain that contains physiological and morphological findings related to the respiratory system, including the organs that are involved in breathing such as the nose, throat, larynx, trachea, bronchi and lungs. RE – Specificationre.xpt, Respiratory System Findings — Findings, Version 1.0. One record per finding or result per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). RE – Assumptions
RE – ExamplesExample This example shows results from several spirometry tests using either a spirometer or a peak flow meter. When spirometry tests are performed, the subject usually makes several efforts, each of which produces results, but only the best result for each test is used in analyses. In this study, the sponsor collected only the best results. The Device Identifiers (DI) domain was submitted for device identification, and the Device In-Use (DU) domain was submitted to provide information about the use of the device. Because the original and standardized units of measure are identical in this example, RESTRESC, RESTRESN, RESTRESU, and RESTREFN are not shown. Instead, an ellipsis marks their place in the dataset. Spirometry test values are compared to a predicted value, rather than a normal range. Predicted values are represented in REORREF. Rows 1-2:Show the results for the spirometry tests FEV1 and FVC, with the predicted values in REORREF. The spirometer used in the tests is identified by the SPDEVID.Rows 3-4:Show the results for FEV1 and FVC as percentages of the predicted values. This result is output by the spirometer device, not derived by the sponsor. REORREF is null as there are no reference results for percent predicted tests.Row 5:Shows the results of the PEF test with the predicted values in REORREF. These results were obtained with a different device, a peak flow meter, identified by the SPDEVID. re.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDRESEQRETESTCDRETESTREORRESREORRESUREORREF...VISITNUMVISITREDTC1XYZREXYZ-001-001ABC0011FEV1Forced Expiratory Volume in 1 Second2.73L3.37 The DI domain provides the information needed to distinguish among devices used in the study. In this example, the only parameter needed to establish identifiers was the device type. di.xpt RowSTUDYIDDOMAINSPDEVIDSPSEQDIPARMCDDIPARMDIVAL1XYZDIABC0011DEVTYPEDevice TypeSPIROMETER2XYZDIDEF9991DEVTYPEDevice TypePEAK FLOW METER The DU domain shows settings used on the devices with identifier "ABC001". The device was set to use the NHANES III reference equation. Since this setting was the same for all uses of the device for all subjects, USUBJID is null. du.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDDUSEQDUTESTCDDUTESTDUORRES1XYZDU Example In this example, a subject made four attempts at the FEV1 pulmonary function test, and data about all attempts were collected. It is standard practice for multiple attempts to be made, and for the best result to be used in analyses. In this example, the spirometry report included an indicator of which was the best result. The spirometry report also included an indicator that one of the attempts was considered to have produced an inadequate result, with the reasons the result was considered inadequate. Rows 1-3:Show individual test results for FEV1 as measured by spirometry.Row 4:Shows an individual test result for FEV1 as measured by spirometry. Note that this result is much less than the others. re.xpt RowSTUDYIDDOMAINUSUBJIDSPDEVIDRESEQRETESTCDRETESTREORRESREORRESURESTRESNRESTRESUREREPNUMVISITNUMVISITREDTC1XYZREXYZ-001-001ABC0011FEV1Forced Expiratory Volume in 1 Second1.94L1.94L12VISIT 22013-04-232XYZREXYZ-001-001ABC0012FEV1Forced Expiratory Volume in 1 Second1.88L1.88L22VISIT 22013-04-233XYZREXYZ-001-001ABC0013FEV1Forced Expiratory Volume in 1 Second1.88L1.88L32VISIT 22013-04-234XYZREXYZ-001-001ABC0014FEV1Forced Expiratory Volume in 1 Second1.57L1.57L42VISIT 22013-04-23 Supplemental qualifiers were used to indicate which was the best result and to provide information on the attempt that was considered to produce inadequate results. Row 1:Shows the record with RESEQ = "1" was the best test result, indicated by BRESFL = "Y".Rows 2-4:The presence of a flag, IRESFL, indicates that the data were inadequate. The two reasons why this was the case are represented by QNAM = "IRREA1" and "IREEA2". suppre.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1XYZREXYZ-001-001RESEQ1BRESFLBest Result FlagYCRF DI was used to represent the device type that was used to perform for the pulmonary function tests. di.xpt RowSTUDYIDDOMAINSPDEVIDDISEQDIPARMCDDIPARMDIVAL1XYZDIABC0011DEVTYPEDevice TypeSPIROMETER 6.3.10.8 Urinary System FindingsUR – Description/OverviewA findings domain that contains physiological and morphological findings related to the urinary tract, including the organs involved in the creation and excretion of urine such as the kidneys, ureters, bladder and urethra. UR – Specificationur.xpt, Urinary System Findings — Findings, Version 3.3. One record per finding per location per per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). UR – Assumptions
UR – ExamplesExample This example shows measurements of the kidney, number of renal arteries and veins, and presence/absence results for pre-specified abnormalities of the kidneys. These findings were made using CT imaging. Row 1:Shows that the subject's left kidney was measured to be 126 mm long.Row 2:Shows that the subject's left kidney had 2 renal arteries.Row 3:Shows that the subject's left kidney had 1 renal vein.Row 4:Shows that no hematomas were found in the kidney. If a hematoma had been present, the variable URLOC (with URDIR as necessary) would have specified where within the kidney.Row 5:Shows that surgical damage was noted in the superior portion of the kidney cortex. Note that in SDTM, there is no way to clearly distinguish between the use of --LOC as a qualifier of --TEST vs. as a qualifier of results, as it is used here. ur.xpt RowSTUDYIDDOMAINUSUBJIDURSEQURTESTCDURTESTURORRESURORRESUURSTRESCURSTRESNURSTRESUURLOCURLATURDIRURMETHODURDTC1ABCURABC-001-0111LENGTHLength12.6cm126126mmKIDNEYLEFT Example This example shows a subject's renal blood flow measurement for each visit based on the subject's para-amino hippuric acid (PAH) clearance, indicated by URMETHOD = "PARA-AMINO HIPPURIC ACID CLEARANCE". ur.xpt RowSTUDYIDDOMAINUSUBJIDURSEQURTESTCDURTESTURORRESURORRESUURSTRESCURSTRESNURSTRESUURLOCURLATURMETHODVISITNUMVISITURDTC1DEFURDEF-01231RBLDFLWRenal Blood Flow20mL/min2020mL/minKIDNEYBILATERALPARA-AMINO HIPPURIC ACID CLEARANCE1VISIT 12016-03-152DEFURDEF-01232RBLDFLWRenal Blood Flow10mL/min1010mL/minKIDNEYLEFTPARA-AMINO HIPPURIC ACID CLEARANCE2VISIT 22016-03-203DEFURDEF-01233RBLDFLWRenal Blood Flow10mL/min1010mL/minKIDNEYRIGHTPARA-AMINO HIPPURIC ACID CLEARANCE3VISIT 32016-04-07 6.3.11 Pharmacokinetics Domains6.3.11.1 Pharmacokinetics ConcentrationsPC – Description/OverviewA findings domain that contains concentrations of drugs or metabolites in fluids or tissues as a function of time. PC – Specificationpc.xpt, Pharmacokinetics Concentrations — Findings, Version 3.2. One record per sample characteristic or time-point concentration per reference time point or per analyte per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the start date/time of the observation, or the date/time of collection if start date/time is not collected.PermPCDTCDate/Time of Specimen CollectionCharISO 8601TimingDate/time of specimen collection represented in ISO 8601 character format. If there is no end time, then this will be the collection time.ExpPCENDTCEnd Date/Time of Specimen CollectionCharISO 8601TimingEnd date/time of specimen collection represented in ISO 8601 character format. If there is no end time, the collection time should be stored in PCDTC, and PCENDTC should be null.PermPCDYActual Study Day of Specimen CollectionNum Timing
TimingActual study day of end of observation expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.PermPCTPTPlanned Time Point NameChar Timing
TimingNumerical version of PCTPT to aid in sorting.PermPCELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a planned fixed reference (PCTPTREF) such as "PREVIOUS DOSE" or "PREVIOUS MEAL". This variable is useful where there are repetitive measures. Not a clock time or a date time variable.PermPCTPTREFTime Point ReferenceChar TimingName of the fixed reference point used as a basis for PCTPT, PCTPTNUM, and PCELTM. Example: "Most Recent Dose".PermPCRFTDTCDate/Time of Reference PointCharISO 8601TimingDate/time of the reference time point described by PCTPTREF.PermPCEVLINTEvaluation IntervalCharISO 8601TimingEvaluation Interval associated with a PCTEST record represented in ISO 8601 character format. Example: "-PT2H" to represent an interval of 2 hours prior to a PCTPT.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). PC – Assumptions
PC – ExamplesDue to space limitations, not all expected or permissible findings variables are included in examples for this domain. Example This example shows concentration data for Drug A and a metabolite of Drug A from plasma and from urine samples collected pre-dose and after dosing on two different study days, Days 1 and 11. PCTPTREF is a text value of the description of a "zero" time (e.g., time of dosing). It should be meaningful. If there are multiple PK profiles being generated, the zero time for each will be different (e.g., a different dose such as "first dose", "second dose") and, as a result, values for PCTPTREF must be different. In this example such values for PCTPTREF are required to make values of PCTPTNUM and PCTPT unique (see Section 4.4.10, Representing Time Points). Rows 1-2:Show Day 1 pre-dose drug and metabolite concentrations in plasma and urine.Rows 3-4:Show Day 1 pre-dose drug and metabolite concentrations in urine. Since urine specimens are often collected over an interval, both PCDTC and PCENDTC in have been populated with the same value to show that the urine specimens were collected at a point in time, rather than over an interval.Rows 5-6:Show specimen properties (VOLUME and PH) for the Day 1 pre-dose urine specimens. These have a PCCAT value of "SPECIMEN PROPERTY".Rows 7-12:Show Day 1 post-dose drug and metabolite concentrations in plasma.Rows 13-16:Show Day 11 drug and metabolite concentrations in plasma.Rows 17-20:Show Day 11 drug and metabolite concentrations in urine specimens collected over an interval. The elapsed times for urine samples are based upon the elapsed time (from the reference time point, PCTPTREF) for the end of the specimen collection period. Elapsed time values that are the same for urine and plasma samples have been assigned the same value for PCTPT. For the urine samples, the value in PCEVLINT describes the planned evaluation (or collection) interval relative to the time point. The actual evaluation interval can be determined by subtracting PCDTC from PCENDTC.Rows 21-30:Show additional drug and metabolite concentrations and specimen properties related to the Day 11 dose. pc.xpt RowSTUDYIDDOMAINUSUBJIDPCSEQPCGRPIDPCREFIDPCTESTCDPCTESTPCCATPCSPECPCORRESPCORRESUPCSTRESCPCSTRESNPCSTRESUPCSTATPCLLOQPCULOQVISITNUMVISITVISITDYPCDTCPCENDTCPCDYPCTPTPCTPTNUMPCTPTREFPCRFTDTCPCELTMPCEVLINT1ABC-123PC123-00011Day 1A554134-10DRGA_METDrug A MetaboliteANALYTEPLASMA<0.1ng/mL<0.1 6.3.11.2 Pharmacokinetics ParametersPP – Description/OverviewA findings domain that contains pharmacokinetic parameters derived from pharmacokinetic concentration-time (PC) data. PP – Specificationpp.xpt, Pharmacokinetics Parameters — Findings, Version 3.2. One record per PK parameter per time-concentration profile per modeling method per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). PP – Assumptions
PP – ExamplesExample This example shows PK parameters calculated from time-concentration profiles for parent drug and one metabolite in plasma and urine for one subject. Note that PPRFTDTC is populated in order to link the PP records to the respective PC records. Note that PPSPEC is null for Clearance (PPTESTCD = "CLO") records since it is calculated from multiple specimen sources (plasma and urine). Rows 1-12:Show parameters for Day 1.Rows 13-24:Show parameters for Day 8. pp.xpt RowSTUDYIDDOMAINUSUBJIDPPSEQPPGRPIDPPTESTCDPPTESTPPCATPPORRESPPORRESUPPSTRESCPPSTRESNPPSTRESUPPSPECVISITNUMVISITPPDTCPPRFTDTC1ABC-123PPABC-123-00011DAY1_PARTMAXTime of CMAXDRUG A PARENT1.87h1.871.87HPLASMA1DAY 12001-03-012001-02-01T08:002ABC-123PPABC-123-00012DAY1_PARCMAXMax ConcDRUG A PARENT44.5ug/L44.544.5ug/LPLASMA1DAY 12001-03-012001-02-01T08:003ABC-123PPABC-123-00013DAY1_PARAUCALLAUC AllDRUG A PARENT294.7h*mg/L294.7294.7h*mg/LPLASMA1DAY 12001-03-012001-02-01T08:004ABC-123PPABC-123-00014DAY1_PARLAMZHLHalf-Life Lambda zDRUG A PARENT0.75h0.750.75HPLASMA1DAY 12001-03-012001-02-01T08:005ABC-123PPABC-123-00015DAY1_PARVZOVz ObsDRUG A PARENT10.9L10.910.9LPLASMA1DAY 12001-03-012001-02-01T08:006ABC-123PPABC-123-00016DAY1_PARCLOTotal CL ObsDRUG A PARENT1.68L/h1.681.68L/h Example This example shows the use of PPSTINT and PPENINT to describe the AUC segments for the test code "AUCINT", the area under the curve from time T1 to time T2. Time T1 is represented in PPSTINT as the elapsed time since PPRFTDTC and Time T2 is represented in PPENINT as the elapsed time since PPRFTDTC. Rows 1-7:Show parameters for Day 1.Rows 8-14:Show parameters for Day 14. pp.xpt RowSTUDYIDDOMAINUSUBJIDPPSEQPPGRPIDPPTESTCDPPTESTPPCATPPORRESPPORRESUPPSTRESCPPSTRESNPPSTRESUPPSPECVISITNUMVISITPPDTCPPRFTDTCPPSTINTPPENINT1ABC-123PPABC-123-0011DRUGA_DAY1TMAXTime of CMAXDRUG A PARENT0.65h0.650.65hPLASMA1DAY 12001-02-252001-02-01T08:00 6.3.11.3 Relating PP Records to PC RecordsSponsors must document the concentrations used to calculate each parameter. This may be done in analysis dataset metadata or by documenting relationships between records in the Pharmacokinetics Parameters (PP) and Pharmacokinetics Concentrations (PC) datasets in a RELREC dataset (See Section 8.2, Relating Peer Records and Section 8.3, Relating Datasets). 6.3.11.3.1 PC-PP – Relating DatasetsIf all time-point concentrations in PC are used to calculate all parameters for all subjects, then the relationship between the two datasets can be documented as shown in the table below. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PC Note that the reference time point and the analyte are part of the natural key (see Section 3.2.1.1, Primary Keys) for both datasets. In the relationship shown above, --GRPID is a surrogate key, and must be populated so that each combination of analyte and reference time point has a separate value of --GRPID. 6.3.11.3.2 PC-PP – Relating RecordsThis section illustrates four methods for representing relationships between PC and PP records under four different example circumstances. All these examples are based on the same PC and PP data for one drug, "Drug X". The different methods for representing relationships are based on which linking variables are used in RELREC.
The different examples illustrate situations in which different subsets of the pharmacokinetic concentration data were used in calculating the pharmacokinetic parameters. As in the example above, --GRPID values must take into account all the combinations of analytes and reference time points, since both are part of the natural key for both datasets. For each example, PCGRPID and PPGRPID were used to group related records within each respective dataset. The exclusion of some concentration values from the calculation of some parameters affects the values of PCGRPID and PPGRPID for the different situations. To conserve space, the PC and PP domains appear only once, but with four --GRPID columns, one for each of the example situations. Note that a submission dataset would contain only one --GRPID column with a set of values such as those shown in one of the four columns in the PC and PP datasets. Since the relationship between PC records and PP records for Day 8 data does not change across the examples, it is shown only for Example 1, and not repeated. Pharmacokinetic Concentrations (PC) Dataset for All Examples pc.xpt RowSTUDYIDDOMAINUSUBJIDPCSEQPCGRPID1PCGRPID2PCGRPID3PCGRPID4PCREFIDPCTESTCDPCTESTPCCATPCORRESPCORRESUPCSTRESCPCSTRESNPCSTRESUPCSPECPCBLFLPCLLOQPCDTCPCDYPCNOMDYPCTPTPCTPTNUMPCELTMPCTPTREFPCRFTDTC1ABC-123PCABC-123-00011DY1_DRGXDY1_DRGXDY1_DRGX_ADY1_DRGX_A123-0001-01DRUG XSTUDYDRUGANALYTE9ug/mL99ug/mLPLASMA Pharmacokinetic Parameters (PP) Dataset for All Examples pp.xpt RowSTUDYIDDOMAINUSUBJIDPPSEQPPGRPID1PPGRPID2PPGRPID3PPGRPID4PPTESTCDPPTESTPPCATPPORRESPPORRESUPPSTRESCPPSTRESNPPSTRESUPPSPECPPNOMDYPPRFTDTC1ABC-123PPABC-123-00011DY1DRGXDY1DRGXDY1DRGX_ATMAXTMAXTime of CMAXDRUG X1.87h1.871.87hPLASMA12001-02-01T08:352ABC-123PPABC-123-00012DY1DRGXDY1DRGXDY1DRGX_ACMAXCMAXMax ConcDRUG X44.5ng/mL44.544.5ng/mLPLASMA12001-02-01T08:353ABC-123PPABC-123-00013DY1DRGXDY1DRGXDY1DRGX_AAUCAUCALLAUC AllDRUG X294.7h*ug/mL294.7294.7h*ug/mLPLASMA12001-02-01T08:354ABC-123PPABC-123-00015DY1DRGXDY1DRGXDY1DRGX_HALFOTHERLAMZHLHalf-Life Lambda zDRUG X4.69h4.694.69hPLASMA12001-02-01T08:355ABC-123PPABC-123-00016DY1DRGXDY1DRGXDY1DRGX_AOTHERVZOVz ObsDRUG X10.9L10.910.9LPLASMA12001-02-01T08:356ABC-123PPABC-123-00017DY1DRGXDY1DRGXDY1DRGX_AOTHERCLOTotal CL ObsDRUG X1.68L/h1.681.68L/hPLASMA12001-02-01T08:357ABC-123PPABC-123-00018DY8DRGXDY8DRGXDY8DRGXDY8DRGXTMAXTime of CMAXDRUG X1.91h1.911.91hPLASMA82001-02-08T08:358ABC-123PPABC-123-00019DY8DRGXDY8DRGXDY8DRGXDY8DRGXCMAXMax ConcDRUG X46.0ng/mL46.046.0ng/mLPLASMA82001-02-08T08:359ABC-123PPABC-123-000110DY8DRGXDY8DRGXDY8DRGXDY8DRGXAUCALLAUC AllDRUG X289.0h*ug/mL289.0289.0h*ug/mLPLASMA82001-02-08T08:3510ABC-123PPABC-123-000112DY8DRGXDY8DRGXDY8DRGXDY8DRGXLAMZHLHalf-Life Lambda zDRUG X4.50h4.504.50hPLASMA82001-02-08T08:3511ABC-123PPABC-123-000113DY8DRGXDY8DRGXDY8DRGXDY8DRGXVZOVz ObsDRUG X10.7L10.710.7LPLASMA82001-02-08T08:3512ABC-123PPABC-123-000114DY8DRGXDY8DRGXDY8DRGXDY8DRGXCLOTotal CL ObsDRUG X1.75L/h1.751.75L/hPLASMA82001-02-08T08:35 Example All PC records used to calculate all pharmacokinetic parameters. This example uses --GRPID values in the columns labeled "PCGRPID1" and "PPGRPID1". Method A (Many to many, using PCGRPID and PPGRPID) Rows 1-2:The relationship with RELID "1" includes all PC records with PCGRPID = "DY1_DRGX" and all PP records with PPGRPID = "DY1DRGX".Rows 3-4:The relationship with RELID "2" includes all PC records with GRPID = "DY8_DRGX" and all PP records with GRPID = "DY8DRGX". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCGRPIDDY1_DRGX Method B (One to many, using PCSEQ and PPGRPID) Rows 1-13:The relationship with RELID "1" includes the individual PC records with PCSEQ values "1" to "12" and all PP records with PPGRPID = "DY1DRGX".Rows 14-26:The relationship with RELID "2" includes the individual PC records with PCSEQ values "13" to "24" and all PP records with PPGRPID = "DY8DRGX". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCSEQ1 Method C (Many to one, using PCGRPID and PPSEQ) Rows 1-8:The relationship with RELID = "1" includes all PC records with a PCGRPID = "DY1_DRGX" and PP records with PPSEQ values "1" through "7".Rows 9-16:The relationship with RELID = "2" includes all PC records with a PCGRPID = "DY8_DRGX" and PP records with PPSEQ values of "8" through "14". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCGRPIDDY1_DRGX Method D (One to one, using PCSEQ and PPSEQ) Rows 1-19:The relationship with RELID "1" includes individual PC records with PCSEQ values "1" through "12" and PP records with PPSEQ values "1" through "7".Rows 20-38:The relationship with RELID "2" includes individual PC records with PCSEQ values "13" through "24" and PP records with PPSEQ values "8" through "14". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCSEQ1 Example Only some records in PC were used to calculate all pharmacokinetic parameters: Time Points 8 and 9 on Day 1 were not used for any pharmacokinetic parameters. This example uses --GRPID values in the columns labeled "PCGRPID2" and "PPGRPID2". Note that for the two excluded PC records, PCGRPID = "EXCLUDE", while for other PC records, PCGRPID = "DY1_DRGX". All pharmacokinetic concentrations for Day 8 were used to calculate all pharmacokinetic parameters. Since Day 8 relationships are the same as for Example 1, they are not included here. Method A (Many to many, using PCGRPID and PPGRPID) The relationship with RELID "1" includes PC records with PCGRPID = "DY1_DRGX" and all PP records with PPGRPID = "DY1DRGX". PC records with PCGRPID = "EXCLUDE" are not included in this relationship. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCGRPIDDY1_DRGX Method B (One to many, using PCSEQ and PPGRPID) The relationship with RELID "1" includes individual PC records with PCSEQ values "1" through "7" and "10" through 11", and all the PP records with PPGRPID = "DY1DRGX". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCSEQ1 Method C (Many to one, using PCGRPID and PPSEQ) The relationship with RELID "1" includes all PC records with PCGRPID = "DY1_DRGX" and individual PP records with PPSEQ values "1" through "7". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCGRPIDDY1_DRGX Method D (One to one, using PCSEQ and PPSEQ) The relationship with RELID "1" includes individual PC records with PCSEQ values "1" through "7" and "10" through "12" and individual PP records with PPSEQ values "1 through "7". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCSEQ1 Example Only some records in PC were used to calculate some parameters: Time points 8 and 9 on Day 1 were not used for half-life calculations, but were used for other parameters. This example uses --GRPID values in the columns labeled "PCGRPID3" and "PPGRPID3". Note that the two excluded PC records have PCGRPID = "DY1_DRGX_B", while the other PC records have PCGRPID = "DY1_DRGX_A". Note also that the PP records for half-life calculations have PPGRPID = "DYDRGX_HALF", while the other PP records have PPGRPID = "DY1DRGX_A". All pharmacokinetic concentrations for Day 8 were used to calculate all pharmacokinetic parameters. Since Day 8 relationships are the same as for Example 1, they are not included here. Method A (Many to many, using PCGRPID and PPGRPID) Rows 1-3:The relationship with RELID "1" includes all PC records with PCGRPID = "DY1_DRGX_A", all PC records with PCGRPID = "DY1_DRGX_B" (which in this case is all the PP records for Day 1) and all PP records with PPGRPID = "DYIDRGX_A".Rows 4-6:The relationship with RELID "2" includes only PC records with PCGRPID = "DY1_DRGX_A" and all PP records with PPGRPID = "DYIDRGX_HALF". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCGRPIDDY1_DRGX_A Method B (One to many, using PCSEQ and PPGRPID) Rows 1-13:The relationship with RELID "1" includes PP records with PCSEQ values "1" through "12" and PP records with PPGRPID = "DY1DRGX_A".Rows 14-24:The relationship with RELID "2" includes PP records with PCSEQ values "1" through "7" and "10" through "12" and PP records with PPGRPID = "DY1DRGX_HALF". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCSEQ1 Method C (Many to one, using PCGRPID and PPSEQ) Rows 1-7:The relationship with RELID "1" includes all PP records with PGRPID values "DY1_DRGX_A" and "DY1_DRGX_B" and PP records with PPSEQ values "1" through "3", "6", and "7".Rows 8-10:The relationship with RELID "2" includes all PP records with PGRPID value "DY1_DRGX_A" and PP records with PPSEQ values "4" and "5". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCGRPIDDY1_DRGX_A Method D (One to one, using PCSEQ and PPSEQ) Rows 1-17:The relationship with RELID "1" includes PC records with PCSEQ values of "1" through "12" and PP records with PPSEQ values "1" through "3" and "6" and "7".Rows 18-29:The relationship with RELID "2" includes PC records with PCSEQ values of "1" through "7" and "10" through "12" and PP records with PPSEQ values "4" and "5". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCSEQ1 Example Only some records in PC were used to calculate parameters: Time point 5 was excluded from Tmax, 6 from Cmax, and 11 and 12 from AUC. This example uses --GRPID values in the columns labeled "PCGRPID4" and "PPGRPID4". Note that four values of PCGRPID and four values of PPGRPID were used. Because of the complexity of this example, only methods A and D are illustrated. Method A (Many to many, using PCGRPID and PPGRPID) Rows 1-4:The relationship with RELID "1" includes PC records with PCGRPID values "DY1DRGX_A", "DY1DRGX_C", and "DY1DRGX_D" and the one PP record with PPGRPID = "TMAX".Rows 5-8:The relationship with RELID "2" includes PC records with PCGRPID values "DY1DRGX_A", "DY1DRGX_B", and "DY1DRGX_D" and the one PP record with PPGRPID = "CMAX".Rows 9-12:The relationship with RELID "1" includes PC records with PCGRPID values "DY1DRGX_A", "DY1DRGX_B", and "DY1DRGX_C" and the one PP record with PPGRPID = "AUC".Rows 13-17:The relationship with RELID "1" includes PC records with PCGRPID values "DY1DRGX_A", "DY1DRGX_B", "DY1DRGX_C", and "DY1DRGX_D" (in this case, all PC records) and all PP records with PPGRPID = "OTHER". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PPABC-123-0001PPGRPIDTMAX Note that in RELREC table for Method A, the single records in Rows 1, 3, 5, 7, and 9, represented by their PPGRPID values, could have been referenced by their PPSEQ values, since both identify the records sufficiently. At least two other hybrid approaches would also be acceptable:
Method D, shown below, uses only PCSEQ and PPSEQ values. Method D (One to one, using PCSEQ and PPSEQ) Rows 1-12:The relationship with RELID "1" includes PC records with PCSEQ values "1" through "4" and "6" through "12" and PP records with PPSEQ = "1".Rows 13-24:The relationship with RELID "2" includes PC records with PCSEQ values "1" through "5" and "7" through "12" and PP records with PPSEQ = "2".Rows 24-35:The relationship with RELID "3" includes PC records with PCSEQ values "1" through "10" and PP records with PPSEQ = "3".Rows 36-51:The relationship with RELID "4" includes PC records with PCSEQ values "1" through "12" and PP records with PPSEQ values "4" through "7". relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABC-123PCABC-123-0001PCSEQ1 6.3.11.3.3 PC-PP ConclusionsRelating the datasets (as described in Section 8, Representing Relationships and Data) is the simplest method; however, all time-point concentrations in PC must be used to calculate all parameters for all subjects. If datasets cannot be related, then individual subject records must be related. In either case, the values of PCGRPID and PPGRPID must take into account multiple analytes and multiple reference time points, if they exist. Method A is clearly the most efficient in terms of having the least number of RELREC records, but it does require the assignment of --GRPID values (which are optional) in both the PC and PP datasets. Method D, in contrast, does not require the assignment of --GRPID values, but relies instead on the required --SEQ values in both datasets to relate the records. Although Method D results in the largest number of RELREC records compared to the other methods, it may be the easiest to implement consistently across the range of complexities shown in the examples. Two additional methods, Methods B and C, are also shown for Examples 1-3. They represent hybrid approaches, using --GRPID values in only one dataset (PP and PC, respectively) and --SEQ values for the other. These methods are best suited for sponsors who want to minimize the number of RELREC records while not having to assign --GRPID values in both domains. Methods B and C would not be ideal, however, if one expected complex scenarios as shown in Example 4. Note that an attempt has been made to approximate real pharmacokinetic data; however, the example values are not intended to reflect data used for actual analysis. When certain time-point concentrations have been omitted from PP calculations in Examples 2-4, the actual parameter values in the PP dataset have not been recalculated from those in Example 1 to reflect those omissions. 6.3.11.3.4 PC-PP – Suggestions for Implementing RELREC in the Submission of PK DataDetermine which of the scenarios best reflects how PP data are related to PC data. Questions that should be considered:
6.3.12 Physical ExaminationPE – Description/OverviewA findings domain that contains findings observed during a physical examination where the body is evaluated by inspection, palpation, percussion, and auscultation. PE – Specificationpe.xpt, Physical Examination — Findings, Version 3.3. One record per body system or abnormality per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of VISIT. Should be an integer.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the observation date/time of the physical exam finding.PermPEDTCDate/Time of ExaminationCharISO 8601TimingDate and time of the physical examination represented in ISO 8601 character format.ExpPEDYStudy Day of ExaminationNum Timing
¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). PE – Assumptions
PE – ExamplesExample This example shows data for one subject collected at one visit. The data come from a general physical examination. Rows 1-2, 6:Show how PESTRESC is populated if result is "NORMAL".Rows 3-5:Show how PESPID is used to show the sponsor-defined identifier, which in this case is the CRF sequence number used for identifying abnormalities within a body system. Additionally, the abnormalities were encoded and PESTRESC represents the MedDRA Preferred Term and PEBODSYS represents the MedDRA System Organ Class. pe.xpt RowSTUDYIDDOMAINUSUBJIDPESEQPESPIDPETESTCDPETESTPELOCPELATPEBODSYSPEORRESPESTRESCVISITNUMVISITVISITDYPEDTCPEDY1ABCPEABC-001-0011 6.3.13 Questionnaires, Ratings, and Scales (QRS) DomainsThis section includes three domains which are used to represent data from questionnaires, ratings, and scales.
CDISC develops controlled terminology and publishes supplements for individual questionnaires, ratings, and scales when the instrument is in the public domain or permission is granted by the copyright holder. The CDISC website pages for controlled terminology (https://www.cdisc.org/standards/semantics/terminology) and questionnaires, ratings, and scales (QRS) (https://www.cdisc.org/foundational/qrs) provide downloads as well as further information about the development processes. Each QRS supplement includes instrument-specific implementation assumptions, dataset example, SDTM mapping strategies, and a list of any applicable supplemental qualifiers. SDTM-annotated CRFs are also provided where available. 6.3.13.1 Functional TestsFT – Description/OverviewA findings domain that contains data for named, stand-alone, task-based evaluations designed to provide an assessment of mobility, dexterity, or cognitive ability. FT – Specificationft.xpt, Functional Tests — Findings, Version 3.3. One record per Functional Test finding per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Short character value for FTTEST, which can be used as a column name when converting a dataset from a vertical format to a horizontal format. The value cannot be longer than 8 characters, nor can it start with a number (e.g., "1TEST" is not valid). FTTESTCD cannot contain characters other than letters, numbers, or underscores. Controlled terminology for FTTESTCD is published in separate codelists for each questionnaire. See https://www.cdisc.org/standards/semantics/terminology for values for FTTESTCD. Examples: "W250101", "W25F0102". ReqFTTESTName of TestChar*Synonym QualifierVerbatim name of the question used to obtain the finding. The value in FTTEST cannot be longer than 40 characters. Controlled terminology for FTTEST is published in separate codelists for each questionnaire. See https://www.cdisc.org/standards/semantics/terminology for values for FTTEST. Examples: "W2501-25 Foot Walk Time", "W25F-More Than Two Attempts". ReqFTCATCategoryChar(FTCAT)Grouping QualifierUsed to specify the functional test in which the functional test question identified by FTTEST and FTTESTCD was included.ReqFTSCATSubcategoryCharGrouping QualifierUsed to define a further categorization of FTCAT values.PermFTPOSPosition of Subject During ObservationChar(POSITION)Record QualifierPosition of the subject during the test. Examples: "SUPINE", "STANDING", "SITTING".PermFTORRESResult or Finding in Original UnitsChar Result QualifierResult of the measurement or finding as originally received or collected.ExpFTORRESUOriginal UnitsChar(UNIT)Variable QualifierOriginal units in which the data were collected. Unit for FTORRES.PermFTSTRESCResult or Finding in Standard FormatChar Result QualifierContains the result value for all findings, copied or derived from FTORRES in a standard format or in standard units. FTSTRESC should store all results or findings in character format; if results are numeric, they should also be stored in numeric format in FTSTRESN.ExpFTSTRESNNumeric Result/Finding in Standard UnitsNum Result QualifierUsed for continuous or numeric results or findings in standard format; copied in numeric format from FTSTRESC. FTSTRESN should store all numeric test results or findings.PermFTSTRESUStandard UnitsChar(UNIT)Variable QualifierStandardized units used for FTSTRESC and FTSTRESN.PermFTSTATCompletion StatusChar(ND)Record QualifierUsed to indicate that a test was not done, or a test was attempted but did not generate a result. Should be null or have a value of "NOT DONE".PermFTREASNDReason Not DoneChar Record QualifierDescribes why a test was not done, or a test was attempted but did not generate a result. Used in conjunction with FTSTAT when value is "NOT DONE".PermFTXFNExternal File PathChar Record QualifierFile path to an external file.PermFTNAMVendor NameChar Record QualifierName or identifier of the vendor or laboratory that provided the test results.PermFTMETHODMethod of TestChar(METHOD)Record QualifierMethod of the test.PermFTLOBXFLLast Observation Before Exposure FlagChar(NY)Record QualifierOperationally-derived indicator used to identify the last non-missing value prior to RFXSTDTC. The value should be "Y" or null.ExpFTBLFLBaseline FlagChar(NY)Record QualifierA baseline defined by the sponsor (could be derived in the same manner as FTLOBXFL or ABLFL, but is not required to be). The value should be "Y" or null. Note that FTBLFL is retained for backward compatibility. The authoritative baseline flag for statistical analysis is in an ADaM dataset.PermFTDRVFLDerived FlagChar(NY)Record QualifierUsed to indicate a derived record (e.g., a record that represents the average of other records such as a computed baseline). Should be "Y" or null.PermFTEVALEvaluatorChar(EVAL)Record QualifierRole of the person who provided the evaluation. Used only for results that are subjective (e.g., assigned by a person or a group). Examples: "ADJUDICATION COMMITTEE", "INDEPENDENT ASSESSOR", "RADIOLOGIST".PermFTREPNUMRepetition NumberNum Record QualifierThe incidence number of a test that is repeated within a given timeframe for the same test. The level of granularity can vary, e.g., within a time point or within a visit. For example, multiple measurements of blood pressure or multiple analyses of a sample.PermVISITNUMVisit NumberNum TimingClinical encounter number. Numeric version of VISIT, used for sorting.ExpVISITVisit NameChar TimingProtocol-defined description of a clinical encounter.PermVISITDYPlanned Study Day of VisitNum TimingPlanned study day of VISIT based upon RFSTDTC in Demographics. Should be an integer.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the observation date/time of the functional tests finding.PermFTDTCDate/Time of TestCharISO 8601TimingCollection date and time of functional test.ExpFTDYStudy Day of TestNum TimingActual study day of test expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.PermFTTPTPlanned Time Point NameChar TimingText description of time when a measurement or observation should be taken, as defined in the protocol. This may be represented as an elapsed time relative to a fixed reference point, such as time of last dose. See FTTPTNUM and FTTPTREF.PermFTTPTNUMPlanned Time Point NumberNum TimingNumeric version of planned time point used in sorting.PermFTELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time relative to a planned fixed reference (FTTPTREF). Not a clock time or a date/time variable, but an interval, represented as ISO duration.PermFTTPTREFTime Point ReferenceChar TimingDescription of the fixed reference point referred to by FTELTM, FTTPTNUM, and FTTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermFTRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time for a fixed reference time point defined by FTTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). FT – Assumptions
FT – ExamplesCDISC publishes supplements for individual functional tests, available here: https://www.cdisc.org/foundational/qrs. Additional FT examples can be found in supplements on this webpage. Example The generic example below represents how the FT domain is to be populated for a fictional 40 Yard Dash functional test at 3 different visits following the QRS Naming Rules. ft.xpt RowSTUDYIDDOMAINUSUBJIDFTSEQFTTESTCDFTTESTFTCATFTORRESFTORRESUFTSTRESCFTSTRESNFTSTRESUFTLOBXFLVISITNUMFTDTC1STUDYXFTP00011FYD01001FYD01-TimeFORTY YARD DASH5.2sec5.25.2secY12012-11-162STUDYXFTP00012FYD01001FYD01-TimeFORTY YARD DASH5sec55sec 6.3.13.2 QuestionnairesQS – Description/OverviewA findings domain that contains data for named, stand-alone instruments designed to provide an assessment of a concept. Questionnaires have a defined standard structure, format, and content; consist of conceptually related items that are typically scored; and have documented methods for administration and analysis. QS – Specificationqs.xpt, Questionnaires — Findings, Version 3.3. One record per questionnaire per question per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Topic variable for QS. Short name for the value in QSTEST, which can be used as a column name when converting the dataset from a vertical format to a horizontal format. The value in QSTESTCD cannot be longer than 8 characters, nor can it start with a number (e.g., "1TEST" is not valid). QSTESTCD cannot contain characters other than letters, numbers, or underscores. Controlled terminology for QSTESTCD is published in separate codelists for each questionnaire. See https://www.cdisc.org/standards/semantics/terminology for values for QSTESTCD. Examples: "ADCCMD01", "BPR0103". ReqQSTESTQuestion NameChar*Synonym QualifierVerbatim name of the question or group of questions used to obtain the measurement or finding. The value in QSTEST cannot be longer than 40 characters. Controlled terminology for QSTEST is published in separate codelists for each questionnaire. See https://www.cdisc.org/standards/semantics/terminology for vaues for QSTEST. Example: "BPR01 - Emotional Withdrawal". ReqQSCATCategory of QuestionChar(QSCAT)Grouping QualifierUsed to specify the questionnaire in which the question identified by QSTEST and QSTESTCD was included. Examples: "ADAS-COG", "MDS-UPDRS".ReqQSSCATSubcategory for QuestionChar*Grouping QualifierA further categorization of the questions within the category. Examples: "MENTAL HEALTH" , "DEPRESSION", "WORD RECALL".PermQSORRESFinding in Original UnitsCharResult QualifierFinding as originally received or collected (e.g., "RARELY", "SOMETIMES"). When sponsors apply codelist to indicate the code values are statistically meaningful standardized scores, which are defined by sponsors or by valid methodologies such as SF36 questionnaires, QSORRES will contain the decode format, and QSSTRESC and QSSTRESN may contain the standardized code values or scores.ExpQSORRESUOriginal UnitsChar(UNIT)Variable QualifierOriginal units in which the data were collected. The unit for QSORRES, such as minutes or seconds or the units associated with a visual analog scale.PermQSSTRESCCharacter Result/Finding in Std FormatChar Result QualifierContains the finding for all questions or sub-scores, copied or derived from QSORRES in a standard format or standard units. QSSTRESC should store all findings in character format; if findings are numeric, they should also be stored in numeric format in QSSTRESN. If question scores are derived from the original finding, then the standard format is the score. Examples: "0", "1". When sponsors apply codelist to indicate the code values are statistically meaningful standardized scores, which are defined by sponsors or by valid methodologies such as SF36 questionnaires, QSORRES will contain the decode format, and QSSTRESC and QSSTRESN may contain the standardized code values or scores.ExpQSSTRESNNumeric Finding in Standard UnitsNum Result QualifierUsed for continuous or numeric findings in standard format; copied in numeric format from QSSTRESC. QSSTRESN should store all numeric results or findings.PermQSSTRESUStandard UnitsChar(UNIT)Variable QualifierStandardized unit used for QSSTRESC or QSSTRESN.PermQSSTATCompletion StatusChar(ND)Record QualifierUsed to indicate that a question was not done or was not answered. Should be null if a result exists in QSORRES.PermQSREASNDReason Not PerformedChar Record QualifierDescribes why a question was not answered. Used in conjunction with QSSTAT when value is "NOT DONE". Example: "SUBJECT REFUSED".PermQSLOBXFLLast Observation Before Exposure FlagChar(NY)Record QualifierOperationally-derived indicator used to identify the last non-missing value prior to RFXSTDTC. Should be "Y" or null.PermQSBLFLBaseline FlagChar(NY)Record QualifierIndicator used to identify a baseline value. Should be "Y" or null. Note that QSBLFL is retained for backward compatibility. The authoritative baseline for statistical analysis is in an ADaM dataset.PermQSDRVFLDerived FlagChar(NY)Record QualifierUsed to indicate a derived record. The value should be "Y" or null. Records that represent the average of other records or questionnaire sub-scores that do not come from the CRF are examples of records that would be derived for the submission datasets. If QSDRVFL = "Y", then QSORRES may be null with QSSTRESC and (if numeric) QSSTRESN having the derived value.PermQSEVALEvaluatorChar(EVAL)Record QualifierRole of the person who provided the evaluation. Examples: "STUDY SUBJECT", "CAREGIVER", "INVESTIGATOR".PermVISITNUMVisit NumberNum Timing
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the observation date/time of the physical exam finding.PermQSDTCDate/Time of FindingCharISO 8601TimingDate of questionnaire.ExpQSDYStudy Day of FindingNum Timing
Timing
TimingNumerical version of QSTPT to aid in sorting.PermQSELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a planned fixed reference (QSTPTREF). This variable is useful where there are repetitive measures. Not a clock time or a date time variable. Represented as an ISO 8601 duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by QSTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by QSTPTREF.PermQSTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by QSELTM, QSTPTNUM, and QSTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermQSRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time of the reference time point, QSTPTREF.PermQSEVLINTEvaluation IntervalCharISO 8601TimingEvaluation interval associated with a QSTEST question represented in ISO 8601 character format. Example: "-P2Y" to represent an interval of 2 years in the question "Have you experienced any episodes in the past 2 years?".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). QS – Assumptions
QS – ExamplesCDISC publishes supplements for individual questionnaires, available here: https://www.cdisc.org/foundational/qrs. Additional QS examples can be found in supplements on this webpage. Example The generic example below represents how the QS domain is to be populated for a fictional Fruit Preference questionnaire following the QRS Naming Rules. The questionnaire has responses from Strongly Disagree to Strongly Agree (0-4). qs.xpt RowSTUDYIDDOMAINUSUBJIDQSSEQQSTESTCDQSTESTQSCATQSORRESQSSTRESCQSSTRESNQSLOBXFLVISITNUMQSDTC1STUDYXQSP00011FPQ01001FPQ01-I Like ApplesFRUIT PREFERENCE QUESTIONNAIREStrongly Agree44Y12012-11-162STUDYXQSP00012FPQ01002FPQ01-I Like OrangesFRUIT PREFERENCE QUESTIONNAIREDisagree11Y12012-11-163STUDYXQSP00013FPQ01003FPQ01-I Like BananasFRUIT PREFERENCE QUESTIONNAIREAgree33Y12012-11-16 6.3.13.3 Disease Response and Clin ClassificationRS – Description/OverviewA findings domain for the assessment of disease response to therapy, or clinical classification based on published criteria. RS – Specificationrs.xpt, Disease Response and Clin Classification — Findings, Version 3.3. One record per response assessment or clinical classification assessment per time point per visit per subject per assessor per medical evaluator, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Short name of the TEST in RSTEST. RSTESTCD cannot contain characters other than letters, numbers, or underscores. Examples: "TRGRESP", "NTRGRESP", "OVRLRESP", "SYMPTDTR", "CPS0102". There are separate codelists used for RSTESTCD where the choice depends on the value of RSCAT. Codelist "ONCRTSCD" is used for oncology response criteria (when RSCAT is a term in codelist "ONCRSCAT"). Examples: TRGRESP, "NTRGRESP, "OVRLRESSP". For Clinical Classifications (when RSCAT is a term in codelist "CCCAT"), QRS Naming Rules apply. These instruments have individual dedicated terminology codelists. ReqRSTESTAssessment NameChar(ONCRTS)Synonym QualifierVerbatim name of the response assessment. The value in RSTEST cannot be longer than 40 characters. There are separate codelists used for RSTEST where the choice depends on the value of RSCAT. Codelist "ONCRTS" is used for oncology response criteria (when RSCAT is a term in codelist "ONCRSCAT"). Examples: "Target Response", "Non-target Response", "Overall Response", "Symptomatic Deterioration", "CPS01-Ascites". For Clinical Classifications, QRS Naming Rules apply. These instruments have individual dedicated terminology codelists. ReqRSCATCategory for AssessmentChar(ONCRSCAT)(CCCAT)Grouping Qualifier Used to define a category of related records across subjects. Examples: "RECIST 1.1", "CHILD-PUGH CLASSIFICATION". There are separate codelists used for RSCAT where the choice depends on whether the related records are about an oncology response criterion or another clinical classification. RSCAT is required for clinical classifications other than oncology response criteria. ExpRSSCATSubcategoryCharGrouping QualifierUsed to define a further categorization of RSCAT values.PermRSORRESResult or Finding in Original UnitsChar Result QualifierResult of the response assessment as originally received, collected, or calculated.ExpRSORRESUOriginal UnitsChar(UNIT)Variable QualifierUnit for RSORRES.PermRSSTRESCCharacter Result/Finding in Std FormatChar(ONCRSR)Result Qualifier Contains the result value for the response assessment, copied, or derived from RSORRES in a standard format or standard units. RSSTRESC should store all results or findings in character format. For Clinical Classifications, this may be a score. ExpRSSTRESNNumeric Result/Finding in Standard UnitsNumResult QualifierUsed for continuous or numeric results or findings in standard format; copied in numeric format from --STRESC. --STRESN should store all numeric test results or findings. For Clinical Classifications, this may be a score.PermRSSTRESUStandard UnitsChar(UNIT)Variable QualifierStandardized units used for RSSTRESC and RSSTRESN.PermRSSTATCompletion StatusChar(ND)Record QualifierUsed to indicate the response assessment was not performed. Should be null if a result exists in RSORRES.PermRSREASNDReason Not DoneChar Record QualifierDescribes why a response assessment was not performed. Examples: "All target tumors not evaluated", "Subject does not have non-target tumors". Used in conjunction with RSSTAT when value is "NOT DONE".PermRSNAMVendor NameChar Record QualifierThe name or identifier of the vendor that performed the response assessment. This column can be left null when the investigator provides the complete set of data in the domain.PermRSLOBXFLLast Observation Before Exposure FlagChar(NY)Record Qualifier Operationally-derived indicator used to identify the last non-missing value prior to RFXSTDTC. The value should be "Y" or null. When a clinical classification is assessed at multiple times, including baseline, RSLOBXFL should be included in the dataset. PermRSBLFLBaseline FlagChar(NY)Record QualifierIndicator used to identify a baseline value. Should be "Y" or null. Note that --BLFL is retained for backward compatibility. The authoritative baseline for statistical analysis is in an ADaM dataset.PermRSDRVFLDerived FlagChar(NY)Record QualifierUsed to indicate a derived record (e.g., a record that represents the average of other records such as a computed baseline). Should be "Y" or null.PermRSEVALEvaluatorChar(EVAL)Record QualifierRole of the person who provided the evaluation. Used only for results that are subjective (e.g., assigned by a person or a group). Examples: "ADJUDICATION COMMITTEE", "INDEPENDENT ASSESSOR", "RADIOLOGIST". RSEVAL is expected for oncology response criteria. It can be left null when the investigator provides the complete set of data in the domain. However, the column should contain no null values when data from one or more independent assessors is included, meaning that the rows attributed to the investigator should contain a value of "INVESTIGATOR". PermRSEVALIDEvaluator IdentifierChar(MEDEVAL)Variable QualifierUsed to distinguish multiple evaluators with the same role recorded in RSEVAL. Examples: "RADIOLOGIST1", "RADIOLOGIST2". See RS Assumption 9.PermRSACPTFLAccepted Record FlagChar(NY)Record QualifierIn cases where more than one independent assessor (e.g., "RADIOLOGIST 1", "RADIOLOGIST 2", "ADJUDICATOR") provides an evaluation of response, this flag identifies the record that is considered to be the accepted evaluation.PermVISITNUMVisit NumberNumTimingClinical encounter number. Numeric version of VISIT, used for sorting.ExpVISITVisit NameChar TimingProtocol-defined description of a clinical encounter.PermVISITDYPlanned Study Day of VisitNum TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time at which the assessment was made.PermRSDTCDate/Time of AssessmentCharISO 8601TimingCollection date and time of the assessment represented in ISO 8601 character format.ExpRSDYStudy Day of AssessmentNum TimingStudy day of the assessment, measured as integer days. Algorithm for calculations must be relative to the sponsor-defined RFSTDTC variable in Demographics.PermRSTPTPlanned Time Point NameChar TimingText description of time when a measurement or observation should be taken as defined in the protocol. This may be represented as an elapsed time relative to a fixed reference point, such as time of last dose. See RSTPTNUM and RSTPTREF.PermRSTPTNUMPlanned Time Point NumberNum TimingNumeric version of planned time point used in sorting.PermRSELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time in ISO 8601 character format relative to a planned fixed reference (RSTPTREF) such as "Previous Dose" or "Previous Meal". This variable is useful where there are repetitive measures. Not a clock time or a date/time variable, but an interval, represented as ISO duration.PermRSTPTREFTime Point ReferenceChar TimingDescription of the fixed reference point referred to by RSELTM, RSTPTNUM, and RSTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermRSRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time for a fixed reference time point defined by RSTPTREF in ISO 8601 character format.PermRSEVLINTEvaluation IntervalCharISO 8601TimingDuration of interval associated with an observation such as a finding RSTESTCD, represented in ISO 8601 character format. Example: "-P2M" to represent a period of the past 2 months as the evaluation interval.PermRSEVINTXEvaluation Interval TextChar TimingEvaluation interval associated with an observation, where the interval is not able to be represented in ISO 8601 format. Examples: "LIFETIME", "LAST NIGHT", "RECENTLY", "OVER THE LAST FEW WEEKS".PermRSSTRTPTStart Relative to Reference Time PointChar(STENRF)Timing Identifies the start of the observation as being before or after the sponsor-defined reference time point defined by variable RSSTTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermRSSTTPTStart Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by RSSTRTPT. Examples: "2003-12-15" or "VISIT 1".PermRSENRTPTEnd Relative to Reference Time PointChar(STENRF)Timing Identifies the end of the observation as being before or after the sponsor-defined reference time point defined by variable RSENTPT. Not all values of the codelist are allowable for this variable. See Section 4.4.7, Use of Relative Timing Variables. PermRSENTPTEnd Reference Time PointCharTimingDescription or date/time in ISO 8601 character format of the sponsor-defined reference point referred to by RSENRTPT. Examples: "2003-12-25" or "VISIT 2".Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). RS – Assumptions
[1]Eisenhauer EA, Therasse P, Bogaerts J, et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer. 2009;45(2):228-47.[2]Cheson BD, Fisher RI, Barrington SF, et al. Recommendations for Initial Evaluation, Staging, and Response Assessment of Hodgkin and Non-Hodgkin Lymphoma: The Lugano Classification. J Clin Oncol. 2014;32(27):3059-68.[3]Hallek M, Cheson BD, Catovsky D, et al. Guidelines for the diagnosis and treatment of chronic lymphocytic leukemia: a report from the International Workshop on Chronic Lymphocytic Leukemia updating the National Cancer Institute-Working Group 1996 guidelines. Blood. 2008;111(12):5446-56. RS – ExamplesThe following are examples for oncology response criteria. Example This example shows response assessments determined from the TR domain based on RECIST 1.1 criteria and also shows a case where progressive disease due to symptomatic deterioration was determined based on a clinical assessment by the investigator. Rows 1-3:Show the target response, non-target response, and the overall response by the investigator using RECIST 1.1 at the week 6 visit.Rows 4-7:Show the target response and non-target response by the investigator using RECIST 1.1, as well as the determination of symptomatic determination (Pleural Effusion) and overall response using protocol-defined response criteria at the week 12 visit. rs.xpt RowSTUDYIDDOMAINUSUBJIDRSSEQRSLNKGRPRSTESTCDRSTESTRSCATRSORRESRSSTRESCRSEVALVISITNUMVISITRSDTCRSDY1ABCRS444441 Example This example shows response assessments determined from the TR domain based on RECIST 1.1 criteria and also shows a confirmation of an equivocal new lesion progression. Rows 1-4:Show the target response, non-target response, and the overall response by the independent assessor Radiologist 1 using RECSIT 1.1 at the week 6 visit. At this week 6 visit, an equivocal new lesion was identified.Rows 5-8:Show the target response, non-target response, and the overall response by the independent assessor Radiologist 1 using RECSIT 1.1 at the week 12 visit. At this week 12 visit, the new lesion was determined to be unequivocally a new lesion. rs.xpt RowSTUDYIDDOMAINUSUBJIDRSSEQRSLNKGRPRSTESTCDRSTESTRSCATRSORRESRSSTRESCRSNAMRSEVALRSEVALIDRSACPTFLVISITNUMVISITRSDTCRSDY1ABCRS555551 Example This example shows best response and the overall response of progression to prior therapies and follow-up therapies. Row 1:Shows disease progression on or after a prior chemotherapy regimen. The date of progression is represented in RSDTC. RSENTPT and RSENRTPT represent that the disease progression was prior to screening. RSCAT = "UNSPECIFIED" indicates that the criteria used to determine the disease progression was unknown or not collected. RSPLNKGRP = "CM1" is used to link this record in RS to the prior chemotherapy in CM where the CMLNKGRP = "CM1".Row 2:Shows best response to prior chemotherapy regimen. The date of best response is represented in RSDTC. RSENTPT and RSENRTPT represent that the best response was prior to screening. RSCAT = "UNSPECIFIED" indicates that the criteria used to determine the best response was unknown or not collected. RSPLNKGRP = "CM2" is used to link this record in RS to the prior chemotherapy in CM where the CMLNKGRP = "CM2".Row 3:Shows best response to prior radiotherapy. The date of best response is represented in RSDTC. RSENTPT and RSENRTPT represent that the best response was prior to screening. RSCAT = "UNSPECIFIED" indicates that the criteria used to determine the best response was unknown or not collected. RSPLNKGRP = "PR2" is used to link this record in RS to the prior radiotherapy in PR where the PRLNKGRP = "PR2".Rows 4-5:Show best response and progression to a follow-up anti-cancer therapy. The date of best response and date of progression are represented in RSDTC. RSSTTPT and RSSTRTPT represent that the best response and progression were after study treatment discontinuation. RSCAT = "UNSPECIFIED" indicates that the criteria used to determine the best response and progression was unknown or not collected. RSPLNKGRP = "CM3" is used to link this record in RS to the prior chemotherapy in CM where the CMLNKGRP = "CM3". rs.xpt RowSTUDYIDDOMAINUSUBJIDRSSEQRSLNKGRPRSTESTCDRSTESTRSCATRSSTTPTRSSTRTPTRSENRTPTRSENTPTRSORRESRSORRESURSSTRESCRSSTRESNRSSTRESURSEVALVISITNUMVISITRSDTCRSDY1ABCRS555551CM1OVRLRESPOverall ResponseUNSPECIFIED CDISC publishes supplements for individual clinical classifications, available here: https://www.cdisc.org/foundational/qrs. Additional RS examples can be found in supplements on this webpage. Example The generic example below represents how the RS domain is to be populated for a fictional Smith Snoring Scale clinical classification at one visit. The clinical classification captures responses for snoring extent as soft/moderate/loud, snoring extent as <25% of sleep time/25-50% of sleep time/>50% of sleep time and snoring pattern as very regular/somewhat irregular/very irregular with pauses and snorts. Each of the 3 tests are scored from 1-3. A total score is represented as captured data. As with all QRS standards, the RSORRES text values match the case of the data capture media (.ex CRF, RDC screen, etc.). rs.xpt RowSTUDYIDDOMAINUSUBJIDRSSEQRSTESTCDRSTESTRSCATRSORRESRSSTRESCRSSTRESNRSLOBXFLRSEVALVISITNUMRSDTC1STUDYXRSP00011SSS01001SSS01-Snoring VolumeSMITH SNORING SCALEloud33YSPOUSE12012-11-162STUDYXRSP00012SSS01002SSS01-Snoring ExtentSMITH SNORING SCALE25-50% of sleep time22YSPOUSE12012-11-163STUDYXRSP00013SSS01003SSS01-Snoring PatternSMITH SNORING SCALEvery regular11YSPOUSE12012-11-164STUDYXRSP00014SSS01004SSS01-Total ScoreSMITH SNORING SCALE666YSPOUSE12012-11-16 6.3.14 Subject CharacteristicsSC – Description/OverviewA findings domain that contains subject-related data not collected in other domains. SC – Specificationsc.xpt, Subject Characteristics — Findings, Version 3.3. One record per characteristic per subject., Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). SC – Assumptions
SC – ExamplesExample The example below shows data collected once per subject that does not fit into the Demographics domain. For this example, national origin and marital status were collected. sc.xpt RowSTUDYIDDOMAINUSUBJIDSCSEQSCTESTCDSCTESTSCORRESSCSTRESCSCDTC1ABCSCABC-001-0011NATORIGNational OriginUNITED STATESUSA1999-06-192ABCSCABC-001-0012MARISTATMarital StatusDIVORCEDDIVORCED1999-06-193ABCSCABC-001-0021NATORIGNational OriginCANADACAN1999-03-194ABCSCABC-001-0022MARISTATMarital StatusMARRIEDMARRIED1999-03-195ABCSCABC-001-0031NATORIGNational OriginUSAUSA1999-05-036ABCSCABC-001-0032MARISTATMarital StatusNEVER MARRIEDNEVER MARRIED1999-05-037ABCSCABC-001-2011NATORIGNational OriginJAPANJPN1999-06-148ABCSCABC-002-0012MARISTATMarital StatusWIDOWEDWIDOWED1999-06-14 6.3.15 Subject StatusSS – Description/OverviewA findings domain that contains general subject characteristics that are evaluated periodically to determine if they have changed. SS – Specificationss.xpt, Subject Status — Findings, Version 3.3. One record per finding per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the start date/time of the subject status assessment.PermSSDTCDate/Time of AssessmentCharISO 8601TimingDate and time of the subject status assessment represented in ISO 8601 character format.ExpSSDYStudy Day of AssessmentNum Timing
¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). SS – Assumptions
SS – ExamplesExample In this example, subjects complete a ten-week treatment regimen and are then contacted by phone every month for three months. The phone contact assesses the subject's survival status. If the survival status is "DEAD", additional information is collected in order to complete the subject's final disposition record in DS and to record the date of death in DM (DS and DM records are not shown here). ss.xpt RowSTUDYIDDOMAINUSUBJIDSSSEQSSTESTCDSSTESTSSORRESSSSTRESCVISITNUMVISITSSDTC1XYZSSXYZ-333-0091SURVSTATSurvival StatusALIVEALIVE10MONTH 12010-04-152XYZSSXYZ-333-0092SURVSTATSurvival StatusALIVEALIVE20MONTH 22010-05-123XYZSSXYZ-333-0093SURVSTATSurvival StatusALIVEALIVE30MONTH 32010-06-154XYZSSXYZ-428-0211SURVSTATSurvival StatusALIVEALIVE10MONTH 12010-08-035XYZSSXYZ-428-0212SURVSTATSurvival StatusDEADDEAD20MONTH 22010-09-06 6.3.16 Tumor/Lesion DomainsThe Tumor/Lesion domains (TU/TR) represent data collected in clinical trials where sites of disease (e.g. tumors/lesions, lymph nodes, or organs of interest in the assessment of the disease) are identified and then repeatedly measured/assessed at subsequent time points and often used in an evaluation of disease response(s). As such these domains would be applicable for representing data to support disease response criteria. These two domains each have a distinct purpose and are related to each other, and may also be related to assessments in the Disease Response and Clin Classification (RS) domain. 6.3.16.1 Tumor/Lesion IdentificationTU – Description/OverviewA findings domain that represents data that uniquely identifies tumors or lesions under study. The TU domain represents data that uniquely identifies tumors/lesions (i.e., malignant tumors, culprit lesions, and other sites of disease, e.g., lymph nodes). Commonly the tumors/lesions are identified by an investigator and/or independent assessor and classified according to the disease assessment criteria. For example, for an oncology study using RECIST evaluation criteria, this equates to the identification of Target, Non-Target, or New tumors. A record in the TU domain contains the following information: a unique tumor ID value; anatomical location of the tumor; method used to identify the tumor; role of the individual identifying the tumor; and timing information. TU – Specificationtu.xpt, Tumor/Lesion Identification — Findings, Version 1.0. One record per identified tumor per subject per assessor, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar Used to specify the anatomical location of the identified tumor/lesion, e.g., "LIVER" Note: When anatomical location is broken down and collected as distinct pieces of data that when combined provide the overall location information (e.g., laterality/directionality/distribution), then the additional anatomical location qualifiers should be used. See Assumption 3. ExpTULATLateralityChar(LAT)Variable QualifierQualifier for anatomical location or specimen further detailing laterality, for example, "LEFT", "RIGHT", "BILATERAL".PermTUDIRDirectionalityChar(DIR)Variable QualifierQualifier for anatomical location or specimen further detailing directionality, for example, "UPPER", "INTERIOR".PermTUPORTOTPortion or TotalityChar(PORTOT)Variable QualifierQualifier for anatomical location or specimen further detailing the distribution, which means arrangement of, or apportioning of. Examples: "ENTIRE", "SINGLE", "SEGMENT", "MULTIPLE".PermTUMETHODMethod of IdentificationChar(METHOD)Record QualifierMethod used to identify the tumor/lesion. Examples: "MRI", "CT SCAN".ExpTULOBXFLLast Observation Before Exposure FlagChar(NY)Record QualifierOperationally-derived indicator used to identify the last non-missing value prior to RFXSTDTC. Should be "Y" or null.ExpTUBLFLBaseline FlagChar(NY)Record QualifierIndicator used to identify a baseline value. Should be "Y" or null. Note that TUBLFL is retained for backward compatibility. The authoritative baseline flag for statistical analysis is in an ADaM dataset.PermTUEVALEvaluatorChar(EVAL)Record QualifierRole of the person who provided the evaluation. Examples: "ADJUDICATION COMMITTEE", "INDEPENDENT ASSESSOR". This column can be left null when the investigator provides the complete set of data in the domain. However, the column should contain no null values when data from one or more independent assessors is included. For example, the rows attributed to the investigator should contain a value of "INVESTIGATOR". ExpTUEVALIDEvaluator IdentifierChar(MEDEVAL)Variable QualifierUsed to distinguish multiple evaluators with the same role recorded in --EVAL. Examples: "RADIOLOGIST1", "RADIOLOGIST2". See TU Assumption 8.PermTUACPTFLAccepted Record FlagChar(NY)Record QualifierIn cases where more than one independent assessor (e.g., "RADIOLOGIST 1", "RADIOLOGIST 2", "ADJUDICATION COMMITTEE") provide independent assessments at the same time point, this flag identifies the record that is considered to be the accepted assessment.PermVISITNUMVisit NumberNumTimingClinical encounter number. Numeric version of VISIT, used for sorting.ExpVISITVisit NameChar TimingProtocol-defined description of a clinical encounter.PermVISITDYPlanned Study Day of VisitNum TimingPlanned study day of the visit based upon RFSTDTC in Demographics. Should be an integer.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm for the Element in which the assessment was made.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time at which the assessment was made.PermTUDTCDate/Time of Tumor/Lesion IdentificationCharISO 8601TimingTUDTC variable represents the date of the scan/image/physical exam. TUDTC does not represent the date that the image was read to identify tumors. TUDTC also does not represent the VISIT date.ExpTUDYStudy Day of Tumor/Lesion IdentificationNum TimingStudy day of the scan/image/physical exam, measured as integer days. Algorithm for calculations must be relative to the sponsor-defined RFSTDTC variable in Demographics.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TU – Assumptions
6.3.16.2 Tumor/Lesion ResultsTR – Description/OverviewA findings domain that represents quantitative measurements and/or qualitative assessments of the tumors or lesions identified in the tumor/lesion identification (TU) domain. The TR domain represents quantitative measurements and/or qualitative assessments of the tumors or lesions (malignant tumors, culprit lesions, and other sites of disease, e.g., lymph nodes) identified in the TU domain. These measurements may be taken at baseline and then at each subsequent assessment to support response evaluations. A typical record in the TR domain contains the following information: a unique tumor/lesion ID value; test and result; method used; role of the individual assessing the tumor/lesion; and timing information. Clinically accepted evaluation criteria expect that a tumor/lesion identified by the tumor/lesion ID is the same tumor/lesion at each subsequent assessment. The TR domain does not include anatomical location information on each measurement record, because this would be a duplication of information already represented in TU. The multi-domain approach to representing oncology assessment data was developed largely to reduce duplication of stored information. TR – Specificationtr.xpt, Tumor/Lesion Results — Findings, Version 3.3. One record per tumor measurement/assessment per visit per subject per assessor, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TR – Assumptions
6.3.16.3 Tumor Identification/Tumor Results ExamplesExample This is an example of using the TU domain to represent non-cancerous lesions identified in the heart. Subject "40913" had a pulmonary vein isolation (PVI) procedure on February 1, 2007. A target lesion (L01) was identified in the infrarenal aorta within the aorto-iliac vessel (L01-1). During the same PVI procedure, the subject also had a target graft lesion (L01-G) identified in the left femoro-popliteal graft (L01-G1). The lesion location was noted within the graft anastomosis proximal, the type was a synthetic graft composed of Gortex, and the anastomosis was in the Left Popliteal Artery. Rows 1-2:Show the target lesion located in the infrarenal aorta and within the aorta-iliac vessel.Row 3:Shows the PVI target limb in which the graft lesion is located identified by the investigator.Rows 4-5:Show the target graft lesion located in the left femoro-popliteal graft and within the femoro-popliteal vessel. tu.xpt RowSTUDYIDDOMAINUSUBJIDTUSEQTULNKIDTUTESTCDTUTESTTUORRESTUSTRESCTULOCTULATTUMETHODTUEVALVISITNUMVISITTUDTC1STUDY01TU409131L01LESIONIDLesion IdentificationTARGETTARGETINFRARENAL AORTALEFTPERIPHERAL ANGIOGRAPHYINVESTIGATOR1SCREEN2/1/20072STUDY01TU409132L01-1VESSELIDVessel IdentificationTARGETTARGETAORTO-ILIAC Additional information about the lesion, such as the lesion location within the graft, the graft anastomosis, as well as details regarding the graft type and material is given using supplemental qualifiers. supptu.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1STUDY01TU40913TUSEQ4PAGLLPeripheral Graft Lesion LocationGRAFT ANASTOMOSIS PROXIMALCRF Example This is an example of tumors identified and tracked using RECIST 1.1 criteria. TU shows the target and non-target tumors identified by an investigator at a screening visit and also shows that the investigator determined that one of the previously identified tumors had split at Week 6 visit. Rows 1-6:Show for subject "44444" the target and non-target tumors identified by the investigator at the screening visit.Rows 7-8:Show the investigator had determined that a tumor (TULNKID = "T04" at screening) had split into two separate tumors at the Week 6 visit. The two distinct pieces of the original tumor are then tracked independently from that point in the study forward. tu.xpt RowSTUDYIDDOMAINUSUBJIDTUSEQTUGRPIDTULNKIDTUTESTCDTUTESTTUORRESTUSTRESCTULOCTULATTUMETHODTUEVALVISITNUMVISITTUDTCTUDY1ABCTU444441 The supplemental qualifier dataset below shows that "T01", "T02", and "T04" were not previously irradiated and "T03" was previously irradiated and subsequent progression after irradiation. supptu.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVAL1ABCTU44444TULNKIDT01PREVIRPreviously IrradiatedN2ABCTU44444TULNKIDT02PREVIRPreviously IrradiatedN3ABCTU44444TULNKIDT03PREVIRPreviously IrradiatedY4ABCTU44444TULNKIDT03PREVIRPIrradiated then Subsequent ProgressionY5ABCTU44444TULNKIDT04PREVIRPreviously IrradiatedN TR shows measurements (i.e., short axis) of lymph nodes as well as measurements of other non-lymph node target tumors (i.e., longest diameter). In this example, when TRTEST = "Tumor State" and TRORRES = "ABSENT", it indicates that the target lymph node lesion was no longer pathological, i.e., the diameter has reduced below 10mm. The overall assessment of lymph nodes is represented with TRTEST = "Lymph Nodes State". A lymph node state of "NON-PATHOLOGICAL" means that all target lymph node lesions have a short axis less than 10mm. A lymph node state of "PATHOLOGICAL" means that at least one target lymph node lesion has a short axis greater than or equal to 10mm. Rows 1-8:Show the measurements of the target tumors and other assessments of the target and non-target tumors at the screening visit.Rows 9-21:Show the measurements of the target tumors and other assessments of the target and non-target tumors at the Week 6 visit.Rows 22-27:Show the measurements of the target tumors and other assessments of the target and non-target tumors at the Week 12 visit. tr.xpt RowSTUDYIDDOMAINUSUBJIDTRSEQTRGRPIDTRLNKGRPTRLNKIDTRTESTCDTRTESTTRORRESTRORRESUTRSTRESCTRSTRESNTRSTRESUTRSTATTRREASNDTRMETHODTREVALVISITNUMVISITTRDTCTRDY1ABCTR444441TARGETA1T01DIAMETERDiameter17mm1717mm The relationship between the TU and TR datasets is represented in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCTU Example This is an example of tumors identified and tracked following RECIST 1.1 criteria, with an additional opinion provided by an independent assessor. TU shows the target and non-target tumors identified by a radiologist at a screening visit. It also shows that the radiologist identified two new tumors: one at the Week 6 visit and one at the Week 12 visit. Rows 1-5:Show the target and non-target tumors identified at screening by the independent assessor, Radiologist 1.Row 6:Shows that a new tumor was identified at Week 6 by the independent assessor, Radiologist 1.Row 7:Shows that another new tumor was identified at Week 12 by the independent assessor, Radiologist 1. tu.xpt RowSTUDYIDDOMAINUSUBJIDTUSEQTULNKIDTUTESTCDTUTESTTUORRESTUSTRESCTULOCTULATTUMETHODTUNAMTUEVALTUEVALIDVISITNUMVISITTUDTCTUDY1ABCTU555551T01TUMIDENTTumor IdentificationTARGETTARGETCERVICAL LYMPH NODELEFTMRIACE IMAGINGINDEPENDENT ASSESSORRADIOLOGIST 110SCREEN2010-01-02-22ABCTU555552T02TUMIDENTTumor IdentificationTARGETTARGETLIVER TR shows assessments provided by an independent assessor as opposed to the principal investigator. Rows 1-7:Show the measurements of the target tumors and other assessments of the target and non-target tumors at the screening visit by the independent assessor, Radiologist 1.Rows 8-19:Show the measurements of the target tumors and other assessments of the target and non-target tumors at the Week 6 visit by the independent assessor, Radiologist 1.Rows 20-32:Show the measurements of the target tumors and other assessments of the target and non-target tumors at the Week 12 visit by the independent assessor, Radiologist 1. tr.xpt RowSTUDYIDDOMAINUSUBJIDTRSEQTRGRPIDTRLNKGRPTRLNKIDTRTESTCDTRTESTTRORRESTRORRESUTRSTRESCTRSTRESNTRSTRESUTRNAMTRMETHODTREVALTREVALIDVISITNUMVISITTRDTCTRDY1ABCTR555551TARGETA1R1-T01DIAMETERDiameter20mm2020mmACE IMAGINGMRIINDEPENDENT ASSESSORRADIOLOGIST 110SCREEN2010-01-02-22ABCTR555552TARGETA1R1-T02DIAMETERDiameter15mm1515mmACE IMAGINGCT SCANINDEPENDENT ASSESSORRADIOLOGIST 110SCREEN2010-01-01-33ABCTR555553TARGETA1R1-T03DIAMETERDiameter15mm1515mmACE IMAGINGCT SCANINDEPENDENT ASSESSORRADIOLOGIST 110SCREEN2010-01-01-34ABCTR555554TARGETA1 The relationship between the TU and TR records is represented in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCTU 6.3.17 Vital SignsVS – Description/OverviewA findings domain that contains measurements including but not limited to blood pressure, temperature, respiration, body surface area, body mass index, height and weight. VS – Specificationvs.xpt, Vital Signs — Findings, Version 3.3. One record per vital sign measurement per time point per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the start date/time at which the assessment was made.PermVSDTCDate/Time of MeasurementsCharISO 8601TimingDate and time of the vital signs assessment represented in ISO 8601 character format.ExpVSDYStudy Day of Vital SignsNum Timing
Timing
TimingNumerical version of VSTPT to aid in sorting.PermVSELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a planned fixed reference (VSTPTREF). This variable is useful where there are repetitive measures. Not a clock time or a date time variable. Represented as an ISO 8601 Duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by VSTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by VSTPTREF.PermVSTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by VSELTM, VSTPTNUM, and VSTPT. Examples: "PREVIOUS DOSE", "PREVIOUS MEAL".PermVSRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time of the reference time point, VSTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). VS – Assumptions
VS – ExamplesExample The example below shows one subject with two isits, Baseline and isit 2. Rows 1-4, 6-7:STPT and STPTNUM are populated since more than one measurement was taken at this isit.Rows 2, 4-5, 7-9:Show "Y" in SLOBXFL to indicate the obseration was used as the last obseration before exposure measurement.Row 14:Shows a alue collected in one unit, but conerted to selected standard unit.Row 15:Shows the proper use of the --STAT ariable to indicate "NOT DONE" where a reason was collected when a test was not done. vs.xpt RowSTUDYIDDOMAINUSUBJIDVSSEQVSTESTCDVSTESTVSPOSVSORRESVSORRESUVSSTRESCVSSTRESNVSSTRESUVSSTATVSREASNDVSLOCVSLATVSLOBXFLVISITNUMISITISITDYVSDTCVSDYVSTPTVSTPTNUM1ABCVSABC-001-0011SYSBPSystolic Blood PressureSitting154mmHg154154mmHg 6.4 Findings About Events or InterventionsFindings About Events or Interventions is a specialization of the Findings General Observation Class. As such, it shares all qualities and conventions of Findings observations but is specialized by the addition of the --OBJ variable. Domain CodeDomain DescriptionFA Findings About A findings domain that contains the findings about an event or intervention that cannot be represented within an events or interventions domain record or as a supplemental qualifier. SRSkin Response A findings about domain for submitting dermal responses to antigens. 6.4.1 When to Use Findings AboutIt is intended, as its name implies, to be used when collected data represent "findings about" an Event or Intervention that cannot be represented within an Event or Intervention record or as a Supplemental Qualifier to such a record. Examples include the following:
6.4.2 Naming Findings About DomainsFindings About domains are defined to store Findings About Events or Interventions. Sponsors may choose to represent Findings About data collected in the study in a single FA dataset (potentially splitting the FA domain into physically separate datasets following the guidance described in Section 4.1.6, Additional Guidance on Dataset Naming), or separate datasets assigning unique custom 2-character domain codes following the SR (Skin Response) domain example. For example, if Findings About clinical events and Findings About medical history are collected in a study, they could be represented as either:
For the naming of datasets with findings about events or interventions for associated persons, refer to the SDTM Implementation Guide for Associated Persons. 6.4.3 Variables Unique to Findings AboutThe variable, --OBJ, is unique to Findings About. In conjunction with FATESTCD, it describes what the topic of the observation is; therefore both are required to be populated for every record. FATESTCD describes the measurement/evaluation and FAOBJ describes the Event or Intervention that the measurement/evaluation is about. When collected data fit a Qualifier variable listed in SDTM: Sections 2.2.1, 2.2.2, or 2.2.3, and are represented in the Findings About domain, then the name of the variable should be used as the value of FATESTCD. For example, FATESTCDFATESTOCCUROccurrence IndicatorSEVSeverity/IntensityTOXGRToxicity Grade The use of the same names (e.g., SEV, OCCUR) for both Qualifier variables in the observation classes and FATESTCD is deliberate, but should not lead users to conclude that the collection of such data (e.g., severity/intensity, occurrence) must be stored in the Findings About domain. In fact, data should only be stored in the Findings About domain if they do not fit in the general-observation-class domain. If the data describe the underlying Event or Intervention as a whole and share its timing, then the data should be stored as a qualifier of the general-observation-class record. In general, the value in FAOBJ should match the value in --TERM or --TRT, unless the parent domain is dictionary coded or subject to controlled terminology, in which case FAOBJ should then match the value in --DECOD. Examples for the FA and SR domains include the use of RELREC to represent the relationship between a findings about domain and a parent domain. 6.4.4 Findings AboutFA – Description/OverviewA findings domain that contains the findings about an event or intervention that cannot be represented within an events or interventions domain record or as a supplemental qualifier. FA – Specificationfa.xpt, Findings About Events or Interventions — Findings, Version 3.3. One record per finding, per object, per time point, per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time of the observation. Examples: "SCREENING", "TREATMENT", "FOLLOW-UP".PermFADTCDate/Time of CollectionCharISO 8601TimingCollection date and time of findings assessment represented in ISO 8601 character format.PermFADYStudy Day of CollectionNum Timing
¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). FA – Assumptions
FA – ExamplesExample The form shown below collects severity and symptoms data at multiple time points about a migraine event. Migraine Symptoms DiaryMigraine Reference NumberxxWhen did the migraine start? DD-MMM-YYYY HH:MM Answer the following 5 Minutes BEFORE DosingSeverity of Migraine○ Mild ○ Moderate ○ SevereAssociated Symptoms: Sensitivity to light
○ No ○ Yes Associated Symptoms: Sensitivity to light
○ No ○ Yes Associated Symptoms: Sensitivity to light
○ No ○ Yes The collected data below the migraine start date on the CRF meet the following Findings About criteria: 1) Data that do not describe an Event or Intervention as a whole and 2) Data that indicate the occurrence of related symptoms. In this mock scenario, the sponsor's conventions and/or reporting agreements consider migraine as a clinical event (as opposed to a reportable AE) and consider the pre-specified symptom responses as findings about the migraine, therefore the data are represented in the Findings About domain with FATESTCD = "OCCUR" and FAOBJ defined as the symptom description. Therefore, the mock datasets represent (1) The migraine event record in the CE domain, (2) The severity and symptoms data, per time point, in the Findings About domain, and (3) A dataset-level relationship in RELREC based on the sponsor ID (--SPID) value, which was populated with a system-generated identifier unique to each iteration of this form. ce.xpt RowSTUDYIDDOMAINUSUBJIDCESEQCESPIDCETERMCEDECODCESTDTC1ABCCEABC-123190567MigraineMigraine2007-05-16T10:30 fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFASPIDFATESTCDFATESTFAOBJFACATFAORRESFASTRESCFADTCFATPTFAELTMFATPTREF1ABCFAABC-123190567SEVSeverity/IntensityMigraineMIGRAINE SYMPTOMSSEVERESEVERE2007-05-165M PRE-DOSE-PT5MDOSING2ABCFAABC-123290567OCCUROccurrenceSensitivity To LightMIGRAINE SYMPTOMSYY2007-05-165M PRE-DOSE-PT5MDOSING3ABCFAABC-123390567OCCUROccurrenceSensitivity To SoundMIGRAINE SYMPTOMSNN2007-05-165M PRE-DOSE-PT5MDOSING4ABCFAABC-123490567OCCUROccurrenceNauseaMIGRAINE SYMPTOMSYY2007-05-165M PRE-DOSE-PT5MDOSING5ABCFAABC-123690567OCCUROccurrenceAuraMIGRAINE SYMPTOMSYY2007-05-165M PRE-DOSE-PT5MDOSING6ABCFAABC-123790567SEVSeverity/IntensityMigraineMIGRAINE SYMPTOMSMODERATEMODERATE2007-05-1630M POST-DOSEPT30MDOSING7ABCFAABC-123890567OCCUROccurrenceSensitivity To LightMIGRAINE SYMPTOMSYY2007-05-1630M POST-DOSEPT30MDOSING8ABCFAABC-123990567OCCUROccurrenceSensitivity To SoundMIGRAINE SYMPTOMSNN2007-05-1630M POST-DOSEPT30MDOSING9ABCFAABC-1231090567OCCUROccurrenceNauseaMIGRAINE SYMPTOMSNN2007-05-1630M POST-DOSEPT30MDOSING10ABCFAABC-1231290567OCCUROccurrenceAuraMIGRAINE SYMPTOMSYY2007-05-1630M POST-DOSEPT30MDOSING11ABCFAABC-1231390567SEVSeverity/IntensityMigraineMIGRAINE SYMPTOMSMILDMILD2007-05-1690M POST-DOSEPT90MDOSING12ABCFAABC-1231490567OCCUROccurrenceSensitivity To LightMIGRAINE SYMPTOMSNN2007-05-1690M POST-DOSEPT90MDOSING13ABCFAABC-1231590567OCCUROccurrenceSensitivity To SoundMIGRAINE SYMPTOMSNN2007-05-1690M POST-DOSEPT90MDOSING14ABCFAABC-1231690567OCCUROccurrenceNauseaMIGRAINE SYMPTOMSNN2007-05-1690M POST-DOSEPT90MDOSING15ABCFAABC-1231890567OCCUROccurrenceAuraMIGRAINE SYMPTOMSNN2007-05-1690M POST-DOSEPT90MDOSING relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCCE Example This CRF collects details about rash events at each visit, until resolved. Rash AssessmentDate of AssessmentDD-MMM-YYYYAssociated AE reference numberxxRash Diameter________ ○ cm ○ inLesion Type & Count The collected data meet the following Findings About criteria: 1) Data that do not describe an Event or Intervention as a whole and 2) Data ("about" an Event or Intervention) that have Qualifiers of their own that can be represented in Findings variables (e.g., units, method). In this mock scenario, the rash event is considered a reportable AE; therefore the form design collects a reference number to the AE form where the event is captured. Data points collected on the Rash Assessment form can be represented in the Findings About domain and related to the AE via RELREC. Note that in the mock datasets below, the AE started on May 10, 2007, and the rash assessment was conducted on May 12 and May 19, 2007. Certain Required or Expected variables have been omitted in consideration of space and clarity. ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAESPIDAETERM…AEBODSYS…AELOCAELATAESEVAESERAEACNAESTDTC…1XYZAEXYZ-789478695Injection site rash…General disorders and administration site conditions…ARMLEFTMILDNNOT APPLICABLE2007-05-10… fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFASPIDFATESTCDFATESTFAOBJFAORRESFAORRESUFASTRESCFASTRESUVISITNUMEPOCHFADTC1XYZFAXYZ-7891234515DIAMDiameterInjection Site Rash2.5IN2.5IN3TREATMENT2007-05-122XYZFAXYZ-7891234525COUNTCountMacules26 to 100 relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1XYZAE Example The form below collects information about rheumatoid arthritis. In this mock scenario, rheumatoid arthritis is a prerequisite for participation in an osteoporosis trial and was not collected as a Medical History event. Rheumatoid Arthritis HistoryDate of AssessmentDD-MMM-YYYYDuring the past 6 months, how would you rate the following:Joint stiffness○ MILD ○ MODERATE ○ SEVEREInflammation○ MILD ○ MODERATE ○ SEVEREJoint swelling○ MILD ○ MODERATE ○ SEVEREJoint pain (arthralgia)○ MILD ○ MODERATE ○ SEVEREMalaise○ MILD ○ MODERATE ○ SEVEREDuration of early morning stiffness (hours and minutes)_____Hours _____Minutes The collected data meet the following Findings About criteria: Data ("about" an Event or Intervention) for which no Event or Intervention record has been collected or created. In this mock scenario, the rheumatoid arthritis history was assessed on August 13, 2006. fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFATESTCDFATESTFAOBJFACATFAORRESFASTRESCFADTCFAEVLINT1ABCFAABC-1231SEVSeverity/IntensityJoint StiffnessRHEUMATOID ARTHRITIS HISTORYSEVERESEVERE2006-08-13-P6M2ABCFAABC-1232SEVSeverity/IntensityInflammationRHEUMATOID ARTHRITIS HISTORYMODERATEMODERATE2006-08-13-P6M3ABCFAABC-1233SEVSeverity/IntensityJoint SwellingRHEUMATOID ARTHRITIS HISTORYMODERATEMODERATE2006-08-13-P6M4ABCFAABC-1234SEVSeverity/IntensityArthralgiaRHEUMATOID ARTHRITIS HISTORYMODERATEMODERATE2006-08-13-P6M5ABCFAABC-1235SEVSeverity/IntensityMalaiseRHEUMATOID ARTHRITIS HISTORYMILDMILD2006-08-13-P6M6ABCFAABC-1236DURDurationEarly Morning StiffnessRHEUMATOID ARTHRITIS HISTORYPT1H30MPT1H30M2006-08-13-P6M Example In this example, details about bone-fracture events are collected. This form is designed to collect multiple entries of fracture information, including an initial entry for the most recent fracture prior to study participation, as well as entry of information for fractures that occur during the study. Bone Fracture AssessmentComplete form for most recent fracture prior to study participation.Enter Fracture Event Reference Number for all ○ Normal Healing Select all that apply: □ Complication x ○ No Select all that apply □ Therapeutic measure a The collected data meet the following Findings About criteria: (1) Data ("about" an Event or Intervention) that indicate the occurrence of related symptoms or therapies and (2) Data ("about" an event/intervention) for which no Event or Intervention record has been collected or created. Determining when data further describe the parent event record either as Variable Qualifiers or Supplemental Qualifiers may be dependent on data collection design. In the above form, responses are provided for the most recent fracture, but an event record reference number was not collected. For in-study fracture events, a reference number is collected, which would allow representing the responses as part of the Event record either as Supplemental Qualifiers and/or variables like --OUT and --CONTRT. The below domains reflect responses to each Bone Fracture Assessment question. The historical-fracture responses that are without a parent record are represented in the FA domain, while the current-fracture responses are represented as Event records with Supplemental Qualifiers. Historical Fractures Having No Event Records fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFASPIDFATESTCDFATESTFAOBJFACATFAORRESFADTC1ABCFAABC -US-701-0021798654REASReasonBone FractureBONE FRACTURE ASSESSMENT - HISTORYFALL2006-04-102ABCFAABC -US-701-0022798654OUTOutcomeBone FractureBONE FRACTURE ASSESSMENT - HISTORYCOMPLICATIONS2006-04-103ABCFAABC -US-701-0023798654OCCUROccurrenceComplicationsBONE FRACTURE ASSESSMENTY2006-04-104ABCFAABC -US-701-0024798654OCCUROccurrenceTherapeutic MeasureBONE FRACTURE ASSESSMENTY2006-04-10 suppfa.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCFAABC -US-701-002FASEQ1FATYPFA TypeMOST RECENTCRF Current Fractures Having Event Records ce.xpt RowSTUDYIDDOMAINUSUBJIDCESEQCESPIDCETERMCELOCCEOUTCECONTRTCESTDTC1ABCCEABC -US-701-00211FractureARMNORMAL HEALINGY2006-07-032ABCCEABC -US-701-00222FractureLEGCOMPLICATIONSN2006-10-15 suppce.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL1ABCCEABC -US-701-002CESPID1REASReasonFALLCRF Example In this example, three AEs are pre-specified and are scheduled to be asked at each visit. If the occurrence is "Yes", then a complete AE record is collected on the AE form. Pre-Specified Adverse Events of Clinical InterestDate of AssessmentDD-MMM-YYYYDid the following occur? If Yes, then enter a complete record in the AE CRF Headache Respiratory infection Nausea ○ No ○ Yes ○ Not Done ○ No ○ Yes ○ Not Done ○ No ○ Yes ○ Not Done The collected data meet the following Findings About criteria: Data that indicate the occurrence of pre-specified adverse events. In this mock scenario, each response to the pre-specified terms is represented in the Findings About domain. For the "Y" responses, an AE record is represented in the AE domain with its respective Qualifiers and timing details. In the example below, the AE of "Headache" encompasses multiple pre-specified "Y" responses and the AE of "Nausea", collected on October 10, was reported that to have occurred and started on October 8 and ended on October 9. Note that in the example below, no relationship was collected to link the "Yes" responses with the AE entries; therefore, no RELREC was created. fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFATESTCDFATESTFAOBJFAORRESFASTRESCFASTATVISITNUMVISITEPOCHFADTC1QRSFA12341OCCUROccurrenceHeadacheYY ae.xpt RowSTUDYIDDOMAINUSUBJIDAESEQAETERM…AEDECOD…AEPRESPAEBODSYS…AESEV…AEACNEPOCHAESTDTCAEENDTC1QRSAE12341Headache…Headache…YNervous system disorders…MILD…NONETREATMENT2005-09-30 Example In this example, the following CRF is used to capture data about pre-specified symptoms of the disease under study on a daily basis. The date of the assessment is captured, but start and end timing of the events are not. SYMPTOMSINVESTIGATOR GERD SYMPTOM MEASUREMENT The collected data meet the following Findings About criteria: 1) data that do not describe an Event or Intervention as a whole, and 2) data ("about" an Event or Intervention) having Qualifiers that can be represented in Findings variables (e.g., units, method). The data below represent data from two visits for one subject. Records occur in blocks of three for Vomit, and in blocks of two for Diarrhea and Nausea. Rows 1-3:Show the results for the Vomiting tests at Visit 1.Rows 4-5:Show the results for the Diarrhea tests at Visit 1.Rows 6-7:Show the results for the Nausea tests at Visit 1.Rows 8-10:Show the results for the Vomiting tests at Visit 2. These indicate that Vomiting was absent at Visit 2.Rows 11-12:Show the results for the Diarrhea tests at Visit 2.Rows 13-14:Indicate that Nausea was not assessed at Visit 2. fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFATESTCDFATESTFAOBJFACATFAORRESFAORRESUFASTRESCFASTRESUFASTATVISITNUMVISITFADTC1XYZFAXYZ-701-0021VOLVolumeVomitGERD250mL250mL Example This example is similar to the one above except that with the following CRF, which includes a separate column to collect the occurrence of symptoms, measurements are collected only for symptoms that occurred. There is a record for the occurrence test for each symptom. If Vomiting occurs, there are three additional records; for each occurrence of Diarrhea or Nausea, there are two additional records. Whether there are adverse event records related to these symptoms depends on agreements in place for the study about whether these symptoms are considered reportable adverse events. SYMPTOMS The collected data meet the following Findings About criteria: 1) data that do not describe an Event or Intervention as a whole; 2) data ("about" an Event or Intervention) having Qualifiers that can be represented in Findings variables (e.g., units, method); and 3) data ("about" an Event or Intervention) that indicate the occurrence of related symptoms or therapies. The data below represent two visits for one subject. Rows 1-4:Show the results for the Vomiting tests at Visit 1.Rows 5-7:Show the results for the Diarrhea tests at Visit 1.Rows 8-10:Show the results for the Nausea tests at Visit 1.Row 11:Shows that Vomiting was absent at Visit 2.Rows 12-14:Show the results for the Diarrhea tests at Visit 2.Row 15:Shows that Nausea was not assessed at Visit 2. fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFATESTCDFATESTFAOBJFACATFAORRESFAORRESUFASTRESCFASTRESUFASTATVISITNUMEPOCHFADTC1XYZFAXYZ-701-0021OCCUROccurrenceVomitGERDY Example The adverse event module collects, instead of a single assessment of severity, assessments of severity at each visit, as follows: At each visit, record severity of the Adverse Event.Visit123456Severity The collected data meet the following Findings About criteria: data that do not describe an Event or Intervention as a whole. Row 1:Shows the record for a verbatim term of "Morning queasiness", for which the maximum severity over the course of the event was "Moderate".Row 2:Shows the record for a verbatim term of "Watery stools", for which "Mild" severity was collected at Visits 2 and 3 before the event ended. ae.xpt RowDOMAINUSUBJIDAESEQAETERM…AEDECOD…AESEV…AESTDTCAEENDTC1AE1231Morning queasiness…Nausea…MODERATE…2006-02-012006-02-232AE1232Watery stools…Diarrhea…MILD…2006-02-012006-02-15 Rows 1-4:Show severity data collected at the four visits that occurred between the start and end of the AE, "Morning queasiness". FAOBJ = "NAUSEA", which is the value of AEDECOD in the associated AE record.Rows 5-6:Show severity data collected at the two visits that occurred between the start and end of the AE, "Watery stools." FAOBJ = "DIARRHEA", which is the value of AEDECOD in the associated AE record. fa.xpt RowSTUDYIDDOMAINUSUBJIDFASEQFATESTCDFATESTFAOBJFAORRESVISITNUMVISITFADTC1XYZFAXYZ-US-701-0021SEVSeverity/IntensityNauseaMILD2VISIT 22006-02-022XYZFAXYZ-US-701-0022SEVSeverity/IntensityNauseaMODERATE3VISIT 32006-02-093XYZFAXYZ-US-701-0023SEVSeverity/IntensityNauseaMODERATE4VISIT 42006-02-164XYZFAXYZ-US-701-0024SEVSeverity/IntensityNauseaMILD5VISIT 52006-02-235XYZFAXYZ-US-701-0025SEVSeverity/IntensityDiarrheaMILD2VISIT 22006-02-026XYZFAXYZ-US-701-0026SEVSeverity/IntensityDiarrheaMILD3VISIT 32006-02-09 RELREC dataset Depending on how the relationships were collected, in this example, RELREC could be created with either two or six RELIDs. With two RELIDs, the sponsor is describing that the severity ratings are related to the AE as well as being related to each other. With six RELIDs, the sponsor is describing that the severity ratings are related to the AE only (and not to each other). Example with two RELIDs: relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCAEXYZ-US-701-002AESEQ1 Example with six RELIDs: relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCAEXYZ-US-701-002AESEQ1 6.4.5 Skin ResponseSR – Description/OverviewA findings about domain for submitting dermal responses to antigens. SR – Specificationsr.xpt, Skin Response — Findings, Version 3.3. One record per finding, per object, per time point, per visit per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Timing
TimingPlanned study day of the visit based upon RFSTDTC in Demographics.PermTAETORDPlanned Order of Element within ArmNum TimingNumber that gives the planned order of the Element within the Arm.PermEPOCHEpochChar(EPOCH)TimingEpoch associated with the date/time of the observation. Examples: "SCREENING", "TREATMENT", and "FOLLOW-UP".PermSRDTCDate/Time of CollectionCharISO 8601TimingCollection date and time of an observation represented in ISO 8601.ExpSRDYStudy Day of Visit/Collection/ExamNum TimingActual study day of visit/collection/exam expressed in integer days relative to sponsor- defined RFSTDTC in Demographics.PermSRTPTPlanned Time Point NameChar Timing
TimingNumerical version of SRTPT to aid in sorting.PermSRELTMPlanned Elapsed Time from Time Point RefCharISO 8601TimingPlanned elapsed time (in ISO 8601) relative to a fixed time point reference (SRTPTREF). Not a clock time or a date time variable. Represented as an ISO 8601 duration. Examples: "-PT15M" to represent the period of 15 minutes prior to the reference point indicated by EGTPTREF, or "PT8H" to represent the period of 8 hours after the reference point indicated by SRTPTREF.PermSRTPTREFTime Point ReferenceChar TimingName of the fixed reference point referred to by SRELTM, SRTPTNUM, and SRTPT. Example: "INTRADERMAL INJECTION".PermSRRFTDTCDate/Time of Reference Time PointCharISO 8601TimingDate/time of the reference time point, SRTPTREF.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). SR – Assumptions
SR – ExamplesExample In this example, the subject is dosed with increasing concentrations of Johnson Grass IgE. Rows 1-4:Show responses associated with the administration of a Histamine Control.Rows 5-8:Show responses associated with the administration of Johnson Grass IgE. These records describe the dose response to different concentrations of Johnson Grass IgE antigen, as reflected in SROBJ. All rows show a specific location on the BACK (e.g., QUADRANT1). Since Quandrant1, Quandrant2, etc., are not currently part of the SDTM terminology, the sponsor has decided to include this information in the SRSUBLOC SUPPQUAL variable. sr.xpt RowSTUDYIDDOMAINUSUBJIDSRSEQSRTESTCDSRTESTSROBJSRORRESSRORRESUSRSTRESCSRSTRESNSRSTRESUSRLOCVISITNUMVISIT1SPI-001SRSPI-001-110351FLRMDIAMFlare Mean DiameterHistamine Control 10 mg/mL5mm55mmBACK1VISIT 12SPI-001SRSPI-001-110352FLRMDIAMFlare Mean DiameterHistamine Control 10 mg/mL4mm44mmBACK1VISIT 13SPI-001SRSPI-001-110353FLRMDIAMFlare Mean DiameterHistamine Control 10 mg/mL5mm55mmBACK1VISIT 14SPI-001SRSPI-001-110354FLRMDIAMFlare Mean DiameterHistamine Control 10 mg/mL5mm55mmBACK1VISIT 15SPI-001SRSPI-001-110355FLRMDIAMFlare Mean DiameterJohnson Grass 0.05 BAU/mL10mm1010mmBACK1VISIT 16SPI-001SRSPI-001-110356FLRMDIAMFlare Mean DiameterJohnson Grass 0.10 BAU/mL11mm1111mmBACK1VISIT 17SPI-001SRSPI-001-110357FLRMDIAMFlare Mean DiameterJohnson Grass 0.15 BAU mL20mm2020mmBACK1VISIT 18SPI-001SRSPI-001-110358FLRMDIAMFlare Mean DiameterJohnson Grass 0.20 BAU/mL30mm3030mmBACK1VISIT 1 suppsr.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIG1SPI-001SRSPI-001-11035SRSEQ1SRSUBLOCAnatomical Sub-LocationQUADRANT1CRF2SPI-001SRSPI-001-11035SRSEQ2SRSUBLOCAnatomical Sub-LocationQUADRANT2CRF3SPI-001SRSPI-001-11035SRSEQ3SRSUBLOCAnatomical Sub-LocationQUADRANT3CRF4SPI-001SRSPI-001-11035SRSEQ4SRSUBLOCAnatomical Sub-LocationQUADRANT4CRF5SPI-001SRSPI-001-11035SRSEQ5SRSUBLOCAnatomical Sub-LocationQUADRANT1CRF6SPI-001SRSPI-001-11035SRSEQ6SRSUBLOCAnatomical Sub-LocationQUADRANT2CRF7SPI-001SRSPI-001-11035SRSEQ7SRSUBLOCAnatomical Sub-LocationQUADRANT3CRF8SPI-001SRSPI-001-11035SRSEQ8SRSUBLOCAnatomical Sub-LocationQUADRANT4CRF Example In this example, the study product dose, Dog Epi IgG, was administered at increasing concentrations. The size of the wheal is being measured (reaction to Dog Epi IgG ) to evaluate the efficacy of the Dog Epi IgG extract versus a Negative Control (NC) and a Positive Control (PC) in the testing of allergenic extracts. While SROBJ is populated with information about the substance administered, full details regarding the study product would be submitted in the EX dataset. The relationship between the SR records and the EX records would be represented using RELREC. Rows 1-6:Show the response (description and reaction grade) to the study product at a series of different dose levels, the latter reflected in SROBJ. The descriptions of SRORRES values are correlated to a grade and the grade values are stored in SRSTRESC.Rows 7-12:Show the results of wheal diameter measurements in response to the study product at a series of different dose levels. sr.xpt RowSTUDYIDDOMAINUSUBJIDSRSEQSRSPIDSRTESTCDSRTESTSROBJSRORRESSRORRESUSRSTRESCSRSTRESNSRSTRESUSRLOCVISITNUMVISIT1CC-001SRCC-001-10111RCTGRDEReaction GradeDog Epi 0 mgNEGATIVE ex.xpt RowSTUDYIDDOMAINUSUBJIDEXSPIDEXTRTEXDOSEEXDOSEUEXROUTEEXLOC1CC-001EX1011Dog Epi IgG0mgCUTANEOUSFOREARM2CC-001EX1012Dog Epi IgG0.1mgCUTANEOUSFOREARM3CC-001EX1013Dog Epi IgG0.5mgCUTANEOUSFOREARM4CC-001EX1014Dog Epi IgG1mgCUTANEOUSFOREARM5CC-001EX1015Dog Epi IgG1.5mgCUTANEOUSFOREARM6CC-001EX1016Dog Epi IgG2mgCUTANEOUSFOREARM The relationships between SR and EX records are represented at the record level in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1CC-001SRCC-001-101SRSPID1 Example This example shows the results from a tuberculin PPD skin tests administered using the Mantoux technique. The subject was given an intradermal injection of standard tuberculin purified protein derivative (PPD-S) in the left forearm at Visit 1 (See Procedure Agents record below). At Visit 2, the induration diameter and presence of blistering were recorded. Because the tuberculin PPD skin test cannot be interpreted using the induration diameter and blistering alone (e.g., risk for being infected with TB must also be considered), the interpretation of the skin test resides in its own row. The time point variables show that the planned time for reading the test was 48 hours after Mantoux administration. However, a comparison of datetime values in SRDTC and SRRFTDTC shows that in this case the test was read at 53 hours and 56 minutes after Mantoux administration. Row 1:Shows the diameter in millimeters of the induration after receiving an intradermal injection of 0.1 mL containing 5TU of PPD-S in the left forearm.Row 2:Shows the presence of blistering at the tuberculin PPD skin test site.Row 3:Shows the interpretation of the tuberculin PPD skin test. SRGRPID is used to tie together the results to the interpretation. sr.xpt RowSTUDYIDDOMAINUSUBJIDSRSEQSRGRPIDSRTESTCDSRTESTSROBJSRORRESSRORRESUSRSTRESCSRSTRESNSRSTRESUSRLOCSRLATSRMETHODVISITNUMVISITEPOCHSRDTCSRTPTSRELTMSRTPTREFSRRFTDTC1ABCSRABC-00111INDURDIAInduration DiameterTuberculin PPD-S16mm1616mmFOREARMLEFTRULER2VISIT 2OPEN LABEL TREATMENT2011-01-19T14:08:2448 HPT48HMANTOUX ADMINISTRATION2011-01-17T08:30:002ABCSRABC-00121BLISTERBlisteringTuberculin PPD-SY The Tuberculin PPD skin test administration was represented in the AG domain. ag.xpt RowSTUDYIDDOMAINUSUBJIDAGSEQAGTRTAGDOSEAGDOSUAGVAMTAGVAMTUVISITNUMVISITEPOCHAGSTDTC1ABCAGABC-0011Tuberculin PPD-S5TU0.1mL1VISIT 1OPEN LABEL TREATMENT2011-01-17T08:30:00 Relationships between SR and AG records were shown in RELREC. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1ABCSRABC-001SRGRPID1 The table below describes how Trial Design datasets are grouped in this document. Section 7 Organization Trial Design datasets that describe the planned design of the study and provide the representation of study treatment in its most granular components: Trial Arms (TA) A trial design domain that contains each planned arm in the trial. An arm is described as an ordered sequence of elements. Trial Elements (TE) A trial design domain that contains the element code that is unique for each element, the element description, and the rules for starting and ending an element. 7.3 Schedule for AssessmentsTrial Design datasets that describe the protocol-defined planned schedule of subject encounters at the healthcare facility where the study is being conducted: Trial Visits (TV) A trial design domain that contains the planned order and number of visits in the study within each arm. Trial Disease Assessments (TD) A trial design domain that provides information on the protocol-specified disease assessment schedule, to be used for comparison with the actual occurrence of the efficacy assessments in order to determine whether there was good compliance with the schedule. TD describes the planned schedule of disease assessments, when that schedule is not necessarily visit based. Trial Disease Milestones (TM) A trial design domain that is used to describe disease milestones, which are observations or activities anticipated to occur in the course of the disease under study, and which trigger the collection of data. 7.4 Trial Summary and EligibilityTrial Design datasets that describe the characteristics of the trial: Trial Inclusion/Exclusion Criteria (TI) A trial design domain that contains one record for each of the inclusion and exclusion criteria for the trial. This domain is not subject oriented. Trial Summary (TS) A trial design domain that contains one record for each trial summary characteristic. This domain is not subject oriented. 7.5 How to Model the Design of a Clinical TrialA short guidance for how to develop the Trial Design datasets for any study.7.1 Introduction to Trial Design Model Datasets7.1.1 Purpose of Trial Design ModelICH E3, Guidance for Industry, Structure and Content of Clinical Study Reports (available at http://www.ich.org/products/guidelines/efficacy/article/efficacy-guidelines.html), Section 9.1, calls for a brief, clear description of the overall plan and design of the study, and supplies examples of charts and diagrams for this purpose in Annex IIIa and Annex IIIb. Each Annex corresponds to an example trial, and each shows a diagram describing the study design and a table showing the schedule of assessments. The Trial Design Model provides a standardized way to describe those aspects of the planned conduct of a clinical trial shown in the study design diagrams of these examples. The standard Trial Design Datasets will allow reviewers to:
Modeling a clinical trial in this standardized way requires the explicit statement of certain decision rules that may not be addressed or may be vague or ambiguous in the usual prose protocol document. Prospective modeling of the design of a clinical trial should lead to a clearer, better protocol. Retrospective modeling of the design of a clinical trial should ensure a clear description of how the trial protocol was interpreted by the sponsor. 7.1.2 Definitions of Trial Design ConceptsA clinical trial is a scientific experiment involving human subjects, which is intended to address certain scientific questions (the objectives of the trial). (See the CDISC Glossary, https://www.cdisc.org/standards/semantics/glossary, for more complete definitions of clinical trial and objective.) ConceptDefinitionTrial DesignThe design of a clinical trial is a plan for what will be done to subjects, and what data will be collected about them, in the course of the trial, to address the trial's objectives.EpochAs part of the design of a trial, the planned period of subjects' participation in the trial is divided into Epochs. Each Epoch is a period of time that serves a purpose in the trial as a whole. That purpose will be at the level of the primary objectives of the trial. Typically, the purpose of an Epoch will be to expose subjects to a treatment, or to prepare for such a treatment period (e.g., determine subject eligibility, washout previous treatments) or to gather data on subjects after a treatment has ended. Note that at this high level, a "treatment" is a treatment strategy, which may be simple (e.g., exposure to a single drug at a single dose) or complex. Complex treatment strategies could involve tapering through several doses, titrating dose according to clinical criteria, complex regimens involving multiple drugs, or strategies that involve adding or dropping drugs according to clinical criteria.ArmAn Arm is a planned path through the trial. This path covers the entire time of the trial. The group of subjects assigned to a planned path is also often colloquially called an Arm. The group of subjects assigned to an Arm is also often called a treatment group, and in this sense, an Arm is equivalent to a treatment group.Study CellSince the trial as a whole is divided into Epochs, each planned path through the trial (i.e., each Arm) is divided into pieces, one for each Epoch. Each of these pieces is called a Study Cell. Thus, there is a Study Cell for each combination of Arm and Epoch. Each Study Cell represents an implementation of the purpose of its associated Epoch. For an Epoch whose purpose is to expose subjects to treatment, each Study Cell associated with the Epoch has an associated treatment strategy. For example, a three-Arm parallel trial might have a Treatment Epoch whose purpose is to expose subjects to one of three study treatments: placebo, investigational product, or active control. There would be three Study Cells associated with the Treatment Epoch, one for each Arm. Each of these Study Cells exposes the subject to one of the three study treatments. Another example involving more complex treatment strategies: A trial compares the effects of cycles of chemotherapy drug A given alone or in combination with drug B, where drug B is given as a pre-treatment to each cycle of drug A.ElementAn Element is a basic building block in the trial design. It involves administering a planned intervention, which may be treatment or no treatment, during a period of time. Elements for which the planned intervention is "no treatment" would include Elements for screening, washout, and follow-up.Study Cells and ElementsMany, perhaps most, clinical trials involve a single, simple administration of a planned intervention within a Study Cell, but for some trials, the treatment strategy associated with a Study Cell may involve a complex series of administrations of treatment. It may be important to track the component steps in a treatment strategy both operationally and because secondary objectives and safety analyses require that data be grouped by the treatment step during which it was collected. The steps within a treatment strategy may involve different doses of drug, different drugs, or different kinds of care, as in pre-operative, operative, and post-operative periods surrounding surgery. When the treatment strategy for a Study Cell is simple, the Study Cell will contain a single Element, and for many purposes there is little value in distinguishing between the Study Cell and the Element. However, when the treatment strategy for a Study Cell consists of a complex series of treatments, a Study Cell can contain multiple Elements. There may be a fixed sequence of Elements, or a repeating cycle of Elements, or some other complex pattern. In these cases, the distinction between a Study Cell and an Element is very useful.BranchIn a trial with multiple Arms, the protocol plans for each subject to be assigned to one Arm. The time within the trial at which this assignment takes place is the point at which the Arm paths of the trial diverge, and so is called a branch point. For many trials, the assignment to an Arm happens all at one time, so the trial has one branch point. For other trials, there may be two or more branches that collectively assign a subject to an Arm. The process that makes this assignment may be a randomization, but it need not be.TreatmentsThe word "treatment" may be used in connection with Epochs, Study Cells, or Elements, but has somewhat different meanings in each context:
The distinctions between these levels are not rigid, and depend on the objectives of the trial. For example, route is generally a detail of dosing, but in a bioequivalence trial that compared IV and oral administration of Drug X, route is clearly part of Study Cell treatment. VisitA clinical encounter. The notion of a Visit derives from trials with outpatients, where subjects interact with the investigator during Visits to the investigator's clinical site. However, the term is used in other trials, where a trial Visit may not correspond to a physical Visit. For example, in a trial with inpatients, time may be subdivided into Visits, even though subjects are in hospital throughout the trial. For example, data for a screening Visit may be collected over the course of more than one physical visit. One of the main purposes of Visits is the performance of assessments, but not all assessments need take place at clinic Visits; some assessments may be performed by means of telephone contacts, electronic devices or call-in systems. The protocol should specify what contacts are considered Visits and how they are defined.7.1.3 Current and Future Contents of the Trial Design ModelThe datasets currently included in the Trial Design Model:
The Trial Inclusion/Exclusion Criteria (TI) dataset is discussed in Section 7.4.1, Trial Inclusion/Exclusion Criteria. The Inclusion/Exclusion Criteria Not Met (IE) domain described in Section 6.3.4, Inclusion/Exclusion Criteria Not Met, contains the actual exceptions to those criteria for enrolled subjects. The current Trial Design Model has limitations in representing protocols, which include the following:
The last two situations arise in dose-escalation studies where increasing doses are given until stopping criteria are met. Some dose-escalation studies enroll a new cohort of subjects for each new dose, and so, at the planning stage, have an indefinite number of Arms. Other dose-escalation studies give new doses to a continuing group of subjects, and so are planned with an indefinite number of Epochs. There may also be limitations in representing other patterns of Elements within a Study Cell that are more complex than a simple sequence. For the purpose of submissions about trials that have already completed, these limitations are not critical, so it is expected that development of the Trial Design Model to address these limitations will have a minimal impact on SDTM. 7.2 Experimental Design (TA and TE)This subsection contains the Trial Design datasets that describe the planned design of the study, and provide the representation of study treatment in its most granular components (Section 7.2.2, Trial Elements (TE)) as well as the representation of all sequences of these components (Section 7.2.1, Trial Arms (TA)) as specified by the study protocol. The TA and TE datasets are interrelated, and they provide the building block for the development of the subject-level treatment information (see Sections 5.2, Demographics (DM), and 5.3, Subject Elements (SE), for the subject's actual study treatment information). 7.2.1 Trial ArmsTA – Description/OverviewA trial design domain that contains each planned arm in the trial. This section contains:
TA – Specificationta.xpt, Trial Arms — Trial Design, Version 3.3. One record per planned Element per Arm, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TA – Assumptions
TA – ExamplesThe core of the Trial Design Model is the Trial Arms (TA) dataset. For each Arm of the trial, it contains one record for each occurrence of an Element in the path of the Arm. Although the Trial Arms dataset has one record for each trial Element traversed by subjects assigned to the Arm, it is generally more useful to work out the overall design of the trial at the Study Cell level first, then to work out the Elements within each Study Cell, and finally to develop the definitions of the Elements that are contained in the Trial Elements table. It is generally useful to draw diagrams, like those mentioned in ICH E3, when working out the design of a trial. The protocol may include a diagram that can serve as a starting point. Such a diagram can then be converted into a Trial Design Matrix, which displays the Study Cells and which can be, in turn, converted into the Trial Arms dataset. This section uses example trials of increasing complexity, numbered 1 to 7, to illustrate the concepts of trial design. For each example trial, the process of working out the Trial Arms table is illustrated by means of a series of diagrams and tables, including the following:
Readers are advised to read the following section with Example 1 before reading other examples, since Example 1 explains the conventions used for the diagrams and tables. Example Diagrams that represent study schemas generally conceive of time as moving from left to right, using horizontal lines to represent periods of time and slanting lines to represent branches into separate treatments, convergence into a common follow-up, or cross-over to a different treatment. In this document, diagrams are drawn using "blocks" corresponding to trial Elements rather than horizontal lines. Trial Elements are the various treatment and non-treatment time periods of the trial, and we want to emphasize the separate trial Elements that might otherwise be "hidden" in a single horizontal line. See Section 7.2.2, Trial Elements (TE), for more information about defining trial Elements. In general, the Elements of a trial will be fairly clear. However, in the process of working out a trial design, alternative definitions of trial Elements may be considered, in which case diagrams for each alternative may be constructed. In the study schema diagrams in this document, the only slanting lines used are those that represent branches, the decision points where subjects are divided into separate treatment groups. One advantage of this style of diagram, which does not show convergence of separate paths into a single block, is that the number of Arms in the trial can be determined by counting the number of parallel paths at the right end of the diagram. Below is the study schema diagram for Example Trial 1, a simple parallel trial. This trial has three Arms, corresponding to the three possible left-to-right "paths" through the trial. Each path corresponds to one of the three treatment Elements at the right end of the diagram. Note that the randomization is represented by the three red arrows leading from the Run-in block. Example Trial 1, Parallel Design Study Schema The next diagram for this trial shows the Epochs of the trial, indicates the three Arms, and shows the sequence of Elements for each group of subjects in each Epoch. The arrows are at the right hand side of the diagram because it is at the end of the trial that all the separate paths through the trial can be seen. Note that, in this diagram, the randomization, which was shown using three red arrows connecting the Run-in block with the three treatment blocks in the first diagram, is now indicated by a note with an arrow pointing to the line between two Epochs. Example Trial 1, Parallel Design Prospective view The next diagram can be thought of as the "retrospective" view of a trial, the view back from a point in time when a subject's assignment to an Arm is known. In this view, the trial appears as a grid, with an Arm represented by a series of study cells, one for each Epoch, and a sequence of Elements within each study cell. In this trial, as in many trials, there is exactly one Element in each study cell, but later examples will illustrate that this is not always the case. Example Trial 1, Parallel Design Retrospective view The next diagram shows the trial from the viewpoint of blinded participants. To blinded participants in this trial, all Arms look alike. They know when a subject is in the Screen Element, or the Run-in Element, but when a subject is in the Treatment Epoch, they know only that the subject is in an Element that involves receiving a study drug, not which study drug, and therefore not which Element. Example Trial 1, Parallel Design Blinded View A trial design matrix is a table with a row for each Arm in the trial and a column for each Epoch in the trial. It is closely related to the retrospective view of the trial, and many users may find it easier to construct a table than to draw a diagram. The cells in the matrix represent the Study Cells, which are populated with trial Elements. In this trial, each Study Cell contains exactly one Element. The columns of a Trial Design Matrix are the Epochs of the trial, the rows are the Arms of the trial, and the cells of the matrix (the Study Cells) contain Elements. Note that the randomization is not represented in the Trial Design Matrix. All the diagrams above and the trial design matrix below are alternative representations of the trial design. None of them contains all the information that will be in the finished Trial Arms dataset, but users may find it useful to draw some or all of them when working out the dataset. Trial Design Matrix for Example Trial 1
For Example Trial 1, the conversion of the Trial Design Matrix into the Trial Arms dataset is straightforward. For each cell of the matrix, there is a record in the Trial Arms dataset. ARM, EPOCH, and ELEMENT can be populated directly from the matrix. TAETORD acts as a sequence number for the Elements within an Arm, so it can be populated by counting across the cells in the matrix. The randomization information, which is not represented in the Trial Design Matrix, is held in TABRANCH in the Trial Arms dataset. TABRANCH is populated only if there is a branch at the end of an Element for the Arm. When TABRANCH is populated, it describes how the decision at the branch point would result in a subject being in this Arm. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1EX1TAPPlacebo1SCRNScreen Example The diagram below is for a crossover trial. However, the diagram does not use the crossing slanted lines sometimes used to represent crossover trials, since the order of the blocks is sufficient to represent the design of the trial. Slanted lines are used only to represent the branch point at randomization, when a subject is assigned to a sequence of treatments. As in most crossover trials, the Arms are distinguished by the order of treatments, with the same treatments present in each Arm. Note that even though all three Arms of this trial end with the same block, the block for the follow-up Element, the diagram does not show the Arms converging into one block. Also note that the same block (the "Rest" Element) occurs twice within each Arm. Elements are conceived of as "reusable" and can appear in more than one Arm, in more than one Epoch, and more than once in an Arm. Example Trial 2, Crossover Trial Study Schema The next diagram for this crossover trial shows the prospective view of the trial, identifies the Epoch and Arms of the trial, and gives each a name. As for most crossover studies, the objectives of the trial will be addressed by comparisons between the Arms and by within-subject comparisons between treatments. The design thus depends on differentiating the periods during which the subject receives the three different treatments and so there are three different treatment Epochs. The fact that the rest periods are identified as separate Epochs suggests that these also play an important part in the design of the trial; they are probably designed to allow subjects to return to "baseline", with data collected to show that this occurred. Note that Epochs are not considered "reusable", so each Epoch has a different name, even though all the Treatment Epochs are similar and both the Rest Epochs are similar. As with the first example trial, there is a one-to-one relationship between the Epochs of the trial and the Elements in each Arm. Example Trial 2, Crossover Trial Prospective View The next diagram shows the retrospective view of the trial. Example Trial 2, Crossover Trial Retrospective View The last diagram for this trial shows the trial from the viewpoint of blinded participants. As in the simple parallel trial above, blinded participants see only one sequence of Elements, since during the treatment Epochs they do not know which of the treatment Elements a subject is in. Example Trial 2, Crossover Trial Blinded View The trial design matrix for the crossover example trial is shown below. It corresponds closely to the retrospective diagram above. Trial Design Matrix for Example Trial 2
It is straightforward to produce the Trial Arms dataset for this crossover trial from the diagram showing Arms and Epochs, or from the Trial Design Matrix. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1EX2TAP-5-10Placebo-5mg-10mg1SCRNScreenRandomized to Placebo - 5 mg - 10 mg Example Each of the paths for the trial shown in the diagram below goes through one branch point at randomization, and then through another branch point when response is evaluated. This results in four Arms, corresponding to the number of possible paths through the trial, and also to the number of blocks at the right end of the diagram. The fact that there are only two kinds of block at the right end ("Open DRUG X" and "Rescue") does not affect the fact that there are four "paths" and thus four Arms. Example Trial 3, Multiple Branches Study Schema The next diagram for this trial is the prospective view. It shows the Epochs of the trial and how the initial group of subjects is split into two treatment groups for the double blind treatment Epoch, and how each of those initial treatment groups is split in two at the response evaluation, resulting in the four Arms of this trial The names of the Arms have been chosen to represent the outcomes of the successive branches that, together, assign subjects to Arms. These compound names were chosen to facilitate description of subjects who may drop out of the trial after the first branch and before the second branch. See DM Example 7, which illustrates DM and SE data for such subjects. Example Trial 3, Multiple Branches Prospective View The next diagram shows the retrospective view. As with the first two example trials, there is one Element in each study cell. Example Trial 3, Multiple Branches Retrospective View The last diagram for this trial shows the trial from the viewpoint of blinded participants. Since the prospective view is the view most relevant to study participants, the blinded view shown here is a prospective view. Since blinded participants can tell which treatment a subject receives in the Open Label Epoch, they see two possible element sequences. Example Trial 3, Multiple Branches Blinded View The trial design matrix for this trial can be constructed easily from the diagram showing Arms and Epochs. Trial Design Matrix for Example Trial 3
Creating the Trial Arms dataset for Example Trial 3 is similarly straightforward. Note that because there are two branch points in this trial, TABRANCH is populated for two records in each Arm. Note also that the values of ARMCD, like the values of ARM, reflect the two separate processes that result in a subject's assignment to an Arm. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1EX3TAAAA-Open A1SCRNScreenRandomized to Treatment A See Section 7.2.1.1 Trial Arms Issues, Issue 1, "Distinguishing Between Branches and Transitions", for additional discussion of when a decision point in a trial design should be considered to give rise to a new Arm. Example The diagram below uses a new symbol, a large curved arrow representing the fact that the chemotherapy treatment (A or B) and the rest period that follows it are to be repeated. In this trial, the chemotherapy "cycles" are to be repeated until disease progression. Some chemotherapy trials specify a maximum number of cycles, but protocols that allow an indefinite number of repeats are not uncommon. Example Trial 4, Cyclical Chemotherapy Study Schema The next diagram shows the prospective view of this trial. Note that, in spite of the repeating element structure, this is, at its core, a two-arm parallel study, and thus has two Arms. In SDTMIG 3.1.1, there was an implicit assumption that each Element must be in a separate Epoch, and trials with cyclical chemotherapy were difficult to handle. The introduction of the concept of study cells, and the dropping of the assumption that Elements and Epochs have a one-to-one relationship resolves these difficulties. This trial is best treated as having just three Epochs, since the main objectives of the trial involve comparisons between the two treatments, and do not require data to be considered cycle by cycle. Example Trial 4, Cyclical Chemotherapy Prospective View The next diagram shows the retrospective view of this trial. Example Trial 4, Cyclical Chemotherapy Retrospective View For the purpose of developing a Trial Arms dataset for this oncology trial, the diagram must be redrawn to explicitly represent multiple treatment and rest elements. If a maximum number of cycles is not given by the protocol, then, for the purposes of constructing an SDTM Trial Arms dataset for submission, which can only take place after the trial is complete, the number of repeats included in the Trial Arms dataset should be the maximum number of repeats that occurred in the trial. The next diagram assumes that the maximum number of cycles that occurred in this trial was four. Some subjects will not have received all four cycles, because their disease progressed. The rule that directed that they receive no further cycles of chemotherapy is represented by a set of green arrows, one at the end of each Rest Epoch, that shows that a subject "skips forward" if their disease progresses. In the Trial Arms dataset, each "skip forward" instruction is a transition rule, recorded in the TATRANS variable; when TATRANS is not populated, the rule is to transition to the next Element in sequence. Example Trial 4, Cyclical Chemotherapy Retrospective View with Explicit Repeats The logistics of dosing mean that few oncology trials are blinded, if this trial is blinded, then the next diagram shows the trial from the viewpoint of blinded participant. Example Trial 4, Cyclical Chemotherapy Blinded View The Trial Design Matrix for Example Trial 4 corresponds to the diagram showing the retrospective view with explicit repeats of the treatment and Rest Elements. As noted above, the Trial Design Matrix does not include information on when randomization occurs; similarly, information corresponding to the "skip forward" rules is not represented in the Trial Design Matrix. Trial Design Matrix for Example Trial 4
The Trial Arms dataset for Example Trial 4 requires the use of the TATRANS variable in the Trial Arms dataset to represent the "repeat until disease progression" feature. In order to represent this rule in the diagrams that explicitly represent repeated elements, a green "skip forward" arrow was included at the end of each element where disease progression is assessed. In the Trial Arms dataset, TATRANS is populated for each Element with a green arrow in the diagram. In other words, if there is a possibility that a subject will, at the end of this Element, "skip forward" to a later part of the Arm, then TATRANS is populated with the rule describing the conditions under which a subject will go to a later Element. If the subject always goes to the next Element in the Arm (as was the case for the first three example trials presented here) then TATRANS is null. The Trial Arms datasets presented below corresponds to the Trial Design Matrix above. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1EX4TAAA1SCRNScreenRandomized to A Example Example Trial 5 is much like the last oncology trial in that the two treatments being compared are given in cycles, and the total length of the cycle is the same for both treatments. In this trial, however, Treatment A is given over longer duration than Treatment B. Because of this difference in treatment patterns, this trial cannot be blinded. Example Trial 5, Different Chemo Durations Study Schema In SDTMIG 3.1.1, the assumption of a one-to-one relationship between Elements and Epochs made this example difficult to handle. However, without that assumption, this trial is essentially the same as Trial 4. The next diagram shows the retrospective view of this trial. Example Trial 5, Cyclical Chemotherapy Retrospective View The Trial Design Matrix for this trial is almost the same as for Example Trial 4; the only difference is that the maximum number of cycles for this trial was assumed to be three. Trial Design Matrix for Example Trial 5
The Trial Arms dataset for this trial shown below corresponds to the Trial Design Matrix above. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1EX5TAAA1SCRNScreenRandomized to A Example Example Trial 6 is an oncology trial comparing two types of chemotherapy that are given using cycles of different lengths with different internal patterns. Treatment A is given in 3-week cycles with a longer duration of treatment and a short rest, while Treatment B is given in 4-week cycles with a short duration of treatment and a long rest. Example Trial 6, Different Cycle Durations Study Schema The design of this trial is very similar to that for Example Trials 4 and 5. The main difference is that there are two different Rest Elements, the short one used with Drug A and the long one used with Drug B. The next diagram shows the retrospective view of this trial. Example Trial 6, Cyclical Chemotherapy Retrospective View The Trial Design Matrix for this trial assumes that there was a maximum of four cycles of Drug A and a maximum of three cycles of Drug B. Trial Design Matrix for Example Trial 6
In the following Trial Arms dataset, because the Treatment Epoch for Arm A has more Elements than the Treatment Epoch for Arm B, TAETORD is 10 for the Follow-up Element in Arm A, but 8 for the Follow-up Element in Arm B. It would also be possible to assign a TAETORD value of 10 to the Follow-up Element in Arm B. The primary purpose of TAETORD is to order Elements within an Arm; leaving gaps in the series of TAETORD values does not interfere with this purpose. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1EX6TAAA1SCRNScreenRandomized to A Example In open trials, there is no requirement to maintain a blind, and the Arms of a trial may be quite different from each other. In such a case, changes in treatment in one Arm may differ in number and timing from changes in treatment in another Arm, so that there is nothing like a one-to-one match between the Elements in the different Arms. In such a case, Epochs are likely to be defined as broad intervals of time, spanning several Elements, and be chosen to correspond to periods of time that will be compared in analyses of the trial. Example Trial 7, RTOG 93-09, involves treatment of lung cancer with chemotherapy and radiotherapy, with or without surgery. The protocol (RTOG-93-09), which is available online at the Radiation Oncology Therapy Group (RTOG) website http://www.rtog.org, does not include a study schema diagram, but does include a text-based representation of diverging "options" to which a subject may be assigned. All subjects go through the branch point at randomization, when subjects are assigned to either Chemotherapy + Radiotherapy (CR) or Chemotherapy + Radiotherapy + Surgery (CRS). All subjects receive induction chemotherapy and radiation, with a slight difference between those randomized to the two Arms during the second cycle of chemotherapy. Those randomized to the non-surgery Arm are evaluated for disease somewhat earlier, to avoid delays in administering the radiation boost to those whose disease has not progressed. After induction chemotherapy and radiation, subjects are evaluated for disease progression, and those whose disease has progressed stop treatment, but enter follow-up. Not all subjects randomized to receive surgery who do not have disease progression will necessarily receive surgery. If they are poor candidates for surgery or do not wish to receive surgery, they will not receive surgery, but will receive further chemotherapy. The diagram below is based on the text "schema" in the protocol, with the five "options" it names. The diagram in this form might suggest that the trial has five Arms. Example Trial 7, RTOG 93-09 Study Schema with 5 "options" *Disease evaluation earlier **Disease evaluation later However, the objectives of the trial make it clear that this trial is designed to compare two treatment strategies, chemotherapy and radiation with and without surgery, so this study is better modeled as a two-Arm trial, but with major "skip forward" arrows for some subjects, as illustrated in the following diagram. This diagram also shows more detail within the blocks labeled "Induction Chemo + RT" and "Additional Chemo" than the diagram above. Both the "induction" and "additional" chemotherapy are given in two cycles. Also, the second induction cycle is different for the two Arms, since radiation therapy for those assigned to the non-surgery arm includes a "boost", which those assigned to the surgery Arm do not receive. The next diagram shows the prospective view of this trial. The protocol conceives of treatment as being divided into two parts, Induction and Continuation, so these have been treated as two different Epochs. This is also an important point in the trial operationally, the point when subjects are "registered" a second time, and when subjects are identified who will "skip forward", because of disease progression or ineligibility for surgery. Example Trial 7, RTOG-93-09 Prospective View *Disease evaluation earlier **Disease evaluation later The next diagram shows the retrospective view of this trial. The fact that the Elements in the study cell for the CR Arm in the Continuation Treatment Epoch do not fill the space in the diagram is an artifact of the diagram conventions. Those subjects who do receive surgery will in fact spend a longer time completing treatment and moving into follow-up. Although it is tempting to think of the horizontal axis of these diagrams as a timeline, this can sometimes be misleading. The diagrams are not necessarily "to scale" in the sense that the length of the block representing an Element represents its duration, and elements that line up on the same vertical line in the diagram may not occur at the same relative time within the study. Example Trial 7, RTOG 93-09 Retrospective View *Disease evaluation earlier **Disease evaluation later The Trial Design Matrix for Example Trial 7, RTOG 93-09, a two-Arm trial is shown in the following table.
The Trial Arms dataset for the trial is shown below for Example Trial 7, as a two-Arm trial. ta.xpt RowSTUDYIDDOMAINARMCDARMTAETORDETCDELEMENTTABRANCHTATRANSEPOCH1EX7TA1CR1SCRNScreenRandomized to CR 7.2.1.1 Trial Arms Issues1. Distinguishing Between Branches and Transitions Both the Branch and Transition columns contain rules, but the two columns represent two different types of rules. Branch rules represent forks in the trial flowchart, giving rise to separate Arms. The rule underlying a branch in the trial design appears in multiple records, once for each "fork" of the branch. Within any one record, there is no choice (no "if" clause) in the value of the Branch condition. For example, the value of TABRANCH for a record in Arm A is "Randomized to Arm A" because a subject in Arm A must have been randomized to Arm A. Transition rules are used for choices within an Arm. The value for TATRANS does contain a choice (an "if" clause). In Example Trial 4, subjects who receive 1, 2, 3, or 4 cycles of Treatment A are all considered to belong to Arm A. In modeling a trial, decisions may have to be made about whether a decision point in the flow chart represents the separation of paths that represent different Arms, or paths that represent variations within the same Arm, as illustrated in the discussion of Example Trial 7. This decision will depend on the comparisons of interest in the trial. Some trials refer to groups of subjects who follow a particular path through the trial as "cohorts", particularly if the groups are formed successively over time. The term "cohort" is used with different meanings in different protocols and does not always correspond to an Arm. 2. Subjects Not Assigned to an Arm Some trial subjects may drop out of the study before they reach all of the branch points in the trial design. In the Demographics domain, values of ARM and ARMCD must be supplied for such subjects, but the special values used for these subjects should not be included in the Trial Arms dataset; only complete Arm paths should be described in the Trial Arms dataset. DM Assumption 4 describes special ARM and ARMCD values used for subjects who do not reach the first branch point in a trial. When a trial design includes two or more branches, special values of ARM and ARMCD may be needed for subjects who pass through the first branch point, but drop out before the final branch point. See DM Example 7 for an example showing ARM and ARMCD values for such a trial. 3. Defining Epochs The series of examples for the Trial Arms dataset provides a variety of scenarios and guidance about how to assign Epoch in those scenarios. In general, assigning Epochs for blinded trials is easier than for unblinded trials. The blinded view of the trial will generally make the possible choices clear. For unblinded trials, the comparisons that will be made between Arms can guide the definition of Epochs. For trials that include many variant paths within an Arm, comparisons of Arms will mean that subjects on a variety of paths will be included in the comparison, and this is likely to lead to definition of broader Epochs. 4. Rule Variables The Branch and Transition columns shown in the example tables are variables with a Role of "Rule." The values of a Rule variable describe conditions under which something is planned to happen. At the moment, values of Rule variables are text. At some point in the future, it is expected that these will become executable code. Other Rule variables are present in the Trial Elements and Trial Visits datasets. 7.2.2 Trial ElementsTE – Description/OverviewA trial design domain that contains the element code that is unique for each element, the element description, and the rules for starting and ending an element. The Trial Elements (TE) dataset contains the definitions of the Elements that appear in the Trial Arms (TA) dataset. An Element may appear multiple times in the Trial Arms table because it appears either 1) in multiple Arms, 2) multiple times within an Arm, or 3) both. However, an Element will appear only once in the Trial Elements table. Each row in the TE dataset may be thought of as representing a "unique Element" in the sense of "unique" used when a case report form template page for a collecting certain type of data is often referred to as "unique page." For instance, a case report form might be described as containing 87 pages, but only 23 unique pages. By analogy, the trial design matrix for Example Trial 1 has nine Study Cells, each of which contains one Element, but the same trial design matrix contains only five unique Elements, so the trial Elements dataset for that trial has only five records. An Element is a building block for creating Study Cells and an Arm is composed of Study Cells. Or, from another point of view, an Arm is composed of Elements: That is, the trial design assigns subjects to Arms, which are comprised of a sequence of steps called Elements. Trial Elements represent an interval of time that serves a purpose in the trial and are associated with certain activities affecting the subject. "Week 2 to Week 4" is not a valid Element. A valid Element has a name that describes the purpose of the Element and includes a description of the activity or event that marks the subject's transition into the Element as well as the conditions for leaving the Element. TE – Specificationte.xpt, Trial Elements — Trial Design, Version 3.2. One record per planned Element, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TE – Assumptions
TE – ExamplesBelow are Trial Elements datasets for Example Trials 1 (Example Trial 1) and 2 (Example Trial 2). Both of these trials are assumed to have fixed-duration Elements. The wording in TESTRL is intended to separate the description of the event that starts the Element into the part that would be visible to a blinded participant in the trial (e.g., "First dose of a treatment Epoch") from the part that is revealed when the study is unblinded (e.g., "where dose is 5 mg"). Care must be taken in choosing these descriptions to be sure that they are "Arm and Epoch neutral." For instance, in a crossover trial such as Example Trial 3 (Example Trial 3), where an Element may appear in one of multiple Epochs, the wording must be appropriate for all the possible Epochs. The wording for Example Trial 2 uses the wording "a treatment Epoch." The SDS Team is considering adding a separate variable to the Trial Elements dataset that would hold information on the treatment that is associated with an Element. This would make it clearer which Elements are "treatment Elements", and therefore, which Epochs contain treatment Elements, and thus are "treatment Epochs". Example This example shows the TE dataset for Example Trial 1. te.xpt RowSTUDYIDDOMAINETCDELEMENTTESTRLTEENRLTEDUR1EX1TESCRNScreenInformed consent1 week after start of ElementP7D2EX1TERIRun-InEligibility confirmed2 weeks after start of ElementP14D3EX1TEPPlaceboFirst dose of study drug, where drug is placebo2 weeks after start of ElementP14D4EX1TEADrug AFirst dose of study drug, where drug is Drug A2 weeks after start of ElementP14D5EX1TEBDrug BFirst dose of study drug, where drug is Drug B2 weeks after start of ElementP14D Example This example shows the TE dataset for Example Trial 2. te.xpt RowSTUDYIDDOMAINETCDELEMENTTESTRLTEENRLTEDUR1EX2TESCRNScreenInformed consent2 weeks after start of ElementP14D2EX2TEPPlaceboFirst dose of a treatment Epoch, where dose is placebo2 weeks after start of ElementP14D3EX2TE55 mgFirst dose of a treatment Epoch, where dose is 5 mg drug2 weeks after start of ElementP14D4EX2TE1010 mgFirst dose of a treatment Epoch, where dose is 10 mg drug2 weeks after start of ElementP14D5EX2TERESTRest48 hrs after last dose of preceding treatment Epoch1 week after start of ElementP7D6EX2TEFUFollow-up48 hrs after last dose of third treatment Epoch3 weeks after start of ElementP21D Example The Trial Elements dataset for Example Trial 4 illustrates Element end rules for Elements that are not all of fixed duration. The Screen Element in this study can be up to 2 weeks long, but may end earlier, so is not of fixed duration. The Rest Element has a variable length, depending on how quickly WBC recovers. Note that the start rules for the A and B Elements have been written to be suitable for a blinded study. te.xpt RowSTUDYIDDOMAINETCDELEMENTTESTRLTEENRLTEDUR1EX4TESCRNScreenInformed ConsentScreening assessments are complete, up to 2 weeks after start of Element 7.2.2.1 Trial Elements Issues1. Granularity of Trial Elements Deciding how finely to divide trial time when identifying trial Elements is a matter of judgment, as illustrated by the following examples:
2. Distinguishing Elements, Study Cells, and Epochs It is easy to confuse Elements, which are reusable trial building blocks, with Study Cells, which contain the Elements for a particular Epoch and Arm, and with Epochs, which are time periods for the trial as a whole. In part, this is because many trials have Epochs for which the same Element appears in all Arms. In other words, in the trial design matrix for many trials, there are columns (Epochs) in which all the Study Cells have the same contents. Furthermore, it is natural to use the same name (e.g., Screen or Follow-up) for both such an Epoch and the single Element that appears within it. Confusion can also arise from the fact that, in the blinded treatment portions of blinded trials, blinded participants do not know which Element a subject is in, but do know what Epoch the subject is in. In describing a trial, one way to avoid confusion between Elements and Epochs is to include "Element" or "Epoch" in the values of ELEMENT or EPOCH when these values (such as Screening or Follow-up) would otherwise be the same. It becomes tedious to do this in every case, but can be useful to resolve confusion when it arises or is likely to arise. The difference between Epoch and Element is perhaps clearest in crossover trials. In Example Trial 2, as for most crossover trials, the analysis of PK results would include both treatment and period effects in the model. "Treatment effect" derives from Element (Placebo, 5 mg, or 10 mg), while "Period effect" derives from the Epoch (1st, 2nd, or 3rd Treatment Epoch). 3. Transitions Between Elements The transition between one Element and the next can be thought of as a three-step process: Step NumberStep QuestionHow step question is answered by information in the Trial Design datasets1Should the subject leave the current Element?Criteria for ending the current Element are in TEENRL in the TE dataset.2Which Element should the subject enter next?If there is a branch point at this point in the trial, evaluate criteria described in TABRANCH (e.g., randomization results) in the TA dataset otherwise, if TATRANS in the TA dataset is populated in this Arm at this point, follow those instructions otherwise, move to the next Element in this Arm as specified by TAETORD in the TA dataset.3What does the subject do to enter the next Element?The action or event that marks the start of the next Element is specified in TESTRL in the TE dataset Note that the subject is not "in limbo" during this process. The subject remains in the current Element until Step 3, at which point the subject transitions to the new Element. There are no gaps between Elements. From this table, it is clear that executing a transition depends on information that is split between the Trial Elements and the Trial Arms datasets. It can be useful, in the process of working out the Trial Design datasets, to create a dataset that supplements the Trial Arms dataset with the TESTRL, TEENRL, and TEDUR variables, so that full information on the transitions is easily accessible. However, such a working dataset is not an SDTM dataset, and should not be submitted. The following table shows a fragment of such a table for Example Trial 4. Note that for all records that contain a particular Element, all the TE variable values are exactly the same. Also, note that when both TABRANCH and TATRANS are blank, the implicit decision in Step 2 is that the subject moves to the next Element in sequence for the Arm. ta.xpt RowARMEPOCHTAETORDELEMENTTESTRLTEENRLTEDURTABRANCHTATRANS1AScreen1ScreenInformed ConsentScreening assessments are complete, up to 2 weeks after start of Element Note that both the second and fourth rows of this dataset involve the same Element, Trt A, and so TESTRL is the same for both. The activity that marks a subject's entry into the fourth Element in Arm A is "First dose of treatment Element, where drug is Treatment A." This is not the subject's very first dose of Treatment A, but it is their first dose in this Element. 7.3 Schedule for Assessments (TV, TD, and TM)This subsection contains the Trial Design datasets that describe:
The TV and TD datasets provide the planned scheduling of assessments to which a subject's actual visits and disease assessments can be compared. 7.3.1 Trial VisitsTV – Description/OverviewA trial design domain that contains the planned order and number of visits in the study within each arm. Visits are defined as "clinical encounters" and are described using the timing variables VISIT, VISITNUM, and VISITDY. Protocols define Visits in order to describe assessments and procedures that are to be performed at the Visits. TV – Specificationtv.xpt, Trial Visits — Trial Design, Version 3.2. One record per planned Visit per Arm, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar
Synonym Qualifier
Timing
RuleRule describing when the Visit starts, in relation to the sequence of Elements.ReqTVENRLVisit End RuleChar RuleRule describing when the Visit ends, in relation to the sequence of Elements.Perm ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TV – Assumptions
TV – ExamplesExample The diagram below shows Visits by means of numbered "flags" with Visit Numbers. Each "flag" has two supports, one at the beginning of the Visit, the other at the end of the Visit. Note that Visits 2 and 3 span Epoch transitions. In other words, the transition event that marks the beginning of the Run-in Epoch (confirmation of eligibility) occurs during Visit 2, and the transition event that marks the beginning of the Treatment Epoch (the first dose of study drug) occurs during Visit 3. Example Trial 1, Parallel Design Planned Visits Two Trial Visits datasets are shown for this trial. The first shows a somewhat idealized situation, where the protocol has given specific timings for the Visits. The second shows a more usual situation, where the timings have been described only loosely. tv.xpt RowSTUDYIDDOMAINVISITNUMTVSTRLTVENRL1EX1TV1Start of Screen Epoch1 hour after start of Visit2EX1TV230 minutes before end of Screen Epoch30 minutes after start of Run-in Epoch3EX1TV330 minutes before end of Run-in Epoch1 hour after start of Treatment Epoch4EX1TV41 week after start of Treatment Epoch1 hour after start of Visit5EX1TV52 weeks after start of Treatment Epoch1 hour after start of Visit tv.xpt RowSTUDYIDDOMAINVISITNUMTVSTRLTVENRL1EX1TV1Start of Screen Epoch Although the start and end rules in this example reference the starts and ends of Epochs, the start and end rules of some Visits for trials with Epochs that span multiple Elements will need to reference Elements rather than Epochs. When an Arm includes repetitions of the same Element, it may be necessary to use TAETORD as well as an Element name to specify when a Visit is to occur. 7.3.1.1 Trial Visits Issues1. Identifying Trial Visits In general, a trial's Visits are defined in its protocol. The term "Visit" reflects the fact that data in outpatient studies is usually collected during a physical visit by the subject to a clinic. Sometimes a Trial Visit defined by the protocol may not correspond to a physical visit. It may span multiple physical visits, as when screening data may be collected over several clinic visits but recorded under one Trial Visit name (VISIT) and number (VISITNUM). A Trial Visit may also represent only a portion of an extended physical visit, as when a trial of in-patients collects data under multiple Trial Visits for a single hospital admission. Diary data and other data collected outside a clinic may not fit the usual concept of a Trial Visit, but the planned times of collection of such data may be described as "Visits" in the Trial Visits dataset if desired. 2. Trial Visit Rules Visit start rules are different from Element start rules because they usually describe when a Visit should occur, while Element start rules describe the moment at which an Element is considered to start. There are usually gaps between Visits, periods of time that do not belong to any Visit, so it is usually not necessary to identify the moment when one Visit stops and another starts. However, some trials of hospitalized subjects may divide time into Visits in a manner more like that used for Elements, and a transition event may need to be defined in such cases. Visit start rules are usually expressed relative to the start or end of an Element or Epoch, e.g., "1-2 hours before end of First Wash-out" or "8 weeks after end of 2nd Treatment Epoch". Note that the Visit may or may not occur during the Element used as the reference for Visit start rule. For example, a trial with Elements based on treatment of disease episodes might plan a Visit 6 months after the start of the first treatment period, regardless of how many disease episodes have occurred. Visit end rules are similar to Element end rules, describing when a Visit should end. They may be expressed relative to the start or end of an Element or Epoch, or relative to the start of the Visit. The timings of Visits relative to Elements may be expressed in terms that cannot be easily quantified. For instance, a protocol might instruct that at a baseline Visit the subject be randomized, given the study drug, and instructed to take the first dose of study Drug X at bedtime that night. This baseline Visit is thus started and ended before the start of the treatment Epoch, but we don't know how long before the start of the treatment Epoch the Visit will occur. The trial start rule might contain the value, "On the day of, but before, the start of the Treatment Epoch." 3. Visit Schedules Expressed with Ranges Ranges may be used to describe the planned timing of Visits (e.g., 12-16 days after the start of 2nd Element), but this is different from the "windows" that may be used in selecting data points to be included in an analysis associated with that Visit. For example, although Visit 2 was planned for 12-16 days after the start of treatment, data collected 10-18 days after the start of treatment might be included in a "Visit 1" analysis. The two ranges serve different purposes. 4. Contingent Visits Some data collection is contingent on the occurrence of a "trigger" event, or disease milestone (see the Trial Disease Milestones (TM) dataset under Section 7.3, Schedule for Assessments (TV, TD, and TM)). When such planned data collection involves an additional clinic visit, a "contingent" Visit may be included in the trial visits table, with start a rule that describes the circumstances under which it will take place. Since values of VISITNUM must be assigned to all records in the Trial Visits dataset, a contingent Visit included in the Trial Visits dataset must have a VISITNUM, but the VISITNUM value may not be a "chronological" value, due to the uncertain timing of a contingent Visit. If contingent visits are not included in the TV dataset, then they would be treated as unplanned visits in the Subject Visits (SV) domain. 7.3.2 Trial Disease AssessmentsTD – Description/OverviewA trial design domain that provides information on the protocol-specified disease assessment schedule, to be used for comparison with the actual occurrence of the efficacy assessments in order to determine whether there was good compliance with the schedule. TD – Specificationtd.xpt, Trial Disease Assessments — Trial Design, Version 3.2. One record per planned constant assessment period, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TD – Assumptions
TD – ExamplesExample This example shows a study where the disease assessment schedule changes over the course of the study. In this example, there are three distinct disease-assessment schedule patterns. A single anchor date variable (TDANCVAR) provides the anchor date for each pattern. The offset variable (TDSTOFF) used in conjunction with the anchor date variable provides the start point of each pattern of assessments..
td.xpt RowSTUDYIDDOMAINTDORDERTDANCVARTDSTOFFTDTGTPAITDMINPAITDMAXPAITDNUMRPT1ABC123TD1ANCH1DTP0DP8WP53DP9W62ABC123TD2ANCH1DTP60WP12WP11WP13W43ABC123TD3ANCH1DTP120WP24WP23WP25W12 Example This example is the same as Example 1, except that there is a rest period of 14 days prior to the start of the second disease-assessment schedule. This example also shows how three different reference/anchor dates can be used.
td.xpt RowSTUDYIDDOMAINTDORDERTDANCVARTDSTOFFTDTGTPAITDMINPAITDMAXPAITDNUMRPT1ABC123TD1ANCH1DTP0DP8WP53DP9W62ABC123TD2ANCH2DTP0DP12WP11WP13W43ABC123TD3ANCH3DTP0DP24WP23WP25W17 Example This example shows a study where subjects are randomized to standard treatment or an experimental treatment. The subjects who are randomized to standard treatment are given the option to receive experimental treatment after the end of the standard treatment (e.g., disease progression on standard treatment). In the randomized treatment Epoch, the disease assessment schedule changes over the course of the study. At the start of the extension treatment Epoch, subjects are re-baselined, i.e., an extension baseline disease assessment is performed and the disease assessment schedule is restarted. In this example, there are three distinct disease-assessment schedule patterns.
For open-ended patterns, the total number of repeats can be identified when the data analysis is performed; the highest number of repeat assessments for any subject in that pattern must be recorded in the TDNUMRPT variable on the final pattern record. td.xpt RowSTUDYIDDOMAINTDORDERTDANCVARTDSTOFFTDTGPAITDMINPAITDMAXPAITDNUMRPT1ABC123TD1ANCH1DTP0DP8WP53DP9W62ABC123TD2ANCH1DTP60WP12WP11WP13W173ABC123TD3ANCH2DTP0DP12WP11WP13W17 7.3.3 Trial Disease MilestonesTM – Description/OverviewA trial design domain that is used to describe disease milestones, which are observations or activities anticipated to occur in the course of the disease under study, and which trigger the collection of data. TM – Specificationtm.xpt, Trial Disease Milestones — Trial Design, Version 1.0. One record per Disease Milestone type, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TM – Assumptions
TM – ExamplesExample In this diabetes study, initial diagnosis of diabetes and the hypoglycemic events that occur during the trial have been identified as Disease Milestones of interest. Row 1:Shows that the initial diagnosis is given the MIDSTYPE of "DIAGNOSIS" and is defined in TMDEF. It is not repeating (occurs only once).Row 2:Shows that hypoglycemic events are given the MIDSTYPE of "HYPOGLYCEMIC EVENT", and a definition in TMDEF. For an actual study, the definition would be expected to include a particular threshold level, rather than the text "threshold level" used in this example. A subject may experience multiple hypoglycemic events as indicated by TMRPT = "Y". tm.xpt RowSTUDYIDDOMAINMIDSTYPETMDEFTMRPT1XYZTMDIAGNOSISInitial diagnosis of diabetes, the first time a physician told the subject they had diabetesN2XYZTMHYPOGLYCEMIC EVENTHypoglycemic Event, the occurrence of a glucose level below (threshold level)Y 7.4 Trial Summary and Eligibility (TI and TS)This subsection contains the Trial Design datasets that describe:
The TI and TS datasets are tabular synopses of parts of the study protocol. 7.4.1 Trial Inclusion/Exclusion CriteriaTI – Description/OverviewA trial design domain that contains one record for each of the inclusion and exclusion criteria for the trial. This domain is not subject oriented. It contains all the inclusion and exclusion criteria for the trial, and thus provides information that may not be present in the subject-level data on inclusion and exclusion criteria. The IE domain (described in Section 6.3.4, Inclusion/Exclusion Criteria Not Met) contains records only for inclusion and exclusion criteria that subjects did not meet. TI – Specificationti.xpt, Trial Inclusion/Exclusion Criteria — Trial Design, Version 3.2. One record per I/E crierion, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TI – Assumptions
TI – ExamplesExample This example shows records for a trial that had two versions of inclusion/exclusion criteria. Rows 1-3:Show the two inclusion criteria and one exclusion criterion for version 1 of the protocol.Rows 4-6:Show the inclusion/exclusion criteria for version 2.2 of the protocol, which changed the minimum age for entry from 21 to 18. ti.xpt RowSTUDYIDDOMAINIETESTCDIETESTIECATTIVERS1XYZTIINCL01Has disease under studyINCLUSION12XYZTIINCL02Age 21 or greaterINCLUSION13XYZTIEXCL01Pregnant or lactatingEXCLUSION14XYZTIINCL01Has disease under studyINCLUSION2.25XYZTIINCL02AAge 18 or greaterINCLUSION2.26XYZTIEXCL01Pregnant or lactatingEXCLUSION2.2 7.4.2 Trial SummaryTS – Description/OverviewA trial design domain that contains one record for each trial summary characteristic. This domain is not subject oriented. The Trial Summary (TS) dataset allows the sponsor to submit a summary of the trial in a structured format. Each record in the Trial Summary dataset contains the value of a parameter, a characteristic of the trial. For example, Trial Summary is used to record basic information about the study such as trial phase, protocol title, and trial objectives. The Trial Summary dataset contains information about the planned and actual trial characteristics. TS – Specificationts.xpt, Trial Summary Information — Trial Design, Version 3.2. One record per trial summary parameter value, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). TS – Assumptions
TS – ExamplesExample This example shows all of the parameters that are required or expected in the Trial Summary dataset. Use controlled terminology for TSVAL, available at: https://www.cancer.gov/research/resources/terminology/cdisc. ts.xpt RowSTUDYIDDOMAINTSSEQTSGRPIDTSPARMCDTSPARMTSVALTSVALNFTSVALCDTSVCDREFTSVCDVER1XYZTS1 Example This example shows an example of how to implement the null flavor in TSVALNF when the value in TSVAL is missing. Note that when TSVAL is null, TSVALCD is also null, and no code system is specified in TSVCDREF and TSVCDVER. Row 1:Shows that there was no upper limit on planned age of subjects, as indicated by TSVALNF = "PINF", the null value that means "positive infinity".Row 2:Shows that Trial Phase Classification is not applicable, as indicated by TSVALNF = "NA". ts.xpt RowSTUDYIDDOMAINTSSEQTSGRPIDTSPARMCDTSPARMTSVALTSVALNFTSVALCDTSVCDREFTSVCDVER1XYZTS1 7.4.2.1 Use of Null FlavorThe variable TSVALNF is based on the idea of a "null flavor" as embodied in the ISO 21090 standard, "Health Informatics – Harmonized data types for information exchange." A null flavor is an ancillary piece of data that provides additional information when its primary piece of data is null (has a missing value). There is controlled terminology for the null flavor data item which includes such familiar values as Unknown, Other, and Not Applicable among its fourteen terms. The proposal to include a null flavor variable to supplement the TSVAL variable in the Trial Summary dataset arose when it was realized that the Trial Summary model did not have a good way to represent the fact that a protocol placed no upper limit on the age of study subjects. When the trial summary parameter is AGEMAX, then TSVAL should have a value expressed as an ISO8601 time duration (e.g., P43Y for 43 years old or P6M for 6 months old). While it would be possible to allow a value such as NONE or UNBOUNDED to be entered in TSVAL, validation programs would then have to recognize this special term as an exception to the expected data format. Therefore, it was decided that a separate null flavor variable that uses the ISO 21090 null flavor terminology would be a better solution. It was also decided to specify the use of a null flavor variable with this updated release of trial summary as a way of testing the use of such a variable in a limited setting. As its title suggests, the ISO 21090 standard was developed for use with healthcare data, and it is expected that it will eventually see wide use in the clinical data from which clinical trial data is derived. CDISC already uses this data type standard in the BRIDG model and the CDISC SHARE project. The null flavor, in particular, is a solution to the widespread problem of needing or wanting to convey information that will help in the interpretation of a missing value. Although null flavors could certainly be eventually used for this purpose in other cases, such as with subject data, doing so at this time would be extremely disruptive and premature. The use of null flavors for the one variable TSVAL should provide an opportunity for sponsors and reviewers to learn about the null flavors and to evaluate their usefulness in one concrete setting. The controlled terminology for null flavor, which supersedes use of Appendix C1, Trial Summary Codes, is included below NullFlavor Enumeration. OID: 2.16.840.1.113883.5.10081NINo informationThe value is exceptional (missing, omitted, incomplete, improper). No information as to the reason for being an exceptional value is provided. This is the most general exceptional value. It is also the default exceptional value.2INVInvalidThe value as represented in the instance is not a member of the set of permitted data values in the constrained value domain of a variable.3OTHOtherThe actual value is not a member of the set of permitted data values in the constrained value domain of a variable (e.g., concept not provided by required code system).4PINFPositive infinityPositive infinity of numbers4NINFNegative infinityNegative infinity of numbers3UNCUnencodedNo attempt has been made to encode the information correctly, but the raw source information is represented (usually in original Text).3DERDerivedAn actual value may exist, but it must be derived from the information provided (usually an expression is provided directly).2UNKUnknownA proper value is applicable, but not known.3ASKUAsked but unknownInformation was sought but not found (e.g., patient was asked but didn't know).4NAVTemporarily unavailableInformation is not available at this time, but is expected to be available later.3NASKNot askedThis information has not been sought (e.g., patient was not asked).3QSSufficient quantityThe specific quantity is not known, but is known to be non-zero and is not specified because it makes up the bulk of the material. For example, if directions said, "Add 10 mg of ingredient X, 50 mg of ingredient Y, and sufficient quantity of water to 100 ml", the null flavor "QS" would be used to express the quantity of water.3TRCTraceThe content is greater than zero, but too small to be quantified.2MSKMasked There is information on this item available, but it has not been provided by the sender due to security, privacy or other reasons. There may be an alternate mechanism for gaining access to this information. WARNING — Use of this null flavor does provide information that may be a breach of confidentiality, even though no detailed data are provided. Its primary purpose is for those circumstances where it is necessary to inform the receiver that the information does exist without providing any detail. 2NANot applicableNo proper value is applicable in this context (e.g., last menstrual period for a male).The numbers in the first column of the table above describe the hierarchy of these values, i.e.:
The one value at level 1, No information, is the least informative. It merely confirms that the primary piece of data is null. The values at level 2 provide a little more information, distinguishing between situations where the primary piece of data is not applicable and those where it is applicable but masked, unknown, or "invalid", i.e., not in the correct format to be represented in the primary piece of data. The values at levels 3 and 4 provide successively more information about the situation. For example, for the MAXAGE case that provided the impetus for the creation of the TSVALNF variable, the value PINF means that there is information about the maximum age, but it is not something that can be expressed, as in the ISO8601 quantity of time format required for populating TSVAL. The null flavor PINF provides the most complete information possible in this case, i.e., that the maximum age for the study is unbounded. 7.5 How to Model the Design of a Clinical TrialThe following steps allow the modeler to move from more-familiar concepts, such as Arms, to less-familiar concepts, such as Elements and Epochs. The actual process of modeling a trial may depart from these numbered steps. Some steps will overlap; there may be several iterations; and not all steps are relevant for all studies.
The defined variables of the SDTM general observation classes could restrict the ability of sponsors to represent all the data they wish to submit. Collected data that may not entirely fit includes relationships between records within a domain, records in separate domains, and sponsor-defined "variables". As a result, the SDTM has methods to represent distinct types of relationships, all of which are described in more detail in subsequent sections. These include the following:
All relationships make use of the standard domain identifiers, STUDYID, DOMAIN, and USUBJID. In addition, the variables IDVAR and IDVARVAL are used for identifying the record-level merge/join keys. These keys are used to tie information together by linking records. The specific set of identifiers necessary to properly identify each type of relationship is described in detail in the following sections. Examples of variables that could be used in IDVAR include the following:
8.1 Relating Groups of Records Within a Domain Using the --GRPID VariableThe optional grouping identifier variable --GRPID is Permissible in all domains that are based on the general observation classes. It is used to identify relationships between records within a USUBJID within a single domain. An example would be Intervention records for a combination therapy where the treatments in the combination varies from subject to subject. In such a case, the relationship is defined by assigning the same unique character value to the --GRPID variable. The values used for --GRPID can be any values the sponsor chooses; however, if the sponsor uses values with some embedded meaning (rather than arbitrary numbers), those values should be consistent across the submission to avoid confusion. It is important to note that --GRPID has no inherent meaning across subjects or across domains. Using --GRPID in the general observation class domains can reduce the number of records in the RELREC, SUPP--, and CO datasets, when those datasets are submitted to describe relationships/associations for records or values to a "group" of general observation class records. 8.1.1 --GRPID ExampleThe following table illustrates --GRPID used in the Concomitant Medications (CM) domain to identify a combination therapy. In this example, both subjects 1234 and 5678 have reported two combination therapies, each consisting of three separate medications. The components of a combination all have the same value for CMGRPID. This example illustrates how CMGRPID groups information only within a subject within a domain. Rows 1-3:Show three medications taken by subject "1234". GMGRPID = "COMBO THPY 1" has been used to group these medications.Rows 4-6:Show three different medications taken by subject "1234" with CMGRPID = "COMBO THPY 2".Rows 7-9:Show three medications taken by subject "5678". CMGRPID = "COMBO THPY 1" has been used to group these medications. Note that the medications with GMGRPID "COMBO THPY 1" are completely different for subjects "1234" and "5678".Rows 10-12:Show three different medications taken by subject "5678" with CMGRPID = "COMBO THPY 2". Again, the medications with "COMBO THPY 2" are completely different for subjects "1234" and "5678". cm.xpt RowSTUDYIDDOMAINUSUBJIDCMSEQCMGRPIDCMTRTCMDECODCMDOSECMDOSUCMSTDTCCMENDTC11234CM12341COMBO THPY 1Verbatim Med AGeneric Med A100mg2004-01-172004-01-1921234CM12342COMBO THPY 1Verbatim Med BGeneric Med B50mg2004-01-172004-01-1931234CM12343COMBO THPY 1Verbatim Med CGeneric Med C200mg2004-01-172004-01-1941234CM12344COMBO THPY 2Verbatim Med DGeneric Med D150mg2004-01-212004-01-2251234CM12345COMBO THPY 2Verbatim Med EGeneric Med E100mg2004-01-212004-01-2261234CM12346COMBO THPY 2Verbatim Med FGeneric Med F75mg2004-01-212004-01-2271234CM56781COMBO THPY 1Verbatim Med GGeneric Med G37.5mg2004-03-172004-03-2581234CM56782COMBO THPY 1Verbatim Med HGeneric Med H60mg2004-03-172004-03-2591234CM56783COMBO THPY 1Verbatim Med IGeneric Med I20mg2004-03-172004-03-25101234CM56784COMBO THPY 2Verbatim Med JGeneric Med J100mg2004-03-212004-03-22111234CM56785COMBO THPY 2Verbatim Med KGeneric Med K50mg2004-03-212004-03-22121234CM56786COMBO THPY 2Verbatim Med LGeneric Med L10mg2004-03-212004-03-22 8.2 Relating Peer RecordsThe Related Records (RELREC) special purpose dataset is used to describe relationships between records for a subject (as described in this section), and relationships between datasets (as described in Section 8.3, Relating Datasets). In both cases, relationships represented in RELREC are collected relationships, either by explicit references or check boxes on the CRF, or by design of the CRF, such as vital signs captured during an exercise stress test. A relationship is defined by adding a record to RELREC for each record to be related and by assigning a unique character identifier value for the relationship. Each record in the RELREC special purpose dataset contains keys that identify a record (or group of records) and the relationship identifier, which is stored in the RELID variable. The value of RELID is chosen by the sponsor, but must be identical for all related records within USUBJID. It is recommended that the sponsor use a standard system or naming convention for RELID (e.g., all letters, all numbers, capitalized). Records expressing a relationship are specified using the key variables STUDYID, RDOMAIN (the domain code of the record in the relationship), and USUBJID, along with IDVAR and IDVARVAL. Single records can be related by using a unique-record-identifier variable such as --SEQ in IDVAR. Groups of records can be related by using grouping variables such as --GRPID in IDVAR. IDVARVAL would contain the value of the variable described in IDVAR. Using --GRPID can be a more efficient method of representing relationships in RELREC, such as when relating an adverse event (or events) to a group of concomitant medications taken to treat the adverse event(s). The RELREC dataset should be used to represent either:
8.2.1 RELREC Datasetrelrec.xpt, Related Records — Relationships, Version 3.3. One record per related record, group of records or dataset, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). 8.2.2 RELREC Dataset ExamplesExample This example illustrates the use of the RELREC dataset to relate records stored in separate domains for USUBJID = "123456". This example represents a situation in which an adverse event is related both to concomitant medications and to lab tests, but there is no relationship between the lab values and the concomitant medications. Rows 1-3:Show the representation of a relationship between an AE record and two concomitant medication records.Rows 4-6:Show the representation of a relationship between the same AE record and two laboratory findings records. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1EFC1234AE123456AESEQ5 Example Example 2 is the same scenario as Example 1. In this case, however, the way the data were collected indicated that the concomitant medications and laboratory findings were all in a single relationship with the adverse event. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1EFC1234AE123456AESEQ5 Example Example 3 is the same scenario as Example 2. However, the sponsor grouped the two concomitant medications using CMGRPID = "COMBO 1", allowing the relationship among these five records to be represented with four, rather than five, records in the RELREC dataset. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1EFC1234AE123456AESEQ5 Additional examples may be found in the domain examples such as Section 6.2.4, Disposition, Example 4, and all of the Pharmacokinetics examples in Section 6.3.11.3, Relating PP Records to PC Records. 8.3 Relating DatasetsThe Related Records (RELREC) special purpose dataset can also be used to identify relationships between datasets (e.g., a one-to-many or parent-child relationship). The relationship is defined by including a single record for each related dataset that identifies the key(s) of the dataset that can be used to relate the respective records. Relationships between datasets should only be recorded in the RELREC dataset when the sponsor has found it necessary to split information between datasets that are related, and that may need to be examined together for analysis or proper interpretation. Note that it is not necessary to use the RELREC dataset to identify associations from data in the SUPP-- datasets or the CO dataset to their parent general-observation-class dataset records or special purpose domain records, as both these datasets include the key variable identifiers of the parent record(s) that are necessary to make the association. 8.3.1 RELREC Dataset Relationship ExampleExample This example illustrates RELREC records used to represent the relationship between records in two datasets that have a one-to-many relationship. In the example below, all the records in one domain are being related to all of the records in the other, so both USUBJID and IDVARVAL are null. relrec.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALRELTYPERELID1EFC1234TU In the sponsor's operational database, these datasets may have existed as either separate datasets that were merged for analysis, or one dataset that may have included observations from more than one general observation class (e.g., Events and Findings). The value in IDVAR must be the name of the key used to merge/join the two datasets. In the above example, the --LNKID variable is used as the key to identify the related observations. The values for the --LNKID variable in the two datasets are sponsor defined. Although other variables may also serve as a single merge key when the corresponding values for IDVAR are equal, --GRPID, --SPID, --REFID, --LNKID, or --LNKGRP are typically used for this purpose. The variable RELTYPE identifies the type of relationship between the datasets. The allowable values are ONE and MANY (controlled terminology is expected). This information defines how a merge/join would be written, and what would be the result of the merge/join. The possible combinations are the following:
Since IDVAR identifies the keys that can be used to merge/join records between the datasets, --SEQ cannot be used because --SEQ only has meaning within a subject within a dataset, not across datasets. 8.4 Relating Non-Standard Variables Values to a Parent DomainThe SDTM does not allow the addition of new variables. Therefore, the Supplemental Qualifiers special purpose dataset model is used to capture non-standard variables and their association to parent records in general-observation-class datasets (Events, Findings, Interventions) and Demographics. Supplemental Qualifiers are represented as separate SUPP-- datasets for each dataset containing sponsor-defined variables (see Section 8.4.2, Submitting Supplemental Qualifiers in Separate Datasets, for more on this topic). SUPP-- represents the metadata and data for each non-standard variable/value combination. As the name "Supplemental Qualifiers" suggests, this dataset is intended to capture additional Qualifiers for an observation. Data that represent separate observations should be treated as separate observations. The Supplemental Qualifiers dataset is structured similarly to the RELREC dataset, in that it uses the same set of keys to identify parent records. Each SUPP-- record also includes the name of the Qualifier variable being added (QNAM), the label for the variable (QLABEL), the actual value for each instance or record (QVAL), the origin (QORIG) of the value (see Section 4.1.8, Origin Metadata), and the Evaluator (QEVAL) to specify the role of the individual who assigned the value (such as ADJUDICATION COMMITTEE or SPONSOR). Controlled terminology for certain expected values for QNAM and QLABEL is included in Appendix C2, Supplemental Qualifiers Name Codes. SUPP-- datasets are also used to capture attributions. An attribution is typically an interpretation or subjective classification of one or more observations by a specific evaluator, such as a flag that indicates whether an observation was considered to be clinically significant. Since it is possible that different attributions may be necessary in some cases, SUPP-- provides a mechanism for incorporating as many attributions as are necessary. A SUPP-- dataset can contain both objective data (where values are collected or derived algorithmically) and subjective data (attributions where values are assigned by a person or committee). For objective data, the value in QEVAL will be null. For subjective data (when QORIG = "Assigned"), the value in QEVAL should reflect the role of the person or institution assigning the value (e.g., "SPONSOR" or "ADJUDICATION COMMITTEE"). The combined set of values for the first six columns (STUDYID…QNAM) should be unique for every record. That is, there should not be multiple records in a SUPP-- dataset for the same QNAM value, as it relates to IDVAR/IDVARVAL for a USUBJID in a domain. For example, if two individuals provide a determination on whether an Adverse Event is Treatment Emergent (e.g., the investigator and an independent adjudicator), then separate QNAM values should be used for each set of information, perhaps "AETRTEMI" and "AETRTEMA". This is necessary to ensure that reviewers can join/merge/transpose the information back with the records in the original domain without risk of losing information. Just as use of the optional grouping identifier variable, --GRPID, can be a more efficient method of representing relationships in RELREC, it can also be used in a SUPP-- dataset to identify individual qualifier values (SUPP-- records) related to multiple general-observation-class domain records that could be grouped, such as relating an attribution to a group of ECG measurements. 8.4.1 Supplemental Qualifiers – SUPP-- Datasetssupp--.xpt, Supplemental Qualifiers for [domain name] — Relationships, Version 3.3. One record per IDVAR, IDVARVAL, and QNAM value per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). A record in a SUPP-- dataset relates back to its parent record(s) via the key identified by the STUDYID, RDOMAIN, USUBJID, and IDVAR/IDVARVAL variables. An exception is SUPP-- dataset records that are related to Demographics (DM) records, where both IDVAR and IDVARVAL will be null because the key variables STUDYID, RDOMAIN, and USUBJID are sufficient to identify the unique parent record in DM (DM has one record per USUBJID). All records in the SUPP-- datasets must have a value for QVAL. Transposing source variables with missing/null values may generate SUPP-- records with null values for QVAL, causing the SUPP-- datasets to be extremely large. When this happens, the sponsor must delete the records where QVAL is null prior to submission. See Section 4.5.3, Text Strings That Exceed the Maximum Length for General-Observation-Class Domain Variables, for information on representing data values greater than 200 characters in length. See Appendix C2, Supplemental Qualifiers Name Codes, for controlled terminology for QNAM and QLABEL for some of the most common Supplemental Qualifiers. Additional QNAM values may be created as needed, following the guidelines provided in the CDISC Notes for QVAL. 8.4.2 Submitting Supplemental Qualifiers in Separate DatasetsThere is a one-to-one correspondence between a domain dataset and its Supplemental Qualifier dataset. The single SUPPQUAL dataset option that was introduced in SDTMIG v3.1 was deprecated. The set of Supplemental Qualifiers for each domain is included in a separate dataset with the name SUPP-- where "--" denotes the source domain which the Supplemental Qualifiers relate back to. For example, Demographics Qualifiers would be submitted in suppdm.xpt. When data have been split into multiple datasets (see Section 4.1.7, Splitting Domains), longer names such as SUPPFAMH may be needed. In cases where data about Associated Persons (see Associated Persons Implementation Guide) have been collected, Supplemental Qualifiers for Findings About events or interventions for an associated person may need to be represented. A dataset name with the SUPP fragment, e.g., SUPPAPFAMH, would be too long. In this case only, the "SUPP" portion should be shortened to "SQ", resulting in a dataset name such as SQAPFAMH. 8.4.3 SUPP-- ExamplesThe examples below llustrate how a set of SUPP-- datasets could be used to relate non-standard information to a parent domain. Example The two rows of suppae.xpt add qualifying information to adverse event data (RDOMAIN = "AE"). IDVAR defines the key variable used to link this information to the AE data (AESEQ). IDVARVAL specifies the value of the key variable within the parent AE record that the SUPPAE record applies to. The remaining columns specify the supplemental variables' names (AESOSP and AETRTEM), labels, values, origin, and who made the evaluation. suppae.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL11996001AE99-401AESEQ1AESOSPOther Medically Important SAESpontaneous AbortionCRF Example This example illustrates how the language used for a questionnaire might be represented. The parent domain (RDOMAIN) is QS, and IDVAR is QSCAT. QNAM holds the name of the Supplemental Qualifier variable being defined (QSLANG). The language recorded in QVAL applies to all of the subject's records where IDVAR (QSCAT) equals the value specified in IDVARVAL. In this case, IDVARVAL has values for two questionnaires (BPI and ADAS-COG) for two separate subjects. QVAL identifies the questionnaire language version (French or German) for each subject. suppqs.xpt RowSTUDYIDRDOMAINUSUBJIDIDVARIDVARVALQNAMQLABELQVALQORIGQEVAL11996001QS99-401QSCATBPIQSLANGQuestionnaire LanguageFRENCHCRF Additional examples may be found in the domain examples, such as in Section 5.2 Demographics, Examples 3 and 4, in Section 6.3.3, ECG Test Results, Example 1, and in Section 6.3.6, Laboratory Test Results, Example 1. 8.4.4 When Not to Use Supplemental QualifiersThe following are examples of data that should not be submitted as Supplemental Qualifiers:
8.5 Relating Comments to a Parent DomainThe Comments (CO) special purpose domain, which is described in Section 5.1, Comments, is used to capture unstructured free text comments. It allows for the submission of comments related to a particular domain (e.g., Adverse Events) or those collected on separate general-comment log-style pages not associated with a domain. Comments may be related to a Subject, a domain for a Subject, or to specific parent records in any domain. The Comments special purpose domain is structured similarly to the Supplemental Qualifiers (SUPP--) dataset, in that it uses the same set of keys (STUDYID, RDOMAIN, USUBJID, IDVAR, and IDVARVAL) to identify related records. All comments except those collected on log-style pages not associated with a domain are considered child records of subject data captured in domains. STUDYID, USUBJID, and DOMAIN (with the value CO) must always be populated. RDOMAIN, IDVAR, and IDVARVAL should be populated as follows:
If additional information was collected further describing the comment relationship to a parent record(s), and it cannot be represented using the relationship variables, RDOMAIN, IDVAR and IDVARVAL, this can be done by two methods:
As with Supplemental Qualifiers (SUPP--) and Related Records (RELREC), --GRPID and other grouping variables can be used as the value in IDVAR to identify comments with relationships to multiple domain records, for example as a comment that applies to a group of concomitant medications, perhaps taken as a combination therapy. The limitation on this is that a single comment may only be related to a group of records in one domain (RDOMAIN can have only one value). If a single comment relates to records in multiple domains, the comment may need to be repeated in the CO special purpose domain to facilitate the understanding of the relationships. Examples for Comments data can be found in Section 5.1, Comments. 8.6 How to Determine Where Data Belong in SDTM-Compliant Data Tabulations8.6.1 Guidelines for Determining the General Observation ClassSection 2.6, Creating a New Domain, discusses when to place data in an existing domain and how to create a new domain. A key part of the process of creating a new domain is determining whether an observation represents an Event, Intervention, or Finding. Begin by considering the content of the information in the light of the definitions of the three general observation classes (see Section 2.3, The General Observation Classes), rather than by trying to deduce the class from the information's physical structure; physical structure can sometimes be misleading. For example, from a structural standpoint, one might expect Events observations to include a start and stop date. However, Medical History data (data about previous conditions or events) is Events data regardless of whether dates were collected. An Intervention is something that is done to a subject (possibly by the subject) that is expected to have a physiological effect. This concept of an intended effect makes Interventions relatively easy to recognize, although there are gray areas around some testing procedures. For example, exercise stress tests are designed to produce and then measure certain physiological effects. The measurements from such a testing procedure are Findings, but some aspects of the procedure might be modeled as Interventions. An Event is something that happens to a subject spontaneously. Most, although not all, Events data captured in clinical trials is about medical events. Since many medical events must, by regulation, be treated as adverse events, a new Events domain will be created only for events that are clearly not adverse events; the existing Medical History and Clinical Events domain are the appropriate places to store most medical events that are not adverse events. Many aspects of medical events, including tests performed to evaluate them, interventions that may have caused them, and interventions given to treat them, may be collected in clinical trials. Where to place data on assessments of events can be particularly challenging, and is discussed further in Section 8.6.3, Guidelines for Differentiating Between Events, Findings, and Findings About Events. Findings general observation class data are measurements, tests, assessments, or examinations performed on a subject in the clinical trial. They may be performed on the subject as a whole (e.g., height, heart rate), or on a "specimen" taken from a subject (e.g., a blood sample, an ECG tracing, a tissue sample). Sometimes the relationship between a subject and a finding is less direct; a finding may be about an event that happened to the subject or an intervention they received. Findings about Events and Interventions are discussed further in Section 8.6.3, Guidelines for Differentiating Between Events, Findings, and Findings About Events. 8.6.2 Guidelines for Forming New DomainsIt may not always be clear whether a set of data represents one topic or more than one topic, and thus whether it should be combined into one domain or split into two or more domains. This implementation guide shows examples of both. In some cases, a single data structure works well for a variety of types of data. For example, all questionnaire data are placed in the QS domain, with particular questionnaires identified by QSCAT (see Section 6.3.13, Questionnaires, Ratings, and Scales (QRS) Domains). Although some operational databases may store urinalysis data in a separate dataset, SDTM places all lab data in the LB domain (see Section 6.3.6, Laboratory Test Results) with urinalysis tests identified using LBSPEC. In other cases, a particular topic may be very broad and/or require more than one data structure (and therefore require more than one dataset). Two examples in this implementation guide are the topics of microbiology and pharmacokinetics. Both have been modeled using two domain datasets (see Section 6.3.7, Microbiology Domains, and Section 6.3.11, Pharmacokinetics Domains). This is because, within these scientific areas, there is more than one topic, and each topic results in a different data structure. For example, the topic for PC is plasma (or other specimen) drug concentration as a function of time, and the structure is one record per analyte per time point per reference time point (e.g., dosing event) per subject. PP contains characteristics of the time-concentration curve such as AUC, Cmax, Tmax, half-life, and elimination rate constant; the structure is one record per parameter per analyte per reference time point per subject. 8.6.3 Guidelines for Differentiating Between Events, Findings, and Findings About EventsThis section discusses Events, Findings, and Findings about Events. The relationship between Interventions, Findings, and Findings about Interventions would be handled similarly. The Findings About domain was specially created to store findings about events. This section discusses Events and Findings generally, but it is particularly useful for understanding the distinction between the CE and FA domains. There may be several sources of confusion about whether a particular piece of data belongs in an Event record or a Findings record. One generally thinks of an event as something that happens spontaneously, and has a beginning and end; however, one should consider the following:
The structure of the data being considered, although not definitive, will often help determine whether the data represent an Event or a Finding. The questions below may assist sponsors in deciding where data should be placed in SDTM. QuestionInterpretation of AnswersIs this a measurement, with units, etc.?
The Events general observation class is intended for observations about a clinical event as a whole. Such observations typically include what the condition was (captured in --TERM, the topic variable) and when it happened (captured in its start and/or end dates). Other qualifier values collected (severity, seriousness, etc.) apply to the totality of the event. Note that sponsors may choose how they define the "event as a whole." Data that do not describe the event as a whole should not be stored in the record for that event or in a --SUPP record tied to that event. If there are multiple assessments of an event, then each should be stored in a separate FA record. When data related to an event do not fit into one of the existing Event general observation class Qualifiers, the first question to consider is whether the data represent information about the event itself, or about something (a Finding or Intervention) that is associated with the event.
If a Supplemental Qualifier is not appropriate, the data may be stored in Findings About. Section 6.4, Findings About Events or Interventions, provides additional information and examples. 8.7 Relating Study SubjectsRELSUB – Description/OverviewRELSUB describes collected relationships between study subjects. Some studies include subjects who are related to each other, and in some cases it is important to record those relationships. Studies in which pregnant women are treated, and both the mother and her child(ren) are study subjects are the most common case in which relationships between subjects are collected. There are also studies of genetically based diseases where subjects who are related to each other are enrolled, and the relationships between subjects are recorded. RELSUB – Specificationrelsub.xpt, Related Subjects — Relationships, Version 1.0. One record per relationship per related subject per subject, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). RELSUB – Assumptions
RELSUB – ExamplesExample The following data are from a hemophilia study (HEM021) in which the study subjects are a pair of fraternal (dizogotic) twins and their mother. Some expected and required variables not needed to illustrate the example are not shown. Row 1:Subject is the mother.Rows 2-3:Subjects are the children. dm.xpt RowSTUDYIDDOMAINUSUBJIDBRTHDTCAGEAGEUSEX1HEM021DMHEM021-0011941-05-1660YEARSF2HEM021DMHEM021-0021965-04-1235YEARSM3HEM021DMHEM021-0031965-04-1235YEARSM The RELSUB table for the three subjects whose demography data is shown above. Rows 1-2:The relationship of the mother to the two children.Rows 3, 5:The relationships of the children to the mother.Rows 4, 6:The relationships of the children to each other. relsub.xpt RowSTUDYIDUSUBJIDRSUBJIDSREL1HEM021HEM021-001HEM021-002MOTHER, BIOLOGICAL2HEM021HEM021-001HEM021-003MOTHER, BIOLOGICAL3HEM021HEM021-002HEM021-001CHILD, BIOLOGICAL4HEM021HEM021-002HEM021-003TWIN, DIZOGOTIC5HEM021HEM021-003HEM021-001CHILD, BIOLOGICAL6HEM021HEM021-003HEM021-002TWIN, DIZOGOTIC 9 Study ReferencesThere are occasions when it is necessary to establish study-specific terminology that will be used in subject data. Three such situations have been identified thus far:
9.1 Device IdentifiersThe Device Identifiers (DI) dataset establishes identifiers for devices, which are used to populate the variable SPDEVID. The dataset was introduced as part of the SDTMIG for Medical Devices (SDTMIG-MD). It was originally classified as a special purpose domain, but in SDTM v1.7, it is classified as a study reference dataset. The SDTMIG-MD includes the domain specification and assumptions and provides examples of its use. 9.2 Non-host Organism IdentifiersOI – Description/OverviewThe Non-host Organism Identifiers domain is for storing the levels of taxonomic nomenclature of microbes or parasites that have been either experimentally determined in the course of a study or are previously known, as in the case of lab strains used as reference in the study. The biological classification of a non-host organism typically stops at the taxonomic rank of "species". Scientific taxonomic nomenclature below the rank of species is not clearly defined, lacks a globally-accepted standard terminology, and is frequently organism-dependent. Therefore the OI domain addresses organism taxonomy with a series of parameters that name the taxa appropriate to the organism and the granularity with which the organism has been identified in the particular study. OI – Specificationoi.xpt, Non-host Organism Identifiers — Study Reference, Version 1.0. One record per taxon per non-host organism, Tabulation. Variable NameVariable LabelTypeControlled Terms, Codelist or Format1RoleCDISC NotesCoreSTUDYIDStudy IdentifierChar ¹ In this column, * indicates the variable may be subject to controlled terminology, and CDISC/NCI codelist code values are enclosed in (parenthesis). OI – Assumptions
OI – ExamplesExample This example shows taxonomic identifiers for HIV and HCV. NHOID is a unique non-host organism ID used to link findings on that organism in other datasets with details about its identification in OI. OIPARM shows the name of the individual taxa identified and OIVAL shows the experimentally determined values of those taxa. Rows 1-4:Show the taxonomy for the HIV organism given the NHOID of HIV1MC. This virus has been identified as HIV-1, Group M, Subtype C.Rows 5-8:Show the taxonomy for the HIV organism given the NHOID of HIV1MB, which was used as a reference. This virus has been identified as HIV-1, Group M, Subtype B.Rows 9-11:Show the taxonomy for the HCV organism given the NHOID of HCV2C. This virus has been identified as HCV 2c.Rows 12-14:Show the taxonomy for the HCV organism given the NHOID of H77. This virus is a known reference strain of HCV 1a. oi.xpt RowSTUDYIDDOMAINNHOIDOISEQOIPARMCDOIPARMOIVAL1STUDY123OIHIV1MC1SPCIESSpeciesHIV2STUDY123OIHIV1MC2TYPEType13STUDY123OIHIV1MC3GROUPGroupM4STUDY123OIHIV1MC4SUBTYPSubtypeC5STUDY123OIHIV1MB1SPCIESSpeciesHIV6STUDY123OIHIV1MB2TYPEType17STUDY123OIHIV1MB3GROUPGroupM8STUDY123OIHIV1MB4SUBTYPSubtypeB9STUDY123OIHCV2C1SPCIESSpeciesHCV10STUDY123OIHCV2C2GENTYPGenotype211STUDY123OIHCV2C3SUBTYPSubtypeC12STUDY123OIH771SPCIESSpeciesHCV13STUDY123OIH772GENTYPGenotype114STUDY123OIH773SUBTYPSubtypeA 9.3 Pharmacogenomic/Genetic Biomarker IdentifiersThe Pharmacogenomic/Genetic Biomarker Identifiers (PB) dataset establishes identifiers for pharmacogenomic/genetic biomarkers which are composed of groups of genetic variations. The dataset was introduced as part of the SDTMIG for Pharmacogenomic/Genetics (SDTMIG-PGx). It was originally classified as a special purpose domain, but it is to be reclassified as a study reference dataset. The SDTMIG-PGx includes the domain specification and assumptions and provides examples illustrating its use. AppendicesAppendix A: CDISC SDS Extended Leadership TeamThe CDISC SDS Extended Leadership Team would like to thank the many volunteers who contributed to the development, review, and publication of SDTMIG v3.3. Additionally, this publication would not have been possible without the support of the Foundational Team Leads, Global Governance Group, Regulatory Liaisons, and CDISC. SDS Extended Leadership TeamNameCompanyAmy AdyanthayaBiogenEllina BabouchkinaQuality Data ServicesAnthony ChowCDISCChristine Connolly - Current Leadership TeamEMD SeronoGary CunninghamThe Griesser GroupChris Gemma - Current Leadership TeamCDISCDan Godoy - Past Leadership TeamMedImmuneTom GuinterIndependentMike Hamidi - Current Leadership TeamPRA Health Sciences (formerly Merck & Co.)Sterling HardyMerck & Co. (formerly Bristol-Myers Squibb)Joyce HernandezIndependentMarcelina HungriaDIcore GroupKristin KellyPinnacle 21Éanna KielySyneosSteve KopkoCDISCBess LeRoyCDISCRichard LewisTalentMineStetson LineClinventiveTodd MaileyGlaxoSmithKlineBarrie Nelson - Past Leadership TeamNurocorJon NevilleCDISCAmy Palmer - Past Leadership TeamCDISCMelanie PaulesGlaxoSmithKlineCarlo Radovskyetera solutionsJanet Reich - Current Leadership TeamAmgenDonna SattlerEli LillyCary SmoakS-cubedSusan TierneyIQVIAMadhavi VemuriJanssen ResearchGary WalkerIQVIADarcy WoldCDISCDiane Wold - Past Leadership TeamCDISCFred Wood - Past Leadership TeamTalentMine Appendix B: Glossary and AbbreviationsThe following abbreviations and terms are used in this document. Additional definitions can be found in the CDISC Glossary available at https://www.cdisc.org/standards/semantics/glossary. ADaMCDISC Analysis Dataset ModelATC codeAnatomic Therapeutic Chemical code from WHO DrugCDISCClinical Data Interchange Standards ConsortiumCRFCase report form (sometimes case record form)CRTCase report tabulationcSDRGClinical Study Data Reviewers GuideCTCAECommon Terminology Criteria for Adverse EventsDatasetA collection of structured data in a single fileDefine-XMLCDISC standard for transmitting metadata that describes any tabular dataset structure.DomainA collection of observations with a topic-specific commonalityeDTElectronic Data TransferFDAFood and Drug AdministrationHL7Health Level 7ICHInternational Conference on Harmonisation of Technical Requirements for Registration of Pharmaceuticals for Human UseICH E2AICH guidelines on Clinical Safety Data Management: Definitions and Standards for Expedited ReportingICH E2BICH guidelines on Clinical Safety Data Management: Data Elements for Transmission of Individual Cases Safety ReportsICH E3ICH guidelines on Structure and Content of Clinical Study ReportsICH E9ICH guidelines on Statistical Principles for Clinical TrialsISOInternational Organization for StandardizationISO 8601ISO character representation of dates, date/times, intervals, and durations of time. The SDTM uses the extended format.ISO 3166ISO codelist for representing countries; the Alpha-3 codelist uses 3-character codes.LOINCLogical Observation, Identifiers, Names, and CodesMedDRAMedical Dictionary for Regulatory ActivitiesNCINational Cancer Institute (NIH)SDSSubmission Data Standards. Also the name of the team that created the SDTM and SDTMIG.SDTMStudy Data Tabulation ModelSDTMIGStudy Data Tabulation Model Implementation Guide: Human Clinical Trials [this document]SDTMIG-APStudy Data Tabulation Model Implementation Guide: Associated PersonsSDTMIG-MDStudy Data Tabulation Model Implementation Guide for Medical DevicesSDTMIG-PGxStudy Data Tabulation Model Implementation Guide: Pharmacogenomics/GeneticsSENDStandard for Exchange of Non-Clinical DataSF-36A multi-purpose, short-form health survey with 36 questionsSNOMEDSystematized Nomenclature of Medicine (a dictionary)SOCSystem Organ Class (from MedDRA)TDMTrial Design ModelUUIDUniversally Unique IdentifierWHODRUGWorld Health Organization Drug DictionaryXMLeXtensible Markup Language Appendix C: Controlled TerminologyCDISC Terminology is centrally managed by the CDISC Controlled Terminology Team, supporting the terminology needs of all CDISC foundational standards (SDTM, CDASH, ADaM, SEND) and all disease/therapeutic area standards. New/modified terms have a three-month development period during which the Controlled Terminology Team evaluates the requests received, incorporates as much as possible for each quarterly release, and has a quarterly public review comment period followed by a publication release. Visit the CDISC Controlled Terminology page (http://www.cdisc.org/terminology) to find the most recently published terminology packages (final or under review), or visit the NCI Enterprise Vocabulary Services CDISC Terminology website at https://www.cancer.gov/research/resources/terminology/cdisc for access to the full list of CDISC terminology. Note that the SDTM terminology was previously provided separately for questionnaires and other domains. However, as of the 2015-12-18 release, these were merged into a single publication. SDTM Implementation Guides (v3.1.3 or earlier) included several appendices regarding Controlled Terminology. Starting with SDTMIG 3.2, Appendix C was simplified to contain only a couple of important Terminology Code Lists that are specific to this Implementation Guide. Appendix C1: Trial Summary CodesThe Parameter table includes text to indicate if the parameter should be included in the dataset. To make this domain useful, a minimum number of trial summary parameters should be provided as shown below. The column titled "Record with this Parameter" indicates whether the parameter should be included in the dataset. If a record is included, either TSVAL or TSVALNF must be populated. Most of the new parameters are coming from http://www.clinicaltrials.gov/ and the controlled terminology shown below is aligned with that source. All definitions of the parameters are maintained in NCI EVS. The Notes column provides some additional information about the specific parameter or its values. TSPARMCDTSPARMTSVAL (Codelist Name or Format)Record with this ParameterNotesADDONAdded on to Existing TreatmentsNo Yes ResponseRequired Appendix C2: Supplemental Qualifiers Name CodesThe following table contains an initial set of standard name codes for use in the Supplemental Qualifiers (SUPP--)special purpose datasets. There are no specific conventions for naming QNAM and some sponsors may choose to include the 2-character domain in the QNAM variable name. Note that the 2-character domain code is not required in QNAM since it is present in the variable RDOMAIN in the SUPP-- datasets. QNAMQLABELApplicable DomainsAESOSPOther Medically Important SAEAEAETRTEMTreatment Emergent FlagAE--CLSIGClinically SignificantFindings--REASReasonAll general observation classes Appendix D: CDISC Variable-Naming FragmentsThe CDISC SDS group has defined a standard list of fragments to use as a guide when naming variables in SUPP-- datasets (as QNAM) or assigning --TESTCD values that could conceivably be treated as variables in a horizontal listing derived from a v3.x dataset. In some cases, more than one fragment is used for a given keyword. This is necessary when a shorter fragment must be used for a --TESTCD or QNAM that incorporates several keywords that must be combined while still meeting the 8-character variable naming limit of SAS transport files. When using fragments, the general rule is to use the fragment(s) that best conveys the meaning of the variable within the 8-character limit; thus, the longer fragment should be used when space allows. If the combination of fragments still exceeds 8 characters, a character should be dropped where most appropriate (while avoiding naming conflicts if possible) to fit within the 8-character limit. In other cases the same fragment may be used for more than one meaning, but these would not normally overlap for the same variable. Keyword(s)FragmentACTIONACNADJUSTMENTADJANALYSIS DATASETADASSAYASBASELINEBLBIRTHBRTHBODYBODCANCERCANCATEGORYCATCHARACTERCCLASSCLASCLINICALCLCODECDCOMMENTCOMCONCOMITANTCONCONDITIONCNDCONGENITALCONGDATE TIME - CHARACTERDTCDAYDYDEATHDTHDECODEDECODDERIVEDDRVDESCRIPTIONDESCDISABILITYDISABDOSE, DOSAGEDOS, DOSEDURATIONDURELAPSEDELELEMENTETEMERGENTEMENDEND, ENETHNICITYETHNICEVALUATIONEVLEVALUATOREVALEXTERNALXFASTINGFASTFILENAMEFNFLAGFLFORMULATION, FORMFRMFREQUENCYFRQGRADEGRGROUPGRPHOSPITALIZATIONHOSPIDENTIFIERIDINDICATIONINDCINDICATORINDINTERPRETATIONINTPINTERVALINTINVESTIGATORINVLIFE-THREATENINGLIFELOCATIONLOCLOINC CODELOINCLOWER LIMITLOMEDICALLY-IMPORTANT EVENTMIENAMENAMNON-STUDY THERAPYNSTNORMAL RANGENRNOT DONENDNUMBERNUMNUMERICNOBJECTOBJONGOINGONGOORDERORDORIGINORIGORIGINALOROTHEROTH, OOUTCOMEOUTOVERDOSEODPARAMETERPARMPATTERNPATTPOPULATIONPOPPOSITIONPOSQUALIFIERQUALREASONREASREFERENCEREF, RFREGIMENRGMRELATEDREL, RRELATIONSHIPRELRESULTRESRULERLSEQUENCESEQSERIOUSS, SERSEVERITYSEVSIGNIFICANTSIGSPECIMENSPEC, SPCSPONSORSPSTANDARDST, STDSTARTSTSTATUSSTATSUBCATEGORYSCATSUBJECTSUBJSUPPLEMENTALSUPPSYSTEMSYSTEXTTXTTIMETMTIME POINTTPTTOTALTOTTOXICITYTOXTRANSITIONTRANSTREATMENTTRTUNIQUEUUNITUUNPLANNEDUPUPPER LIMITHIVALUEVALVARIABLEVARVEHICLEV Appendix E: Revision HistoryThis appendix summarizes revisions since the last production version.
Section numberSectionChange1.1PurposeRemoved outdated language.1.2Organization of this DocumentAdded new Section 9 for Study Reference Datasets.1.3Relationship to Prior CDISC DocumentsUpdated to include new domains and Section 9.1.4How to Read this Implementation GuideAdded mentions of other SDTM implementation guides SDTMIG-AP, SDTMIG-MD, and SDTMIG-PGx.1.4.1How to Read a Domain SpecificationNew section1.5How to Submit CommentsDeleted. The CDISC Discussion Forum has been decommissioned. A replacement will be communicated when details are available.2.2Datasets and DomainsRemoved information available in the Define-XML Specification.2.3The General Observation ClassesSwitched order with what is now Section 2.4.2.4Datasets Other Than General Observation Class DomainsUpdated to include references to new Section 9.2.5The SDTM Standard Domain Models
Removed the restriction that EPOCH be used only when DSCAT = "DISPOSITION EVENT". Revised domain specification and assumptions to explicitly recognize and provide advice on:
New groupings of related domains:
Revised to limit this domain to data collected in a traditional physical examination of the body.
Expanded scope of the RS domain to include clinical classifications in addition to oncology disease response. The RS domain was moved from the old Oncology grouping to the Questionnaires, Ratings, and Scales grouping. 6.3.16Tumor/Lesion DomainsScope of the TU and TI domains was expanded to include lesions in addition to tumors.6.4Skin ResponseExamples 1 and 2 revised in consultation with a subject matter expert to be more accurate and realistic.7.1.2Definitions of Trial Design ConceptsPresented definitions in a table.7.2.2Trial ElementsCorrected erroneous domain values in third example.7.3.3Trial Disease MilestonesNew domain8.4.3SUPP-- ExamplesRemoved example showing population flags, since these supplemental qualifiers were removed.9Study ReferencesNew section9.1Device IdentifiersNew section. Provides basic information on domain and refers the user full information on this domain in the SDTMIG-MD.9.2Non-host Organism IdentifiersNew domain9.3Pharmacogenomic/Genetic Biomarker IdentifiersNew section. Provides basic information on domain and refers the user full information on this domain in the SDTMIG-PGx.Appendix ACDISC SDS Extended Leadership TeamReplaced former team list.Appendix CControlled TerminologyUpdated language describing past changes in controlled terminology publication and the SDTMIG Controlled Terminology appendices.Appendix C2Supplemental Qualifiers Name CodesRemoved population flag supplemental qualifiers.Appendix ERevision HistoryReplaced with summary of changes between SDTMIG v3.2 and SDTMIG v3.3.Appendix F: Representations and Warranties, Limitations of Liability, and DisclaimersCDISC Patent Disclaimers It is possible that implementation of and compliance with this standard may require use of subject matter covered by patent rights. By publication of this standard, no position is taken with respect to the existence or validity of any claim or of any patent rights in connection therewith. CDISC, including the CDISC Board of Directors, shall not be responsible for identifying patent claims for which a license may be required in order to implement this standard or for conducting inquiries into the legal validity or scope of those patents or patent claims that are brought to its attention. Representations and Warranties "CDISC grants open public use of this User Guide (or Final Standards) under CDISC's copyright." Each Participant in the development of this standard shall be deemed to represent, warrant, and covenant, at the time of a Contribution by such Participant (or by its Representative), that to the best of its knowledge and ability: (a) it holds or has the right to grant all relevant licenses to any of its Contributions in all jurisdictions or territories in which it holds relevant intellectual property rights; (b) there are no limits to the Participant's ability to make the grants, acknowledgments, and agreements herein; and (c) the Contribution does not subject any Contribution, Draft Standard, Final Standard, or implementations thereof, in whole or in part, to licensing obligations with additional restrictions or requirements inconsistent with those set forth in this Policy, or that would require any such Contribution, Final Standard, or implementation, in whole or in part, to be either: (i) disclosed or distributed in source code form; (ii) licensed for the purpose of making derivative works (other than as set forth in Section 4.2 of the CDISC Intellectual Property Policy ("the Policy")); or (iii) distributed at no charge, except as set forth in Sections 3, 5.1, and 4.2 of the Policy. If a Participant has knowledge that a Contribution made by any Participant or any other party may subject any Contribution, Draft Standard, Final Standard, or implementation, in whole or in part, to one or more of the licensing obligations listed in Section 9.3, such Participant shall give prompt notice of the same to the CDISC President who shall promptly notify all Participants. No Other Warranties/Disclaimers. ALL PARTICIPANTS ACKNOWLEDGE THAT, EXCEPT AS PROVIDED UNDER SECTION 9.3 OF THE CDISC INTELLECTUAL PROPERTY POLICY, ALL DRAFT STANDARDS AND FINAL STANDARDS, AND ALL CONTRIBUTIONS TO FINAL STANDARDS AND DRAFT STANDARDS, ARE PROVIDED "AS IS" WITH NO WARRANTIES WHATSOEVER, WHETHER EXPRESS, IMPLIED, STATUTORY, OR OTHERWISE, AND THE PARTICIPANTS, REPRESENTATIVES, THE CDISC PRESIDENT, THE CDISC BOARD OF DIRECTORS, AND CDISC EXPRESSLY DISCLAIM ANY WARRANTY OF MERCHANTABILITY, NONINFRINGEMENT, FITNESS FOR ANY PARTICULAR OR INTENDED PURPOSE, OR ANY OTHER WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, FINAL STANDARDS OR DRAFT STANDARDS, OR CONTRIBUTION. Limitation of Liability IN NO EVENT WILL CDISC OR ANY OF ITS CONSTITUENT PARTS (INCLUDING, BUT NOT LIMITED TO, THE CDISC BOARD OF DIRECTORS, THE CDISC PRESIDENT, CDISC STAFF, AND CDISC MEMBERS) BE LIABLE TO ANY OTHER PERSON OR ENTITY FOR ANY LOSS OF PROFITS, LOSS OF USE, DIRECT, INDIRECT, INCIDENTAL, CONSEQUENTIAL, OR SPECIAL DAMAGES, WHETHER UNDER CONTRACT, TORT, WARRANTY, OR OTHERWISE, ARISING IN ANY WAY OUT OF THIS POLICY OR ANY RELATED AGREEMENT, WHETHER OR NOT SUCH PARTY HAD ADVANCE NOTICE OF THE POSSIBILITY OF SUCH DAMAGES. Note: The CDISC Intellectual Property Policy can be found at: cdisc_policy_003_intellectual_property_v201408.pdf. How many 5 digit numbers can be formed from 12345 if repetition is allowed?Yes the answer is 120.
How many three digit numbers can be formed from the digits 1234 and 5 if repetition is not allowed?so 60(ans.)
How many 3Therefore 120 such numbers are possible.
How many 3ANSWER: 120 three-digit numbers can be formed WITHOUT REPETITION OF DIGITS.
|