ADaM 2.1 Implementation: A Challenging Next Step in the Process

Download Report

Transcript ADaM 2.1 Implementation: A Challenging Next Step in the Process

CDISC ADaM 2.1 Implementation: A Challenging Next Step in the Process Presented by Tineke Callant 2014-03-14

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 2

Clinical Data Interchange Standards Consortium - Introduction  1997 - Inception  2000 - 32 global companies CDISC is a global, open, multidisciplinary, non-profit organization that has established standards to support the acquisition, exchange, submission and archive of clinical research data and metadata.

 2014     ± 200 organizations biotechnology and pharmaceutical development companies device and diagnostic companies CROs and technology providers government institutions, academic research centers and other non-profit organizations 3

Clinical Data Interchange Standards Consortium - Introduction 4

Clinical Data Interchange Standards Consortium - Introduction  Mission statement The CDISC mission is to develop and support global, platform-independent data standards that enable information system interoperability to improve medical research and related areas of healthcare.

Data standards to improve clinical research 5

Clinical Data Interchange Standards Consortium - Introduction - 2001: Biomedical Research Integrated Domain Group (BRIDG) Model 6

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 7

CDISC - Foundational standards 8

CDISC - Foundational standards content transport 9

CDISC - Foundational standards 10

CDISC - Foundational standards  Study Data Tabulation Model (SDTM) The content standard for regulatory submission of case report form data tabulations from clinical research studies.

Datasets containing data collected during the study and organized by clinical domain.

 Analysis Data Model (ADaM) The content standard for regulatory submission of analysis datasets and associated files.

Datasets used for statistical analysis and reporting by the sponsor, submitted in addition to the SDTM domains.

11

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 12

CDISC ADaM V2.1 - Analysis data flow ADaM 13

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 14

CDISC ADaM V2.1 - ADaM data structures  The Subject-Level Analysis Dataset (ADSL) structure  The Basic Data Structure (BDS)  Other 15

CDISC ADaM V2.1 - ADaM data structures The Subject-Level Analysis Dataset (ADSL) structure  One record per subject  Variables (required + other)      Study identifiers (e.g. DM.STUDYID) Subject demographics (e.g. DM.AGE) Population indicator(s) (e.g. RANDFL) Treatment variables (e.g. DM.ARM) Trial dates (e.g. RANDDT)  Required in a CDISC-based submission 16

CDISC ADaM V2.1 - ADaM data structures  The Subject-Level Analysis Dataset (ADSL) structure  The Basic Data Structure (BDS)  Other 17

CDISC ADaM V2.1 - ADaM data structures The Basic Data Structure (BDS)  One or more records per subject, per analysis parameter, per analysis time point (conditionally required)  Variables      e.g. PARAM and related variables e.g. AVAL and AVALC and related variables e.g. the subject identification e.g. DTYPE e.g. treatment variables, covariates  Supports the majority of statistical analyses 18

CDISC ADaM V2.1 - ADaM data structures  The Subject-Level Analysis Dataset (ADSL) structure  The Basic Data Structure (BDS)  Other 19

CDISC ADaM V2.1 - ADaM data structures Other  CDISC ADaM Basic Data Structure for Time-to-Event Analysis Version 1.0 - May 8, 2012  CDISC ADaM Data Structure for Adverse Event Analysis Version 1.0 - May 10, 2012 20

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 21

CDISC ADaM V2.1 - Analysis data flow ADaM 22

CDISC ADaM V2.1 - Traceability  Understanding the relationship of element vs. predecessor  Enabling transparancy  Analysis results → Analysis datasets → SDTM 23

CDISC ADaM V2.1 - Traceability Strategies for implementing SDTM and ADaM standards Susan Kenny – Michael Litzsinger  Parallel method DBMS Extract SDTM Domains Analysis Datasets  Retrospective method DBMS Extract → Analysis Datasets → SDTM Domains  Linear method DBMS Extract → SDTM Domains → Analysis Datasets  Hybrid method DBMS Extract → SDTM Draft Domains → Analysis Datasets → SDTM Final Domains 24

CDISC ADaM V2.1 - Traceability  Traceability 25

CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1  Fundamental principles – Provide traceability between the analysis data and its source data  Practical considerations – Maintain the values and attributes of SDTM variables  CDISC ADaM implementation guide (IG) V1.0

 General variable naming conventions 26

CDISC ADaM V2.1 - Traceability General variable naming conventions Any ADaM variable whose name is the same as an SDTM variable must be a copy of the SDTM variable, and its label, meaning, and values must not be modified 27

CDISC ADaM V2.1 - Traceability Strategies for implementing SDTM and ADaM standards Susan Kenny – Michael Litzsinger  Parallel method DBMS Extract SDTM Domains Analysis Datasets  Retrospective method DBMS Extract → Analysis Datasets → SDTM Domains  Linear method DBMS Extract → SDTM Domains → Analysis Datasets  Hybrid method DBMS Extract → SDTM Draft Domains → Analysis Datasets → SDTM Final Domains 28

CDISC ADaM V2.1 - Traceability Strategies for implementing SDTM and ADaM standards Susan Kenny – Michael Litzsinger  Linear method DBMS Extract → SDTM Domains → Analysis Datasets    Traceability CDISC SDTM/ADaM Pilot Project Recommended 29

CDISC ADaM V2.1 - Traceability Strategies for implementing SDTM and ADaM standards Susan Kenny – Michael Litzsinger  Hybrid method DBMS Extract → SDTM Draft Domains → Analysis Datasets → SDTM Final Domains    Traceability Amendment 1 SDTM V1.2 and SDTM IG V3.1.2

Future?!?

30

CDISC ADaM V2.1 - Traceability  Traceability → Recommended: Linear method  Flexible  Delivery of consistent analysis datasets  Easy to use (Excel file)  Easy to maintain (Excel file) 31

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 32

CDISC ADaM V2.1 - ADaM metadata  Microsoft Office Excel spreadsheet as framework  Metadata 33

CDISC ADaM V2.1 - ADaM metadata  Microsoft Office Excel spreadsheet as framework  analysis dataset  %CHKSTRUCT(ds_ = )   Automatization Compliance  define.xml

34

CDISC ADaM V2.1 - ADaM metadata  Analysis dataset metadata  Analysis variable metadata  Analysis parameter value-level metadata  Analysis results metadata 35

CDISC ADaM V2.1 - ADaM metadata Analysis dataset metadata ! ≠ SDTM !

The key variables should define uniqueness  Illustration from CDISC ADaM V2.1

 Practical consideration: ADxxxxxx 36

CDISC ADaM V2.1 - ADaM metadata Analysis dataset metadata Analysis dataset naming convention  ADxxxxxx  The subject-level analysis dataset is named ADSL  max. 8 characters 37

CDISC ADaM V2.1 - ADaM metadata  Analysis dataset metadata  Analysis variable metadata  Analysis parameter value-level metadata  Analysis results metadata 38

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata  Illustration from CDISC ADaM V2.1

39

CDISC ADaM V2.1 - ADaM metadata  Analysis dataset metadata  Analysis variable metadata  Analysis parameter value-level metadata  Analysis results metadata 40

CDISC ADaM V2.1 - ADaM metadata Analysis parameter value-level metadata  Illustration from CDISC ADaM V2.1

41

CDISC ADaM V2.1 - ADaM metadata  Analysis dataset metadata  Analysis variable metadata  Analysis parameter value-level metadata  Analysis results metadata (not required) 42

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice   Analysis dataset metadata Analysis variable metadata Dataset name Variable name Variable label Variable type Display format Codelist / Controlled terms Source / Derivation Parameter identifier (Basic Data Structure (BDS))  Analysis results metadata (not required) 43

CDISC ADaM V2.1 - ADaM metadata  Microsoft Office Excel spreadsheet as framework  Metadata 44

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice   SAS variable attributes To work in a SAS environment – – – NAME TYPE LENGTH – – – – FORMAT INFORMAT LABEL POSITION IN OBSERVATION – INDEX TYPE  Analysis variable metadata fields – – – – – – – – DATASET NAME VARIABLE NAME VARIABLE LABEL VARIABLE TYPE DISPLAY FORMAT CODELIST / CONTROLLED TERMS SOURCE / DERIVATION BASIC DATA STRUCTURE: PARAMETER IDENTIFIER 45

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice  Example ...

46

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice Subposition in observation  Example  ADSL – SITEGR* (Char) and SITEGR*N (Num) * = a single digit [1-9]  SITEID  SITEID grouped together by city in the variable SITEGR1 (SITEGR1N)  SITEID grouped together by province in the variable SITEGR2 (SITEGR2N) 47

ORDER 1 CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice Subposition in observation 1 2 2 %CHKSTRUCT(ds_ = ADSL) 48

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice Subposition in observation ORDER 1 2 1 2 49

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice Subposition in observation  Example  ADSL – SITEGR* (Char) and SITEGR*N (Num) * = a single digit [1-9] 2 3 4 POSITION IN OBSERVATION 1 4 1 2 SUBPOSITION IN OBSERVATION VARIABLE NAME STUDYID USUBJID SITEID SITEGR* SITEGR*N 50

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice ...

51

CDISC ADaM V2.1 - ADaM metadata Analysis variable metadata in practice - Core

CDISC SDTM

Req - Required

CDISC ADaM

Req - Required The variable must be included in the dataset and cannot be null for any record.

Exp - Expected The variable must be included in the dataset.

Cond - Conditionally required ... and may contain some null values.

Perm - Permissible ... in certain circumstances.

Perm - Permissible The variable should be used in a domain as appropriate when collected or derived.

The variable may be included in the dataset, but is not required.

 Nulls are allowed 52

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 53

CHKSTRUCT macro  Microsoft Office Excel spreadsheet as framework  analysis dataset  %CHKSTRUCT(ds_ = )   Automatization Compliance  define.xml

54

Before CHKSTRUCT macro - Automatization 4 6 5 7 1 2 3 After ORDER THE ANALYSIS VARIABLES %CHKSTRUCT(ds_ = ADSL) 1 2 3 4 5 6 7 55

Before CHKSTRUCT macro - Automatization After LABEL THE ANALYSIS VARIABLES %CHKSTRUCT(ds_ = ADSL) 56

CHKSTRUCT macro - Automatization Key variables Before 2 1 6 9 8 10 3 4 7 5 SORT THE ANALYSIS DATASET Key variables %CHKSTRUCT(ds_ = ADSL) After 1 2 3 4 5 6 7 8 9 10 57

CHKSTRUCT macro – Compliance Analysis dataset Analysis variable metadata 58

CHKSTRUCT macro – Compliance Analysis dataset Analysis variable metadata 59

CHKSTRUCT macro – Compliance Analysis dataset Analysis variable metadata 60

CHKSTRUCT macro  Excel spreadsheet as framework  Purpose    Reference Automatization Compliance 61

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 62

Linear method - Challenges and solutions

Step 1

63

Linear method - Challenges and solutions Step 1 - CDISC SDTM Implementation Guide ...

...

64

Linear method - Challenges and solutions Step 1 - CDISC SDTM Implementation Guide Any ADaM variable whose name is the same as an SDTM variable must be a copy of the SDTM variable, and its label, meaning, and values must not be modified 65

Linear method - Challenges and solutions Step 1 - CDISC SDTM Implementation Guide Challenge: Flexible variable length ...

...

...

66

Linear method - Challenges and solutions Step 1 - CDISC SDTM Implementation Guide Challenge: Flexible variable length 

CDISC SDTM IG

 Variables of the same name in split datasets should have the same SAS Length attribute    Version 5 SAS transport file format: max. 200 characters -- TESTCD and QNAM: max. 8 characters -- TEST and QLABEL: max. 40 characters 

Example: DM.RACE: $41, $50, and $200

Amendment 1 to SDTM V1.2 and SDTM IG V3.1.2

 Version 5 SAS transport file format: max. 200 characters

! only if necessary !

67

Linear method - Challenges and solutions Step 1 - CDISC SDTM Implementation Guide Challenge: Flexible variable length  Traceability  Flexible  Delivery of consistent analysis datasets  Easy to use  Easy to maintain 68

Linear method - Challenges and solutions Step 1 - CDISC SDTM Implementation Guide Solution: [sdtm] ↔ %CHKSTRUCT(ds_ = ) 69

Linear method - Challenges and solutions Step 1 - CDISC SDTM Implementation Guide Challenge: Permissible variables Example: LB.LBSCAT

Solution: [sdtm] ↔ %CHKSTRUCT(ds_ = ) 70

Linear method - Challenges and solutions

Step 2

71

Linear method - Challenges and solutions Step 2 - SUPP-    QNAM QLABEL QVAL e.g. SUPPDM SDTM dataset → variable name → variable label → variable type → variable length e.g. ADSL ADaM dataset 72

Linear method - Challenges and solutions Step 2 - SUPP- Challenge: Flexible code list  QLABEL is different for the same QNAM – Example ELIGCONF Subject Still Eligible ELIGCONF Still Fulfill Eligibility Criteria  QLABEL format – Example RANDNO RANDOMIZATION NUMBER RANDNO Randomization Number  QLABEL changes during the course of a study – Example ELIGIBLE

Suject

Eligible For Dosing ELIGIBLE

Subject

Eligible For Dosing 73

Linear method - Challenges and solutions Step 2 - SUPP- Solution: [supp] ↔ %CHKSTRUCT(ds_ = ) 74

Linear method - Challenges and solutions

Step 3

75

Linear method - Challenges and solutions - Step 3 ADaM 76

1 2 Linear method - Challenges and solutions - Step 3 Challenge: 12 SDTM → 12 ADaM?!?

3 6 SDTM

?

?

?

?

ADaM

?

?

?

?

?

?

?

?

7 8 4 12 5 11 10 9 77

Linear method - Challenges and solutions - Step 3 Solution: 1 central model + sponsor specific add-ons central ADaM model domlist.sas7bdat

varlist.sas7bdat

codelist.sas7bdat

1

domlist.sas7bdat

varlist.sas7bdat

codelist.sas7bdat

2

sponsor specific add-on domlist.sas7bdat

varlist.sas7bdat

codelist.sas7bdat

1 1

Convert Excel file to SAS datasets (by ADaM administrator)

2

Combine central model and sponsor specific add-on (by study programmer) 78

Linear method - Challenges and solutions - Step 3 Solution: 1 central model + sponsor specific add-ons  Traceability  Flexible  Delivery of consistent analysis datasets  Easy to use  Easy to maintain 79

Linear method - Challenges and solutions

Step 4

80

1 2 Linear method - Challenges and solutions - Step 4 Challenge: SDTM model no. 1, 2, 3 ... ?

3 6 SDTM

?

?

?

?

ADaM

?

?

?

?

?

?

?

?

7 8 4 12 5 11 10 9 81

Linear method - Challenges and solutions - Step 4 Solution: Central metadata repository  CDISC metadata    SDTM version SDTM metadata ...

 Study characteristics     Therapeutic area Clinical phase Trial design characteristics ...

 Project metadata    Study timelines Key Performance Indicators ...

82

Linear method - Challenges and solutions

Step 5

83

Linear method - Challenges and solutions – Step 5 Challenge: Future 84

Linear method - Challenges and solutions – Step 5 Challenge: Future 85

Agenda  CDISC - Introduction  CDISC - Foundational standards  CDISC ADaM V2.1 - Analysis data flow  CDISC ADaM V2.1 - ADaM data structures  CDISC ADaM V2.1 - Traceability  CDISC ADaM V2.1 - ADaM metadata  CHKSTRUCT macro  Linear method - Challenges and solutions  Take home messages 86

  Take home messages Message no. 1 SDTM and ADaM go hand in hand Thus, without a CDISC compliant SDTM database to start from, ADaM cannot exist ADaM SDTM  But do realize a strong analysis data model needs more than a CDISC compliant SDTM database alone 87

Take home messages Message no. 2  Linear method:   Recommended Challenging  Solution:   SDTM: Central metadata repository ADaM: Automatization, e.g. [sdtm], [supp] … Study medata differences are handled efficiently 88

E-mail: [email protected]

Internet: www.sgs.com/cro 89