THRio - CREATE Biostats Core

Download Report

Transcript THRio - CREATE Biostats Core

THRio
Outline

Data flow and database

Database matching (linkage) issues
Data flow

Data abstracted from medical charts





Specific forms (several)
Forms are reviewed on site
Forms get to our central office from Health
Care Units over the mail
Further inconsistencies are spotted by the
DB system
Double data entry
Database

Flexibility X Complexity

Access + VBA



Simpler
User friendly
Not all the features we needed
Database

SQL Server + Delphi





Very flexible
Secure remote access
Simultaneous transactions
Good freeware servers (MySQL, PostgreSQL)
Interface with programming languages (S,
Perl)
Database


Single server running PostgreSQL (Linux)
5 clients connected through intranet
(Windows)




No Internet access (security concerns)
Electronic forms – Delphi
Allows logs for every action in the DB
Double data entry analyzed within the
system
Database

Problems

System getting too complex


Auditing takes a long time



Will suffer major update
Will be done in real time now
We are still having bugs in the system that
have to be resolved
Not all forms are ready
Database matching

There are several systems that do not
“talk” to each other





SINAN – reportable diseases (TB, AIDS)
SIM – Mortality
SICOM – Pharmaceutical database (ARVs)
THRio – Our DB
We will need to match THRio with all other
3 DBs above
Database matching

Problems



THRio




There is no unique identifier common for all
systems
We use name, gender and DOB as surrogates
Standardization of names abbreviations
Double data entry
Not enough – names are misspelled
The other databases – even worse

No QC
Database matching

SICOM



Monthly cumulative DB
One Excel spreadsheet for each month
Names are repeated




No guarantee that the name will remain the same
for all spreadsheets
Missing DOBs
These problems are more prevalent in older
spreadsheets
Hard to reconstruct ARV history for patients
Database matching

Proposed solution

Compare different approaches
Translated SOUNDEX
 Reclink – probabilistic linkage
 Other algorithms



Apply to different examples and get
sensitivity/specificity for each one
SICOM – it’ll be trickier


Sequential matching
Match TB before doing the sequential
DSMB Issues

TB case definition



Very important
Our primary outcome
Special form to get info about Dx and Tx



Including alternative sources (e.g. “Black book”)
Discussed today in THRio meeting
Came up with a classification like this…
Clinical presentation
compatible with TB
TB Classification
Culture +
Culture -
Smear Clinical response to Tx
AND no alternative Dx
Smear +
Definite
Probable
Yes
Possible
No
No TB
DSMB Issues

Adherence evaluation

Sample of patients – standardized survey
Adapted from ARV adherence survey
 Already validated


Adherence should be reported in a more
timely fashion



Originally, data would be collected only after
course is complete
Now we will collect it whenever it is available
We’ll still have a delay
DSMB Issues

Education level


Years of study (self-reported)
Race/ethnicity




Population highly admixtured in Brazil
Very hard to objectively classify
Even the Brazilian Census Bureau uses selfdeclared race/ethnicity
Could lead to distortions