Transcript PPT

CSE5810
Computer Science Issues In a Patient’s Perspective
Exploring the application of data mining
to Bioinformatics
Guanming Wu
Computer Science & Engineering Department
University of Connecticut
[email protected]
Background
CSE5810

Nature of being asynchronous


Management of patients’ medical records
Lack of a full(fuller) perception




Speciality
Multiple medical providers
Transparency issue
Separated medical record
2
Motivation
CSE5810

Developed in a patient’s perspective

More outsider-friendly

Used by participants in other levels and domains

Improvements in efficiency
3
Goal
CSE5810

Higher data transparency

Better comprehensibility

Higher efficiency
4
Basic Information
CSE5810
There are progresses in Bioinformatics researches:

Literature extraction based on keywords

Literature extraction from words combined
with the context of the words

Information extraction from nature language
(with high error rate)

Automated database curation and ontology
development
5
Data storage
CSE5810

Update(download) medical data on demand

Store the data locally

Does not upload data locally generated

One database per account

Architecture

Sequence
6
Standardization & quantization
CSE5810

Diverse formats of medical records



Medical test reports
Diagnosis
Prescriptions




Standardization


Easier to process
More comprehensible
Detail-oriented
Formats
Quantization

Tests and results
7
Application of data mining (1)
CSE5810
Data Mining

Helps medical systems better benefit from
data and analytics

Helps improve user-friendliness

Helps Reduces inefficiency and low-term costs
8
Application of data mining (2)
Analysis of large datasets to discover their patterns
CSE5810
Use the patterns to build models
Predict the likelihood of a patient having
a certain type of disease;
(Possible) stage of a patient
1
2
9
Application of data mining (3)
CSE5810
1 For patients:



Chronic disease management
Reminders/warning
etc.
2 For medical professionals/administration



Follow-up of patients with chronic disease(s)
Patients/professionals/Staffing management
etc.
10
Train data
CSE5810
Generate models
Data
Test Models
- from different providers
- in different formats
Run Models
- in different orders
Result
Original
database
Data
local
database
11
CSE5810
Error rate issue
Not a solid evidence/proof
Recommandation
Decision Making
12
CSE5810
Thank you
for your attention (and patience)