13
LINDI Miner • Goal – Investigate causes of high cholesterol to inform new research directions • Use Scenario – User loads a collection of Medline abstracts for mining – User explores documents containing the string cholesterol

LINDI Miner

  • Upload
    makan

  • View
    40

  • Download
    2

Embed Size (px)

DESCRIPTION

LINDI Miner. Goal Investigate causes of high cholesterol to inform new research directions Use Scenario User loads a collection of Medline abstracts for mining User explores documents containing the string cholesterol. LINDI Text Miner. Create a new project using. T ext File(s). - PowerPoint PPT Presentation

Citation preview

Page 1: LINDI Miner

LINDI Miner

• Goal– Investigate causes of high cholesterol to inform

new research directions

• Use Scenario– User loads a collection of Medline abstracts for

mining– User explores documents containing the string

cholesterol

Page 2: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

No documents available

LINDI Text Miner

Create a new project usingText File(s)

XML File(s)

Database(s)

Open an existing project

More Files…tobacco.lpf\…\cancerdata.lpf\\share\…\water.lpf

CancelOK

Don’t show this dialog box again

HTML File(s)

Page 3: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

No documents available

Project Files

Cancel

OK

IncludeSubfolders

Look in: medline

Q1Q2Q3Q4

Add All

Add

Remove

File name:

Files of type: Text Files(*.txt)

Files in Project:

Page 4: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

No documents available

Project Options

CancelOK

Exclude During ParsingStop WordsNamed Entities

Include During ParsingMetadataSynonyms

Indexing

Term Log termFrequency:

None InverseScaling: Entropy 1 Norm

None 1 NormNormalization:

Independent term and document normalization

Inclusion Range: of documents

Page 5: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

Save Project As

Cancel

Save

Look in: medline

Q1Q2Q3Q4

File name:

Save as type: LINDI Project(*.lpf)

medline.lpf

No documents available

Page 6: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

No documents available

Project Setup

Status: Processing file 99459448.txt

Page 7: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

All docs (12793): .*

- medline.lpf

View topics

Page 8: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

All docs (12793): .*

- medline.lpf

Topics – Step 1 of 2

Use document setsAll docs (12793)

Apply term setsNo term sets available

Cancel | < Back | Next > | Create

Page 9: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

All docs (12793): .*

- medline.lpf

Terms Documents

Yes No

Topics – Step 2 of 2

Map Creation

Algorithm: Self-organizing map

Number of Topics: 40

Display Options

Color Coding:Display Map: Yes No

Topic Labels:

Display Table: Yes No

Cancel | < Back | Next > | Create

Page 10: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

All docs (12793): .*

- medline.lpf

Topic Map (All terms (7944); Cholesterol docs)

Topic Size Actionarterial 123.73fluvastatin 29.18diabetic 29.01association 18.81triglycerides 17.61uptake 15.73data 11.50tgf 11.18chinese 11.08mediated 9.80

Search for:

Status: map creation completed

arterial fluvastatindiabetic

triglyceridestgf

chinese

association

mediatedvldl

uptake

data

methylglutaryl

nephropathy

detection

ambulatory

cisplatin

shock

antihypersentive

separation

glycosylphos-phatidylinositol

estimates

hemoglobine

implicated

edcfepa

lcfa

subcutaneous

compartments

cycle

establish

haemostatic

start

synthetic

released

radicalscavenge

describes

vldl 7.94methylglutaryl 7.54nephropathy 6.89describes 6.81detection 6.02ambulatory 5.35cisplatin 5.17shock 5.03antihypertensive 4.91separation 4.69glycosylphosphati… 4.31estimates 4.31epa 4.19lcfa 3.97hemoglobine 3.94

Topics (All docs)

data determined decrease presence suggest treated

Select All | Clear All | Save Terms | Save Docs

Options Find

Page 11: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

All docs (12793): .*

- medline.lpf

Topic Map (All terms (7944); Cholesterol docs)

Topic Size Actionarterial 123.73fluvastatin 29.18diabetic 29.01association 18.81triglycerides 17.61uptake 15.73data 11.50tgf 11.18chinese 11.08mediated 9.80

Search for:

Status: map creation completed

arterial fluvastatindiabetic

triglyceridestgf

chinese

association

mediatedvldl

uptake

data

methylglutaryl

nephropathy

detection

ambulatory

cisplatin

shock

antihypersentive

separation

glycosylphos-phatidylinositol

estimates

hemoglobine

implicated

edcfepa

lcfa

subcutaneous

compartments

cycle

establish

haemostatic

start

synthetic

released

radicalscavenge

describes

vldl 7.94methylglutaryl 7.54nephropathy 6.89describes 6.81detection 6.02ambulatory 5.35cisplatin 5.17shock 5.03antihypertensive 4.91separation 4.69glycosylphosphati… 4.31estimates 4.31epa 4.19lcfa 3.97hemoglobine 3.94

Topics (All docs)

arterial early hemodialysis artherogenisis arthrogenic

Select All | Clear All | Save Terms | Save Docs

Options Find

Page 12: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

All docs (12793): .*

- medline.lpf

Topic Map (All terms (7944); Cholesterol docs)

Topic Size Actionarterial 123.73fluvastatin 29.18diabetic 29.01association 18.81triglycerides 17.61uptake 15.73data 11.50tgf 11.18chinese 11.08mediated 9.80

Search for:

Status: map creation completed

arterial fluvastatindiabetic

triglyceridestgf

chinese

association

mediatedvldl

uptake

data

methylglutaryl

nephropathy

detection

ambulatory

cisplatin

shock

antihypersentive

separation

glycosylphos-phatidylinositol

estimates

hemoglobine

implicated

edcfepa

lcfa

subcutaneous

compartments

cycle

establish

haemostatic

start

synthetic

released

radicalscavenge

describes

vldl 7.94methylglutaryl 7.54nephropathy 6.89describes 6.81detection 6.02ambulatory 5.35cisplatin 5.17shock 5.03antihypertensive 4.91separation 4.69glycosylphosphati… 4.31estimates 4.31epa 4.19lcfa 3.97hemoglobine 3.94

Topics (All docs)

Select All | Clear All | Save Terms | Save Docs

Options Find

View topic

Page 13: LINDI Miner

LINDI Text Miner File Project Window Help

Summary

Query

Analysis

Term Sets

Document Sets

x x

a c u y m z

History

All docs (12793): .*

- medline.lpf

Topic Map (All terms (7944); Cholesterol docs)

Topic Size Actionarterial 123.73fluvastatin 29.18diabetic 29.01association 18.81triglycerides 17.61uptake 15.73data 11.50tgf 11.18chinese 11.08mediated 9.80

Search for:

Status: map creation completed

arterial fluvastatindiabetic

triglyceridestgf

chinese

association

mediatedvldl

uptake

data

methylglutaryl

nephropathy

detection

ambulatory

cisplatin

shock

antihypersentive

separation

glycosylphos-phatidylinositol

estimates

hemoglobine

implicated

edcfepa

lcfa

subcutaneous

compartments

cycle

establish

haemostatic

start

synthetic

released

radicalscavenge

describes

vldl 7.94methylglutaryl 7.54nephropathy 6.89describes 6.81detection 6.02ambulatory 5.35cisplatin 5.17shock 5.03antihypertensive 4.91separation 4.69glycosylphosphati… 4.31estimates 4.31epa 4.19lcfa 3.97hemoglobine 3.94

Topics (All docs)

Select All | Clear All | Sav e Terms | Sav e Docs

Options Find

Topic Map (All terms (7944); Cholesterol docs)

Topic Size Actionarterial 123.73fluvastatin 29.18diabetic 29.01association 18.81triglycerides 17.61uptake 15.73data 11.50tgf 11.18chinese 11.08mediated 9.80

Search for:

Status: map creation completed

arterial fluvastatindiabetic

triglyceridestgf

chinese

association

mediatedvldl

uptake

data

methylglutaryl

nephropathy

detection

ambulatory

cisplatin

shock

antihypersentive

separation

glycosylphos-phatidylinositol

estimates

hemoglobine

implicated

edcfepa

lcfa

subcutaneous

compartments

cycle

establish

haemostatic

start

synthetic

released

radicalscavenge

describes

vldl 7.94methylglutaryl 7.54nephropathy 6.89describes 6.81detection 6.02ambulatory 5.35cisplatin 5.17shock 5.03antihypertensive 4.91separation 4.69glycosylphosphati… 4.31estimates 4.31epa 4.19lcfa 3.97hemoglobine 3.94

Topics (All docs)Topics (All docs)

Select All | Clear All | Sav e Terms | Sav e Docs

Options Find

Topic ExplorerTopic: arterial (65 terms, 557 docs)

arterialearlyhemodialysisatherogenesisatherogenicdevelopmentbackgroundarterytherapyformation

Term Action

Search for:Select All | Clear All | Save Terms

Find

View Terms | View Docs