从无的放矢到个性化的知识探索 From OPAC to Personalized Discovery

Preview:

DESCRIPTION

从无的放矢到个性化的知识探索 From OPAC to Personalized Discovery. Foster Zhang Systems Integration Digital Library Systems and Services fjzhang@stanford.edu. 问题的提出. - PowerPoint PPT Presentation

Citation preview

从无的放矢到个性化的知识探索From OPAC to Personalized Discovery

Foster ZhangSystems IntegrationDigital Library Systems and Servicesfjzhang@stanford.edu

问题的提出 Today’s library catalog no longer meets the

expectation of users accustomed to Internet search engines. -- Eric Celeste, Associate University Librarian, University of Minnesota

Only 30% of students turn to library websites when searching for scholarly information sources.

Only 2% of students begin their searches at library sites. -- OCLC survey

使用图书馆与使用检索引擎的对比 (LibQUAL’05)

CNI Spring 2005 Task force meeting

读者对图书馆服务的期望 Start from one place to search all Full-text, please! Easy move from a citation to the item itself Systems will provide lots of intelligent assistance Authenticated single sign-on Security and privacy Do not have to come the library to use the service

门户是否是解决办法?Web portals are sites on the World Wide Web that typically provide personalized capabilities to their visitors.

-- http://en.wikipedia.org/wiki/Web_portal

图书馆员的思路与读者的想法有差距User’s problem

Personal research interests

vs.

World of knowledge

Library’s solution

Classification and Cataloging

for

Library-own materials

10. Library of Congress Classification

9. Availability

1. Subject: Topic2. Subject: Genre3. Format4. Library5. Subject: Region6. Subject: Era7. Language8. Author

Usage patterns

Requests by Search Type

Search -> Navigation

29%

Navigation 20%

Search 51%

Usage patterns, cont.Navigation Requests by Dimension

70,516

38,074

38,605

59,248

87,221

74,985

65,545

155,856

169,249

23,848

0 30,000 60,000 90,000 120,000 150,000

Author

Language

Subject: Era

Subject: Region

Library

Format

Subject: Genre

Subject: Topic

LC Classification

Availability

Requests

北卡州立大学新的检索界面是基于Endeca ProFind 导航软件

Technical Overview Endeca co-exists with SirsiDynix Unicorn ILS and

Web2 online catalog. Endeca indexes MARC records exported from

Unicorn. Index is refreshed nightly with records

added/updated during previous day.

Basic Architecture

Endeca Data

Foundry

Endeca Navigation

Engine

Endeca Presentation

ServerRaw DataSources

EndecaStudio

NavigationEngineIndices

Client Browser

HTTP

Offline Process Online System

Data FoundryConfiguration

XML

Step 1: Data Transformation

Transform

Data Foundry RecordXML

Transform Rules

XSLTscripts

Set up Data Pipeline

MARC Records…

MARC MARCXML RecordXML

Endeca ForgeActivity: Transform data source(s) into RecordXML

Step 2: Data Pipeline Editing

Endeca Foundry

[RH CP6 + 008 CP23] Physical Form

[008 CP22] Target Audience

[008 CP24-27 + CP28 + CP33] Content Form

[008 CP35-37] Language

[650,654,656,657 $a + 6## $x] Subject - General

[651 $a + 6## $z] Subject - Geographic

[6## $d + 6## $y] Subject – Time Period

[6## $v] Format

Record Header ….008 …. …. ….650 $a651 $a Geographic Subject654 $a ….656 $a657 $a 6## $d, $y Time Period6## $v Format6## $x General, modifies $a6## $z Geographic Subject

Filters & Scripts

Activity: Create orthogonal facets from MARC fields; clean up data

Step 3: Create Dynamic DimensionsEndeca Foundry

Publication Date

Language

Content Form

Target Audience

Physical Form

Subject – Other Name

Subject – Person Name

Subject – General

Subject – Time Period

Activity: Use GUI in Dev Studio to designate & configure Dynamic Dimensions. Underlying data automatically drives dimension category values.

Publication Date: from to

Language …English, Spanish, French, German

Content Form …Not Fiction, Fiction, Novels, Speeches

Target Audience …General, Juvenile, Adolescent

Physical Form …Language Material, Projected Material,Cartographic Material, Manuscript

Subject – Other Name …IBM, American Express, Endeca, Barnes & Noble

Subject – Person Name…Jack Blount, Ric Rodriguez, Mark Calkins, Steve Nielsen, Rob Madsen

Step 4: Hierarchy Transformation

Transform

Data Foundry DimensionXMLTransform Rules

scripts

LOC Taxonomy

Break out Facets

Class A – General WorksClass B – PhilosophyClass C – Auxiliary Sciences of HistoryClass D – History, General and Old WorldClass E – History, America: USClass F – History, America: AllClass G – Geography, Anthropology…Class H – Social SciencesClass J – Political ScienceClass K – Law

Activity: Create and run a script to transform input hierarchies into DimensionXML. Atlernatively, for smaller hierarchies, can use GUI to build in the Developer Studio.

Step 5: Create Edited DimensionsEndeca Foundry

LOC Classification

Subject – Geography

Activity: Use GUI in Dev Studio to designate & configure Edited Dimensions. Underlying data automatically matches to values from hierarchy; dead ends suppressed & stored for future matches.

LOC ClassificationClass A – General WorksClass B – PhilosophyClass C – Auxiliary Sciences of HistoryClass D – History, General and Old WorldClass E – History, America: USClass F – History, America: AllClass G – Geography, Anthropology…Class H – Social SciencesClass J – Political ScienceClass K – LawClass L – Education … …

Subject – GeographyNorth AmericaSouth AmericaEuropeAsiaAfrica050 H1.xxx.xxx

…RecordXML

Book Records

Step 6: Load Indices and Create UI

Data Foundry

Navigation Engine Indices

Endeca API

ASP

JSP

.NetMDEXEngine

DGraph(exe / binary)

Activity: Use GUI in Dev Studio to designate & configure all dimension options. Underlying data automatically matches to values from hierarchy; dead ends are not shown, but stored for future potential matches. Use API and standard Web coding to modify reference implementation to final UI.

Easy move from a citation to the item itself

Citation 引文

OpenURL resolver 解析器Full text sources 全文源

Discovery 如何发现相关的引文

赋予读者检索与研究的主导能力 Expanded possibilities for users Guide users back to library’s resources

持续检索馆外资源

持续检索馆外资源: Open WorldCat

持续检索馆外资源: Google Scholar

持续检索馆外资源: RLG Union Catalog

Google Style Spelling Check

Look Inside (similar to Amazon)

将读者引回图书馆的例子 Users control what they want to go and see Script codes made available for free share Currently only work on Firefox Personalize information services

Google Book Search Link to OPAC Item Status

Library item status displayed on Google

Link lead to library item status

Link from Amazon

Library item status displayed on Amazon

Link to library recall request page

小程序( Script codes)例子/ ==UserScript==// @name Google Books Socrates Link// @namespace http://www.superpatron.com// @description Search Stanford Library Catalog (Socrates) from Google Books listings.// @include http://books.google.*// ==/UserScript==

function(){

var libraryUrlPattern = 'http://library.stanford.edu/searchcat?isbn=';var libraryUrlPatternTrailer = '';var librarySearchScreen = 'http://socrates.stanford.edu/';… }

固定格式可检索 URL System provided Customized for easy use

Example: URL of an ISBN Search with Search Qualifiers

/uhtbin/cgisirsi/x/LIBPOLNAME/0/5?searchdata1=0070167818{020} &srchfield1=GENERAL^SUBJECT^GENERAL^^words+or+phrase &searchoper1=AND&thesaurus1=GENERAL&search_entries1= GENERAL&search_type1=SUBJECT&special_proc1 =&library=ALL&pubyear=&language=ANY&format=ANY& item_type=ANY&location=ANY&item_1cat=ANY&item_2cat= ANY&match_on=KEYWORD&shadow=YES &sort_by=NONE&user_id=GUEST&password=xxxx

Create Custom URL for Easy Reference

ckey=isbn=issn=author=xauthor=bauthor=title=xtitle=btitle=everything=

subject=xsubject=bsubject=periodical=xperiodical=bperiodical=series=xseries=bseries=

http://library.stanford.edu/searchcat?author=proust+marcelThe complete list of parameters is:

OpenURL 连接示意图Link

SourceOpenURL.

Resource Resource

ServiceComponent

Link Server

Provision of Hooks(OpenURL)

Resolution into Links

Context Sensitive

Link DestinationLink

Link DestinationLink

Link DestinationLink

Link DestinationLink

Hook

AppropriateServerReference

OpenURL 例子与结构

OpenURL contains three parts: Address of OpenURL resolver: http://caslon.stanford.edu:3210/sfxlcl3 Database source: (America History and Life from ABC-CLIO). Metadata citation

使用 OpenURL连接的例子 Google Scholar

为何与 Google合作 ? Better user search experiences Rich contents contain citation and full text of

English and non-English materials Can be easily integrated into your library

environment It is the user’s choice, not ours

Google Scholar Registration Form

Enable SFX Google scholar registration

Access to full text

Stanford library does not have access to the article

Campus Registry(students, faculty, staff)

Book vendors(invoices and catalog records)

RLIN, OCLC, Library of Congress(Catalog records, vernacular data)

Delphi financial system (pay invoices & get reports)

SAL3 inventory system(LAS)

Unicorn(Socrates)

ERM from Innovative(e-resource management)

Grokker from Groxis

SFX link server(open URL’s)

TDNet (e-journal presentation)

SUL/AIR web space

Stanford Digital Repository

Authentication & Authorization

CourseWork/

Sakai

Locally digitized and Google digitized content

小结: 不要强迫读者使用单一的方法查找图书馆资料 支持和提供读者选择他们决定检索信息的方法 为读者计算所有下一步检索的可能性,建议最相关的下一步骤( Guided Navigation) 不要将图书馆服务的范围限制在馆藏的基础上,积极为教学和研究开发工具和资源 根据读者使用需要,分析和连接相关信息并提供技术支持服务,做为图书馆长期的任务

谢谢各位!

Recommended