An Introduction to VIVO

Preview:

DESCRIPTION

 

Citation preview

Introduction to VIVOPaul Albert & Ryan CobineCode4LibFebruary 7, 2011

VIVO aims to address certain intractable problems of academia

- Finding collaborators - Automatically generating list of all a person’s

publications- Inferring a researcher’s expertise- Outputting a researcher’s work in a standard format

such as a CV or a biosketch.- Easily moving institutional data from one system to

another

Preview of version 1.2 http://is.gd/562IU5

Index lists most classes

Browse by class

Search usingfacets

Individual profile

Faculty Affairs

Scopus

User Input

Grants DB

Course management

Faculty Affairs

Co-author visualizations

Is VIVO really Facebook for researchers???

FarmVille application for VIVO currently in development!Version 1.0 drops June 2011

Opensource

Not Facebook Reason #1

Not Facebook Reason #2

Opendata

Not Facebook Reason #3

VIVO’s data is from authoritarian sources.

LOL he means authoritative #hatemyjob

Embraces semantic approach

Not Facebook Reason #4

2003 – VIVO created for local use at Cornell University (Ithaca) by life sciences

2009 – The US National Center for Research Resources (NIH) awards the VIVO Collaboration a two-year $12.2 million grant to improve VIVO

2010 Apr. – Version 1.0 released

2010 Aug. – Version 1.12010 Oct. – First VIVO conference (NYC); version 1.1.1

2011 Feb. – Version 1.2 and Harvester version 1.0

2011 Aug. – Second VIVO conference (D.C.)

A Brief History of VIVO

!"#$%&&' ($)*%#+),-!" #$%&" '(%)*" +,-(&$.." /012" 3%&-.-4$56%2" 768" 4.%9$2" :6;9"

,%<<%=-&%2"4(6%&",%(>?-2"7-&",-(?-&@A69$(*2"B..C",(%8

$(2"3$=D%"#$5%($2"B.6E%F$*D"

G6&$?2" G>=%" 'D%&2" 4(6%&" H-I$2" 7-?$<D" 3;B&$(&$C2" G-..C" 36?*.$F%>$(2" J*$..%"

36*;D$..2" K&><J%I%&*2" ,D(6?*-<D$(" L$?*.6&M2" A$F$;;%" N->&$?O

" ($)*%#+),-' ".'

/&"#)01!"369$",-&.-&"+P0PQ"%&="RS"/012",D(6?"4%(&$?2",$;6.6%"4-*$

(-2"'$((C"4(6T2"B(6&"

4(--9?2" K8C" 4>D.$(2" B..6$" 4>?DD->?$&2" ,D(6?" ,%?$2" ,D(6?U&$" ,-

M%(2" P%.(6$" #%56?2"

3%(C" B=I%(=?2" :6*%" S$(($$2" V$-(M$" G%;92" ,D(6?" G%6&

$?2" A%$" 7$?%&-2" 3%(M$%>W"

7-D&?-&2" J%(%" '($6&$?*2" 3$M%&" H%*-(($2" N%&M" H62" /%>

.%" 3%(9$?2" G%&&%D" :-(*-&2"

:%(%C%&A%>82" K.$W%&=$(" A-;9I$..2" J%(%" A>??$.." V-&E%.$E2" :%&;C

" J;D%$X$(2" #%.$"

J;D$<<.$(2" :6;D-.%?" J9%MM?2" 3%TD$I" Y$==$(2" 36;D$.$" AO" Y$&&%&*2" K.6;6%" Y

>(&$(2"

J*$<D$&"L6..6%8?O"2$0)1$1'($)*%#+),-!"'%*C"4-(&$("+0R"/012"'%56*D%,%&=(%?D$9%(2"46&"

,D$&2"JD%&?D%&",D$&2"7$&6",-)$C2"J>($?D"#$65%?6M%8

%&62"N6&M"#6&M2"A>??$.."#>D-&2"

7-&" #>&&2" /--(&68%" V-<6&%*D2" 7>.6$" G%(=$?*C2" 4(6%&" '$$?$2" :%8(%*%H$.$2" 36;%D"

H6&&$8$6$(2" :6%&.6" 3%2" A-F$(*" GO" 3;#-&%.=2" K?69/(%=D%&V-&M%Z>2" 3%(9" /(6;$2"

N>C6&"J>&2",D6&*%&"Y%&92"K.%&"L%.?D2"4(6%&"LD$$.$(2"S$&M"L>2"K&M$.%"[-??O"""3"$4%'

546""&' ".' 7%0)4)$%!" A6;D%(=" :-$." +/-&;$" /012" A6;%(=-" B?<%=%2" #%8%(6?" Y-(($?2"

36;D%$."P$M%":$M(\&O" "86%'54#)99+':%+%1#46' 2$+;,<,%!"V$(%.=" 7-C;$" +J;(6<<?"/012"

V($M"#>&.%<2" ,%*D$(6&$" #>&&2" 4(%&*" '$..$C2" /%>.%" '

6&M2" " K&M$.%"3>(($..2" 4%(F%(%"

:-F.$2" ,%(C" YD-8%?2" 36;D%$.$$&Y(68%(;D6O" " =1+6)$>,"$' ($)*%#+),-?' 5,@' A"<)+!"

A%9$?D:%M%(%Z%&" +LRJYH" /012" '(6?U" HO" G-.8$?2" ,%$(6$G->;D6&?2" V$-(M$" 7-?$<D2"

J>&6*%"4O"'->.2"H$?.6$"#O"3;0&*-?DO""=%)&&'!"#$%&&'7%0)41&'!"&&%>%!",>(U?",-.$"+L$6.."

/012"/%>."K.F$(*2"P6;*-("4(-=?9C2"3%(9"4(-&&68%&&2"K=%8",D$(6)2"Q?;%(",(>E2"#%&"

#6;96&?-&2" A6;D%(="G>2",D(6?"G>%&M2" 0*%C'.%E2"'$&&$*

D"H$$2"/$*$("36;D$.6&62"V(%;$"

36M.6-(6?62"7-D&"A>]&M2"7%?-&"J<$;.%&=2"Y(>"Y(%&2"P6&%CP%(>MD$?$2"P6(M6."L

-&MO

VIVO Collaboration

Publishers and aggregators – Elsevier, Thomson Reuters, ORCID, Collexis, Information Today, CiteSeer, ArxivOntology – Eagle-I, BIBO, FOAF, UCSFFederal agencies – OTSP, NIH, NSF, VA, USDASearch providers – Google, Bing, YahooProfessional societies – AAASSemantic web community – DERI, Tim Berners-Lee, MyExperiment, ConceptWeb, Linked DataSchools and consortia – SURA, CTSA, CIC, CBC, HubZero, FLR, dozens of individual schoolsExisting application and service providers – over 100

Collaboration & coordination

Human resources

Individuals ortheir proxies

Data aggregators and repositories

Local systemsof record

Academic affairs

Grants databases

Clinical databases

Events calendar

Credentialing DB

Course database

Sources of data

> > >

> RDF harvestSPARQL endpoint

Local data flow in VIVO

VIVO(RDF)

data ingest ontologies

(RDF)

shared as RDF

interactiveinput

local systems of

record

national sources

SubjectWalter Mondale

Data in VIVO is stored using Resource Description Framework (RDF)

Predicatehead of

ObjectTrilateral Commission

Andrew McDonald

author of

has author

research arearesearch area for

academic staff in

academic staff

Susan Riha

Mining the record: Historical evidence for…

author of has author

teaches research area for

research area

headed by

crop management

CSS 4830head of

faculty appointment in

faculty members

taught by

featured in

features person

Semantic representation of data

NYS WRI

Cornell’s supercomputers crunch weather data to help farmers manage chemicals

Earth and Atmospheric Sciences

VIVOs can connect with one another

VIVO enables authoritative data about researchers to

join the Linked Data cloud.

DT<!^^(6;D%(=O;CM%&6%9O=$^_``a^b`^.-=^.-=@=%*%?$*?c_`b`@`d@__c;-.-($=O<&M

An accessible introduction to the semantic web

Alex RockwellUniversity of Florida

AbstractVIVO provides complete information on organizational structures of institutions.Each organization object in VIVO has parent and child organizations. Startingat any particular organization, it is easy to use a simple recursion algorithm totraverse the organizations that report up to the starting point. If the startingpoint is the institution “root”, the algorithm will produce an organizational chartfor the entire organization. Using Ruby and some open source extensions, wehave developed simple software to draw pictures of organizations. We willpresent code, algorithm, commentary and sample output. All code is availableas open source at http://github.com/arockwell/vivo_org_chart/

Drawing Graphs with Labels

• Drawing graphs with 500+ nodes and corresponding labels is extremelydifficult.

• We made two major attempts to prune the graph:• Removing all non-college organizations that are direct children of UF left 300nodes remaining. The graph in the center of the poster shows these nodes.• Removing all non-college and non-department nodes from the graph left ~150nodes. These nodes are the basis for an interactive version of the graph thatincludes labels.

• We created over 100 graphs during the making of this poster.• Tweaking the settings on graph drawing programs (Graphviz and NetworkWorkBench) consumed more time than any other part of this project.

Graph of all Colleges, Departments, Centers & Institutes at UFOverview

Further Research

Copyright Information Here

Purpose• One of the goals of VIVO is to show which organizations, faculty, staff, andstudents belong to.• UF’s academic structure is highly complex and does not correspond to itsfinancial structure.• UF does not have a facility to create organizational charts. Mostorganizational charts are created by hand.

Practical Uses

Visualizing the Organization StructureThe structure is generally regular and has 4 levels:

• University of Florida (the root of the graph)• Colleges• Departments (along with some Centers and Institutes)• Centers and Institutes

Some organizations do not fit into this pattern. For example, organizations atthe college level with no sub-organizations stick out on the graph.

Finding Data Integrity ProblemsLooking at graphs generated by the program has uncovered many problemsin our data, including missing, misplaced, and duplicated records. Withoutgraphs, we might not have been able to find these inconsistencies.

We can understood organizational structure much faster by looking at thegraph rather than manually following the links from one organization toanother.

Finding all UF Organizations

We added over 100 external organizations to VIVO during CV entry of theshowcase departments. As a result, it is no longer possible to consider allentries in our database to be UF organizations. Since SPARQL cannot dorecursive queries, there was also no way to find automatically all sub-organizations at UF.

We added a rootOrganization data property to the local ontology. Thisproperty allowed us to directly mark sub-organizations as being part of UF.Solving this problem alone likely justified the time spent writing the program.

Challenges

Program Design

Extending to People

We plan to include people in graphs for a college or department, which will be particularly challenging.

• UF’s VIVO will include close to 30,000 people by the end of the grant. • We lack reliable data linking people to departments.• We need to import data to show the heads of departments.

Drawing Organizational Charts with VIVO

University of Florida

Medicine

Liberal Arts and Sciences

Engineering

Agricultural and Life Sciences

Fine Arts

Dentistry

Nursing

Education

Health and Human Performance

Veterinary Medicine

Design, Construction and Planning

Law

Pharmacy

Business Journalism

Draw organizational charts http://vivoweb.org/files/orgLast.pdf

Repurpose content into Drupal http://bit.ly/gmm8Ng

!"!#$%&$'()(*+$$%,-%.&$%/001234567$85+94$2:7$;5:<969:=9$;9:+96&$>2*?@:)+5:$A;

Get involved

- Adopt VIVO- Provide data- Develop an application- Ask questions – vivoweb.org/contact- Chat – irc.freenode.net #vivo