: Your 24hr Library
California Digital Library 25 Apr 2012
TECHNOLOGY ADVANCES
http://www.biodiversitylibrary.org/
@chrisfreeland
Points we’ll cover
Development Methodology Usage Stats
APIs Data Mining
External Connections iTunes U
DOIs & Mendeley Data Challenges & Gaming
@chrisfreeland
Methodology: ���Evolutionary Prototyping
• Find a problem, code a solution, test & evaluate with staff & users, push it out to production
• Example: Fuzzy Searches for bibliographic metadata
• Proceedings Philadelphia
• Darwin 1859
• Proc Soc Zool Lond Moved into production yesterday
http://blog.biodiversitylibrary.org/2012/04/new-improved-bhl-search-interface.html
YTD Usage stats
Darwin’s Library Launch
@chrisfreeland
BHL APIs http://biodivlib.wikispaces.com/Developer+Tools+and+API
http://biodivlib.wikispaces.com/Data+Exports
Can address & download all of BHL through our APIs
@chrisfreeland
@chrisfreeland
Tropicos TitleID: 648 = BHL TitleID: 64
http://www.biodiversitylibrary.org/openurl?pid=title:64&volume=10&issue=&spage=151&date=1827
@chrisfreeland
@chrisfreeland
@chrisfreeland
EOL
• 12,800,000 BHL pages linked to EOL
• 65,245,000 occurrences of EOL names in BHL
• Total of 95,900,000 names found < Big Data
@chrisfreeland
Pagination & Flickr Integration
public Flickr view
Direct line to EOL @chrisfreeland
@chrisfreeland
@chrisfreeland
iTunes U
• Since Launch on Feb 2:
• 50,120 browses
• 1,646 subscriptions
• 6,007 downloads @chrisfreeland
DOIs
Have assigned 49,000 DOIs to BHL monographs
@chrisfreeland
Purpose of exercise • Make legacy scientific literature citable via modern systems
• Make 250+ years of scholarly communications available via new tools
• Assign contemporary identifiers to legacy literature
• DOI
• ISBN
• ISSN
@chrisfreeland
DOI, ISBN, ISSN, ei, ei,…uh oh… • Ran into trouble with each agency
• BHL isn’t a publisher
• BHL is a consortium, not a separate legal entity
• BHL doesn’t own all the content it serves
• “We know we need a policy on that…”
…and on…and on…for more than 3 years…
CrossRef DOIs: path of least
@chrisfreeland
Challenge: Title Matching for Uniqueness
• Only want 1 DOI per intellectual, citable unit
• Differences between cataloging & publishing
• Libraries:
• Publishers:
<datafield tag="245" ind1="1" ind2="4"> <subfield code="a">The amoebae living in man;</subfield> <subfield code="b">a zoological monograph,</subfield> <subfield code="c">by Clifford Dobell.</subfield>
</datafield>
<title>The amoebae living in man; a zoological monograph</title>
@chrisfreeland
Challenge: ���Monographs/Series/Monographic Series
• “Report on the Rhynchota collected by the Wollaston Expedition in Dutch New Guinea”
• Published in 1914
• Bound & catalogued as a monograph
• Scanned as a monograph
• Assigned a DOI as a monograph in BHL:http://dx.doi.org/10.5962/bhl.title.13791
• Transactions of the Zoological Society of London, vol. 20, pt. 11.
• Presented by current publisher of that title as an article
• Assigned a DOI as a journal article:���http://dx.doi.org/10.1111/j.1469-7998.1912.tb07839x
Guess who provides free access?
Challenge: Ownership of Backfiles • Who owns public domain works?
• Some publishers consider they have ownership of backfiles for journals they currently publish
• Long running series in natural history
• Curtis’s Botanical Magazine, since 1787
• Those publishers assign DOIs to their current volumes
• BHL assigns DOIs to the public domain works we’ve digitized
Guess who is upset?
BHL DOIs ���
in ���Use
@chrisfreeland
@chrisfreeland
Gaming
More than Angry Books
https://biodivlib.wikispaces.com/BHL+and+Gaming
Email: [email protected] Skype: cfreeland27
Twitter: @chrisfreeland
http://www.biodiversitylibrary.org/