This discussion is now closed.
show all (21)
There are no comments. Click the text to your left to make a new comment.

need to add definitions for “cultural organization” stating that “they could serve as memory institutions”

reply

“since it uses HTTP, the Web’s standard retrieval protocol.” – Just noting a typo, in the lack of the possessive apostrophe. Please excuse me if this is out of scope.

reply

This is a thorough, well-researched, report on a topic that is very important to the future of libraries. I would like to see the report make the strongest possible case for moving forward on its various recommendations for library linked data, while at the same time not minimizing the obstacles that will need to be overcome to make such initiatives successful. To that end, I will contribute suggestions throughout the report that I believe will help to make it as persuasive as possible for supporting library linked data.
Jennifer Bowen
University of Rochester
eXtensible Catalog Organization

reply

This seems to be lacking something here – add an explanation about how libraries have valuable skills and experience in creating and maintaining authority data and how those activities are important for linked data.

reply

This whole section needs to make an even stronger case for why sharing library data as linked data is vitally important for the future of libraries, as well as how library data can contribute valuable data to the semantic web. I would suggest expanding this section and making it as persuasive as possible.

reply

Not exactly sure who the audience for this report is likely to be. If it is general library administrators, they are not likely to understand the term “global graph” unless it is defined here.

reply

1st part of last sentence is very awkward.

reply

Last line here is great! Possible to highlight this in a text box or something?

reply

It would be more accurate to characterize these as potential benefits, as they are still to be demonstrated on a meaningful scale.

The analogy of “stone soup” is inappropriate. It implies that libraries have nothing of value to contribute. Whereas library metadata is valuable information that is rich in relationships. Expressing this data as linked data is not trivial as it was not designed for this purpose.

reply

Exposure of library authority data could be of much wider benefit than to libraries and their users

reply

“Links can be used to expand indexes much easier than required for todays federated searching, and can offer users a nearly unlimited number of pathways for browsing”

Not sure what this sentence means.

reply

The top down model has been driven by economics: we can’t afford to provide the granularity that would serve the customers best. The potential to invert this is one of the exciting possibilities offered by Linked Data.

reply

BL’s experience with opening BNB data is that any restrictions on use will prevent re-use or aggregation of linked data.

reply

The distinction between “information produced or curated by libraries that describes resources or aids their discovery” and “data used primarily for library-management purposes” isn’t clear.

It’s clear that e.g. holdings information and user data are omitted. But what about circulation data for a resource, the number and frequency of lendings and co-occurence with lendings of other books? This can aid the discovery of resources through suggestions like “People who borrowed resource A also borrowed resource B, C and D” and thus might be covered by the report (but I think it isn’t).

reply

The distinction between “information produced or curated by libraries that describes resources or aids their discovery” and “data used primarily for library-management purposes” isn’t clear.

It’s clear that e.g. holdings information and user data are omitted. But what about circulation data for a resource, the number and frequency of lendings and co-occurence with lendings of other books? This can aid the discovery of resources through suggestions like “People who borrowed resource A also borrowed resource B, C and D” and thus might be covered by the report (but I think it isn’t).

reply

“Citation management can be made as simple as cutting and pasting URLs.”

I’d use the broader term “URIs” here.

reply

The term open is used here for the first time in the report. Its meaning should be made clear, e.g. by means of linking to a section where it is defined.

reply

Again, it is talked about openness but it hasn’t been made clear, what it actually means. In general, the paragraph seems a bit out of context to me.

reply

“The openness of data is more an opportunity than a threat. One benefit may be a clarification of the licensing of descriptive metadata towards openness, thus facilitating the reusing and sharing of data and improving institutional visibility.”

Open licenses aren’t a benefit but an aspect of open data.

reply

This section makes some fairly high level assumptions about user understanding of HTTP.

Such interactions with the data would depend largely upon the nature of the applications built around it.

reply
1 0

@@ To read the most-up-to-date of version of this section, in the context of the entire report, please see our wiki page

2 0

Library Linked Data: Scope of this report

3 0

The scope of this report library linked data can be understood as follows:

4 1

Library. The word library (analogously to archive and museum) refers to three distinct but related concepts: a collection, a place where the collection is located, and an agent which curates the collection and administers the location. Collections may be public or private, large or small, and are not limited to any particular types of resources.

5 1

Library data. Library data refers to any type of digital information produced or curated by libraries that describes resources or aids their discovery. Data used primarily for library-management purposes is generally out of scope. As discussed in more detail below, this report pragmatically distinguishes three types of library data based on their typical use: datasets, element sets, and value vocabularies.

6 0

Linked Data. Linked Data (LD) refers to data published in accordance with principles designed to facilitate linkages among datasets, element sets, and value vocabularies. Linked Data uses Web addresses (URIs) as globally unique identifiers for dataset items, elements, and value concepts, analogously to the library worlds identifiers for authority control. Linked Data defines relationships between things; these relationships can be used for navigating between, or integrating, complementary sources of information.

7 0

Library Linked Data. Library Linked Data (LLD) is any type of library data that is either natively maintained, or merely exposed, in the form of RDF triples, thus facilitating linking.

8 3

Benefits of the Linked Data approach

9 3

The Linked Data approach offers significant advantages over current practices for creating and delivering library data while providing a natural extension to the collaborative sharing models historically employed by libraries, archives, and museums (memory institutions). Linked data is sharable, extensible, and easily re-usable. It supports internationalization of data and user services. These characteristics are inherent in the linked data standards and are supported by the use of web-friendly identifiers for data and concepts. Resources can be described in collaboration with other libraries, and linked to data contributed by other communities or even individuals. Like the linking that takes place today between web documents, linked data allows anyone to contribute their unique expertise so that it can be reused and recombined with the expertise of others. The use of identifiers ensures that the diverse descriptions are all talking about the same thing. Through rich linkages with complementary data from trusted sources, libraries can increase the value of their own data beyond the sum of its sources taken individually, as in the story of the stone soup, where the hungry travellers boiling a pot of stones attracted from the locals enough curiosity, and small contributions of herbs and carrots, to create a nourishing meal.

10 0

By using globally unique identifiers to designate works, places, people, events, subjects, and other objects or concepts of interest, memory institutions can make trusted metadata descriptions available for common use, allowing resources to be cited across a broad range of data sources. An important aspect of the identifier system is its use of the Domain Name System of the Web. This assures stability and trust in a regulated and well-understood ownership and maintenance context. This is fully compatible with the long-term mandate of memory institutions. Libraries, and memory institutions generally, are thus in a unique position to provide the metadata for resources of long-term cultural importance as data on the Web.

11 1

Library authority data for names and subjects will help reduce redundancy of bibliographic descriptions on the Web by clearly identifying key entities that are shared across linked data. This will also aid in the reduction of redundancy of metadata representing library holdings.

12 0

Benefits to Researchers, Students and Patrons

13 1

Users of library and cultural institution services may not be immediately aware that linked data is being employed. Although the changes that occur will be under the hood, the underlying structured data will become more richly linked and the user experience will provide greater discovery and use capabilities. The resulting data webs will result in more sophisticated discovery and navigation across library and non-library information resources. Links can be used to expand indexes much easier than required for todays federated searching, and can offer users a nearly unlimited number of pathways for browsing.

14 3

Library users should be comfortable with the basic concepts of linked data since it uses HTTP, the Webs standard retrieval protocol. Applications may allow users to follow their nose (i.e., resolve trails of URI links) to the data itself. Once retrieved, the recombinational nature of RDF will allow information seekers to extract the parts of the data they need and understand, re-mix as required, or even to add their own annotations as contributions to the global graph. These capabilities meet expectations for an interactive user experience, such as is found in social networking applications.

15 1

Relationships to and from non-library services such as Wikipedia, Geonames, and Musicbrainz will help connect local collections into the larger universe of information available on the Web. The rise of semantics in HTML, which plays a role in the crawling and relevancy algorithms of Google, Google Scholar, and Facebook, will provide a way for libraries to enhance their visibility through search engine optimization (SEO), allowing resources to be discovered from Websites they use routinely. Citation management can be made as simple as cutting and pasting URLs. Embedding structured data in Web pages using, for example, RDFa markup in HTML pages, will also facilitate re-use of library data in services to information seekers. Automating the retrieval of citations from linked data or creating links from Web resources to library resources will mean that library data is fully integrated into research documents and bibliographies.

16 0

Benefits to Organizations

17 1

By promoting a bottom-up approach to publishing data, Linked Data creates an opportunity for cultural organizations (including libraries) to improve the value proposition of describing their assets.

18 1

The technology itself can help organizations improve their internal data curation processes and maintain better links between, for instance, digitized objects and their descriptions, and improve data publishing process within the organization, even in a context where all the data isn’t necessarily open. Cultural organizations will be able to make use of mainstream technologies to manage their data. (Todays library technology is specific to library data formats, leading to the existence of a special Integrated Library Systems industry specific to libraries). Library system vendors will benefit from the adoption of mainstream technology as it will give them an opportunity to broaden their user base.

19 0

Linked Data may be a first step toward an in the cloud approach to managing cultural information one which will be more cost-effective than individual systems in institutions. This approach will make it possible for small institutions or individual projects to be visible and connected, with reduced infrastructure costs.

20 3

Moreover, in an open data context, these institutions will gain greater visibility on the Web, which is where most information seekers may be found. The focus on identifiers allows descriptions to be tailored to specific communities such as museums, archives, galleries, and audiovisual archives. The openness of data is more an opportunity than a threat. One benefit may be a clarification of the licensing of descriptive metadata towards openness, thus facilitating the reusing and sharing of data and improving institutional visibility. Data thus exposed will be put to unexpected uses, as in the adage: “The best thing to do to your data will be thought of by somebody else.”

21 0

Benefits to Librarians, Archivists and Curators

22 0

The benefits to Patrons and Organizations will also have a direct impact on library and memory-institution professionals. By using Linked Data, memory institutions will create an open, global pool of shared data that can be used and re-used to describe resources, with a limited amount of redundant effort compared with current cataloguing processes.

23 0

The use of the Web and Web-based identifiers will make resources immediately available, up-to-date, for cataloguers to re-use. They will be able to pull together descriptions for resources outside their domain environment, across all cultural heritage datasets, and even the web at large. They will be able to concentrate their effort on their domain of local expertise, rather than having to re-create existing descriptions that have been already elaborated by others.

24 0

Benefits to Developers

25 1

Linked Data methods support the retrieval and re-mixing of data in a way that is consistent across all metadata providers. Instead of requiring data to be accessed using library-centric protocols such as Z39.50, linked data uses well-known standard Web protocols like HTTP. Developers will also no longer have to work with library-specific data formats such as MARC and EAD, which require custom software tools and applications. Linked Data methods involve pushing data onto the Web in a form that is understandable to Web applications. By leveraging RDF and HTTP, library developers are freed from the need to use domain-specific software, opening a growing range of generic tools, much of which are open-source. Thus they will find it much easier to build new services on top of their data. This also opens up a much larger developer community to provide support to IT professional in libraries. In a sea of RDF triples, no developer is an island.