Los árboles y el bosque: 23 November 2010

Open data is the electricity of the 21st century « Web of Data

Lonclass and RDF

Lonclass and RDF: "Lonclass and RDF
By DANBRI | Published: 2010-11-18
Lonclass is one of the BBC’s in-house classification systems – the “London classification”. I’ve had the privilege of investigating lonclass within the NoTube project. It’s not currently public, but much of what I say here is also applicable to the Universal Decimal Classification (UDC) system upon which it was based. UDC is also not fully public yet; I’ve made a case elsewhere that it should be, and I hope we’ll see that within my lifetime. UDC and Lonclass have a fascinating history and are rich cultural heritage artifacts in their own right, but I’m concerned here only with their role as the keys to many of our digital and real-world archives.

Why would we want to map Lonclass or UDC subject classification codes into RDF?"

Universal Decimal Classification: Announcement: Classification & Ontology: International UDC Seminar 2011

culturegraph

culturegraph: "culturegraph.org is a Linked Open Data service that aims to establish shared identifiers (Uniform Resource Identifiers) for cultural works (books and other text, paintings, sculptures, piece of music etc.) to ensure these resources can be reliably and persistently referenced. The service is currently being developed cooperatively by the German National Library (DNB) and the North Rhine-Westphalian Library Service Center with support from the German-speaking Working Group of Library Networks."

The Nature of Connectedness on the Web » AI3:::Adaptive Information

The Nature of Connectedness on the Web » AI3:::Adaptive Information: "What does it mean to interoperate information on the Web? With linked data and other structured data now in abundance, why don’t we see more information effectively combined? Why express your information as linked data if no one is going to use it?

Interoperability comes down to the nature of things and how we describe those things or quite similar things from different sources. This was the major thrust of my recent keynote presentation to the Dublin Core annual conference. In that talk I described two aspects of the semantic “gap”:

One aspect is the need for vetted reference sources that provide the entities and concepts for aligning disparate content sources on the Web, and
A second aspect is the need for accurate mapping predicates that can represent the often approximate matches and overlaps of this heterogeneous content.
I’ll discuss the first “gap” in a later post. What we’ll discuss here is the fact that most relationships between putatively same things on the Web are rarely exact, and are most often approximate in nature."

Importing lots of nodes from DBpedia -OR- Import under request | groups.drupal.org

LISTSERV 15.5 - NGC4LIB Archives

LISTSERV 15.5 - NGC4LIB Archives: "The Variations/FRBR Project at Indiana University (http://vfrbr.info) has released version 1.1 of a set of XML Schemas designed for the representation of FRBR (http://www.ifla.org/en/publications/functional-requirements-for-bibliographic-records) data in XML. The 1.1 Schema release represents some significant improvements over our earlier 1.0 release, particularly in the handling of FRBR relationships. As before, the Variations/FRBR XML Schemas are defined at three 'levels': frbr, which embodies faithfully only those features defined by the FRBR and FRAD reports; efrbr, which adds additional features we hope will make the data format more 'useful'; and vfrbr, which both contracts and extends the FRBR and FRAD models to create a data representation optimized for the description of musical materials and we hope provides a model for other domain-specific applications of FRBR."

Querying the British National Bibliography

Following up on the earlier announcement that the British Library has made the British National Bibliography available under a public domain dedication, the JISC Open Bibliography project has worked to make this data more useable.

The data has been loaded into a Virtuoso store that is queriable through the SPARQL Endpoint and the URIs that we have assigned each record use the ORDF software to make them dereferencable, supporting perform content auto-negotiation as well as embedding RDFa in the HTML representation.

The data contains some 3 million individual records and some 173 million triples. Indexing the data was a very CPU intensive process taking approximately three days. Transforming and loading the source data took about five hours.

To get an idea of the shape of the data, let us consider a sample resource, http://bnb.bibliographica.org/entry/GB8102507 . Apart from linkage between the various representations, the description of the entity itself is as follows

JISC Beginner’s Guide to Digital Preservation

JISC Beginner’s Guide to Digital Preservation: "JISC Beginner’s Guide to Digital Preservation
Welcome to the JISC Beginner’s Guide to Digital Preservation

The Guide has been written for those working on JISC projects who would like help with preserving their outputs.

It is aimed at those who are new to digital preservation but can also serve as a resource for those who have specific requirements or wish to find further resources in certain areas."

Los árboles y el bosque

Pages

23 November 2010