11 April 2011
eFoundations: RDTF metadata guidelines - Limp Data or Linked Data?
eFoundations: RDTF metadata guidelines - Limp Data or Linked Data?: "Having just finished reading thru the 196 comments we received on the draft metadata guidelines for the UK RDTF I'm now in the process of wondering where we go next. We (Pete and I) have relatively little effort to take this work forward (a little less than 5 days to be precise) so it's not clear to me how best we use that effort to get something useful out for both RDTF and the wider community."
Fihrist - Home
Fihrist - Home: "Welcome to Fihrist. This catalogue provides a searchable interface to more than 3,000 basic manuscript descriptions taken from printed and card catalogues of the collections of the Bodleian Libraries, Oxford and Cambridge University Library. It contains records for Arabic manuscripts at Oxford and both Arabic and Persian records for Cambridge. Fihrist was created with JISC funding by the OCIMCO project."
[1103.5046] From Linked Data to Relevant Data -- Time is the Essence
[1103.5046] From Linked Data to Relevant Data -- Time is the Essence: "From Linked Data to Relevant Data -- Time is the Essence
Markus Kirchberg, Ryan K L Ko, Bu Sung Lee
(Submitted on 25 Mar 2011)
The Semantic Web initiative puts emphasis not primarily on putting data on the Web, but rather on creating links in a way that both humans and machines can explore the Web of data. When such users access the Web, they leave a trail as Web servers maintain a history of requests. Web usage mining approaches have been studied since the beginning of the Web given the log's huge potential for purposes such as resource annotation, personalization, forecasting etc. However, the impact of any such efforts has not really gone beyond generating statistics detailing who, when, and how Web pages maintained by a Web server were visited."
Markus Kirchberg, Ryan K L Ko, Bu Sung Lee
(Submitted on 25 Mar 2011)
The Semantic Web initiative puts emphasis not primarily on putting data on the Web, but rather on creating links in a way that both humans and machines can explore the Web of data. When such users access the Web, they leave a trail as Web servers maintain a history of requests. Web usage mining approaches have been studied since the beginning of the Web given the log's huge potential for purposes such as resource annotation, personalization, forecasting etc. However, the impact of any such efforts has not really gone beyond generating statistics detailing who, when, and how Web pages maintained by a Web server were visited."
[1103.5043] An Empirical Study of Real-World SPARQL Queries
[1103.5043] An Empirical Study of Real-World SPARQL Queries: "An Empirical Study of Real-World SPARQL Queries
Mario Arias, Javier D. Fernández, Miguel A. Martínez-Prieto, Pablo de la Fuente
(Submitted on 25 Mar 2011)
Understanding how users tailor their SPARQL queries is crucial when designing query evaluation engines or fine-tuning RDF stores with performance in mind. In this paper we analyze 3 million real-world SPARQL queries extracted from logs of the DBPedia and SWDF public endpoints. We aim at finding which are the most used language elements both from syntactical and structural perspectives, paying special attention to triple patterns and joins, since they are indeed some of the most expensive SPARQL operations at evaluation phase. We have determined that most of the queries are simple and include few triple patterns and joins, being Subject-Subject, Subject-Object and Object-Object the most common join types. The graph patterns are usually star-shaped and despite triple pattern chains exist, they are generally short."
Mario Arias, Javier D. Fernández, Miguel A. Martínez-Prieto, Pablo de la Fuente
(Submitted on 25 Mar 2011)
Understanding how users tailor their SPARQL queries is crucial when designing query evaluation engines or fine-tuning RDF stores with performance in mind. In this paper we analyze 3 million real-world SPARQL queries extracted from logs of the DBPedia and SWDF public endpoints. We aim at finding which are the most used language elements both from syntactical and structural perspectives, paying special attention to triple patterns and joins, since they are indeed some of the most expensive SPARQL operations at evaluation phase. We have determined that most of the queries are simple and include few triple patterns and joins, being Subject-Subject, Subject-Object and Object-Object the most common join types. The graph patterns are usually star-shaped and despite triple pattern chains exist, they are generally short."
[1103.4295] Linking Literature and Data: Status Report and Future Efforts
[1103.4295] Linking Literature and Data: Status Report and Future Efforts: "Linking Literature and Data: Status Report and Future Efforts
Alberto Accomazzi
(Submitted on 22 Mar 2011)
In the current era of data-intensive science, it is increasingly important for researchers to be able to have access to published results, the supporting data, and the processes used to produce them. Six years ago, recognizing this need, the American Astronomical Society and the Astrophysics Data Centers Executive Committee (ADEC) sponsored an effort to facilitate the annotation and linking of datasets during the publishing process, with limited success. I will review the status of this effort and describe a new, more general one now being considered in the context of the Virtual Astronomical Observatory."
Alberto Accomazzi
(Submitted on 22 Mar 2011)
In the current era of data-intensive science, it is increasingly important for researchers to be able to have access to published results, the supporting data, and the processes used to produce them. Six years ago, recognizing this need, the American Astronomical Society and the Astrophysics Data Centers Executive Committee (ADEC) sponsored an effort to facilitate the annotation and linking of datasets during the publishing process, with limited success. I will review the status of this effort and describe a new, more general one now being considered in the context of the Virtual Astronomical Observatory."
SUSHI Server - National Information Standards Organization
SUSHI Server - National Information Standards Organization: "Author(s): Brinda Shah is a web programmer at H.W. Wilson..
doi: 10.3789/isqv23n1.2011.05
Citation: Shah, Brinda. SUSHI Implementation: The Server Side Experience. Information Standards Quarterly, 2011 Winter, 23(1):20-22.
Abstract: The author describes her experience in implementing the server side of the Standardized Usage Statistics Harvesting Initiative (SUSHI) Protocol (ANSI/NISO Z39.93) at H.W. Wilson. She describes her learning curve with web services, the steps involved in implementing a SUSHI server for delivering usage data to clients. Tools used include J2EE framework, Apache Tomcat web application server, Axis SOAP engine, and Eclipse development tool."
doi: 10.3789/isqv23n1.2011.05
Citation: Shah, Brinda. SUSHI Implementation: The Server Side Experience. Information Standards Quarterly, 2011 Winter, 23(1):20-22.
Abstract: The author describes her experience in implementing the server side of the Standardized Usage Statistics Harvesting Initiative (SUSHI) Protocol (ANSI/NISO Z39.93) at H.W. Wilson. She describes her learning curve with web services, the steps involved in implementing a SUSHI server for delivering usage data to clients. Tools used include J2EE framework, Apache Tomcat web application server, Axis SOAP engine, and Eclipse development tool."
SUSHI Client - National Information Standards Organization
SUSHI Client - National Information Standards Organization: "Author(s): Omar Villa is IT Development Manager at Grupo Integra in Mexico City, Mexico.
doi: 10.3789/isqv23n1.2011.04
Citation: Villa, Omar. SUSHI Implementation: The Client Side Experience. Information Standards Quarterly, 2011 Winter, 23(1):18-19.
Abstract: The author describes his experience in implementing the client side of the Standardized Usage Statistics Harvesting Initiative (SUSHI) Protocol (ANSI/NISO Z39.93) at Grupo Integra. He developed a module for their Kenvo Stats system, which generates statistics on the usage of electronic resources, to automate the retrieval of the COUNTER report statistics. After trying some PHP tools and a Java implementation, the final client was built using PHP Sockets."
doi: 10.3789/isqv23n1.2011.04
Citation: Villa, Omar. SUSHI Implementation: The Client Side Experience. Information Standards Quarterly, 2011 Winter, 23(1):18-19.
Abstract: The author describes his experience in implementing the client side of the Standardized Usage Statistics Harvesting Initiative (SUSHI) Protocol (ANSI/NISO Z39.93) at Grupo Integra. He developed a module for their Kenvo Stats system, which generates statistics on the usage of electronic resources, to automate the retrieval of the COUNTER report statistics. After trying some PHP tools and a Java implementation, the final client was built using PHP Sockets."
Democratizing Information with Semantics » AI3:::Adaptive Information
Democratizing Information with Semantics » AI3:::Adaptive Information: "Self-service Information Management for Knowledge Workers
Though I have alluded to it numerous times in my past writings [1], I think one of the most pervasive and important benefits from semantic technologies in the enterprise will come from the democratization of information. These benefits will arise mostly from a fundamental change in how we manage and consume information. A new “system” of semantic technologies is now largely available that can put the collection, assembly, organization, analysis and presentation of information directly in the hands of those who need it most — the consumers of information."
Though I have alluded to it numerous times in my past writings [1], I think one of the most pervasive and important benefits from semantic technologies in the enterprise will come from the democratization of information. These benefits will arise mostly from a fundamental change in how we manage and consume information. A new “system” of semantic technologies is now largely available that can put the collection, assembly, organization, analysis and presentation of information directly in the hands of those who need it most — the consumers of information."
Mar 16: Metadata Harmonization (NISO/DCMI Webinar) - National Information Standards Organization
Mar 16: Metadata Harmonization (NISO/DCMI Webinar) - National Information Standards Organization: "Metadata Harmonization: Making Standards Work Together"
Subscribe to:
Posts (Atom)