03 Aug 2015
Hybrid Question Answering at QALD 5 challenge by Ricardo Usbeck The plethora of datasets on the web, both structured and unstructured, enables answering complex questions such as "Which anti-apartheid activist was born in Mvezo?" Some of those hybrid (source) question answering system have been benchmarked at the QALD 5 challenge at CLEF conference. Ricardo is going to present some of the results and give future research directions. Slides: https://docs.google.com/presentation/d/1dccMwbPMIeOpzvV1PNCKKxg96xZBSdK2Gav9JynAJjo/edit?usp=sharing BDE, Hadoop MapR and HDFS by Hajira Jabeen Hajira will present brief introduction to BigData Europe project (BDE). Followed by , Hadoop HDFS and map reduce for distributed processing of large data ...
03 Aug 2015 11:13am GMT
Business is becoming more and more globalised, and enterprises and organisations are acting in several different regions and thus facing more challenges of different cultural aspects as well as respective language barriers. Looking at the European market, we even see 24 working languages in EU28, which make cross-border services considerably complicated. As a result, powerful language technology is needed, and intense efforts have already been taken in the EU to deal with this situation and enable the vision of a multilingual digital single market (a priority area of the European Commission this year, see: http://ec.europa.eu/priorities/digital-single-market/ ). Here at the Semantic ...
03 Aug 2015 9:49am GMT
29 Jul 2015
2015-07-29, DCMI is please to announce that the National Diet Library of Japan has translated " Guidelines for Dublin Core Application Profiles ", a DCMI Recommended Resource. The link to the new Japanese translation is available on the DCMI Documents Translation page at http://dublincore.org/resources/translations/index.shtml .
29 Jul 2015 11:59pm GMT
2015-07-29, São Paulo State University (UNESP) and the Conference Committee of DC-2015 in São Paulo, Brazil on 1-4 September have published the final program of the DCMI International Conference at http://dcevents.dublincore.org/IntConf/index/pages/view/schedule-15 . Join us in São Paulo for an exciting agenda including papers, project reports and best practice posters and presentations. Parallel with the peer reviewed program is an array of special sessions of panels and discussions on key metadata issues, challenges and new opportunities. Pre- and post-conference Professional Program workshops round out the program by providing full-day instruction. Every year the DCMI community gathers for both its Annual Meeting ...
29 Jul 2015 11:59pm GMT
28 Jul 2015
28 Jul 2015 8:09am GMT
27 Jul 2015
The ESIP 2015 Summer Meeting was held at Pacific Grove, CA in the week of July 14-17. Pacific Grove is such a beautiful place with the coast line, sand beach and sun set. What excited me more are the science and technical topics covered in the meeting sessions, as well as the opportunity to catch up with friends in the ESIP community. Excellent topics + a scenic place + friends = a wonderful meeting. Thanks a lot to the meeting organizers! The theme of this summer meeting is "The Federation of Earth Science Information Partners & Community Resilience: Coming Together." ...
27 Jul 2015 5:13pm GMT
22 Jul 2015
Dear all, we are happy to announce DL-Learner 1.1. DL-Learner is a framework containing algorithms for supervised machine learning in RDF and OWL. DL-Learner can use various RDF and OWL serialization formats as well as SPARQL endpoints as input, can connect to most popular OWL reasoners and is easily and flexibly configurable. It extends concepts of Inductive Logic Programming and Relational Learning to the Semantic Web in order to allow powerful data analysis. Website: http://dl-learner.org GitHub page: https://github.com/AKSW/DL-Learner Download: https://github.com/AKSW/DL-Learner/releases ChangeLog: http://dl-learner.org/development/changelog/ DL-Learner is used for data analysis in other tools such as ORE and RDFUnit. Technically, it uses refinement ...
22 Jul 2015 2:14pm GMT
16 Jul 2015
Enterprise Linked Data Networks (PhD progress report) by Marvin Frommhold The topic of the thesis is the scientific utilization of the LUCID research project, in particular the LUCID Endpoint Prototype . In LUCID we research and develop on Linked Data technologies in order to allow partners in supply chains to describe their work, their companies and their products for other participants. This allows for building distributed networks of supply chain partners on the Web without a centralized infrastructure. About the AKSW Colloquium This event is part of a series of events about Semantic Web technology. Please see http://wiki.aksw.org/Colloquium for further information ...
16 Jul 2015 9:20am GMT
15 Jul 2015
2015-07-15, Conference host UNESP and DCMI are pleased to announce that Ex Libris and Elsevier are now among the sponsors of DC-2015 in São Paulo, Brazil, 1-4 September 2015. Elsevier is a world-leading provider of scientific, technical and medical information products and services and a world-leading provider of information solutions that enhance the performance of science, health, and technology professionals, empowering them to make better decisions. Ex Libris is a leading provider of library automation solutions, offering the only comprehensive product suite for the discovery, management, and distribution of all materials--print, electronic, and digital. For information about how your organization ...
15 Jul 2015 11:59pm GMT
2015-07-15, Early Bird registration for DC-2015 in São Paulo, Brazil closes on 31 July 2015. In addition to Keynote Speakers Paul Walk of EDINA and Ana Alice Baptista of the University of Minho, there is a full Technical Program of peer-reviewed papers, project reports and posters , as well as a Professional Program of full-day Workshops and Conference Special Sessions . For more about the conference, visit the conference website at http://purl.org/dcevents/dc-2015 .
15 Jul 2015 11:59pm GMT
In this article we will look at Virtuoso vs. Impala with 100G TPC-H on two R3.8 EC2 instances. We get a single user win for Virtuoso by a factor of 136, and a five user win by a factor of 55. The details and analysis follow. The load setup is the same as ever, with copying from CSV files attached as external tables into Parquet tables . We get lineitem split over 88 Parquet files, which should provide enough parallelism for the platform. The Impala documentation states that there can be up to one thread per file, and here we ...
15 Jul 2015 8:12pm GMT
Gang Fu and Evan Bolton have blogged about it previously, but their PubChemRDF paper is out now (doi: 10.1186/s13321-015-0084-4 ). It very likely defines the largest collection of RDF triples using the CHEMINF ontology and I congratulate the authors with a increasingly powerful PubChem database. With this major provider of Linked Open Data for chemistry now published, I should soon see where my Isbjørn stands . The release of this publication is also very timely with respect to the CHEMINF ontology, as I last week finished a transition from Google to GitHub, by moving the important wiki pages, including one about ...
15 Jul 2015 5:21pm GMT
With some help from SPARQL.
15 Jul 2015 1:34pm GMT
14 Jul 2015
The Innovation Radar is a DG Connect support initiative which focuses on the identification of high potential innovations and the key innovators behind them in FP7, CIP and H2020 projects. The Radar supports the innovators by suggesting a range of targeted actions that can assist them in fulfilling their potential in the market place. The first Innovation Radar Report reviews the innovation potential of ICT projects funded under 7th Framework Programme and the Competitiveness and Innovation Framework Programme. Between May 2014 and January 2015, the Commission reviewed 279 ICT projects, which had resulted in a total of 517 innovations, delivered by ...
14 Jul 2015 10:15am GMT
13 Jul 2015
This article discusses the relationship between vectored execution and column- and row-wise data representations. Column stores are traditionally considered to be good for big scans but poor at indexed access. This is not necessarily so, though. We take TPC-H Q9 as a starting point, working with different row- and column-wise data representations and index choices. The goal of the article is to provide a primer on the performance implications of different physical designs. All the experiments are against the TPC-H 100G dataset hosted in Virtuoso on the test system used before in the TPC-H series : dual Xeon E5-2630, 2x6 ...
13 Jul 2015 5:46pm GMT
Two papers presented at SIGMOD 2015 have been added to the Virtuoso Science Library . Orri Erling (OpenLink Software); Alex Averbuch (Neo Technology); Josep Larriba-Pey (Sparsity Technologies); Hassan Chafi (Oracle Labs); Andrey Gubichev (TU Munich); Arnau Prat-Pérez (Universitat Politècnica de Catalunya); Minh-Duc Pham (VU University Amsterdam); Peter Boncz (CWI): The LDBC Social Network Benchmark: Interactive Workload . Proceedings of SIGMOD 2015, Melbourne . This paper is an overview of the challenges posed in the LDBC social network benchmark, from data generation to the interactive workload. Mihai Capotă (Delft University of Technology), Tim Hegeman (Delft University of Technology), Alexandru Iosup (Delft ...
13 Jul 2015 4:52pm GMT