01 Oct 2014

feedPlanet RDF

Simmons College and Infocom Corporation renew organizational memberships

2014-10-01, DCMI is very pleased to announce the renewal of two of its organizational members for the coming year. The Graduate School of Library and Information Science at Simmons College in Boston, USA, has renewed as an Institutional Member and Infocom Corporation of Japan has renewed as a Supporting Member. The DCMI Supporting Member Program is open to all private sector companies that want to support DCMI financially in continuing its work to the benefit of a healthy metadata ecosystem. The Institutional Member Program is open to all public sector organizations interested in supporting DCMI while participating actively in DCMI ...

01 Oct 2014 11:59pm GMT

30 Sep 2014

feedPlanet RDF

In Hoc Signo Vinces (part 19 of n): Scalability, 1000G, and 3000G

Scalability, specifically linear scalability, means that twice the data takes twice as long to process, or that double the gear processes the same data in half the time. This is only literally true for "embarrassingly parallel" workloads. There are parts of TPC-H which have an embarrassingly parallel nature, like Q1 and Q7 . There are parts that are almost as easy, like Q14, Q17 , Q19 , and Q21 , where there is a big scan and a selective hash join with a hash table small enough to replicate everywhere. The scan scales linearly; building the hash does not, since ...

30 Sep 2014 8:33pm GMT

Read Write Web — Q3 Summary — 2014

Summary The schedule for ISWC 2014 in Trentino, Italy has been announced. A packed program was under way with a seemingly increasing number of mature demos . MIT / DIG will present an excellent paper, entitled, " CIMBA - Client-Integrated MicroBlogging Architecture ". A massive amount of work has come together under the EU funded, LOD2 banner, which has now been renamed to Linked Data Stack. Far too much to pack into one blog post, but an excellent overview is available under Open Access, called, " Linked Open Data - Creating Knowledge Out of Interlinked Data ". Congrats to everyone ...

30 Sep 2014 12:35pm GMT

Data Shapes Working Group Launched

It's taken a while but we've finally been able to launch the RDF Data Shapes Working Group. As the charter for the new WG says, the mission is to produce a language for defining structural constraints on RDF graphs. In … Continue reading →

30 Sep 2014 8:14am GMT

Infoboxer: using statistical semantic knowledge to help create Wikipedia infoboxes

Tweet In this week's ebiquity meeting (10am Tue. Oct 1 in ITE346), Varish Mulwad will present Infoboxer, a prototype tool he developed with Roberto Yus that overcomes these challenges using statistical and semantic knowledge from linked data sources to ease the process of creating Wikipedia infoboxes. Wikipedia infoboxes serve as input in the creation of knowledge bases such as DBpedia, Yago, and Freebase. Current creation of Wikipedia infoboxes is manual and based on templates that are created and maintained collaboratively. However, these templates pose several challenges: Different communities use different infobox templates for the same category articles Attribute names differ ...

30 Sep 2014 12:56am GMT

26 Sep 2014

feedPlanet RDF

In Hoc Signo Vinces (part 18 of n): Cluster Dynamics

This article is about how scale-out differs from single-server. This shows large effects of parameters whose very existence most would not anticipate, and some low level metrics for assessing these. The moral of the story is that this is the stuff which makes the difference between merely surviving scale-out and winning with it. The developer and DBA would not normally know about this; thus these things fall into the category of adaptive self-configuration expected from the DBMS. But since this series is about what makes performance, I will discuss the dynamics such as they are and how to play these. ...

26 Sep 2014 9:02pm GMT

In Hoc Signo Vinces (part 17 of n): 100G and 300G Runs on Dual Xeon E5 2650v2

This is an update presenting sample results on a newer platform for a single-server configuration. This is to verify that performance scales with the addition of cores and clock speed. Further, we note that the jump from 100G to 300G changes very little about the score. 3x larger takes approximately 3x longer, as long as things are in memory. The platform is one node of the CWI cluster which was also used for the 500Gt RDF experiments reported on this blog . The specification is dual Xeon E5 2650v2 (8 core, 16 thread, 2.6 GHz) with 256 GB RAM. The ...

26 Sep 2014 9:02pm GMT

25 Sep 2014

feedPlanet RDF

OpenTox Europe 2014 presentation: "Open PHACTS: solutions and the foundation"

CC-BY 2.0 by Dmitry Valberg . Where the OpenTox Europe 2013 presentation focused on the technical layers of Open PHACTS , this presentation addressed a key knowledge management solution to scientific questions and the Open PHACTS Foundation . I stress here too, as in the slides, that the presentation is on behalf of the full consortium! For the knowledge management, I think Open PHACTS did really interested work in the field of "identity" and am happy to have been involved in this [ Brenninkmeijer2012 ]. The platform implementation is, furthermore, based on the BridgeDb platform, that originated in our group [ ...

25 Sep 2014 8:59am GMT

24 Sep 2014

feedPlanet RDF

In Hoc Signo Vinces (part 16 of n): Introduction to Scale-Out

So far, we have analyzed TPC-H in a single-server, memory-only setting. We will now move to larger data and cluster implementations. In principle, TPC-H parallelizes well, so we should expect near-linear scalability; i.e., twice the gear runs twice as fast, or close enough. In practice, things are not quite so simple. Larger data, particularly a different data-to-memory ratio, and the fact of having no shared memory, all play a role. There is also a network, so partitioned operations, which also existed in the single-server case, now have to send messages across machines, not across threads. For data loading and refreshes, ...

24 Sep 2014 5:05pm GMT

19 Sep 2014

feedPlanet RDF

Infocom Corporation renews DCMI Supporting Membership

2014-09-19, DCMI is very pleased to announce that Infocom Corporation of Japan has renewed its Supporting Membership status until 30 June 2015. Please see the DCMI membership page for more details. The DCMI Supporting Member Program is open for all private sector companies that want to support DCMI financially to continue its work to the benefit of the global audience and a healthy metadata ecosystem.

19 Sep 2014 11:59pm GMT

Rafiki: A Semantic and Collaborative Approach to Community Health-Care in Underserved Areas

Tweet Primal Pappachan, Roberto Yus, Anupam Joshi and Tim Finin, Rafiki: A Semantic and Collaborative Approach to Community Health-Care in Underserved Areas , 10th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing, 22-15 October2014, Miami. Community Health Workers (CHWs) act as liaisons between health-care providers and patients in underserved or un-served areas. However, the lack of information sharing and training support impedes the effectiveness of CHWs and their ability to correctly diagnose patients. In this paper, we propose and describe a system for mobile and wearable computing devices called Rafiki which assists CHWs in decision making and facilitates ...

19 Sep 2014 12:26pm GMT

18 Sep 2014

feedPlanet RDF

Taming Wild Big Data

Tweet Jennifer Sleeman and Tim Finin, Taming Wild Big Data , AAAI Fall Symposium on Natural Language Access to Big Data, Nov. 2014. Wild Big Data is data that is hard to extract, understand, and use due to its heterogeneous nature and volume. It typically comes without a schema, is obtained from multiple sources and provides a challenge for information extraction and integration. We describe a way to subduing Wild Big Data that uses techniques and resources that are popular for processing natural language text. The approach is applicable to data that is presented as a graph of objects and ...

18 Sep 2014 1:36am GMT

13 Sep 2014

feedPlanet RDF

A schemaless computer database in 1965

To enable flexible metadata aggregation, among other things.

13 Sep 2014 4:09pm GMT

12 Sep 2014

feedPlanet RDF

Schema.org v1.91: Offer/price documentation fixes, cleanup and community contributions.

Schema.org has been updated to v1.91 . From the release notes: Updated text of the price property to include practical usage guidance, alongside links to information from GS1 for the gtin-related Offer properties. Updated all our examples to follow that guidance; primarily by using priceCurrency and the content= attribute. Noted our thanks to the OpenDomain project for our domain name. Updated the text of the ' image ' property to match its expected types. Thanks, Dan Scott! Changed spelling of 'supercededBy' to the more conventional supersededBy . Thanks, Sachini Aparna Herath! Noted that ' logo ' and ' photo ' ...

12 Sep 2014 7:52am GMT

09 Sep 2014

feedPlanet RDF

Major UMBEL Release: 1.10

After more than 2 years, we are now finally releasing a new version of the UMBEL ontology and reference concept structure . One might think that we haven't worked on the project all that time, but it is not strictly true. We did improve the mapping to external vocabularies/ontologies, we worked much on linking Wikipedia pages to the UMBEL structure, but we haven't had time to release a new version… until now! For people new to the ontology, UMBEL is a general reference structure of about 28,000 reference concepts, which provides a scaffolding to link and interoperate other datasets and ...

09 Sep 2014 2:57pm GMT

08 Sep 2014

feedPlanet RDF

SEMANTiCS 2014 (part 3 of 3): Conversations

I was asked for an oracular statement about the future of relational database (RDBMS) at the conference. The answer, without doubt or hesitation, is that this is forever. But this does not mean that the RDBMS world would be immutable, quite the opposite. The specializations converge. The RDBMS becomes more adaptable and less schema-first. Of course the RDBMS also take new data models beside the relational. RDF and other property graph models, for instance. The schema-last-ness is now well in evidence. For example, PostgreSQL has an hstore column type which is a list of key-value pairs. Vertica has a feature ...

08 Sep 2014 8:11pm GMT