24 Nov 2015

feedPlanet RDF

Who is the intended audience for open data?

This post is part of my ongoing series:

24 Nov 2015 5:29pm GMT

23 Nov 2015

feedPlanet RDF

AKSW Colloquium, 23-11-2015, CVtec and Patty

CVtec and model-driven semantification by Andreas Nareike In this presentation, I will give a short introduction to our project CVtec ( http://www.cv-tec.de/ ). CVtec is concerned with knowledge management for technical facilities and uses methods of model-driven software development. Although CVtec uses relational databases to persist data right now, we are researching possibilities to move towards a Semantic data model. I will give an overview of our different approaches and hope to get some valuable feedback . René Speck will present " Patty: A Taxonomy of Relational Patterns with Semantic Types " by Ndapandula Nakashole, Gerhard Weikum, Fabian SuchanekMax Planck ...

23 Nov 2015 12:28pm GMT

21 Nov 2015

feedPlanet RDF

Semantic Interpretation of Structured Log Files

Tweet Piyush Nimbalkar, Semantic Interpretation of Structured Log Files , M.S. thesis, University of Maryland, Baltimore County, August, 2015. Log files comprise a record of different events happening in various applications, operating systems and even in network devices. Originally they were used to record information for diagnostic and debugging purposes. Nowadays, logs are also used to track events which can be used in auditing and forensics in case of malicious activities or systems attacks. Various softwares like intrusion detection systems, web servers, anti-virus and anti-malware systems, firewalls and network devices generate logs with useful information, that can be used ...

21 Nov 2015 3:44pm GMT

17 Nov 2015

feedPlanet RDF

13 ways to make your writing look more professional

Simple copyediting things.

17 Nov 2015 7:35pm GMT

15 Nov 2015

feedPlanet RDF

Managing risks when publishing open data

A question that I frequently encounter when talking to organisations about publishing open data is: " what if someone misuses or misunderstands our data? ". These concerns stem from several different sources: that the data might be analysed incorrectly , drawing incorrect conclusions that might be attributed to the publisher that the data has known limitations and this might reflect on the publisher's abilities, e.g. exposing issues with their operations that the data might be used against the publisher in some way, e.g. to paint them in a bad light that the data might be used for causes with which the ...

15 Nov 2015 11:24am GMT

13 Nov 2015

feedPlanet RDF

Fictional data

The phrase "fictional data" popped into my head recently, largely because of odd connections between a couple of projects I've been working on. It's stuck with me because, if you set aside the literal meaning of " data that doesn't actually exist ", there are some interesting aspects to it. For example the phrase could apply to: data that is deliberately wrong or inaccurate in order to mislead - lies or spam data that is deliberately wrong as a proof of origin or claim of ownership - e.g. inaccuracies introduced into maps to identify their sources, or

13 Nov 2015 6:19pm GMT

10 Nov 2015

feedPlanet RDF

Radiodan Part 2: Unexpectedly discovering latent user needs

As I explained in the

10 Nov 2015 12:00pm GMT

Radiodan Part 2: Drawing customisations to discover latent user needs

As I explained in the

10 Nov 2015 12:00pm GMT

08 Nov 2015

feedPlanet RDF

Supporting Situationally Aware Cybersecurity Systems

Tweet Zareen Syed, Tim Finin, Ankur Padia and M. Lisa Mathews, Supporting Situationally Aware Cybersecurity Systems , Technical Report, Computer Science and Electrical Engineering, UMBC, 30 September 2015. In this report, we describe the Unified Cyber Security ontology (UCO) to support situational awareness in cyber security systems. The ontology is an effort to incorporate and integrate heterogeneous information available from different cyber security systems and most commonly used cyber security standards for information sharing and exchange. The ontology has also been mapped to a number of existing cyber security ontologies as well as concepts in the Linked Open Data cloud. ...

08 Nov 2015 10:59pm GMT

07 Nov 2015

feedPlanet RDF

AKSW Colloquium, 09-11-2015, Versioning of Arbitrary RDF Data (PhD progress report) and GraphLab Platform

GraphLab Platform - Overview and History by Simon Bin GraphLab is a graph-based distributed computation framework. It was developed from 2009 at Carnegie Mellon University. At that time it was competing with Hadoop on Graph processing. The typical example algorithm demonstrated with it is the PageRank calculation. It still appears today in the Spark GraphX documentation as a filler for the computation step. We will look at the architecture, sample code and what happened to GraphLab today. Versioning of Arbitrary RDF Data (PhD progress report) by Marvin Frommhold A major challenge of B2B Data Networks is efficient synchronization of data between the ...

07 Nov 2015 10:43am GMT

06 Nov 2015

feedPlanet RDF

Schema.org: what's new?

[starburst visualization of schema.org's hierarchy] It's time for a round-up of recent developments at schema.org. We have just published version 2.2. As usual this combines many small fixes with a mix of new vocabulary, as well as efforts to improve the integration and documentation of our existing vocabulary. And as always you can read the full details in our releases page , which in turn links to our issue tracker for even more details. Here are some highlights: We made a number of improvements relating to the description of services , including the addition of providerMobility to indicate dynamic locations, ...

06 Nov 2015 3:13pm GMT

If you like “Friends” you probably also will like “Veronica’s Closet” (find out with SPARQL why)

In a previous blog post I have discussed the power of SPARQL to go beyond data retrieval to analytics. Here I look into the possibilities to implement a product recommender all in SPARQL. Products are considered to be similar if they share relevant characteristics, and the higher the overlap the higher the similarity. In the case of movies or TV programs there are static characteristics (e.g. genre, actors, director) and dynamic ones like viewing patterns of the audience. The static part of this we can look up in resources like the DBpedia. If we look at the data related to the ...

06 Nov 2015 12:40pm GMT

Extracting Structured Summaries from Text Documents

Tweet Extracting Structured Summaries from Text Documents Dr. Zareen Syed Research Assistant Professor, UMBC 10:30am, Monday, 9 November 2015, ITE 346, UMBC In this talk, Dr. Syed will present unsupervised approaches for automatically extracting structured summaries composed of slots and fillers (attributes and values) and important facts from articles, thus effectively reducing the amount of time and effort spent on gathering intelligence by humans using traditional keyword based search approaches. The approach first extracts important concepts from text documents and links them to unique concepts in Wikitology knowledge base. It then exploits the types associated with the linked concepts to ...

06 Nov 2015 2:48am GMT

03 Nov 2015

feedPlanet RDF

ADEQUATe for the Quality of Open Data

The ADEQUATe project builds on two observations: An increasing amount of Open Data becomes available as an important resource for emerging businesses and furtheron the integration of such open, freely re-usable data sources into organisations' data warehouse and data management systems is seen as a key success factor for competitive advantages in a data-driven economy. The project now identifies crucial issues which have to be tackled to fully exploit the value of open data and the efficient integration with other data sources: the overall quality issues with meta data and the data itself the lack of interoperability between data sources The projects approch is now to address ...

03 Nov 2015 10:19am GMT

01 Nov 2015

feedPlanet RDF

AKSW Colloquium, 2 November, 3pm, Automating Geo-spatial RDF Dataset Integration and Enrichment

On November 2nd at 3 PM, Mohamed Sherif will present the progress of his PhD titled "Automating Geo-spatial RDF Dataset Integration and Enrichment" . Abstract : Within this thesis, we will spur the transition from islands of isolated Geographic Information Systems (GIS) to enriched geo-spatial Linked Data sets with which geographic information can easily be integrated and processed. To achieve this goal, we will provide concepts, approaches and use cases that facilitate the combination and manipulation of geographic information with other data types that are already present on the Linked Data Web. Moreover, we will provide means to automate the ...

01 Nov 2015 7:38pm GMT

31 Oct 2015

feedPlanet RDF

The KELVIN Information Extraction System

Tweet In this week's ebiquity lab meeting (10:30am Monday Nov 2), Tim Finin will describe recent work on the Kelvin information extraction system and its performance in two tasks in the 2015 NIST Text Analysis Conference. Kelvin has been under development at the JHU Human Language Center of Excellence for several years. Kelvin reads documents in several languages and extracts entities and relations between them. This year it was used for the Coldstart Knowledge Base Population and Trilingual Entity Discovery and Linking tasks. Key components in the tasks are a system for cross-document coreference and another that links entities to ...

31 Oct 2015 3:41am GMT