30 Nov 2016

PhD Proposal: Ankur Padia, Dealing with Dubious Facts in Knowledge Graphs

Tweet Dissertation Proposal Dealing with Dubious Facts in Knowledge Graphs Ankur Padia 1:00-3:00pm Wednesday, 30 November 2016, ITE 325b, UMBC Knowledge graphs are structured representations of facts where nodes are real-world entities or events and edges are the associations among the pair of entities. Knowledge graphs can be constructed using automatic or manual techniques. Manual techniques construct high quality knowledge graphs but are expensive, time consuming and not scalable. Hence, automatic information extraction techniques are used to create scalable knowledge graphs but the extracted information can be of poor quality due to the presence of dubious facts. An extracted fact ...

30 Nov 2016 2:25am GMT

26 Nov 2016

AKSW Colloquium, 28.11.2016, NED using PBOH + Large-Scale Learning of Relation-Extraction Rules.

In the upcoming Colloquium, November the 28th at 3 PM, two papers will be presented: Probabilistic Bag-Of-Hyperlinks Model for Entity Linking Diego Moussallem will discuss the paper "Probabilistic Bag-Of-Hyperlinks Model for Entity Linking" by Octavian-Eugen Ganea et. al. which was accepted at WWW 2016. Abstract : Many fundamental problems in natural language processing rely on determining what entities appear in a given text. Commonly referenced as entity linking, this step is a fundamental component of many NLP tasks such as text understanding, automatic summarization, semantic search or machine translation. Name ambiguity, word polysemy, context dependencies and a heavy-tailed distribution of entities contribute to ...

26 Nov 2016 11:30am GMT

21 Nov 2016

Leveraging KBpedia Aspects To Generate Training Sets Automatically

In previous articles I have covered multiple ways to create training corpuses for unsupervised learning and positive and negative training sets for supervised learning 1 , 2 , 3 using Cognonto and KBpedia. Different structures inherent to a knowledge graph like KBpedia can lead to quite different corpuses and sets. Each of these corpuses or sets may yield different predictive powers depending on the task at hand. So far we have covered two ways to leverage the KBpedia Knowledge Graph to automatically create positive and negative training corpuses: Using the links that exist between each KBpedia reference concept and their ...

21 Nov 2016 11:14am GMT

17 Nov 2016

Dynamic Machine Learning Using the KBpedia Knowledge Graph – Part 2

In the first part of this series we found the good hyperparameters for a single linear SVM classifier. In part 2, we will try another technique to improve the performance of the system: ensemble learning. So far, we already reached 95% of accuracy with some tweaking the hyperparameters and the training corpuses but the F1 score is still around ~70% with the full gold standard which can be improved. There are also situations when precision should be nearly perfect (because false positives are really not acceptable) or when the recall should be optimized. Here we will try to improve this ...

17 Nov 2016 11:05am GMT

Dynamic Machine Learning Using the KBpedia Knowledge Graph – Part 1

In my previous blog post, Create a Domain Text Classifier Using Cognonto , I explained how one can use the KBpedia Knowledge Graph to automatically create positive and negative training corpuses for different machine learning tasks. I explained how SVM classifiers could be trained and used to check if an input text belongs to the defined domain or not. This article is the first of two articles.In first part I will extend on this idea to explain how the KBpedia Knowledge Graph can be used, along with other machine learning techniques, to cope with different situations and use cases. I ...

17 Nov 2016 11:00am GMT

16 Nov 2016

Triplifying a real dictionary

The Linked Data Lexicography for High-End Language Technology (LDL4HELTA) project was started in cooperation between Semantic Web Company (SWC) and K Dictionaries . LDL4HELTA combines lexicography and Language Technology with semantic technologies and Linked (Open) Data mechanisms and technologies. One of the implementation steps of the project is to create a language graph from the dictionary data. The input data, described further, is a Spanish dictionary core translated into multiple languages and available in XML format. This data should be triplified (which means to be converted to RDF - Resource Description Framework ) for several purposes, including to enrich it with ...

16 Nov 2016 12:07pm GMT

14 Nov 2016

Accepted paper in AAAI 2017

Hello Community! We are very pleased to announce that our paper "Radon- Rapid Discovery of Topological Relations" was accepted for presentation at the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17) , which will be held in February 4-9 at the Hilton San Francisco, San Francisco, California, USA. In more detail, we will present the following paper: "Radon- Rapid Discovery of Topological Relations" Mohamed Ahmed Sherif , Kevin Dreßler , Panayiotis Smeros , and Axel-Cyrille Ngonga Ngomo Abstract. Datasets containing geo-spatial resources are increasingly being represented according to the Linked Data principles. Several time-efficient approaches for discovering links between RDF resources ...

14 Nov 2016 1:48pm GMT

13 Nov 2016

Pulling RDF out of MySQL

With a command line option and a very short stylesheet.

13 Nov 2016 3:09pm GMT

11 Nov 2016

SUB Göttingen joins DCMI as Institutional Member

2016-11-11, DCMI is pleased to announce that Göttingen State and University Library (SUB Göttingen) has joined DCMI as an Institutional Member. SUB Göttingen is one of most important research libraries in Germany, plays a leading role in a large number of national and international projects involving the optimization of literature and information provision and the establishment and development of digital research and information infrastructures. Its scope of activities include the cooperative development of a Germany-wide service infrastructure for the acquisition, licensing and provision of electronic resources; the coordination of large-scale joint research projects for developing research infrastructures in the humanities ...

11 Nov 2016 11:59pm GMT

A speaking camera using Pi3 and Tensorflow

11 Nov 2016 12:39pm GMT

10 Nov 2016

Donate to the commons this holiday season

Holiday season is nearly upon us. Donating to a charity is an alternative form of gift giving that shows you care, whilst directing your money towards helping those that need it. There are a lot of great and deserving causes you can support, and I'm certainly not going to tell you where you should donate your money. But I've been thinking about the various ways in which I can support projects that I care about. There are a lot of them as it turns out. And it occurred to me that I could ask friends and family who might want to buy me a gift to ...

10 Nov 2016 7:23pm GMT

08 Nov 2016

The practice of open data

Open data is data that anyone can access, use and share. Open data is the result of several processes. The most obvious one is the release process that results in data being made available for reuse and sharing. But there are other processes that may take place before that open data is made available: collecting and curating a dataset; running it through quality checks; or ensuring that data has been properly anonymised. There are also processes that happen after data has been published. Providing support to users, for example. Or dealing with error reports or service issues with an API ...

08 Nov 2016 7:52pm GMT

07 Nov 2016

Building and Maintaining the KBpedia Knowledge Graph

The Cognonto demo is powered by an extensive knowledge graph called the KBpedia Knowledge Graph, as organized according to the KBpedia Knowledge Ontology (KKO). KBpedia is used for all kinds of tasks, some of which are demonstrated by the Cognonto use cases . KBpedia powers dataset linkage and mapping tools, machine learning training workflows, entity and concept extractions, category and topic tagging, etc. The KBpedia Knowledge Graph is a structure of more than 39,000 reference concepts linked to 6 major knowledge bases and 20 popular ontologies in use across the Web. Unlike other knowledge graphs that analyze big corpuses of ...

07 Nov 2016 7:57pm GMT

04 Nov 2016

Discogs: a business based on public domain data

When I'm discussing business models around open data I regularly refer to a few different examples. Not all of these have well developed case studies, so I thought I'd start trying to capture them here. In this first write-up I'm going to look at

04 Nov 2016 10:28pm GMT

Machine learning links

[work in progress - I'm updating it gradually] Machine Learning

04 Nov 2016 4:11pm GMT

01 Nov 2016

Checking Fact Checkers

As of last month

01 Nov 2016 7:39pm GMT