25 Oct 2016

feedPlanet RDF

Create a Domain Text Classifier Using Cognonto

A common task required by systems that automatically analyze text is to classify an input text into one or multiple classes . A model needs to be created to scope the class (what belongs to it and what does not) and then a classification algorithm uses this model to classify an input text. Multiple classification algorithms exists to perform such a task: Support Vector Machine (SVM), K-Nearest Neigbours (KNN), C4.5 and others. What is hard with any such text classification task is not so much how to use these algorithms: they are generally easy to configure and use once implemented ...

25 Oct 2016 12:49am GMT

24 Oct 2016

feedPlanet RDF

A presence robot with Chromium, WebRTC, Raspberry Pi 3 and EasyRTC

Here's how to make a presence robot with Chromium 51, WebRTC, Raspberry Pi 3 and EasyRTC. It's actually very easy, especially now that Chromium 51 comes with Raspian Jessie, although it's taken me a long time to find the exact incantation. If you're going to use it for real, I'd suggest using the

24 Oct 2016 9:53pm GMT

21 Oct 2016

feedPlanet RDF


For the purposes of having something to point to in future, here's a list of different meanings of "open" that I've encountered. XYZ is "open" because: It's on the web It's free to use It's published under an open licence It's published under a custom licence, which limits some types of use (usually commercial, often everything except personal) It's published under an open licence, but we've not checked too deeply in whether we can do that It's free to use, so long as you do so within our app or application There's a restricted/limited access free version There's documentation on how it works ...

21 Oct 2016 2:51pm GMT

Current gaps in the open data standards framework

In this post I want to highlight what I think are some fairly large gaps in the standards we have for publishing and consuming data on the web. My purpose for writing these down is to try and fill in gaps in my own knowledge, so leave a comment if you think I'm missing something (there's probably loads!) To define the scope of those standards, lets try and answer two questions. Question 1: What are the various activities that we might want to carry out around an open dataset? A. Discover the metadata and documentation about a dataset B. Download ...

21 Oct 2016 2:22pm GMT

17 Oct 2016

feedPlanet RDF

AKSW Colloquium, 17.10.2016, Version Control for RDF Triple Stores + NEED4Tweet

In the upcoming Colloquium, October the 17th at 3 PM, two papers will be presented: Version Control for RDF Triple Stores Marvin Frommhold will discuss the paper "Version Control for RDF Triple Stores" by Steve Cassidy and James Ballantine which forms the foundation of his own work regarding versioning for RDF. Abstract : RDF, the core data format for the Semantic Web, is increasingly being deployed both from automated sources and via human authoring either directly or through tools that generate RDF output. As individuals build up large amounts of RDF data and as groups begin to collaborate on authoring knowledge stores in RDF, ...

17 Oct 2016 7:55am GMT

14 Oct 2016

feedPlanet RDF

LIMES 1.0.0 Released

Dear all, the LIMES Dev team is happy to announce LIMES 1.0.0. LIMES, the Li nk Discovery Framework for Me tric S paces, is a link discovery framework for the Web of Data. It implements time-efficient approaches for large-scale link discovery based on the characteristics of metric spaces. Our approaches facilitate different approximation techniques to compute estimates of the similarity between instances. These estimates are then used to filter out a large amount of those instance pairs that do not suffice the mapping conditions. By these means, LIMES can reduce the number of comparisons needed during the mapping process by ...

14 Oct 2016 9:38am GMT

11 Oct 2016

feedPlanet RDF

DL-Learner 1.3 (Supervised Structured Machine Learning Framework) Released

Dear all, the Smart Data Analytics group at AKSW is happy to announce DL-Learner 1.3. DL-Learner is a framework containing algorithms for supervised machine learning in RDF and OWL. DL-Learner can use various RDF and OWL serialization formats as well as SPARQL endpoints as input, can connect to most popular OWL reasoners and is easily and flexibly configurable. It extends concepts of Inductive Logic Programming and Relational Learning to the Semantic Web in order to allow powerful data analysis. Website: http://dl-learner.org GitHub page: https://github.com/AKSW/DL-Learner Download: https://github.com/AKSW/DL-Learner/releases ChangeLog: http://dl-learner.org/development/changelog/ DL-Learner is used for data analysis tasks within other tools such as ...

11 Oct 2016 7:41pm GMT

07 Oct 2016

feedPlanet RDF

Mapping Datasets, Schema and Ontologies Using the Cognonto Mapper

There are many situations were we want to link named entities from two different datasets or to find duplicate entities to remove in a single dataset. The same is true for vocabulary terms or ontology classes that we want to integrate and map together. Sometimes we want to use such a linkage system to help save time when creating gold standards for named entity recognition tasks. There exist multiple data linkage & deduplication frameworks developed in several different programming languages. At Cognonto, we have our own system called the Cognonto Mapper. Most mapping frameworks work more or less the same ...

07 Oct 2016 12:20pm GMT

05 Oct 2016

feedPlanet RDF

OntoWiki 1.0.0 released

Dear Semantic Web and Linked Data Community, we are proud to finally announce the releases of OntoWiki 1.0.0 and the underlying Erfurt Framework in version 1.8.0 . After 10 years of development we've decided to release the teenager OntoWiki from the cozy home of 0.x versions. Since the last release of 0.9.11 in January 2014 we did a lot of testing to stabilize OntoWikis behavior and accordingly made a lot of bug fixes, also we are now using PHP Composer for dependency management, improved the testing work flow, gave a new structure and home to the documentation and we have ...

05 Oct 2016 2:50pm GMT

04 Oct 2016

feedPlanet RDF

Improving Machine Learning Tasks By Integrating Private Datasets

In the last decade, we have seen the emergence of two big families of datasets: the public and the private ones. Invaluable public datasets like Wikipedia , Wikidata , Open Corporates and others have been created and leveraged by organizations world-wide. However, as great as they are, most organization still rely on private datasets of their own curated data. In this article, I want to demonstrate how high-value private datasets may be integrated into the Cognonto's KBpedia knowledge base to produce a significant impact on the quality of the results of some machine learning tasks. To demonstrate this impact, I ...

04 Oct 2016 3:00pm GMT

04 Oct 2016 7:16am GMT

02 Oct 2016

feedPlanet RDF

Read Write Web — Q3 Summary — 2016

Summary The community group celebrates its 5th birthday this quarter. With almost 3000 posts (roughly 2 per day) from around 100 members a large number of topics have been raised, discussed and resolved. A bit thank you to to everyone that has been involved! On the subject of statistics, there was a great paper produced by AKSW: LODStats: The Data Web Census Dataset which provides a comprehensive picture of the current state of a significant part of the Data Web. There was also a status update from the LDP Next Community Group and Data on the Web Best Practices is ...

02 Oct 2016 4:04pm GMT

28 Sep 2016

feedPlanet RDF

Using Cognonto to Generate Domain Specific word2vec Models

word2vec is a two layer artificial neural network used to process text to learn relationships between words within a text corpus to create a model of all the relationships between the words of that corpus. The text corpus that a word2vec process uses to learn the relationships between words is called the training corpus . In this article I will show you how Cognonto 's knowledge base can be used to automatically create highly accurate domain specific training corpuses that can be used by word2vec to generate word relationship models. However you have to understand that what is being discussed ...

28 Sep 2016 7:27pm GMT

25 Sep 2016

feedPlanet RDF

Semantic web semantics vs. vector embedding machine learning semantics

It's all semantics.

25 Sep 2016 4:01pm GMT

24 Sep 2016

feedPlanet RDF

Using Recurrent Neural Networks to Hallucinate New Model Army Lyrics

I decided to follow the example of

24 Sep 2016 3:36pm GMT

23 Sep 2016

feedPlanet RDF

Danish Bibliographic Centre (DBC) becomes DC-2016 Sponsor<

2016-09-23, The Danish Bibliographic Centre (DBC) joins in supporting DC-2016 in Copenhagen as Sponsor of the Conference Delegate Bags. The DBC's main task in Denmark is the development and maintenance of the bibliographic and IT infrastructure of Danish libraries. The DBC handles registration of books, music, AV materials, Internet documents, articles and reviews in newspapers and magazines in the National Bibliography, develops Danbib, the Danish union catalogue, and the infrastructure for interlibrary loan. Danbib is comprised of the National Bibliography and the holdings of the libraries. DBC also develops bibliotek.dk - the citizen's access to all Danish publications and the ...

23 Sep 2016 11:59pm GMT