We are very happy to announce that Pundit has made it to the final round of the LODLAM (Linked Open Data in Libraries, Archives and Museums) challenge.
This means the Pundit team will be presenting the tool to semantically annotate text to an international crowd of linked open data professionals in Montréal. During the summit in June, the winner will be picked out of four finalists.
The Pundit tool is being developed by the Italian company Net7 as part of the DM2E project. It enables users to annotate web pages and create structured data. The annotations can be collected in notebooks and shared with others to create collaborative structured knowledge. Annotations span from simple comments to semantic links to the Web of Data (as Freebase.com and Dbpedia.org), to fine granular cross-references and citations.
A specialisation of the EDM for handwritten manuscripts
This month, we have finished the first operational version of our DM2E model (v1.0), a specialisation of the EDM for handwritten manuscripts. The ontology has been developed within Work Package 2 with a lot of input from others in the project. Especially results of extensive mapping workshops with our data providers were integrated into the model. Metadata of various formats like TEI, EAD and MARC21 was analysed and used to create new classes and properties that specialise the current EDM.
Screenshot from the DM2E model representation on Neologism.
How to get the DM2E model?
You can find the specification of the model…
… as a pdf file and in the owl ontology, which can both be downloaded from the documents section
Earlier this year we put out a call to humanities academics and technologists to see if they could come up with innovative ideas for small technology projects that would further humanities research by using open content, open data and/or open source.
We’re very pleased to announce that the winners are Dr Bernhard Haslhofer (University of Vienna) and Dr Robyn Adams (Centre for Editing Lives and Letters, University College London). Both winners will receive financial support to help them undertake the work they proposed and will be blogging about the progress of their project. You can follow their progress via the DM2E blog.
Award 1: Semantic tagging for old maps… and other things
The first Award goes to Dr Bernhard Haslhofer of Vienna University. His project will involve building on an open source web application he has been working on called Maphub.
Dr Haslhofer told us a little bit about the inspiration for his project:
“People love old maps” is a statement that we heard a lot from curators in libraries. This combined with the assumption that many people also have knowledge to share or stories to tell about historical maps, was our motivation to build Maphub.
In essence Maphub is an open source Web application that, first of all, pulls out digitized historical maps from closed environments, adds zooming functionality, and assigns Web URIs so that people can talk about them online. It also supports two main use cases:
(i) georeferencing maps by linking points on the map to Geonames locations;
(ii) commenting on maps or map regions by creating annotations. While users are entering their comments, Maphub analyzes the entered text on the fly and suggests so-called semantic tags, which the user accepts or rejects.
Semantic tags appear like “normal” tags on the user interface, but are in fact links to DBpedia resources. In that way, the user links her annotations and therefore also the underlying historical map with resources from two open data sources. Besides consuming open data during the annotation authoring process, Maphub also contributes collected knowledge back as open data by exposing all annotations following the W3C Open Annotation specification. In that way, Maphub supports people in a loop of using and producing open data in the context of historical maps.
Dr Haslhofer looks forward to seeing how collaborations will blossom between these various web annotation systems:
Annotorious and Maphub have common origins and the Open Humanities will support us in unifing parallel development streams into a single, reusable annotation tool that works for digitized maps but also for other media. We will also conduct another user study to inform the design of that function for other application contexts.
Award 2: Joined Up Early Modern Diplomacy: Linked Data from the Correspondence of Thomas Bodley
The second award goes to Dr Robyn Adams of Centre for Editing Lives and Letters, University College London. The project will re-purpose the open resource that Dr Adams has been building with a team of others: the Diplomatic Correspondence of Thomas Bodley.
The project will use ‘additional’ information that was encoded into the digitisation of early modern letters that took place at the Centre for Editing Lives and Letters. In the initial incarnation of the project this data which included biographical and geographical information contained within letters was not used (although it was encoded).
Dr Adams told us a little bit about what she plans on doing with the money from the Awards:
With the prize funding from the Open Humanities Awards, we propose to mine the data that was
generated but not fully used in the first phase of the project. This data is a rich source of
biographical and geographical information, the visualization of which evokes
the complex and diverse texture of the late sixteenth-century European
diplomatic and military landscape. Bodley’s position in The Hague as the only
English representative on the Dutch Council of State put him at the centre of a
heterogeneous nexus of correspondents a time long before the Republic of
Letters burgeoned in the subsequent century.
The project will interrogate three data fields within the larger data set of Bodley’s diplomatic
correspondence in order to generate visualizations; the network of correspondents
and recipients, and the people and places mentioned within the letters. These
visualizations will be incorporated into the project website, where they will
enhance and extend the knowledge derived from the existing corpus of
correspondence. The visualizations, which will have scope to be playful while
drawn from scrupulous scholarship, will offer an alternative pathway for
scholars and the interested public to understand that in this period
especially, the political, university and kinship networks were fundamental to
advancement and prosperity.
“In mapping the relational activity between data sets,” Dr Adams went on, “I hope to further illuminate and reanimate Bodley’s position within the Elizabethan compass. Furthermore, I hope to demonstrate that fruitful routes of enquiry can result if scholars commit to going the extra mile to encode and record data in their research that may not have immediate relevance to their own studies.”
On Thursday the 23d of May, the DM2E project and the The National library of the Netherlands will organise the first Pundit workshop focussed on the digital humanities. We invite humanities scholars to join us during this one day event to learn what Pundit is and how it can be used in humanities research.
The Pundit tool is being developed by the Italian company Net7 as part of the DM2E project. It enables users to annotate web pages and create structured data. The annotations can be collected in notebooks and shared with others to create collaborative structured knowledge. Annotations span from simple comments to semantic links to the Web of Data (as Freebase.com and Dbpedia.org), to fine granular cross-references and citations. For more information about the tool you can watch this video:
We will start the day with a series of presentation about the tool and the possibilities of it for humanities research. After lunch we will work with Pundit and see how it can help us answering research questions in a collaborative way. This session is being led by one of the lead-developers of Pundit and the Open Knowledge Foundation.
10:00-10:30 – coffee and registration
10:30-12:00 – Presentations.
Welcome and introduction to manuscript collection
Introduction into the potential of Linked Open Data in the humanities
Net7, The Open Knowledge Foundation and the DM2E project present the first Pundit Hackday: an open hackathon about Pundit, the novel semantic web annotation tool that is being developed as part of the DM2E project. Parallel we will run a users workshop.
To get an idea what Pundit is about, have a look at this new video:
This free event aims to bring together software developers, scholars and all kind of users to try to steer Pundit’s roadmap following crowdsourced suggestions and ideas, building useful and interesting proof of concepts or pure hacks.
An initial plenary brainstorming session will determine what the hackers will (try to) develop during the two days. A non-technical workshop will run alongside the coding, a sort of Pundit users meeting or focus group. In the end, a final meet-up on the afternoon of the 4th will be open to all, to see what hackers hacked!
Some ideas to get you started:
connect more linked data providers (VIAF? dbpedia.de/it/es? .. your own!)
Taking place on 10th June 2013 and hosted by the British Library the event will bring together leading digital humanists and cultural heritage professionals and will feature a keynote from Ted Nelson the inventor of hypertext.
Attendance is free, but places are limited. Sign-up here to guarantee yourself a place.
The programme includes:
A keynote by the inventor of hypertext, Ted Nelson
A dedicated Digital Humanities panel on modelling the humanities with Professor Stefan Gradmann (KU Leuven), Dr Tobias Blanke (Kings College London) and Dominic Oldman (Research Space, British Museum)
The European Digital Library and Linked Open Data with a presentation from Antoine Isaac
Hands-on workshops with DM2E’s flagship tool, Pundit, for semantically annotating and linking texts and images
We are excited to announce the first ever Open Humanities Awards. The are €15,000 worth of prizes on offer for 3-5 projects that use open content, open data or open source tools to further humanities teaching and research. Whether you’re interested in patterns of allusion in Aristotle, networks of correspondence in the Jewish Enlightenment or digitising public domain editions of Dante, we’d love to hear about the kinds of open projects that could support your interest!
Why are we running these Awards?
Humanities research is based on the interpretation and analysis of a wide variety of cultural artefacts including texts, images and audiovisual material. Much of this material is now freely and openly available on the internet enabling people to discover, connect and contextualise cultural artefacts in ways previously very difficult.
We want to make the most of this new opportunity by encouraging budding developers and humanities researchers to collaborate and start new projects that use this open content and data paving the way for a vibrant cultural and research commons to emerge.
Who can apply?
The Awards are open to any citizen of the EU.
Who is judging the Awards?
The Awards will be judged by a stellar cast of leading Digital Humanists:
On Monday the Digital Humanities Advisory Board, responsible for steering the research direction of the project and checking that the technical development on the project responds to the needs of scholars, met in Humboldt University Department for Library Sciences and Information Science.
Comments were taken on the current draft of the co-authored paper on the scholarly domain which will is being readied for publication.
Christian Morbidonni of Net7 updated the group on the progress that was being made with Korbo and Pundit. Demoing fantastic integrations of Pundit with LODLive and notebook functionalities, as well as improvement to the genera user experience.
Alois Pilcher of Bergen University took the stage to present the plans for the up-and-coming “Wittgenstein Incubator” in which scholars will use Pundit to annotate Wittgenstein’s Brown Book via Wittgenstein Source in order to gain further user feedback on the technical development of the annotation tool. More to follow very soon on this blog.
The day was rounded off with a brainstorming session on further modelling of the scholarly domain. A detailed look at was taken at the various meanings of modelling as Professor Gradmann honed in on the concept of modelling that stands at the heart of much of the DM2E Advisory Board’s work on this project:
During the DM2E meeting in Vienna, some exciting news was announced. Ontotext, a Bulgarian company that develops core semantic technology, text mining and web mining solutions will become an associated partner of the project. It was agreed that Ontotext will provide licenses to their OWLIM-SE triple store and will also provide technical assistance for installation and configuration to set up OWLIM-SE as a backend repository for the tools being developed within the DM2E project.
OWLIM is a family of semantic repositories, RDF database management system, implemented in Java, delivering full performance through both Sesame and Jena, robust support for the semantics of RDFS, OWL 2 RL and OWL 2 QL, best scalability, loading and query evaluation. More information about OWLIM is available at http://www.ontotext.com/owlim.
In the cultural heritage domain OWLIM powers the SPARQL end point of the British Museum, it was selected as the semantic repository of choice by the Yale Center of British Art, and Europeana SPARQL end point hosted at Ontotext , which allows access to the entire Europeana collection in EDM (close to 1 billion RDF statements) in a reason-able view, which employs inference and provides 4 times more (close to 4 billion) statements available to retrieve, is also powered by OWLIM.
DM2E aims at developing tools to enable as many providers as possible to get their data into Europeana, and to stimulate the creation of new tools and services for re-use of Europeana data in the Digital Humanities. OWLIM will provide a solid, scalable and powerful backend to allow the project to meet its objectives. In DM2E OWLIM will be used as a backend to the annotation tools KORBO and PUNDIT, developed by Net7, and as an overall data integration repository which will carry experiments with the semantic data produced during the project.
We are very excited about the possibilities Ontotext will add to DM2E and are looking forward to working together.
On Friday the 30th of December, all the Work Packages of the Digitised Manuscripts to Europeana project gathered in Vienna to discuss the results of the last 6 months.
Stefan Gradman presents the Scholarly Domain Model during the All-WP meeting in the Austrian National Library
The morning started with a presentation from all the Work Packages and was followed by more specific presentations about the tools being developed in the project.
WP1 – Content providers
The different content providers of DM2E have finished their first requirements report for WP2 and will continue delivering the digitised content and metadata to the project. In early January the Wittgenstein Incubator will start where a group of scholars will work on Wittgenstein’s Brown Book, being provided by the University of Bergen (WAB)
In WP2, many important progress has been made in the last six months. The goal of WP2 is to create a workflow for memory institutions to easily convert their metadata into EDM ready data.
First, Evelyn Dröge from the Humboldt University in Berlin presented the work they have been doing on developing an improved version of the European Data Model (EDM) and how they have worked on developing an EDM specification for manuscripts.
Kai Eckert from the University of Mannheim, now leading WP2, made a number of proposals how the improved EDM should be further developed in order to make it as usable as possible for both the user, as well as the memory institutions.
Stefan Gradman of the Humboldt University and leader of the project, presented the talk that he gave at the Leipzig Digital Humanities Seminar earlier in November. Here, he gave an extensive introduction into DM2E and its possibilites. He also presented the Scholarly Domain Model and how it will the change the way research is being done in the humanities.
It was good to see that all deliverables were achieved and especially the recent work of WP2 with the mapping of metadata into the new EDM was very impressive. In early 2013, DM2E will start with the testing of several of the tools being developed during the Wittgenstein pilot.