dr. Lloyd Rutledge, Ontology-driven data system creation software
Transcription
dr. Lloyd Rutledge, Ontology-driven data system creation software
Ontology-driven data system creation software Lloyd Rutledge TouW-dag 21 maart 2015 Semantic Web in a nutshell • A unifiable world-wide database on the World Wide Web – Graph of node-edge triples is like one sparse table • URI’s as universal identifiers • Open World Assumption – • No unique naming assumption – • Multiple URI’s for the same thing Much inferencing – • Current data collection may be incomplete Helps handle incomplete data Ontologies – Define data structure and inferencing Pagina 2 Pagina 3 Fresnel Forms plugin for Protégé Fresnel Forms plugin for Protégé • Protégé: Top tool for editing a Semantic Web ontology • Fresnel Forms plugin lets user design an interface • – Generates default system interface for ontology – GUI-editor for interface specification – Fresnel is standard for semantic browser interface specification – Interface export to semantic wiki – Interface is browsing and form-based annotation Educational context – Developed by two ABI projects, some functionality from a third – Principles from MDD and Business Rules – Protégé used in Semantic Web and MSC Rule based Design – Potential use by Masters Thesis students Pagina 5 Pagina 7 http://lod-cloud.net/ Linked Data Cloud, 2007 Great contribution, but can’t it be easier? • • • Model-driven development – Start with ontology – Automatically generate default interface from it – Efficiently tailor that interface Form-based annotation – Instead of text editing template parameter code – Guides user to enter field values that fit ontology Direct export to Semantic Web Pagina 11 Pagina 12 ABI 29 – MDD Protégé -OWL • Team – Thomas Brenninkmeijer, Protégé guru – Tim Zwanenberg, XForms/Fresnel guru – Harold Pootjes, supervisor • Bijdrage – Eerste versie plugin voor Protégé – Default en GUI voor Fresnel • KLAAR! - Augustus 2014 Pagina 13 Pagina 14 ABI 30 –Protégé Forms • Team – Alex Mekkering, Protégé guru – Teun Theunissen, Fresnel guru – Joop van de Heijning, wiki guru – Rik Bos, supervisor • Bijdrage – Nu ook export naar wiki – Default en GUI functies uitbreiden – Fresnel ontologie uitbreiden • Verwacht juni 2015 Pagina 15 Fresnel Forms plugin for Protégé Generating the default interface Pagina 17 Default interface for an ontology Ontology Interface Domain Property-box assignment Class range Autocompletion Wiki link to form for page creation Data type range Box displays Entry fields Cardinality <=> 1 Mandatory or optional List or single Pagina 18 Ontology Fresnel Whole :allProperties :hideProperties owl:Thing owl:Ontology rdfs:isDefinedBy :label Pagename CSS :subClassOf :subPropertyOf Fresnel not automatically generated from ontology Class MW , Label on form & template OWF Page owl:ObjectProperty owl:DataProperty owl:AnnotationProperty String Literal W rdfs: :domain :sub- :Class is Wiki page name namespace has levels Ontology (2ary level) Source URI (2ary level) New sublevel OWF Labels for property and class on templates and forms OWF Shows description Adapts language to user Mouseover on label , content on page Selection of text display from user for label, comment, etc. SF delimiter= Additional content OWF Put text: before, after, between, starting, ending, if empty MW Psuedo-classes Link style Whole table :containerStyle Multiple :resourceStyle For template :propertyStyle Single SF and form tabel :labelStyle Left SF :valueStyle Right , class= for forms MW [[Category:]] :member in :showProperties #arraymap/list SF SMW Property: MW MW Category:... , also: [[Category:...]] in template MW :Lens Template:... (if domain) SF Form:... (if domain) SF :classLensDomain [[Has default form::...]] on category OWF Check box to assign classes if not domain MW [[Category:...]] on category page OWF Nested form access SMW [[Subproperty of::...]] on property page OWF Grouped together on forms and templates by default :showProperties OWF Asssign property in template :hideProperties UnOWF :showProperties rdf:List Sort properties in template :mergeProperties SF Template row queries multiple properties for one display :alternateProperties SF autocomplete on category=... SF [[Has default form::...]] on property Page rdf: :type Containers :Property 1 Secondary prefix for class and property page names Namespace (2ary level) Form and template for general properties from ... Top level form and template Loaded as part of ontology to process OWF Link from page for property or category MW rdfs:seeAlso Fragment identifier rdfs:label skos:prefLabel rdfs:comment xml:lang Default Frensel Delimiters Cascading Fresnel Default w/o :range Ontology URI's used in RDF export OWF CSS Text owl:imports Each gets own template and form OWF for domainless properties Description Cell Row Foundation URIs Identifier Namespace Prefix MediaWiki and extensions SMW [[EquivalentURI::...]] SMW OWF [[Imported from::...]] Identifier/prefix specs MW Wiki page name prefixes Wiki page is member of given category User can enter sorted list as comma-delimited Property Recognized in wiki data Category system as Data display table/template Form for data entry Clicking to new pages leads to given form for data entry Wiki and queries recognize as subcategory Queries using property also recognize superproperties Properties in template and forms are Placed Removed Sorted Values from muliiple properties display as if from one Pull down list shows current pages in target class(es) Clicking to new pages leads to given form for data entry Entered as wiki page name Links to wiki page Entered and displayed as unlinked string Pagina 19 http://en.wikipedia.org/wiki/Fresnel_lens Fresnel: RDF interface in RDF • Specifies how RDF is display • Separates style from content: like CSS for XML • Also in RDF • Supported by multiple semantic browsers such as Longwell http://www.w3.org/2005/04/fresnel-info/manual/ http://dig.csail.mit.edu/2006/Talks/0724-fresnel/#%2814%29 Domain-independent default for browser (Longwell) With (handmade) Fresnel style Wikipedia Fresnel Forms, default Default browser • Property order not normalized • Property display for type • No tailored CSS • Simple two-column table Pagina 22 Default Form • Order same as browser • Shows all properties – Browser only those with values • Input fields special for data types • Cardinality min/max =1 • – Optional or mandatory – List or single Autocompletion to class members Pagina 23 Falco Paul’s BPMIT Master’s thesis • What is best default property order? • Compares algorithms for sorting ontology properties in browsers • Wikipedia infoboxes are truth set • Simple heuristic outperforms various text-processing algorithms • An ABI student’s research context assignment will apply this to Fresnel Forms • Resulting Fresnel Forms makes a better default interface • Especially for Wikipedia, albeit with chicken-egg paradox Pagina 24 Editing the interface with Fresnel Forms Wikipedia Fresnel Forms, edited Edited browser • Property order easy to change • CSS – Fits in Fresnel standard – <div>-based better than table • “Awards” is more page link than data annotation • Maybe with SPARQL queries • – Age calculation – Honorific acronyms Ordering of multiple values works – But via the wiki code – Not via Semantic Web Pagina 26 Pagina 27 Pagina 28 That’s fine for one infobox, but what about all the others*? • • Fresnel Forms scales to process the entire DBpedia ontology – Default GUI display appears in a few minutes – Fresnel generation takes about a half hour – Protégé on special server makes wiki in about an hour The new, improved Wikipedia interface is right here on my laptop – Jimmy Wales just needs to import the wiki XML – And install a few Semantic MediaWiki extensions – Existing infoboxes share same foundational syntax • Wikibots can automate some simple cleanup • And/or modified existing infobox templates can map to new ones – DBpedia ontology needs to be up-to-date – Default interface could use some tailoring * ~500 from Fresnel Forms algorithm applied to the DBpedia ontology Pagina 29 Pagina 30 Paper submission van de ABI’s Pagina 32 Fresnel Forms summary • Efficient building of data system interface for given ontology • Wikipedia infoboxes can be rebuilt with this approach and tool • Valorization: the OU made Wikipedia resilient • … and now some research context … Pagina 33 Future work: the same, but with rules • Apply the course Rule based Design – Relation algebra logic – Ampersand tool • Relation algebra constraints plus Semantic Web inferencing • Masters thesis students already busy with Ampersand to Semantic web case study conversions • Potential software – Automatic conversion Ampersand to Semantic Web plus Fresnel – New Protégé plugin adds relation algebra rules to ontology • – Fresnel Forms generates wiki that handles rules as constraints • • Plus reasoner in Protégé? Like the Ampersand wiki And then with BI Pagina 34 Semantic web in OU onderwijs • • Bachelors – Vrije ruimte Cursus Semantic web – ABI projecten Masters – 2009: Capita Selecta Thema – Semantic Web – Misschien: Ontwerpen van Bedrijfsregels voor MCS • – • OBR opdracht maar dan op Protégé Afstudeer onderwerpen Ph.D – Jij? Pagina 35 bedrijfsregels gedrag interfacespecificatie gedrag Model-driven development informatiemodel default interfacespecificatie informatiemodel Album nr titel aantal tracks Albumnr Titel Aantal i generatie 1 * Track album volgnr titel componist Album Volgnr Titel Componistnaam i i from OU course T37121 Model-driven development workbook + = + Van OU course T64211 Semantic web werkboek en tekstboeken Ontwerpen met Bedrijfsregels - Ampersand RULE "Toegelaten identificatiemiddelen": vogAanvragerIDMSoort~;vogAanvragerIDMSoort | 'Paspoort' \/ 'IDkaart' \/ 'Rijbewijs' RULE "Toegelaten identificatiemiddelen": vogAanvragerIDMSoort~;vogAanvragerIDMSoort | 'Paspoort' \/ 'IDkaart' \/ 'Rijbewijs' MEANING "Voor het identificeren van de aanvrager met fysieke middelen zijn toegestaan: Paspoort, IDkaart, Rijbewijs" MEANING "Voor het identificeren van de aanvrager met fysieke middelen zijn toegestaan: Paspoort, IDkaart, Rijbewijs" MESSAGE "U mag hier alleen 'Paspoort', 'IDkaart' of 'Rijbewijs' invullen." MESSAGE "U mag hier alleen 'Paspoort', 'IDkaart' of 'Rijbewijs' invullen." Van Masters Scriptie Pim Bos Pagina 39 Van Ampersand naar Semantic web • Ampersand – • gbaBSN :: NatuurlijkPersoon -> BurgerServiceNummer [INJ,UNI] --burgerservicenummer ingeschrevene; Semantic Web / Protégé: Van Masters Scriptie Pim Bos Pagina 40 Masters afstudeerders in logic • • • Pim Bos, BPMIT – Bestaande ambtelijke wettekst (VOG) Ampersand in Protégé – KLAAR! Azar Lalmohamed, BPMIT – Bestaande gezondheid wettekst (HIPAA) in Ampersand – KLAAR! Pieter Slootweg, BPMIT – • HIPAA in Protégé Enno Beversluis, BPMIT – Roosteradministratie in BPMN en SBVR dan in Protégé Pagina 41 http://www.rutgerbregman.nl/diplomafabriek.html Business Intelligence Business Intelligence • • Kishan Shri, BPMIT – BI volwassenheidsmodellen voor ziekenhuizen – KLAAR! Jurrian Wijffels, BPMIT – Succesfactoren voor BI dashboards in onderwijs – 14 april! Gert-Jan den Besten, ABI – • • Learning Analytics voor OU op SpagoBI Justian Knobbout, Ph.D – Learning Analytics voor formatieve toetsen – Met Johan Versendaal BISS http://www.rutgerbregman.nl/diplomafabriek.html • Semantic Wiki’s, ABI en Masters CS • • • Semantic Friendly Forms, ABI team 10 – Rens te Kloese, Henri Gerrits en Mark Rotteveel – Default interface van ontologie ingevoerde op wiki zelf Rineke Oostenrijk, Masters CS – CSA: OWL afleiding op semantic wiki’s, deel van publicatie – Scriptie: Semantic Search voor onderzoeksdocumenten – 14 april! Astrid Ramakers, Masters CS – Semantic wiki voor meertalige semantic registries Pagina 44 Semantic Wiki’s, BPMIT • • Falco Paul, BPMIT – Automatiseren van property volgorde op Wikipedia – KLAAR! Hein Koops, BPMIT – • Semantic wiki voor onderwijs procesmanagement Henk Langbroek , BPMIT – Semantic wiki voor formatieve toetsen Pagina 45