Wednesday, 23. August 2006

stop fumbling the semantic web, do science

A short note to myself and the community:

Stop fumbling around with the semantic web, make quality science.


A prototype for a search engine with a bad user interface, an implementation of a rdf database that only works half, an ontology that is never used, we all know these projects.

Our discipline is a crossover, we need results from artificial intelligence, web 2.0, usability, personalization, databases, data integration, software engineering . . .

So - science would be to concentrate on one aspect and then improve that, for example to fix yourself on a scaleable rdf database. You develop a scaleable algorithm and prove in a test setup that it works - voila. But then, it takes YEARS until you yourself or others can use this result in their other projects. Saying "we need named graphs" is far away from having an RDF store that supports them in a scaleable way, but the distance is often underestimated by us.

so, I should concentrate on writing down the good ideas we have and wait -YEARS- until I can benefit from my own ideas using software written by somebody else. Like TimBl using a Firefox.
;-)
QR barcode by i-nigma.com/CreateBarcodes
Cédric Mesnage (guest) - 24. Aug, 18:07

You are on the way

right way to go leo, you should then definitely join TF-Abstraction...

Wednesday, 16. August 2006

Photos from Strasbourg online

We took some photos in Strasbourg when we were there weeks ago. They are on flickr, my favorite is this here:

Strasbourg Cathedral

here i made it to ARTE! the great, entertaining, cultural TV station showing quality tv documentations and movies. etc...
leobard at ARTE! you're at home baby

Ingrid at L'Epicerie

L'Epicerie
QR barcode by i-nigma.com/CreateBarcodes

making sesame2 SPARQL protocol conformant

At the moment we need a SPARQL conformant interface to Sesame2, and as there is none I know of, the power of open source allows us to write one.
openrdf

My first question is: did somebody already write a Servlet that does map Sesame2 servers to the SPARQL protocol?


At the moment sesame2 does not support full SPARQL querying, but it will soon. We don't have to insist on SPARQL as query language, we can pass in SERQL queries and treat them like they were SPARQL, but we have to start with a conformant servlet :-)

Then some longer questions I also asked on the Sesame developer mailinglist:

We think that a SPARQL protocol conformant HTTP servlet is most important for any use of Sesame2 and are willing to invest 10 hours a week into this, more precisely, a clever student worker. We hope to get this done until the end of September.

We would implement a SPARQL protocol conformat query server and a SPARQL protocol conformant query client (issue tracker) for the reading operations of a HTTPSail. For updates of the model, we would stick to the current implementation of the latest CVS of sesame2.

want to know what the sparql protocol is?

I understand that these are MANY questions, I tried to think of all the calamities we are going to face in the next months. And I expect that some hackers out there already handled half of these questions, so don't hesitate to write me, or comment here, or to the sesame devel list.

* The org.openrdf.sesame.server.http.RepositoryServlet is not conformant to the SPARQL protocol,
as defined in the WSDL, or?
https://www.w3.org/TR/rdf-sparql-protocol/#query-bindings-http
the protocol described at org.openrdf.sesame.server.http.protocol.txt does not say anything about sparql

* If not, does anybody know how to generate stubs for the servlets automatically (so that they strictly conform to the protocol)?

* If not, we would examine the Jena / Joseki implementation, as it serves as reference implementation.

* When we implement a SPARQL conformat servlet - can we put it directly into the package org.openrdf.sesame.server.http.SparqlReadServlet, directly in the latest CVS, to have the best uptake and feedback possible?

* If yes, is there also a parser for query results, that can be used on the Client side HTTP sail to read results written by the server?

* What is the status of the HTTP Client? Did anybody do since we last mailed? if yes, please add comments to this ticket:
https://www.openrdf.org/issues/browse/SES-205

* Is the query string already part of the Query object? Jeen said this is a prerequisite for this hack. If not, Jeen: could you do this? This is such a core thing that I don't want to touch it and for you its probably only 50 lines of code.
I mean the solution 1) suggested here:
https://www.openrdf.org/issues/browse/SES-205#action_10533

* Can Sesame2 serialize Query results according to the SPARQL protocol?
I see the QueryResultFormat.SPARQL which would indicate that.

* last but not least: any news about SPARQL query support?

* Do you have a debug environment to test the existing servlets from org.openrdf.sesame.server.http?

* Does the WebClient work? (the code looks SOOO COOL! spring rocks)
- I cannot find any code in the webclient project that actually *changes* triples... hm.

* When Sebastian starts hacking, whom can he jabber/icq for help?
QR barcode by i-nigma.com/CreateBarcodes
leobard - 16. Aug, 17:37

a wiki page

on that is available here:
https://gnowsis.opendfki.de/wiki/SesameDeveloping

I will collect some thoughts on how to do it there.

Damian (guest) - 16. Aug, 17:53

Try joseki, arq, and sesame?

It isn't that difficult to plug sesame into jena as a graph. That should do the trick. You could also try hooking ARQ into sesame directly, although that's less trivial.

leobard - 21. Aug, 09:12

we tried jena and joseki

last year. The gnowsis 0.0 version, and 0.8 version both used Jena. The SPARQL support in Jena is quite a hack on top of the DataSet implementation. The problem here is that Jena doesn't support named graphs "naturally" in the Model / Graph abstraction layer but only on top. So, if you are unlucky, named graphs will take endlessly in evaluating queries.

We also stacked Jena and ARQ on top of Sesame, which is also useless, as you then have the SPARQL query dissected into find(spo) calls that get forwarded, whereas you would want the store to handle the whole query, etc.
https://gnowsis.opendfki.de/wiki/SesameToJena

A nice hack is Richard Cyganiak's sparql2sql, which was published here:
https://jena.sourceforge.net/sparql2sql/

We also tried it, but it has not such a big community as Sesame, and at the end the Jena Database mapping was - at that time - somehow "mediocre". The literals and resources in the object table are marked with "l:" and "r:" (or something similar), so every call to the db involved heavy string parsing .... they could've just separated into table columns, etc etc etc... rant .. rant ... slow ... slow ...

Sesame2 has named graphs in from bottom to top, and a transaction oriented architecture. I feel more comfortable there now. Lets see what happens this time .... ;-)
Jeen (guest) - 17. Aug, 16:03

Most of this stuff is actually already there in Sesame 2

I already replied in some detail sesame developers list to this
(see https://sourceforge.net/mailarchive/message.php?msg_id=36390015 ). but it can't hurt to repeat it in short here I guess: most of the stuff you mention is already there in Sesame 2. There is a servlet that implements SPARQL protocol (a superset actually), and there are parsers and writers for serialized query results (in SPARQL, binary and JSON format).

Damian, I would expect that using ARQ on top of Sesame would result in very bad performance for anything bigger than a toy example, really. For good query performance the engine really needs low-level access to the store...

speednews.it (guest) - 1. Sep, 16:24

speednews.it - web information directory


Saturday, 12. August 2006

lost in spam bei twoday

Comment und Trackback spam nervt genauso wie der Spießrutenlauf durch eine Fußgängerzone gefüllt mit Abo-Keilenden Greenpeace Activisten.

Worum gehts? Irgendwer will eine Porno Seite bewerben, und um im Google rank nach oben zu hopsen wird nun ein Haufen von etwa 100 links auf die Seite in meine Kommentare reingestellt, darüber steht die freundliche Meldung "Tolle Seite. Das Himmel ist Grün", also irgendwas das den Anschein ehrlicher Bewunderung meines bloggens zeigen soll. Die links zeigen am Anfang auf irgendwelche Weblogs oder unscheinbaren seiten (bei Trackback Spam ist das so), diese Seiten werden aber irgendwann entfernt und durch Porn ersetzt.

Alsdann, irgendwie kommt in meinem Blog immer noch Comment Spam und Trackback Spam durch und solange twoday.net das Problem nicht löst muss ich mir selber helfen.

Dazu hab ich mir eine seite gebaut, wo ich alle Kommentare in einer Liste sehe (der RSS feed) mit praktischen Links zum löschen der Kommentare.

https://www.dfki.uni-kl.de/~sauermann/2006/08/twoday/

falls du ein ähnliches Problem hast, vielleicht hilft dir das.
QR barcode by i-nigma.com/CreateBarcodes
medienjunkie2.0 - 17. Aug, 16:16

seit einer ganzen zeit hab ich nun keinen trackbackspam mehr und hoffe, dass das jetzt nicht wieder anfängt. =(

aber danke für den link. :-)

Tuesday, 8. August 2006

going to burningman-who will be there?

Ok, who from the Semantic Web guys and girls is going to Burning Man this year?

I am going, and we are blogging here about that: I tried to find you guys by using search engines, but I am not witty enought to get the right results, so blogging and asking this question is probably ok for today.

Here is the story how I tried to find you:

This time I swoogled the semantic web to find out who lives near SFO, which indicates that they may go to burningman.

My first try was to go to swoogle.com, a very interesting website where you may lose hours during your office time. Then I found the search engine at swoogle.umbc.edu.

I tried to search for:
  • ns:foaf SFO but this broke. Internal Server Error in /work/swoogle/www/swoogle/3.1/components/com_frontpage/writer.search.php on line 61
  • perhaps not search documents, search ontologies instead. SFO - no results
  • just entering francisco returned too much.
  • refining to ns:foaf francisco, zero.
  • at this point, I registered for an account at swoogle, perhaps then more. Hm, no.
Ok, perhaps the search syntax is wrong. Lets go to Intellidimension's Semantic Web Search engine at www.semanticwebsearch.com. ok, change of tactics: shoot straight at the target of burning man.
  • semanticwebsearch for burningman. Brings some RSS feeds. Ok, that are bloggers. I need semweb bloggers though.
  • perhaps some foaf person said something in interests or so? search, nothing.
  • with "burning man" we get two livejournal users.
ok, lets see what the market leader does. ok, so I didn't hear of these guys before and I see that this use case is interesting. Anyway:

Ok, who from the Semantic Web guys and girls is going to Burning Man this year?
QR barcode by i-nigma.com/CreateBarcodes
tim finin (guest) - 8. Aug, 13:58

Swoogle's down for the moment

Yesterday we discovered that the box that runs swoogle's web interface was compromised and being used to launch attacks on other systems. We also found that Swoogle's database server was being overwhelmed by queries. These probably are not related, but we've had to take Swoogle offline while we address the issues.

BTW, all of your searching on swoogle.com has been noted and is being integrated into AOL's search database, along with your interest in burningman.

leobard - 11. Aug, 22:03

sorry to hear that swoogle is down

that is sad news. Hope you do not have too many troubles. Looking at the comment spam in this blog here I also get nervous.

> BTW, all of your searching on swoogle.com
> has been noted and is being
> integrated into AOL's search database,
> along with your interest in burningman.

What should this tell me? All your base are belong to us? fnord.
nym (guest) - 10. Aug, 00:44

i'm going

my name is nym (tom longson) and i'm going. i should update my foaf to express my interest in black rock city.

will be in the broken staple camp, which is 4:00 and D... next to Pocket, which should be easier to find since a lot of DJs are in there.

hope to meet you!

leobard - 11. Aug, 22:15

hi Tom

I jot down the coordinates. We will hop by sometimes during the bm.

thanx god for the web.
nym (guest) - 13. Aug, 21:13

playainfo

i will put down my info in playainfo (near center camp), and will try to check my messages on there under 'nym'.

leobard, i suggest you do the same, playainfo is very cool (although still very much a closed system). i had aspirations in the past to make a competitive system, but they're already doing a huge amount of work.

see you in 14 days!

Monday, 7. August 2006

three of a perfect pair: gromgull's gnowsis improvements

Gunnar invested much time in the last week to improve his gnowsis experience - adding features he (and everybody else) missed.

read his post on everything that is now available in the gnowsis SVN!

a tag-cloud
tagcloud

a tagging bookmarklet
webtag

synchronising two gnowsis installations
sync
QR barcode by i-nigma.com/CreateBarcodes
icon

semantic weltbild 2.0

Building the Semantic Web is easier together

and then...

foaf explorer
foaf

Geo Visitors Map
I am a hard bloggin' scientist. Read the Manifesto.
www.flickr.com
lebard's photos More of lebard's photos
Skype Me™!

Search

 

Users Status

You are not logged in.

I support

Wikipedia Affiliate Button

Archive

September 2025
Sun
Mon
Tue
Wed
Thu
Fri
Sat
 
 1 
 2 
 3 
 4 
 5 
 6 
 7 
 8 
 9 
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
 
 
 
 
 
 
 

Credits


austriaca
Chucknorrism
digitalcouch
gnowsis
Jesus
NeueHeimat
route planning
SemWeb
travel
zoot
Profil
Logout
Subscribe Weblog
development