From a valid RDF/XML for Twitter users to a dynamic SPARQL Endpoint

This weekend I upgraded the semantic profiling framework. Now it annotates for every Twitter user:

  • Its profile as SIOC UserAccount
  • The timeline as SIOC(Types) MicroBlog
  • All the tweets as SIOC(Types) MicroBlogPost

It grabs the tweets from a user in Grabeeter if the user has registered there. If not they are being retrieved with the Twitter API.

Other ontologies that are used are the Dublin Core, FOAF and GeoNames.
The result for my own account:
A proof of the valid xml can be obtained on http://www.w3.org/RDF/Validator/. It is possible to generate a graph image based on this document.
Valid RDF/XML for a twitter user's annotated data

Valid RDF/XML for a Twitter user's annotated data

You can get your own Twitter account annotated in rdf/xml format on:
It takes about 3 minutes to extract about 12000 triples for a user with 2000 tweets. For now there is time limit on the script to prevent system crashing. A user with more than 30000 triples takes at the moment more than 5 minutes to generate the XML document. This is because the extraction algorithm is >O(n^2), hard to avoid because there are three layers in the profile:
for (every tweet) {
for (all properties) {
something >O(1)
}
}
The triples that are contained in these document will form a basis as a resource repository of people, represented as their Twitter identities. Another repository, currently under development, will keep track of conferences. The next step will be to create a dynamic SPARQL endpoint, such that Twitter user data is loaded on request. A way that could help doing so is by using the friends of a Twitter user. Basically every information request into the database will start from a user’s own profile. The system will then follow paths and if necessary load more users. Another possibility is using DERI Pipes to combine several resources for to support an information query.
Advertisements

About laurensdv
Computer Science Student, interested in creating more innovating user experiences for information access. Fond of travelling around Europe!

2 Responses to From a valid RDF/XML for Twitter users to a dynamic SPARQL Endpoint

  1. Pingback: SPARQL Endpoint set-up and load any twitter profile into the RDF Store « Laurens goes semantic…

  2. Pingback: SPARQL Endpoint set-up and load any twitter profile into the RDF Store | Laurens goes semantic

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: