Google Fusion Tables: Databases on the Cloud
From the Google Research Blog: Google Fusion Tables.
Now it is possible to upload tabular data sets on Google, let other people use the data, and provide easy-to-use visualizations. No complicated joins or other heavy-duty relational stuff but there is functionality to connect (fuse) tables. There is also functionality embedded to discuss the contents of the data set.
Here is an early example. I took the data from a survey of Mechanical Turkers and imported it in Google Tables. Here is the resulting intensity map that shows the distribution of workers per country:
and the "lift" of the distribution of workers per state (we are comparing actual population percentage with percentage of Turkers):
I am truly excited about this feature. Just the idea that it will be possible to release "live" data sets, without having to set up complicated web interfaces, worrying about security, SQL injections, and so on, makes this absolutely wonderful for me.
For comparison, see the corresponding visualizations from Many Eyes:
But the flexibility of Google Tables for data management counters the relative lack of visualization options.
My only real complaint: The 100Mb limit. I was ready to upload my Mechanical Turk archive (see the related blog post) there, and let other people use it. Unfortunately, it is larger than the 100Mb limit. If only I could use the extra storage that I bought from Google for my Gmail account...

2 comments:
The Many Eyes applets are horribly buggy for me. Google wins this one.
Swivel.com is another nice data upload/viz site.
But none of them seem to be good for constantly updated datasets.
When I need to give someone lots of data it's usually through flatfiles -- or an SQL dump if there's significant structure -- behind an HTTP password. Cutting-edge technologies as of 1995. Well, sometimes I manually make a few summary histograms in R and write a HEADER.html. That's regressing to what, 1990 now...
I did not see an API for Google Tables so far, but I would like to believe that it is coming soon. Even better if we can see the App Engine datastore as tables with the ability to share and visualize.
Post a Comment