<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Then each went to his own home &#187; RawSugar</title>
	<atom:link href="http://www.pui.ch/phred/archives/category/rawsugar/feed" rel="self" type="application/rss+xml" />
	<link>http://www.pui.ch/phred</link>
	<description>Philipp Kellers weblog</description>
	<lastBuildDate>Wed, 15 Dec 2010 12:37:04 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>RawSugar: post mortem analysis</title>
		<link>http://www.pui.ch/phred/archives/2007/01/rawsugar-post-mortem-analysis.html</link>
		<comments>http://www.pui.ch/phred/archives/2007/01/rawsugar-post-mortem-analysis.html#comments</comments>
		<pubDate>Sun, 14 Jan 2007 15:41:31 +0000</pubDate>
		<dc:creator>Philipp Keller</dc:creator>
				<category><![CDATA[RawSugar]]></category>
		<category><![CDATA[Simpy]]></category>

		<guid isPermaLink="false">http://www.pui.ch/phred/archives/2007/01/rawsugar-post-mortem-analysis.html</guid>
		<description><![CDATA[As various blogs already posted, RawSugar has closed its research and development efforts. That is, the service will still be up for a while but there won&#8217;t me new features or bugfixes (see Rawsugars announcement).

I&#8217;m sad to hear this as I was a happy user of RawSugar. It was the only tag based service that [...]]]></description>
			<content:encoded><![CDATA[<p>As <a href="http://www.techcrunch.com/2006/12/30/rawsugar-in-deadpool/">various</a> <a href="http://mashable.com/2006/12/30/rawsugar-crashes-sells-assets/">blogs</a> already posted, <a href="http://www.rawsugar.com">RawSugar</a> has closed its research and development efforts. That is, the service will still be up for a while but there won&#8217;t me new features or bugfixes (see <a href="http://blog.rawsugar.com/blog/?p=64">Rawsugars announcement</a>).</p>
<p><span id="more-44"></span></p>
<p>I&#8217;m sad to hear this as I was a happy user of RawSugar. It was the only tag based service that didn&#8217;t just copy delicious to add a new features or polished design.<br />
Apart from a few academic papers and proof of concept applications <a href="http://www.pui.ch/phred/archives/2005/11/how-tagging-could-gain-ground.html">I never saw anything new in the tagging world</a> until Rawsugar came up with its implementation of tag clustering and auto-tagging. These features probably need some more polish but they were steps into the right direction.</p>
<p>It&#8217;s not hard to decide where I should move all my bookmarks: <a href="http://www.simpy.com">Simpy</a> has nice search features, a friendly user base and a very likeable <a href="http://www.simpy.com/user/otis">developer</a>. Otis does care for Rawsugar users and implemented an <a href="http://blog.simpy.com/blojsom/blog/news/2007/01/07/Simpy-Welcomes-RawSugar-Users.html">importer</a>.</p>
<p>Apart from that decision I&#8217;m left to ask: &quot;what was wrong with RawSugar?&quot;. I think this is an important question as tagging is web 2.0 and web 2.0 most certainly is another bubble going to pop somewhen &#8211; the question then is who is going to stay.</p>
<h3>what was wrong?</h3>
<p>Knowing that I&#8217;m not in the best position to analyze, I have a go figuring out why this project failed. Some of these objections <a href="http://www.pui.ch/phred/archives/2006/10/tagsessibility.html">I already mentioned</a>, some can be found reading <a href="http://www.techcrunch.com/2006/12/30/rawsugar-in-deadpool/#comments">some (sometimes very mean: has techcrunch turned into digg?) comments</a>.</p>
<h4>user interface</h4>
<p>The web site has always been too cluttered. The page structure even grew with the number of features. Rawsugars biggest competitor, <a href="http://del.icio.us">delicious</a>,  was always very reluctant in exposing new features to the user.<br />
Then its whole look doesn&#8217;t really fit into the web 2.0 world (Ok, <a href="http://del.icio.us">delicious</a> isn&#8217;t that much better, before the redesign about a year ago it looked like plain xhtml without any styling). It&#8217;s not about rounded corners, it&#8217;s about freshness. RawSugar always felt a little bit heavy (although it had the best <a href="http://www.pui.ch/phred/archives/2006/10/tagsessibility.html">response times</a>).</p>
<h4>interaction with community</h4>
<p>I always felt like RawSugar doesn&#8217;t have the early adopters in mind: For questions it did use a forum, its developers weren&#8217;t really involved in the blogosphere. But then they wanted to gain ground in the blogosphere. It very much focused on a feature called &quot;tag search for your blog&quot; (you can see this feature in action when you switch to the <a href="http://www.pui.ch/phred/">index view of this blog</a>, it is a very good feature indeed). This just didn&#8217;t really fit. It didn&#8217;t feel they were &quot;of us&quot;. I hope you get what I mean &#8211; I cannot really describe it</p>
<h4>ah well..</h4>
<p>
Maybe these arguments are all in vain. Why did delicious take off? Because it was the first social bookmark service (in fact it wasn&#8217;t)? There are many things Rawsugar did right. Why didn&#8217;t the user base grow more substantially? I&#8217;m keen to hear your thoughts!
</p>
<p>Anyway &#8211; best luck to all the guys involved in RawSugar. Especially Frank &#8211; I did have a great time with you preparing our talk for www2006!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.pui.ch/phred/archives/2007/01/rawsugar-post-mortem-analysis.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tagsessibility</title>
		<link>http://www.pui.ch/phred/archives/2006/10/tagsessibility.html</link>
		<comments>http://www.pui.ch/phred/archives/2006/10/tagsessibility.html#comments</comments>
		<pubDate>Tue, 31 Oct 2006 12:38:18 +0000</pubDate>
		<dc:creator>Philipp Keller</dc:creator>
				<category><![CDATA[Del.icio.us]]></category>
		<category><![CDATA[RawSugar]]></category>
		<category><![CDATA[Simpy]]></category>
		<category><![CDATA[Web Search]]></category>

		<guid isPermaLink="false">http://www.pui.ch/phred/archives/2006/10/tagsessibility.html</guid>
		<description><![CDATA[Imagine you have a closet where you store all your documents. Each time you want to archive an important document you tell your closet: &#8220;put that under important&#8221;, a magic hand, coming out of the closet, takes your document and puts it into its immense pile of documents. The other day you are in a [...]]]></description>
			<content:encoded><![CDATA[<p>Imagine you have a closet where you store all your documents. Each time you want to archive an important document you tell your closet: &#8220;put that under important&#8221;, a magic hand, coming out of the closet, takes your document and puts it into its immense pile of documents. The other day you are in a hurry to be on time to your next meeting. You need that important document from yesterday so you ask your closet: &#8220;please show me all important documents&#8221;. Then you hear printing and rustling in the closet until, after half a minute, 20 magic hands stick out of your closet, each one holding a document, some of your documents have sticky notes on it saying: &#8220;this is not filed “important” but it is similar to a document you filed important&#8221;, another sticky note says: &#8220;these are all the other categories you put your documents in&#8221; and every document you really filed “important” has got a sticky note on the number of coworker who filed this document under &#8220;important&#8221;. You say to your self: Tomorrow I&#8217;ll reanimate my pile &#8220;important documents&#8221; that was on my desktop before they put this silly closet into my office.</p>
<p>That&#8217;s the feeling that arises when I think of all the bookmark services out there that ought to file urls I somehow find notworthy so that I can quickly recall them afterwards: There&#8217;s too much clutter and the services are just too slow. Therefore I again begin to save my bookmarks at other places: In firefox or in some text documents lying somewhere on my hard drive (probably I should tag them?).<span id="more-43"></span></p>
<h3>«I want it now»</h3>
<p>As for myself, if I don&#8217;t get my data within a fraction of a second, I don&#8217;t feel that I have good access to my data. I think one of the best things of Google Web search is that it is incredible fast. Tag services are not. That&#8217;s a pity because within tag services I search my lowly amount of bookmarks (around 2000 bookmarks at that moment) whereas with Google I search the whole net. I often find myself at the situation searching for a url on Google rather than looking it up on my bookmark service. Even if I have to do 3 Google searches &#8211; I feel at control. If I have to wait for 3 seconds for an answer, that&#8217;s simply too much.</p>
<p>An experiment: I ran <code>for ((i=0;i&lt;10;i++)) do wget -q --user-agent="Mozille Firefox 1.9" $url; done;</code> for the three big players in tagging services: <a id="__CONK_181" href="http://del.icio.us"><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: lightgray; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none">181</span>del.icio.us</a>, <a id="__CONK_182" href="http://www.simpy.com"><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: lightgray; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none">182</span>Simpy</a> and <a id="__CONK_183" href="http://www.rawsugar.com"><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: lightgray; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none">183</span>Rawsugar</a>. For $url I chose a page displaying all bookmarks saved with a certain tag. I also ran a query on Google for that &#8220;tag&#8221;. The numbers are number of seconds per query. The queries were run on different hosts with different internet connection, all in Switzerland.</p>
<table>
<thead>
<th>service</th>
<th>Sun 18:00 CET</th>
<th>Mon 18:50 CET</th>
<th>Tue 09:00 CET</th>
<th>Tue 12:40 CET</th>
<th><strong>Average</strong></th>
</thead>
<tbody>
<tr>
<td>Delicious</td>
<td>2.1s</td>
<td>2.3s</td>
<td>2.6s</td>
<td>2.4s</td>
<td><strong>2.4s</strong></td>
</tr>
<tr>
<td>Simpy</td>
<td>3s</td>
<td>6s</td>
<td>2.7s</td>
<td>3.3s</td>
<td><strong>3.8s</strong></td>
</tr>
<tr>
<td>Rawsugar</td>
<td>1s</td>
<td>1.5s</td>
<td>1s</td>
<td>1.1s</td>
<td><strong>1.2s</strong></td>
</tr>
<tr>
<td>Google</td>
<td>0.2s</td>
<td>0.5s</td>
<td>0.3s</td>
<td>0.3s</td>
<td><strong>0.3s</strong></td>
</tr>
</tbody>
</table>
<p>I&#8217;m happy to see that with Rawsugar I&#8217;m with the fastest bookmark service (2 times faster than Delicious, 3 times faster than Simpy). On the other hand: all the bookmark services are at least 4 times slower than Google. I know that Google has set a high mark &#8211; but emotionally that&#8217;s the response time I&#8217;d like to have when querying for my data.</p>
<h3>«I just want my bookmarks»</h3>
<p>Have a look at the &#8220;result areas&#8221; (highlighted in green) of the three bookmark services.</p>
<div class="caption"><img alt="Delicious results" id="__CONK_186" src="/phred/images/delicious_results.png" /><br />
<strong>Delicious: Result area in green</strong></div>
<div class="caption"><img alt="Simpy results" id="__CONK_187" src="/phred/images/simpy_results.png" /><br />
<strong>Simpy: Result area in green</strong></div>
<div class="caption"><img alt="Rawsugar results" id="__CONK_188" src="/phred/images/rawsugar_results.png" /><br />
<strong>Rawsugar: Result area in green</strong></div>
<p>To come back to the comparison with the closet: Even if there are 20 magic hands with documents sticking out of the closet, it is crucial that the most important document is the one closest to my face. That means, when using those bookmark services, my eye should first notice the most important link. My opinion: The service that solves this best is Delicious. The results appear on the leftmost part of my screen and the result area covers 64% of the screen. Regarding that matter Rawsugar is far worse: My eye has to search for the first result. I find it natural to start reading at the left of the screen. But at Rawsugar there are so many links in the head of the page and then the left column helps me to refine the result &#8211; a great feature but at the wrong place &#8211; furthermore: if I want just get a link I <strong>know</strong> I filed under &#8220;important&#8221;, I don&#8217;t need this at all. Even Delicious&#8217; &#8220;saved by xxx other people&#8221; &#8211; shaped in different colorings &#8211; is too much clutter for me. Maybe I&#8217;m a puritan but &#8211; what the heck &#8211; I just want my 9 bookmarks!</p>
<table>
<thead>
<th>service</th>
<th>result area ratio</th>
</thead>
<tbody>
<tr>
<td>Delicious</td>
<td>63.6%</td>
</tr>
<tr>
<td>Simpy</td>
<td>60.2%</td>
</tr>
<tr>
<td>Rawsugar</td>
<td>51.2%</td>
</tr>
</tbody>
</table>
<h3>Beyond the criticism</h3>
<p>Sorry to only criticize. I like those services a lot. They are helping organizing links and they are for free. To turn this post into a constructive comment, two possible solutions to the problems mentioned.</p>
<h4>Bookmark services offer a «Minimal mode»</h4>
<p>Yes, tagging is about collaboration. But I fear the personal value of tagging is too small &#8211; people might leave tagging services because they don&#8217;t feel their bookmark problem solved. There&#8217;s too much network and too many features. I propose a &#8220;minimal mode&#8221;, a result page that just shows <code>select * from bookmarks where user="phred" and one_of_its_tags="important"</code>. In firefox there are <a id="__CONK_184" href="http://www.mozilla.org/products/firefox/smart-keywords.html"><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: lightgray; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none">184</span>Smart Keywords</a> also known as &#8220;quick search&#8221;. I set up &#8220;myr&#8221; to search for my rawsugar bookmarks. Typing &#8220;myr important&#8221; into my location bar I end up on www.rawsugar.com/links/phred/important. If there would be a site on rawsugar that displays the same results just in this minimal mode, I would bend my smart keyword to this result page and would be happy.</p>
<h4>Someone codes a «Tag agent»</h4>
<p>I imagine a &#8220;tag agent&#8221; that has incredible response time and no clutter. It could either get the results via rss/api from the bookmark service or it could hold all my tagged bookmarks in it&#8217;s own database. Such an agent could be installed on a password-protected part of my website so I can access it from wherever I am. The problem is: I don&#8217;t have time to write an application my own. I have thought about such an application, I&#8217;ve got plenty of ideas how it should look but I&#8217;m afraid I won&#8217;t find time to code such an application. I repeatedly get emails from people writing yet another thesis on collaborative categorisation. Instead of writing about insights on the mental actions taking place while tagging &#8211; do the tagging world a favour and write such an agent. If there&#8217;s such a tag agent already, please leave a comment.</p>
<p><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: pink; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none; left: 4px; top: 227px; position: absolute; opacity: 0.8; z-index: 999">185</span><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: pink; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none; left: 4px; top: 698px; position: absolute; opacity: 0.8; z-index: 999">186</span><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: pink; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none; left: 4px; top: 967px; position: absolute; opacity: 0.8; z-index: 999">187</span><span style="border: 1px solid gray; padding: 0pt; color: black; background-color: pink; font-weight: normal; font-family: sans-serif; font-size: small; text-align: center; -moz-border-radius-topleft: 0.5em; -moz-border-radius-topright: 0.5em; -moz-border-radius-bottomright: 0.5em; -moz-border-radius-bottomleft: 0.5em; display: none; left: 4px; top: 1237px; position: absolute; opacity: 0.8; z-index: 999">188</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.pui.ch/phred/archives/2006/10/tagsessibility.html/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Automated tag clustering</title>
		<link>http://www.pui.ch/phred/archives/2006/07/automated-tag-clustering.html</link>
		<comments>http://www.pui.ch/phred/archives/2006/07/automated-tag-clustering.html#comments</comments>
		<pubDate>Tue, 11 Jul 2006 06:03:37 +0000</pubDate>
		<dc:creator>Philipp Keller</dc:creator>
				<category><![CDATA[Clustering]]></category>
		<category><![CDATA[Del.icio.us]]></category>
		<category><![CDATA[RawSugar]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Tags]]></category>

		<guid isPermaLink="false">http://www.pui.ch/phred/archives/2006/07/automated_tag_clustering.html</guid>
		<description><![CDATA[Grigory Begelman (Technion &#8211; Israel Institute of Technology Computer Science Dpt), Frank Smadja (RawSugar) and I did a paper for www2006 called &#8220;automated tag clustering&#8221;. It deals with why clustering the tag space makes sense and how this could be done.
After the presentation at the tagging workshop at www2006 we felt the need to give [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cs.technion.ac.il/%7Egbeg/">Grigory Begelman</a> (<a href="http://www.cs.technion.ac.il/">Technion &#8211; Israel Institute of Technology Computer Science Dpt)</a>, <a href="http://smadja.us/">Frank Smadja</a> (<a href="http://www.rawsugar.com/">RawSugar</a>) and I did a paper for <a href="http://www2006.org">www2006</a> called &#8220;automated tag clustering&#8221;. It deals with why clustering the tag space makes sense and how this could be done.</p>
<p>After the presentation at the <a href="http://blog.rawsugar.com/wikka/wikka.php?wakka=HomePage">tagging workshop</a> at www2006 we felt the need to give our paper a more www-friendly, I-don&#8217;t-want-to-read-through-those-theoretical-equation-flooded-papers face.</p>
<p>So, here you go: <a href="http://www.pui.ch/phred/automated_tag_clustering/">Automated Tag Clustering: Improving search and exploration in the tag space</a>. To read this document you should have a clue what tags are about, you should also know some tag services as <a href="http://del.icio.us">delicious</a> or <a href="http://www.flickr.com">flickr</a> so you can understand the limitations these services currently have. <span id="more-41"></span><a href="http://www.pui.ch/phred/automated_tag_clustering/#cluster"><img title="clustering the tag space" alt="clustering the tag space" id="image42" src="http://www.pui.ch/phred/wp-content/uploads/2006/07/clusters.png" /></a>If you don&#8217;t want to read through the whole papers, the numerous figures give you a good summary. Finally, to wet your appetite, here a few excerpts of the document:</p>
<blockquote><p>Currently tagging services still provide a relatively marginal value for information discovery and we claim that with the use of clustering techniques this can be greatly improved [from <a href="http://www.pui.ch/phred/automated_tag_clustering/#p_motivation">introduction</a>]</p></blockquote>
<blockquote><p>The whole promise of collaborative tagging is that by exploring the tag space you can discover a lot of useful information you would not find with traditional search engines.  When your information need is not well defined, the idea that you can explore and see what other people tagged with certain tags is very attractive. We believe that tagging will be able to reach a very wide audience only when exploration techniques will be effective. [from <a href="http://www.pui.ch/phred/automated_tag_clustering/#p_exploration">limited exploration</a>]</p></blockquote>
<blockquote><p>Although a great visualization paradigm, we believe that with today&#8217;s tagclouds it is hard to find more than one or two tags to click on. Tags are not grouped, there is too much information, so that you find lot of related tags scattered on the tag cloud.  One or two popular topics and all their related tags tend to dominate the whole cloud.  For example, looking at the del.icio.us tagcloud, one would mostly see tags related to web design and technologies. This is because these topics are overwhelmingly more frequent than anything else. [from <a href="http://www.pui.ch/phred/automated_tag_clustering/#p_exploration">limited exploration</a>]</p></blockquote>
<blockquote><p>Tag <em>web2.0</em> nowadays is so popular and is combined wildly with anything. In fact this tag is so overused that if you look at <a href="http://del.icio.us/tag/bookmarks">tag <em>bookmarks</em> in the del.icio.us dataset</a>, the most used cotag is <em>web2.0</em>[...]. Basing tag similarity on these numbers often doesn&#8217;t make sense at all. The similarity measure should be chosen so the popularity of a tag doesn&#8217;t affect the set of a tags related tags. Don&#8217;t cut the <a href="http://en.wikipedia.org/wiki/Long_tail">long tail</a>. The success of blogs is driven by the importance of the long tail. We all know that it is crucial to support the niches. Tagging applications should empower the long tail too. If you just sort by popularity, you&#8217;d loose all those niches. [from <a href="http://www.pui.ch/phred/automated_tag_clustering/#p_similarity">choosing a similarity measure</a>]</p></blockquote>
<p>We&#8217;d be happy to get any kind of feedback on the article. Just post a comment to this blog post.</p>
<p><strong>Edit (4 years later!)</strong>: A few guys asked me about the source code: <a href="http://pastie.org/1098455">Source code with syntax highlighting</a>, <a href="http://www.pui.ch/phred/archives/cluster.py">download</a>.<br />
You need <a href="http://people.sc.fsu.edu/~jburkardt/c_src/kmetis/kmetis.html">kmetis</a> to make this run, see <code>usage()</code> to see how it should be used.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.pui.ch/phred/archives/2006/07/automated-tag-clustering.html/feed</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
	</channel>
</rss>

