<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Tags and Whitespace &#8211; more feedback requested</title>
	<atom:link href="http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/</link>
	<description>Surfulater, the journey continues...</description>
	<lastBuildDate>Thu, 09 Feb 2012 01:22:49 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: nevf</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61676</link>
		<dc:creator>nevf</dc:creator>
		<pubDate>Thu, 17 Jul 2008 22:14:55 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61676</guid>
		<description>@John,
&quot;Why not just take the tags to be whatever people enter between commas? Then no changes are needed to shift to multi-word tags later!&quot;

That is the plan. Avi and I discussed all of your feedback at length yesterday and decided to change to multi-word comma separated tags. 

The only question was when to do it, and in the end we decided it needs to be in the V3.0 release, else some folks get too used to space as a separator.

Neville</description>
		<content:encoded><![CDATA[<p>@John,<br />
&#8220;Why not just take the tags to be whatever people enter between commas? Then no changes are needed to shift to multi-word tags later!&#8221;</p>
<p>That is the plan. Avi and I discussed all of your feedback at length yesterday and decided to change to multi-word comma separated tags. </p>
<p>The only question was when to do it, and in the end we decided it needs to be in the V3.0 release, else some folks get too used to space as a separator.</p>
<p>Neville</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: avi</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61638</link>
		<dc:creator>avi</dc:creator>
		<pubDate>Thu, 17 Jul 2008 14:14:12 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61638</guid>
		<description>John,

The sentiments expressed by you here and most of the others, seem pretty strongly in favour of natural language, i.e. space is a valid character within a tag, and commas are natural separators. 

The *only* reasons to keep whitespace as a separator are:
- cross-compatibility with other systems that might not allow whitespace
- user experience with other systems that use whitespace as a delimiter

Neither of these seems sufficient to override the natural language and ability to use compound words that users want. 

In other words: we agree with you, and intend to keep whitespace as a valid character within the tag, not as a separator.

Avi</description>
		<content:encoded><![CDATA[<p>John,</p>
<p>The sentiments expressed by you here and most of the others, seem pretty strongly in favour of natural language, i.e. space is a valid character within a tag, and commas are natural separators. </p>
<p>The *only* reasons to keep whitespace as a separator are:<br />
- cross-compatibility with other systems that might not allow whitespace<br />
- user experience with other systems that use whitespace as a delimiter</p>
<p>Neither of these seems sufficient to override the natural language and ability to use compound words that users want. </p>
<p>In other words: we agree with you, and intend to keep whitespace as a valid character within the tag, not as a separator.</p>
<p>Avi</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Hanna</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61637</link>
		<dc:creator>John Hanna</dc:creator>
		<pubDate>Thu, 17 Jul 2008 14:09:20 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61637</guid>
		<description>One more try...

The conclusion last expressed in @12  doesn&#039;t seem to make sense.

It suggests that people don&#039;t effectively use punctuation [&quot;when you *look* at a text field full of tags (as in Surfulater) to my eyes each word looks like a separate tag&quot;].

It will seem to keep us from using natural multi-word terms for tagging.

It will force us to use awkward and unnatural work-arounds like making the single-word tags from multiple words connected with some &quot;special&quot; characters (like the &quot;_&quot;, &quot;-&quot;, &quot;.&quot;, or capitalization shifts) that most everyone indicated they didn&#039;t like.


Why not just take the tags to be whatever people enter between commas? Then no changes are needed to shift to multi-word tags later!</description>
		<content:encoded><![CDATA[<p>One more try&#8230;</p>
<p>The conclusion last expressed in @12  doesn&#8217;t seem to make sense.</p>
<p>It suggests that people don&#8217;t effectively use punctuation ["when you *look* at a text field full of tags (as in Surfulater) to my eyes each word looks like a separate tag"].</p>
<p>It will seem to keep us from using natural multi-word terms for tagging.</p>
<p>It will force us to use awkward and unnatural work-arounds like making the single-word tags from multiple words connected with some &#8220;special&#8221; characters (like the &#8220;_&#8221;, &#8220;-&#8221;, &#8220;.&#8221;, or capitalization shifts) that most everyone indicated they didn&#8217;t like.</p>
<p>Why not just take the tags to be whatever people enter between commas? Then no changes are needed to shift to multi-word tags later!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: nevf</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61569</link>
		<dc:creator>nevf</dc:creator>
		<pubDate>Wed, 16 Jul 2008 22:09:44 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61569</guid>
		<description>Morning all, gray skies and probably rain ahead today, at least here at The Cape. 

For folks addressing me (Neville) here, note that Avi wrote this blog post, hopefully the first of many.

Before I started work on Tags I did a lot of research on tagging systems and how people use tags. You will find that I&#039;ve written more posts on the blog on tagging than on any other subject.

The bad news is there are no standard use cases for tags. Some allow single words, others multiple words. Some get you to enclose multiple words in quotes, others use a comma separator. Some only let you pick pre-defined tags from a list, others let you type them in and create new tags on the fly.

Rightly or wrongly most tagging systems only allow single words. Delicious is a stand-out example of this. My decision was therefore to start off with single words, and treat space as a separator, as this fitted in with popular use. 

Further when you are *typing* in tags, space sort of makes sense as a separator. The other comment I&#039;d make is when you *look* at a text field full of tags (as in Surfulater) to my eyes each word looks like a separate tag, versus words grouped by a comma separator. Finally as Avi wrote there are issues of compatibility with other tagging systems as we move forward.

Note that I said &quot;start&quot; with single words. If I&#039;d started with allowing multiple words and folks didn&#039;t want that, then going backwards to  single word tags would have been troublesome.

My personal preference aligns with most of you here and that is to allow multiple words and don&#039;t treat space as anything special. Further I would use comma as a separator as we do now, and not make this configurable. Blog followers will know I hate options.

Having said that, there remains a strong case for single word, space separated tags. There are no outright winners here.

PS. I loath the notion of auto-converting whitespace. Folks will think their keyboard is acting up and want to throw it against the wall.</description>
		<content:encoded><![CDATA[<p>Morning all, gray skies and probably rain ahead today, at least here at The Cape. </p>
<p>For folks addressing me (Neville) here, note that Avi wrote this blog post, hopefully the first of many.</p>
<p>Before I started work on Tags I did a lot of research on tagging systems and how people use tags. You will find that I&#8217;ve written more posts on the blog on tagging than on any other subject.</p>
<p>The bad news is there are no standard use cases for tags. Some allow single words, others multiple words. Some get you to enclose multiple words in quotes, others use a comma separator. Some only let you pick pre-defined tags from a list, others let you type them in and create new tags on the fly.</p>
<p>Rightly or wrongly most tagging systems only allow single words. Delicious is a stand-out example of this. My decision was therefore to start off with single words, and treat space as a separator, as this fitted in with popular use. </p>
<p>Further when you are *typing* in tags, space sort of makes sense as a separator. The other comment I&#8217;d make is when you *look* at a text field full of tags (as in Surfulater) to my eyes each word looks like a separate tag, versus words grouped by a comma separator. Finally as Avi wrote there are issues of compatibility with other tagging systems as we move forward.</p>
<p>Note that I said &#8220;start&#8221; with single words. If I&#8217;d started with allowing multiple words and folks didn&#8217;t want that, then going backwards to  single word tags would have been troublesome.</p>
<p>My personal preference aligns with most of you here and that is to allow multiple words and don&#8217;t treat space as anything special. Further I would use comma as a separator as we do now, and not make this configurable. Blog followers will know I hate options.</p>
<p>Having said that, there remains a strong case for single word, space separated tags. There are no outright winners here.</p>
<p>PS. I loath the notion of auto-converting whitespace. Folks will think their keyboard is acting up and want to throw it against the wall.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: barrabas45</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61548</link>
		<dc:creator>barrabas45</dc:creator>
		<pubDate>Wed, 16 Jul 2008 21:15:18 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61548</guid>
		<description>hi nev -- maybe i&#039;m missing something, but here&#039;s a question I thought of: 

I would ask: why does the conceptual differnce (between white space as separator vs non-separator) really matter? Arent separators conceptually kind of irrelevant when it comes to tags? 

Lets say I typed in &quot;sierra leone&quot;. 

scenario 1: White space is separator, and it goes in as two tags. Later on, I search for &#039;sierra leone&#039;, and it shows all results that have both sierra &quot;and&quot; leone, and thus the article i want is in the results list. (articles that only have &#039;sierra&#039; or only have &#039;leone&#039; as tags can perhaps show up in the results in a section called  &#039;close matches&#039;, if they show up at all).

Scenario 2: white space is not a separator, and it goes in as one tag, &#039;sierra leone&#039;. Later I do a search for &#039;sierra leone&#039;, and the article i want shows up in the results page. 

In either scenario, the article i&#039;m looking for shows up in the results page.  What am I missing? Is it a question of being able to prioritize the search results? 

well, If the white space question really does matter, then I for one would vote against either camelcase or underscores (or anything that moves TOO far away from natural language). Simpy.com for instance does tag separators with commas, which feels natural and is easy to type rapidly. I think metafilter does the same. 

But again, it seems to me that white space could just be a separator and the idea of linking &#039;sierra&#039; with &#039;leone&#039; is maybe less of an issue than it seems? Again, maybe i&#039;m missing something...</description>
		<content:encoded><![CDATA[<p>hi nev &#8212; maybe i&#8217;m missing something, but here&#8217;s a question I thought of: </p>
<p>I would ask: why does the conceptual differnce (between white space as separator vs non-separator) really matter? Arent separators conceptually kind of irrelevant when it comes to tags? </p>
<p>Lets say I typed in &#8220;sierra leone&#8221;. </p>
<p>scenario 1: White space is separator, and it goes in as two tags. Later on, I search for &#8216;sierra leone&#8217;, and it shows all results that have both sierra &#8220;and&#8221; leone, and thus the article i want is in the results list. (articles that only have &#8216;sierra&#8217; or only have &#8216;leone&#8217; as tags can perhaps show up in the results in a section called  &#8216;close matches&#8217;, if they show up at all).</p>
<p>Scenario 2: white space is not a separator, and it goes in as one tag, &#8216;sierra leone&#8217;. Later I do a search for &#8216;sierra leone&#8217;, and the article i want shows up in the results page. </p>
<p>In either scenario, the article i&#8217;m looking for shows up in the results page.  What am I missing? Is it a question of being able to prioritize the search results? </p>
<p>well, If the white space question really does matter, then I for one would vote against either camelcase or underscores (or anything that moves TOO far away from natural language). Simpy.com for instance does tag separators with commas, which feels natural and is easy to type rapidly. I think metafilter does the same. </p>
<p>But again, it seems to me that white space could just be a separator and the idea of linking &#8216;sierra&#8217; with &#8216;leone&#8217; is maybe less of an issue than it seems? Again, maybe i&#8217;m missing something&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: avi</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61529</link>
		<dc:creator>avi</dc:creator>
		<pubDate>Wed, 16 Jul 2008 13:16:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61529</guid>
		<description>I am glad that I put in the response comment above. People seem to be very strongly in favour of whitespace, and every other character, as legitimate in a tag, except for a well-defined separator, which would be a comma.

We agree wholeheartedly with the option comment. We do not want additional options, as it both violates KISS from a user perspective, but also greatly increases our support burden.

Avi</description>
		<content:encoded><![CDATA[<p>I am glad that I put in the response comment above. People seem to be very strongly in favour of whitespace, and every other character, as legitimate in a tag, except for a well-defined separator, which would be a comma.</p>
<p>We agree wholeheartedly with the option comment. We do not want additional options, as it both violates KISS from a user perspective, but also greatly increases our support burden.</p>
<p>Avi</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Craig Prichard</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61522</link>
		<dc:creator>Craig Prichard</dc:creator>
		<pubDate>Wed, 16 Jul 2008 12:24:46 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61522</guid>
		<description>White space is white space and should be ignored. Has anyone ever heard of a &quot;white space delimited file&quot;? No, comma delimited (CSV) is standard. &quot;Sierra Leone&quot; is a single tag. If you want to separate the words into individual tags, delimit them properly, e.g. with a comma. All leading and trailing spaces are always thrown out.

Yes, you could offer a configurable tag separator but why? All touch typists create lists separated by commas, tabs, or carriage returns. Surfulater doesn&#039;t need to be any fancier than that, in my humble opinion (BTW, I don&#039;t like acronyms either, ergo, no IMHO).

And auto-convert can be done but &quot;auto&quot; anything is always fraught with peril. Most of the time &quot;auto&quot; is acceptable. But when I don&#039;t want something to &quot;auto&quot; I must have a way to either prevent it or revert it. In the absence of a prevent or revert option I suggest avoiding &quot;auto&quot;. Besides, Sierra Leone is the country, not Sierra_Leone or SierraLeone. So my tags will be words/phrases I don&#039;t use in everyday life (&quot;natural language&quot;)? I don&#039;t think so. I agree with David Laing comment re: del.icio.us and with everything John Hanna wrote.

Start with the KISS principle before adding complexity. And multiple options is an absolute no-no.

Craig</description>
		<content:encoded><![CDATA[<p>White space is white space and should be ignored. Has anyone ever heard of a &#8220;white space delimited file&#8221;? No, comma delimited (CSV) is standard. &#8220;Sierra Leone&#8221; is a single tag. If you want to separate the words into individual tags, delimit them properly, e.g. with a comma. All leading and trailing spaces are always thrown out.</p>
<p>Yes, you could offer a configurable tag separator but why? All touch typists create lists separated by commas, tabs, or carriage returns. Surfulater doesn&#8217;t need to be any fancier than that, in my humble opinion (BTW, I don&#8217;t like acronyms either, ergo, no IMHO).</p>
<p>And auto-convert can be done but &#8220;auto&#8221; anything is always fraught with peril. Most of the time &#8220;auto&#8221; is acceptable. But when I don&#8217;t want something to &#8220;auto&#8221; I must have a way to either prevent it or revert it. In the absence of a prevent or revert option I suggest avoiding &#8220;auto&#8221;. Besides, Sierra Leone is the country, not Sierra_Leone or SierraLeone. So my tags will be words/phrases I don&#8217;t use in everyday life (&#8220;natural language&#8221;)? I don&#8217;t think so. I agree with David Laing comment re: del.icio.us and with everything John Hanna wrote.</p>
<p>Start with the KISS principle before adding complexity. And multiple options is an absolute no-no.</p>
<p>Craig</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Saltheart</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61507</link>
		<dc:creator>Saltheart</dc:creator>
		<pubDate>Wed, 16 Jul 2008 10:24:34 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61507</guid>
		<description>I just reread your intro and realise that underscores are NOT necessary at all and therefore I&#039;m repeating your suggested &#039;allow&#039; option (ie Gmail). John&#039;s idea of using a configurable separator makes it a nobrainer imho.</description>
		<content:encoded><![CDATA[<p>I just reread your intro and realise that underscores are NOT necessary at all and therefore I&#8217;m repeating your suggested &#8216;allow&#8217; option (ie Gmail). John&#8217;s idea of using a configurable separator makes it a nobrainer imho.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Saltheart</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61506</link>
		<dc:creator>Saltheart</dc:creator>
		<pubDate>Wed, 16 Jul 2008 10:16:08 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61506</guid>
		<description>Yes, it is desireable to allow whitespace. The choice of an encoding is less clear. From previous comments I take it that you intend the database to be independently accessible. If that is the case you want a simple encoding scheme like the underscore replacement. But that conflicts somewhat with other suggestions to allow the underscore character itself and even the punctuation characters used to separate tags. 

My take would be to allow any character in a tag, including whitespace embedded in a tag, except for a nominated separation character, such as a comma. You should also strip whitespace from the start/end of tags to avoid confusion.  

New questions: should tags be case-sensitive? what about other character sets?</description>
		<content:encoded><![CDATA[<p>Yes, it is desireable to allow whitespace. The choice of an encoding is less clear. From previous comments I take it that you intend the database to be independently accessible. If that is the case you want a simple encoding scheme like the underscore replacement. But that conflicts somewhat with other suggestions to allow the underscore character itself and even the punctuation characters used to separate tags. </p>
<p>My take would be to allow any character in a tag, including whitespace embedded in a tag, except for a nominated separation character, such as a comma. You should also strip whitespace from the start/end of tags to avoid confusion.  </p>
<p>New questions: should tags be case-sensitive? what about other character sets?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joel</title>
		<link>http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61498</link>
		<dc:creator>Joel</dc:creator>
		<pubDate>Wed, 16 Jul 2008 07:37:46 +0000</pubDate>
		<guid isPermaLink="false">http://blog.surfulater.com/2008/07/16/tags-and-whitespace-more-feedback-requested/#comment-61498</guid>
		<description>I&#039;m for allowing white space, a la &quot;Sierra Leone&quot; on the basis that it&#039;s easier, more natural and more aesthetically pleasing. 

I&#039;m not sure where the compatibility issue comes into play, but there are already numerous other standards out there and Surfulater can&#039;t be compliant with all of them. So maybe the best approach is to have a utility to convert tags to other formats as required, e.g. convert all white space between delimiters to an underscore or convert the first character of any term following a white space to upper case and delete the white space (wiki-ize).</description>
		<content:encoded><![CDATA[<p>I&#8217;m for allowing white space, a la &#8220;Sierra Leone&#8221; on the basis that it&#8217;s easier, more natural and more aesthetically pleasing. </p>
<p>I&#8217;m not sure where the compatibility issue comes into play, but there are already numerous other standards out there and Surfulater can&#8217;t be compliant with all of them. So maybe the best approach is to have a utility to convert tags to other formats as required, e.g. convert all white space between delimiters to an underscore or convert the first character of any term following a white space to upper case and delete the white space (wiki-ize).</p>
]]></content:encoded>
	</item>
</channel>
</rss>

