<?xml version='1.0' encoding='utf-8' standalone='yes'?>
<?xml-stylesheet type='text/xsl' href='/style/atom-comments.xsl'?>
<feed xmlns='http://www.w3.org/2005/Atom'>
<title>norman.walsh.name: Comments on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e'/>
<id>http://norman.walsh.name/2008/02/07/xml105e/comments.atom</id>
<updated>2008-02-25T16:23:54Z</updated>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 1 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0001'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0001</id>
<published>2008-02-07T19:12:05Z</published>
<updated>2008-02-07T19:12:05Z</updated>
<author>
  <name>Mark Nottingham</name>
  <foaf:mbox_sha1sum>ea1818ad55858b4b9429b0a0e00bd8ae25c09791</foaf:mbox_sha1sum>
  <uri>http://www.mnot.net/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>Isn't it a bad idea to refer to what's essentially a moving target?
</p><p>
Let's imagine that implementer X produces a parser based upon Unicode 5.0.0, and it gets *really* popular; it's the dominant parser shipped in OSs, used in lots of code around the world, and so on.
</p><p>
Then, implementer Y ships a XML generation implementation (maybe an editor) based upon Unicode 5.1.0 or something, and they produce documents that contain characters which X's implementation knows nothing about. 
</p><p>
Both conform to 5th edition, but it doesn't look like they'll interoperate. I'm hardly a Unicode expert -- what am I missing here?</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 2 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0002'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0002</id>
<published>2008-02-07T21:11:58Z</published>
<updated>2008-02-07T21:11:58Z</updated>
<author>
  <name>John Cowan</name>
  <foaf:mbox_sha1sum>02e728ba6f922a41e37329c3a52b7fa0a787765f</foaf:mbox_sha1sum>
  <uri>http://www.ccil.org/~cowan</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>Norm: Technically, certain well-formed but invalid documents become valid: where the DTD says an attribute is of type NMTOKEN, it couldn't validly contain a character not in Unicode 2.0, whereas under the 5th Edition rules that would (for the most part) be valid.</p>

<p>Mark: X and Y <i>will</i> interoperate, because in 5th Edition (like 1.1 before it), any Unicode code point not explicitly forbidden is permitted, including many that are not assigned.  So even if X's parser does not recognize the characters used in a document shipped by Y's software, it knows whether or not they are legal in names.

</p><p>I've just <a rel='nofollow' href="http://recycledknowledge.blogspot.com/2008/02/which-characters-are-excluded-in-xml.html">blogged</a> about the details of which characters are forbidden and why.</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 3 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0003'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0003</id>
<published>2008-02-07T21:35:11Z</published>
<updated>2008-02-07T21:35:11Z</updated>
<author>
  <name>Norman Walsh</name>
  <foaf:mbox_sha1sum>9f5c771a25733700b2f96af4f8e6f35c9b0ad327</foaf:mbox_sha1sum>
  <uri>http://norman.walsh.name/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>Thank you, John. Good point. I was thinking mostly of documents becoming invalid, but that's not what I said.
</p><p>
Thank you also for answering Mark's question before I got to it :-)</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 4 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0004'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0004</id>
<published>2008-02-08T00:11:26Z</published>
<updated>2008-02-08T00:11:26Z</updated>
<author>
  <name>David Carlisle</name>
  <foaf:mbox_sha1sum>1d1b1ca2e25fe15c3b091a924df14357aed8b199</foaf:mbox_sha1sum>
  <uri>http://www.dpcarlisle.blogspot.com/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>It's not just xml parsers that need to change, it's any software that implements a spec that references the Name production. XPath (1 and 2)  XQuery, XSD and Relax NG Systems would all need to change. Worse if any of them (such as Xpath2 at least) reference undated versions of the XML spec then the fact that the W3C is planning to make an incompatible change _in place_ at 
http://www.w3.org/TR/REC-xml/
means that currently conforming XPath parsers that reject Unicode 5 characters in QNames, or XSLT systems that reject these characters in template names, will become non conforming, even though the software and the xpath/xslt specifications have not changed.
</p><p>
This is bad.
</p><p>
The change is in fact a good one, but you should call it what it is, XML 1.2 (or 2.0) and deprecate XML 1.1.
If the world is not willing to follow you this time, so be it, but don't try to trick them into following you by passing this off as an editorial "edition".
</p><p>
Yes I know I should moan to the offical comment list not your blog, but it's your blog entry that popped up on my screen first...
</p><p>
David</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 5 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0005'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0005</id>
<published>2008-02-08T01:29:01Z</published>
<updated>2008-02-08T01:29:01Z</updated>
<author>
  <name>Michael Day</name>
  <foaf:mbox_sha1sum>f2ecff4d3f408aeea35b3073bfa80143e0dcf464</foaf:mbox_sha1sum>
  <uri>http://yeslogic.com/mikeday/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>
Yay, I think this is a brilliant solution, so much so that I <a rel='nofollow' href="http://mail.gnome.org/archives/xml/2007-June/msg00073.html">proposed it</a> last year:
</p>
<p><i>The only part of XML 1.1 that everyone agrees is reasonable is the 
support for more scripts in name characters, and avoiding explicitly 
specifying the set of UNICODE characters that can be used in names.</i></p>
<p><i>So, why not introduce a fifth edition of XML 1.0 that expands the set of 
characters that may be used as name characters. This would be backwards 
compatible, as every well-formed XML 1.0 document would remain legal. It 
would allow the use of more scripts in XML markup, like Mongolian. 
Parsers could support it quite easily, and the change would be unlikely 
to confuse applications.</i></p>
<p><i>Then admit that XML 1.1 was an experiment that failed, and has not 
achieved widespread interoperable implementation, and deprecate it.</i></p>
<p>(By the way Norm, why can't I use &lt;blockquote&gt; in comments? :)
</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 6 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0006'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0006</id>
<published>2008-02-08T01:44:22Z</published>
<updated>2008-02-08T01:44:22Z</updated>
<author>
  <name>Mark Nottingham</name>
  <foaf:mbox_sha1sum>ea1818ad55858b4b9429b0a0e00bd8ae25c09791</foaf:mbox_sha1sum>
  <uri>http://www.mnot.net/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>OK, so the risk seems like it's restricted to implementation Z (of, say, the 3rd edition) coming across a 5th edition document and blowing up, correct?
</p><p>
I agree that this doesn't seem like an erratum...</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 7 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0007'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0007</id>
<published>2008-02-10T00:21:55Z</published>
<updated>2008-02-10T00:21:55Z</updated>
<author>
  <name>James Abley</name>
  <foaf:mbox_sha1sum>ceb5b79966bb43fd49bced8c0822de129a37cd66</foaf:mbox_sha1sum>
  <uri>http://eternusuk.blogspot.com/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>Thanks for highlighting this change. Submitted feedback to the working group, but I'm pretty aligned with David Carlisle (and yourself?) on this one.</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 8 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0008'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0008</id>
<published>2008-02-15T17:38:36Z</published>
<updated>2008-02-15T17:38:36Z</updated>
<author>
  <name>Michael Kay</name>
  <foaf:mbox_sha1sum>8bf4af8c10a4eac0238ca4b343d5a1b54b84f9c9</foaf:mbox_sha1sum>
  <uri>http://www.saxonica.com/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>All my years of software engineering and quality assurance training tell me this is bad, bad, bad. If you want to change something in a significant way, you should give it a new version number.
</p><p>
And yet: all my years of experience tell me that sometimes it's in everyone's best interests to ignore the rules; don't let process get in the way of giving users what they want.
</p><p>
XML 1.1 failed because the cost of the change was too high for the community as a whole, given that 99% of the community gets zero benefit. This proposal means that the suffering will fall more squarely on the shoulders of those who actually need the change.
</p><p>
So, with deep reluctance, I think I'm not going to stand in front of the tanks.</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 9 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0009'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0009</id>
<published>2008-02-16T02:43:46Z</published>
<updated>2008-02-16T02:43:46Z</updated>
<author>
  <name>Elliotte Rusty Harold</name>
  <foaf:mbox_sha1sum>fcc9a2c3412d8d046a24619e3aa59dadeec7dc91</foaf:mbox_sha1sum>
  <uri>http://www.cafeconleche.org/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml">This is a blatant abuse of the W3C process, though sadly not without precedent. The XML Core working group got away with this chicanery before when the namespaces spec was revised to map the <code>xmlns</code> prefix to a namespace.
<p> 

I didn't like that, but this is just beyond the pale. If this goes through, I suspect I will completely lose faith in the W3V as a fair and honest maintainer of standards. Frankly, if we can't rely on the stability of the base specs, then  I think it may be time to give up on XML completely. :-(</p><p></p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 10 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0010'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0010</id>
<published>2008-02-16T23:23:23Z</published>
<updated>2008-02-16T23:23:23Z</updated>
<author>
  <name>John Cowan</name>
  <foaf:mbox_sha1sum>02e728ba6f922a41e37329c3a52b7fa0a787765f</foaf:mbox_sha1sum>
  <uri>http://www.ccil.org/~cowan</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>David, Elliotte:
</p><p>
It's about justice, not about process.  I'll use any process, fair or foul, to eliminate the blatant injustice that says people who speak English or French get to use their languages (or the official languages of their countries) to define XML schemas (lower-case "s"), and people who speak Cantonese or Khmer don't, because of an accident of Unicode history.  (Romanians do get to use their language, but only if they are willing to misspell it just slightly.)  <i>That</i> is what would be unfair and dishonest.
</p><p>
Saying "If the world won't follow you, so be it" is as much as to say "The majority rules, and devil take the minorities."  That's a world I'm not willing to live in without trying to change.
</p><p>
As for giving up on XML, where you gonna go?  Like democracy, it is a terrible idea, but better than any alternative.</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 11 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0011'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0011</id>
<published>2008-02-18T05:53:23Z</published>
<updated>2008-02-18T05:53:23Z</updated>
<author>
  <name>David Webber</name>
  <foaf:mbox_sha1sum>6260e10a3a990e6158baea0719360548ce58e22c</foaf:mbox_sha1sum>
  <uri>http://wiki.oasis-open.org/cam</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>Norm, seems to me the reality check here is that for 98%+ of the world this is a big snooze.  I cannot see them changing from their cozy UTF-8 land and XML that uses that explicitly.
</p><p>
If someone is using UTF-xxx that supports this UNICODE 5,+ stuff - they are going to take special means to alert their partners that they need software capable of supporting this.
</p><p>
I mean I cannot see eBay, Amazon, Google and Wal-Mart suddenly sending out UNICODE 5+ content that noone can open in their browsers, right?!?
</p><p>
So I expect this one to pass muster - without too much fuss and mush - but it does make for good headlines and copy material.
</p><p>
Cheers, DW</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 12 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0012'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0012</id>
<published>2008-02-18T08:29:03Z</published>
<updated>2008-02-18T08:29:03Z</updated>
<author>
  <name>Buck "Marbux" Martin</name>
  <foaf:mbox_sha1sum>d6dafa322fe0a3b144337fa66ac2da9449863e0c</foaf:mbox_sha1sum>
  <uri>http://www.universal-interop-council.org</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>The perceived process violation may be illusory, depending on whether the change is regarded as a bug fix or as a new feature. I'll be a bit lengthy here because I think the discussion might remove the the procedural issue from the table.
</p><p>
On the fair vs. foul means issue, it should be recognized that a norm once bent to achieve a particular result is no longer a norm. There is a bit of text borrowed by the U.S. Supreme Court in its landmark <a rel='nofollow' href="http://caselaw.lp.findlaw.com/scripts/getcase.pl?court=US&amp;vol=437&amp;invol=153">Tennessee Valley Authority v. Hill</a> Endangered Species Act decision regarding Tellico Dam and the snail darter that eloquently explains the danger of bending behavioral norms to achieve a desired result:
</p><p>
"The law, Roper, the law. I know what's legal, not what's right. And I'll stick to what's legal. . . . I'm not God. The currents and eddies of right and wrong, which you find such plain-sailing, I can't navigate, I'm no voyager. But in the thickets of the law, oh there I'm a forester. . . . What would you do? Cut a great road through the law to get after the Devil? . . . And when the last law was down, and the Devil turned round on you - where would you hide, Roper, the laws all being flat? . . . This country's planted thick with laws from coast to coast - Man's laws, not God's - and if you cut them down . . . d'you really think you could stand upright in the winds that would blow them? . . . Yes, I'd give the Devil benefit of law, for my own safety's sake." R. Bolt, A Man for All Seasons, Act I, p. 147 (Three Plays, Heinemann ed. 1967)."
</p><p>
So I think it important to inquire whether or not the subject procedural norm of a version change being required for major new features is the last word in the particular circumstance presented.  I suggest that it is not. 
</p><p>
The preparation, adoption, and application of technical specifications by standards development organizations is governed globally by the Agreement on Technical Barriers to Trade. I'll nominate its <a rel='nofollow' href="http://www.wto.org/english/res_e/booksp_e/analytic_index_e/tbt_01_e.htm#article2">article 2 section 2.1</a> as the final word on whether the subject change proposed by XML 1.0 Fifth Edition is a bug fix or a new feature (ignoring the fact that the WTO Appellate Body would actually have the final say):
</p><p>
"Members shall ensure that in respect of technical regulations, products imported from the territory of any Member shall be accorded treatment no less favourable than that accorded to like products of national origin and to like products originating in any other country."
</p><p>
ISO/IEC have interpreted that section and section 2.2 read together as requiring that international standards must exhibit the "strategic characteristic" of "cultural and linguistic adapatability." <a rel='nofollow' href="http://www.jtc1sc34.org/repository/0856rev.pdf">ISO/IEC JTC 1 Directives,</a> 5th ed., v. 3.0, pg. 11 (PDF). I have thoroughly studied the law in this area and am of the firm opinion that the ISO/IEC interpretation is sound.
</p><p>
I'll omit the analysis that explains the ATBT's direct applicability to W3C recommendations because it is complex. But it might suffice for now to observe that W3C recommendations are ineligible for the status of government technical regulations or government procurement specifications if they have "the effect of creating unnecessary obstacles to international trade." ATBT section 2.2; Agreement on Government Procurement <a rel='nofollow' href="http://www.wto.org/english/res_e/booksp_e/analytic_index_e/gpa_02_e.htm#article6A">article VI section 1</a>. A cultural and linguistic adaptability barrier would beyond question fit in that category. 
</p><p>
The procedural norm under discussion is subservient to substantive international law, which takes precedence to the extent of the inconsistency. A <b>legal</b> exception to the procedural norm of a version change for new features is created by application of superior law to the extent a markup language standard is not culturally and linguistically adaptable. 
</p><p>
Law is hierarchical, not flat. If you do not like the result of applying one norm, before transforming the norm into a non-norm, it is prudent to determine whether or not there is a superior norm applicable to the particular circumstance whose application supplies a different result . In this situation, applying the superior norm definitively classifies the proposed change to XML 1.0 as a bug fix rather than a new feature.  
</p><p>
Bug fixes are appropriately proclaimed in an erratum rather than requiring a new version of the specification. From a legal standpoint, the only remaining question I see is whether the bug fix proposed is the least trade-restrictive means of repair. <i>See</i> ATBT article 2, <i>supra,</i> sections 2.2 and 2.3. The legal requirement of interoperability also flows from those two sections, so the least-trade restrictive means requirement supplies the guiding light in balancing the requirements of interoperability and cultural and linguistic adaptability that are apparently competing in this particular situation. 
</p><p>
The issues presented thus devolve into two questions of mixed law and fact, whether: [i] there is in fact an apple and orange choice that must be made between [a] cultural and linguistic adaptability and [b] backward and forward compatibility; and [ii] if so, the proposed change produces the best apple and orange that can be feasibly achieved. 
</p><p>
In other words, I suggest that folks talk about the technical merits of the proposed change and alternatives to it, if any, rather than discussing the procedural issue. Anyone have a better bug fix? :-)
</p><p>
My 2 cents.</p></div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 13 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0013'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0013</id>
<published>2008-02-20T20:33:31Z</published>
<updated>2008-02-20T20:33:31Z</updated>
<author>
  <name>Andrew Thompson</name>
  <foaf:mbox_sha1sum>9c2ba9eedd10e969a2fe67fd8a2568db2c5b679c</foaf:mbox_sha1sum>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><pre>Norm, seems to me the reality check here is that for 98%+ of the
 world this is a big snooze. I cannot see them changing from their 
cozy UTF-8 land and XML that uses that explicitly.</pre>

<pre>If someone is using UTF-xxx that supports this UNICODE 5,+ 
stuff - they are going to take special means to alert their 
partners that they need software capable of supporting this.</pre>

Pardon my possible ignorance, but does anything in Unicode 5 require an encoding other than UTF-8? I thought all higher plane characters can be represented just fine in UTF-8?</div></content>
</entry>

<entry xmlns:foaf='http://xmlns.com/foaf/0.1/'>
<title>Comment 14 on /2008/02/07/xml105e</title>
<link rel='alternate' type='text/html' href='http://norman.walsh.name/2008/02/07/xml105e#comment0014'/>
<id>http://norman.walsh.name/2008/02/07/xml105e#comment0014</id>
<published>2008-02-25T16:23:55Z</published>
<updated>2008-02-25T16:23:55Z</updated>
<author>
  <name>David Carlisle</name>
  <foaf:mbox_sha1sum>ac4bbb0ce3a3e02cc386fe410164dc831b49c1ce</foaf:mbox_sha1sum>
  <uri>http://dpcarlisle.blogspot.com/</uri>
</author>
<content type='xhtml'><div xmlns="http://www.w3.org/1999/xhtml"><p>John,
</p><p>
You argue that it is an issue of justice but in what sense is justice improved by making the change in place at version 1.0. It just makes systems that claim to implement the same version (of xml or xpath or whatever) incompatible. XML 1.1 is unpopular not because its version is not 1.0 but because it contained incompatible changes; notably the white space rules and C1 characters. You can fix the problems in 1.1 by having a 1.2 not by rewriting history and changing 1.0.
</p><p>
David Webber,
</p><p>
there appears to be some confusion in your comment as utf8 encoding isn't relevant here utf8 can encode all characters in any version of unicode. there is no proposed change in encodings, or in the characters allowed in element or attribute content, only a change in the characters allowed in element attribute and id names
</p><p>
Buck,
</p><p>
Anyone have a better bug fix? :-)
</p><p>
yes, make the same change but call it version 1.2</p></div></content>
</entry>

</feed>
