[relaxng-user] Latest proposal for smart regexes in RELAX NG

David Tolpin dvd at davidashen.net
Thu May 6 22:27:35 ICT 2004


> > are ordered and at fixed places. A-Z is always ABCDEFGHIJKLMNOPQRSTUVWXYZ,
> > it was not so in the past, and I am glad it is now.
> 
> Well, it is so if the locale is Unicode.

No. This is so always. Because the locale only matters when documents
are read or written. When documents are processed (validated) or
matched against Unicode regular expressions, the encoding is always
Unicode.

> > 1) Strings are not trees. XML documents are trees. That's why 
> > regular expressions (which can be represented in either tree-like
> > (XML) form or in the form of a sequence of instructions (traditional
> > string regular expressions)) should provide 
> > 
> > - XML structured representation (or compact but still structured tree-like)
> >   for XML documents in whole
> > - string representation to match strings.
> 
> We don't provide a string-regex-ish formulation for XML, though.

That's what I am saying; exactly for the same matter there is no
string-regex-ish formulation for XML, there shouldn't be xml-regex-ish
formulation for strings.

> 
> I agree about the bidirectional mapping, but I think using string
> syntax as the base regex syntax is a hangover from the past that
> ought to be discarded.
> 

Is XML syntax for XML is a hangover from the past that ought to
be discarded in favour of the compact syntax?

David


More information about the relaxng-user mailing list