[tahoe-dev] [tahoe-lafs] #1224: Unicode bug in grid to grid copies

tahoe-lafs trac at tahoe-lafs.org
Thu Oct 14 01:40:50 UTC 2010


#1224: Unicode bug in grid to grid copies
-----------------------------------+----------------------------------------
     Reporter:  francois           |       Owner:  francois        
         Type:  defect             |      Status:  new             
     Priority:  major              |   Milestone:  1.8.1           
    Component:  code-frontend-cli  |     Version:  1.8.0           
   Resolution:                     |    Keywords:  unicode tahoe-cp
Launchpad Bug:                     |  
-----------------------------------+----------------------------------------

Comment (by davidsarah):

 I had assumed that {{{urllib.quote}}} was supposed to UTF-8-then-percent-
 encode Unicode strings, but
 [http://docs.python.org/library/urllib.html#urllib.quote it's not
 documented as doing so], so that was probably wishful thinking.

 This seems to be [http://bugs.python.org/issue1712522]. Apparently
 [http://old.nabble.com/Re:-Problem:-neither-urllib2.quote-nor-urllib
 .quote-encode-the--unicode-strings-arguments-p19823144.html you have to
 convert to UTF-8 manually].

 Note that we have a {{{unicode_to_url}}} method in
 [source:src/allmydata/util/encodingutil.py] that should probably be used
 for this (or maybe we should add a {{{quote_unicode_url}}} method, if it
 turns out that we normally need to convert and percent-escape at the same
 time).

-- 
Ticket URL: <http://tahoe-lafs.org/trac/tahoe-lafs/ticket/1224#comment:1>
tahoe-lafs <http://tahoe-lafs.org>
secure decentralized storage


More information about the tahoe-dev mailing list