[ Date Index ] [ Thread Index ] [ <= Previous by date / thread ] [ Next by date / thread => ]
On 07/06/11 19:47, Gordon Henderson wrote: > > > Oh, and pounds signs... àwhat has happened? I view emails with à> signs in them and they look like inverted question marks. I type a à> sign and it's adding 2 extra spaces. UTF-8 encoding of the  sign is 0xC2 0xA3 Since 0xA3 is a pound sign in ISO-8859-1 and 0xC2 is capital letter A with circumflex, is you ever see "ÃÂ" it almost invariably means that you have a character set encoding issue, and have treated UTF-8 as ISO-8859-1 (don't do that) or similar 8 bit character set. These days Debian installs should probably have a UTF-8 locale (en_GB.UTF-8 is a good choice for Devon and Cornwall) as the default, oh and make sure that this is all perfect before installing Postgres otherwise you'll hate character encodings until you dump, hack, and restore all your databases (and the Postgres thingy - urm cluster that is the word. although that was an etch -> lenny grief not a lenny -> squeeze thing IIRC). Applications should by now all be UTF-8 aware. I think this was what they had in mind about Unix becoming ten times larger, but not ten times better, except in this case we can blame Ken Thompson personally - he should have stuck to chess programming if you ask me ;) srw@derek:~$ locale LANG=en_GB.UTF-8 LC_CTYPE="en_GB.UTF-8" LC_NUMERIC="en_GB.UTF-8" LC_TIME="en_GB.UTF-8" LC_COLLATE="en_GB.UTF-8" LC_MONETARY="en_GB.UTF-8" LC_MESSAGES="en_GB.UTF-8" LC_PAPER="en_GB.UTF-8" LC_NAME="en_GB.UTF-8" LC_ADDRESS="en_GB.UTF-8" LC_TELEPHONE="en_GB.UTF-8" LC_MEASUREMENT="en_GB.UTF-8" LC_IDENTIFICATION="en_GB.UTF-8" LC_ALL= Okay I'm done. -- The Mailing List for the Devon & Cornwall LUG http://mailman.dclug.org.uk/listinfo/list FAQ: http://www.dcglug.org.uk/listfaq