Home Page   #c  #ruby-lang  #cisco  #mysql  #apache  #javascript  #java  #perl  #php  #openmoko   Wallpapers Girl
Reliable $1 Web Hosting by 3iX

Channels


#perl

15 October 2007


Total 32 pages. You are browsing page 2/32.

First :: Prev :: [1] [2] [3] [4] [5] [...] :: Next :: Last

00:30 <****> wolverian: Variable length bytes DO NOT EXIST
00:30 <****> wolverian: no, as long as your "bytes" are really characters
00:31 <****> if you have a unicode string, it's not utf8
00:31 <****> it's a bit like people talking about "hex numbers". those don't exist either
00:31 <****> wolverian: A guy named Marc Lehmann had the same notion on p5p once, and I tried real hard to convince him otherwise, in vain.
00:31 <****> if it's in hex, it's a string not a number
00:31 <****> Juerd, it's the only way I can make any sense out of what mauke is saying
00:31 <****> So now I give people one warning, and then I ignore them if they repeat this silly "variable width bytes" thing.
00:31 <****> A byte is exactly 8 bits.
00:31 <****> I'm not saying it's not my fault.. :)
00:32 <****> And not any 8 bits, but a sequential octet at a %8 bit offset boundary.
00:32 <****> wolverian: Do you know the difference between Unicode and UTF8, by the way?
00:32 <****> I'm saying utf-8 means all your characters have values <=255, so they fit in a byte
00:32 <****> Juerd, I think so, yes
00:32 <****> wolverian: Wanna test your knowledge?
00:33 <****> Juerd, ..sure..
00:33 <****> wolverian: What is U+00A0, and don't answer "non breaking space" :)
00:33 <****> Is that unicode, utf-8, or something else?
00:34 <****> I don't know
00:34 <****> Then you don't know the difference :)
00:34 <****> actual character
00:34 <****> it's a unicodepoint. definitely not utf-8.
00:35 <****>ping($host,10) should be enough shouldn't it?
00:35 <****> These elements are called codepoints. Strictly, a codepoint is a number. Usually formatted as U+%04x
00:35 <****> nollan, it means i don't know what's $c blessed to
00:35 <****> so i can't provide any answer at all but 'mu'
00:35 <****>new("icmp");
00:36 <****> If you have a Unicode string, and all text strings in Perl are Unicode strings, you have a sequence of unicode characters (or codepoints)
00:36 <****> nollan, then read the Net::Ping documentation
00:36 <****> i just checked it and you should find an answer there
00:37 <****> This can't be communicated through normal I/O, because I/O works with bytes, and these numbers don't always *fit* in bytes.
00:37 <****> So they must be encoded. The number must be expressed in binary, just like all numbers are expressed in binary internally.
00:37 <****> An "unsigned long" is the encoding of a number.
00:38 <****> UTF-32 is very similar. Most of the time, it is just a long concatenated series of unsigned longs.
00:38 <****> UTF-8 is a more complex encoding scheme for Unicode codepoint numbers.
00:39 <****> It is fully ASCII compatible, and typically takes much less space than the same string encoded as UTF-32.
00:39 <****> U+00A0 is a unicode codepoint.
00:39 * reflexive wonders how one discovers the secret to choosing a good domain name in a world where they're practically all taken.
00:39 <****> When encoded to UTF-8, it is the byte sequence C2 A0
00:40 <****> wolverian: Is this sufficient to realise the difference between UTF-8 and Unicode?
00:43 <****> reflexive: start your own new top level.
00:44 <****> reflexive: Invent a compound word that does not make sense, change the spelling a little.
00:44 <****> Juerd: How very Web 2.0!
00:45 <****> Juerd, yes, and I'm pretty sure I knew that before, actually, if not quite so clearly as you put it. I still don't know what mauke meant though :)
00:45 <****> reflexive: rectangularwheelz.com is available.
00:45 <****> Juerd, (also, thanks)
00:45 <****> oh god
00:45 <****> wolverian: No, but now that you know this, you know enough to learn the difference between characters and bytes at http://tnx.nl/perlunitut :)
00:45 <****> wolverian: And maybe even http://tnx.nl/perlunifaq
00:45 <****> Erlkönig - Web 2.0 or not?
00:46 <****> (didn't mean to sound inappreciative)
00:46 <****> You didn't :)
00:46 <****> reflexive: Need more domain names?
00:46 <****> nor unappreciative either
00:47 <****> reflexive: append a random unicode character
00:47 <****> utf-2.7
00:48 <****> hello..is there any function of perl that will list all timezones?
00:48 <****> I'm waiting for UTF-64 to match my CPU.
00:48 <****> eval: [ -12 .. 12 ]
00:48 <****> mauke: [-12,-11,-10,-9,-8,-7,-6,-5,-4,-3,-2,-1,0,1,2,3,4,5,6,7,8,9,10,11,12]
00:48 <****> magdiwang: I guess "print -12..12" doesn't satisfy?
00:49 <****> Juerd: I've actually got a few to choose from after writing a script to pick out terms from the Jargon File and check for availability. Mostly I'm just a terrible decision-maker :)
00:49 <****> Hm, my alarm clock will wake me up in 6 hours from now. I should probably get some rest.
00:49 <****> reflexive: Send me the list and I'll make the decision for you.
00:49 <****> sleep when you're dead.
00:49 <****> reflexive: #####@juerd.nl
00:50 <****> cfedde: The less I sleep, the sooner that will be.
00:50 <****> Anyway, good night!
00:50 <****> Juerd: awesome address
00:50 <****> Juerd: unfortunately true. And sleep well
00:50 <****> g'night
00:50 <****> wolverian: If you have any questions, /msg me and I'll answer them tomorrow :)
00:50 <****> afk
01:04 <****> OMFG
01:04 <****> this is totally ludicrous
01:04 <****> this is absolutely excellent
01:04 <****> http://www.cs.utexas.edu/users/EWD/video-audio/EWDonNPR.mp3
01:04 <****> looolz
01:05 <****> "and so even tough we haven't found the source of the problem,the bug is still there, and we end up with a very reliable system"
01:07 <****> Gumbybrain: FIX IT!
01:07 <****> It uses whatever it can be opened? This reeks of a comparison on how geeks might call for "but this guitar isn't like playing a real annoyance with it can be explained by your own new top level.
01:07 <****> YES!
01:07 <****> IT WAS THE WILL OF THE BINGOS!
01:12 <****> I'm diggin through the documentation for HTML::Parser and HTML::Treebuilder because everybody in here curses the use of regex for scraping data from html.
01:13 <****> I am not quite finding its usefullness yet, are there any examples of it in use that I might look at?
01:14 <****> Treebuilder's usefulness is hidden away in HTML::Element
01:14 <****> 'div' )
01:15 <****>HATE
01:15 <****> :P
01:16 <****> I'm still torn on camel.
01:18 <****> camel?
01:18 <****> dec: I adapt to the language.
01:18 <****> dec: If I write Java, I use it.
01:18 <****> idiotben: namesLikeThis vs. names_like_this
01:18 <****> ah right
01:19 <****> Called "camel case" because of the "humps" formed by capital letters.
01:19 <****> The thing is, it's a short-sighted notation that doesn't translate well.
01:20 <****> And is hard on people who don't natively speak the programmer's language.
01:21 <****> hrm
01:21 <****> I used to be 100% _'s


Total 32 pages. You are browsing page 2/32.

First :: Prev :: [1] [2] [3] [4] [5] [...] :: Next :: Last


Tutti i nuovi CAP Italiani. Come ottenere il database completo