... What would be interesting is to test and try to use UC_STRING with non-ascii characters not only for the subject, but also for the regexp pattern (the...
... I removed that mod 256 for byte code as you see in the patch, which mean byte code compiled from a pattern accepts Unicode points. ... Yes, I tried the...
... Ted> Yes, I tried the following as subjects: 1) -- Ted> 一个例子 2) -- Ted> 另一个例子子a That...
Colin Paul Adams
colin@...
Jun 14, 2008 5:28 am
1687
... Sorry for my misunderstanding. Look at the following code: make is -- Run application. local pattern, subject: UC_UTF8_STRING i, max: INTEGER_32 matcher:...
... Ted> The result is "False" with patched Gobo Regexp. But using Ted> UC_STRING, with original Gobo Regexp, the result is "False" Ted> too. Then it looks...
Colin Paul Adams
colin@...
Jun 14, 2008 8:32 pm
1689
... Would it be acceptable to use Ted's patch (with some protection when accessing the 256-bit pair of arrays) and make it clear in the doc of the regexp...
... Eric> Would it be acceptable to use Ted's patch (with some Eric> protection when accessing the 256-bit pair of arrays) and Eric> make it clear in the doc...
Colin Paul Adams
colin@...
Jun 15, 2008 6:03 am
1691
... Do you have an example searching what against what the crash happens? That is interesting to me. Because I tried to apply the modified Gobo Regexp to...
... Ted> Do you have an example searching what against what the crash Ted> happens? That is interesting to me. Because I tried to apply Ted> the modified Gobo...
Colin Paul Adams
colin@...
Jun 15, 2008 10:55 am
1693
... I debugged to see if there are rescues. My answer is no. So I still don't find how "a case-insensitive search against a target string containing a...
... I just committed your modifications in SVN. However, in addition to Colin's remark about case-insensitivity, I also noticed that character classes (e.g....
... Ted> Thank you, Eric. Looking forward to full Unicode Regexp. Best if you take it on then. I've haven't done anything on the XSLT library for the past two...
Colin Paul Adams
colin@...
Jul 1, 2008 5:27 am
1697
I could have a look if you don't mind how much time I would take. Because I am quite slow reading C code. And not too much free time left for this. Regards, ...
... That would be great if you could have a look at that. If you want I will send you the C package PCRE 3.9 which I believe was originally used when...
Hello All, Does gec have a minimum object allocation size? I'm trying to figure out why reading a 4MB file with strings balloons into 70MB of memory use. -- ...
... Obviously some garbage is generated somewhere. Did you plug the Boehm GC? Note that I have no idea about minimum object allocation size in the Boehm GC. --...
... Eric> Obviously some garbage is generated somewhere. Did you plug Eric> the Boehm GC? Note that I have no idea about minimum object Eric> allocation size...
... not if you've translated "." properly to UTF-8 (the regexp that matches UTF8 byte sequences, from a cursory look your dot should be translated to the...
... Franck> not if you've translated "." properly to UTF-8 (the regexp Franck> that matches UTF8 byte sequences, from a cursory look your Franck> dot should be...
Colin Paul Adams
colin@...
Aug 14, 2008 3:37 pm
1707
... Great. I would be interested in your experience with the algorithm described in ETL2. I have heard that it is rather pessimistic, i.e. it flags many...
... Yes, it's pessimistic. But all solutions that I heard of are pessimistic. The advantage of this one is that it exists, and it does not require any new...
... That's good news. Catcalls are due to programming errors. But your experience leads me to the hypothesis that a code with catcall potential (in the sense,...
... Heiko> Hello, I want to execute a http file upload from my Windows Heiko> application. Can someone point me to an example to do so? There's no code in Gobo...