Re: dmd 1.060 and 2.045 release
Steven Schveighoffer wrote: It may not be an issue, the spell checker is simply a nice hint, but isn't essential to determine errors. It's a good summary of the situation. I've felt that as long as the message took under a second to generate, it was ok. Realistically, I don't see anyone using identifiers as long as you tested unless they were machine generated. In my own code, I've found the spell checker is more than a nice hint. Most of the time it nails it and speeds fixing the problem because I don't need to myself go and manually look up the correct spelling. Iterating through the symbol table is not impossible, it's just a lot of work as it is not designed for that. There are also some issues with shadowing. Furthermore, I'd like the speller to work with the symbol table in the C++ compiler, too, which I don't wish to reimplement.
Re: dmd 1.060 and 2.045 release
On Mon, 10 May 2010 11:41:08 -0400, BCS wrote: Hello Steven, Several others have privately brought up this problem to Walter. He does not want to change how the symbol lookup tables work, and there is no way to iterate them. Is it fundamentally impossible to iterate or is the code just not there and/or nasty to write? I can only speculate, but I would guess the latter. What I can think of is there may be multiple keys for the same symbol (for example alises, or fully qualified names). But even those should be enumerable. I think the tables themselves are simple hash-maps, but the layers and shadowing is where the problems lie. I don't have a very educated opinion on this, so it may be something else entirely. -Steve
Re: dmd 1.060 and 2.045 release
Hello Steven, Several others have privately brought up this problem to Walter. He does not want to change how the symbol lookup tables work, and there is no way to iterate them. Is it fundamentally impossible to iterate or is the code just not there and/or nasty to write? -- ... <
Re: dmd 1.060 and 2.045 release
On Mon, 10 May 2010 09:22:21 -0400, Ary Borenszweig wrote: Steven Schveighoffer wrote: Several others have privately brought up this problem to Walter. He does not want to change how the symbol lookup tables work, and there is no way to iterate them. I can't imagine why something as simple as iterating a symbol lookup table can't be implemented. (in fact I iterate that lookup table in Descent to show autocompletions... I don't generate every possible string in the world and see if it's a symbol, and if so, suggest it as an autocompletion... hmmm...) On that point I'm just the messenger, I'm not sure why Walter has that position, and I don't know enough about the code to know how to fix it. -Steve
Re: dmd 1.060 and 2.045 release
Steven Schveighoffer wrote: On Sun, 09 May 2010 02:11:21 -0400, Lionello Lunesu wrote: I'm in the middle of moving from one city to another so don't wait for me. I have attached the D version of the code in the wikipedia article (including the patch for transpositions.) It's not straightforward to drop this in (apart from it being in D), since speller.c creates all variations on a string (=inefficient) and uses a callback function to check if a variation is a valid symbol. I'll have more time to look at this next week. Several others have privately brought up this problem to Walter. He does not want to change how the symbol lookup tables work, and there is no way to iterate them. Therefore, without a way to iterate current symbols, this is the only way the algo can be written. However, according to reports on the latest beta, he's sped up the lookup times for symbols significantly enough to perceptually reduce the problem. Because of the nature of the algorithm, the longer the invalid symbol, the slower the algorithm. It would be interesting to see a comparison between the current beta code and code that does a full iteration with very long symbols. I don't know if anyone wants to look at modifying the symbol lookup data structures to allow iteration, it may be perceptually insignificant, and unimportant for most developers. A quick test on a long symbol name: void s023456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567891() {} void main() { s123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890(); } And on the various compilers: 2.043: 0.052s, does not suggest 2.045: 5.4s, suggests the alternative beta: 0.8s, does not suggest 2.043 only does spell checking with a Levenshtein distance of 1, 2.045 does 2, but is extremely slow. The beta does a distance of 2, but only if the errors are close together (Walter added this as an optimization to remove one factor from the runtime complexity). So clearly, there is still room for improvement, I think with a proper symbol iteration, we could get the time down to be as quick as 2.043 or faster *and* provide the ability to check for a complete LD of 2 where the errors are not close together. We might be able to even push the LD to 3 or 4. I've thought about the optimization of errors close together being checked, and I think the counter case is the case which takes the longest -- a long symbol. Typically people use camelCase to denote symbols, what if two of the words in the symbol were misspelled by one character (or a capitalization was forgotten)? It may not be an issue, the spell checker is simply a nice hint, but isn't essential to determine errors. -Steve I can't imagine why something as simple as iterating a symbol lookup table can't be implemented. (in fact I iterate that lookup table in Descent to show autocompletions... I don't generate every possible string in the world and see if it's a symbol, and if so, suggest it as an autocompletion... hmmm...)
Re: dmd 1.060 and 2.045 release
On Sun, 09 May 2010 02:11:21 -0400, Lionello Lunesu wrote: I'm in the middle of moving from one city to another so don't wait for me. I have attached the D version of the code in the wikipedia article (including the patch for transpositions.) It's not straightforward to drop this in (apart from it being in D), since speller.c creates all variations on a string (=inefficient) and uses a callback function to check if a variation is a valid symbol. I'll have more time to look at this next week. Several others have privately brought up this problem to Walter. He does not want to change how the symbol lookup tables work, and there is no way to iterate them. Therefore, without a way to iterate current symbols, this is the only way the algo can be written. However, according to reports on the latest beta, he's sped up the lookup times for symbols significantly enough to perceptually reduce the problem. Because of the nature of the algorithm, the longer the invalid symbol, the slower the algorithm. It would be interesting to see a comparison between the current beta code and code that does a full iteration with very long symbols. I don't know if anyone wants to look at modifying the symbol lookup data structures to allow iteration, it may be perceptually insignificant, and unimportant for most developers. A quick test on a long symbol name: void s023456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567891() {} void main() { s123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890(); } And on the various compilers: 2.043: 0.052s, does not suggest 2.045: 5.4s, suggests the alternative beta: 0.8s, does not suggest 2.043 only does spell checking with a Levenshtein distance of 1, 2.045 does 2, but is extremely slow. The beta does a distance of 2, but only if the errors are close together (Walter added this as an optimization to remove one factor from the runtime complexity). So clearly, there is still room for improvement, I think with a proper symbol iteration, we could get the time down to be as quick as 2.043 or faster *and* provide the ability to check for a complete LD of 2 where the errors are not close together. We might be able to even push the LD to 3 or 4. I've thought about the optimization of errors close together being checked, and I think the counter case is the case which takes the longest -- a long symbol. Typically people use camelCase to denote symbols, what if two of the words in the symbol were misspelled by one character (or a capitalization was forgotten)? It may not be an issue, the spell checker is simply a nice hint, but isn't essential to determine errors. -Steve
Re: dmd 1.060 and 2.045 release
On 7-5-2010 12:01, Lionello Lunesu wrote: > On 7-5-2010 9:10, Brad Roberts wrote: >> On Fri, 7 May 2010, Lionello Lunesu wrote: >> >>> On 6-5-2010 22:37, Michel Fortin wrote: On 2010-05-05 23:45:50 -0400, Walter Bright said: > Walter Bright wrote: >> Alex Makhotin wrote: >>> It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 >>> kernel 2.6.32.4), to get the actual error messages about the >>> undefined identifier. >> >> Definitely there's a problem. > > The problem is the spell checker is O(n*n) on the number of characters > in the undefined identifier. That's an algorithm that can't scale then. Checking the Levenshtein distance for each known identifier within a small difference in length would be a better idea. (Clang is said to use the Levenshtein distance, it probably does something of the sort.) http://en.wikipedia.org/wiki/Levenshtein_distance >>> and especially this line: >>> >>> # If we are only interested in the distance if it is smaller than a >>> threshold k, then it suffices to compute a diagonal stripe of width 2k+1 >>> in the matrix. In this way, the algorithm can be run in O(kl) time, >>> where l is the length of the shortest string. >> >> The source for this is pretty isolated.. anyone want to volunteer take a >> shot at improving this part of dmd? >> >> Later, >> Brad > I see, speller.c, I'll have a look.. I'm in the middle of moving from one city to another so don't wait for me. I have attached the D version of the code in the wikipedia article (including the patch for transpositions.) It's not straightforward to drop this in (apart from it being in D), since speller.c creates all variations on a string (=inefficient) and uses a callback function to check if a variation is a valid symbol. I'll have more time to look at this next week. L. T minimum(T)(T[] all ...) in { assert(all.length > 0); } out(result) { foreach (t; all) assert(result <= t); } body { T ret = all[0]; foreach (t; all) if (t < ret) ret = t; return ret; } struct array2d(T) { private int w; private T[] data; public this(int w, int h) { this.data = new T[w*h]; this.w = w; } public T get(int x, int y) { return data[x+y*w]; } public void set(int x, int y, T t) { data[x+y*w] = t; } } int LevenshteinDistance(string s, string t) { const int m = s.length; const int n = t.length; // d is a table with m+1 rows and n+1 columns array2d!(int) d = array2d!(int)(m+1,n+1); foreach (i; 0..m+1) d.set(i, 0, i); // deletion foreach (j; 0..n+1) d.set(0, j, j); // insertion int k = 0; foreach (j; 1..n+1) { foreach (i; 1..m+1) { int cost = 1; if (s[i-1] == t[j-1]) cost = 0; k = minimum( d.get(i-1, j) + 1, // deletion d.get(i, j-1) + 1, // insertion d.get(i-1, j-1) + cost // substitution ); // transposition ab -> ba if (i > 1 && j > 1 && s[i-1] == t[j-2] && s[i-2] == t[j-1]) k = minimum( k, d.get(i-2, j-2) + cost // transposition ); d.set(i, j, k); } } return k;//d.get(m,n); } unittest { assert(LevenshteinDistance("hello", "hello") == 0);//nop assert(LevenshteinDistance("hello", "helo") == 1);//del assert(LevenshteinDistance("hello", "heello") == 1);//ins assert(LevenshteinDistance("hello", "hallo") == 1);//change assert(LevenshteinDistance("hello", "hlelo") == 1);//transp assert(LevenshteinDistance("hello", "hlelq") == 2); } import std.stdio; void main(string[] args) { writefln("%s", LevenshteinDistance(args[1], args[2])); }
Re: dmd 1.060 and 2.045 release
That code is in the public domain, by the way. DMD should require a copyright notice in each source file :) L.
Re: dmd 1.060 and 2.045 release
Walter Bright wrote: Walter Bright wrote: I recompiled dmd with the profiler (-gt switch) which confirmed it. For those interested, try out changeset 470. On my timing tests, the time spent is linear with the number of characters in the identifier. It's still too slow, though.
Re: dmd 1.060 and 2.045 release
On 7-5-2010 9:10, Brad Roberts wrote: > On Fri, 7 May 2010, Lionello Lunesu wrote: > >> On 6-5-2010 22:37, Michel Fortin wrote: >>> On 2010-05-05 23:45:50 -0400, Walter Bright >>> said: >>> Walter Bright wrote: > Alex Makhotin wrote: >> It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 >> kernel 2.6.32.4), to get the actual error messages about the >> undefined identifier. > > Definitely there's a problem. The problem is the spell checker is O(n*n) on the number of characters in the undefined identifier. >>> >>> That's an algorithm that can't scale then. >>> >>> Checking the Levenshtein distance for each known identifier within a >>> small difference in length would be a better idea. (Clang is said to use >>> the Levenshtein distance, it probably does something of the sort.) >>> >>> http://en.wikipedia.org/wiki/Levenshtein_distance >>> >> and especially this line: >> >> # If we are only interested in the distance if it is smaller than a >> threshold k, then it suffices to compute a diagonal stripe of width 2k+1 >> in the matrix. In this way, the algorithm can be run in O(kl) time, >> where l is the length of the shortest string. > > The source for this is pretty isolated.. anyone want to volunteer take a > shot at improving this part of dmd? > > Later, > Brad I see, speller.c, I'll have a look..
Re: dmd 1.060 and 2.045 release
Walter Bright wrote: I recompiled dmd with the profiler (-gt switch) which confirmed it. For those interested, try out changeset 470.
Re: dmd 1.060 and 2.045 release
On Fri, 7 May 2010, Lionello Lunesu wrote: > On 6-5-2010 22:37, Michel Fortin wrote: > > On 2010-05-05 23:45:50 -0400, Walter Bright > > said: > > > >> Walter Bright wrote: > >>> Alex Makhotin wrote: > It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 > kernel 2.6.32.4), to get the actual error messages about the > undefined identifier. > >>> > >>> Definitely there's a problem. > >> > >> The problem is the spell checker is O(n*n) on the number of characters > >> in the undefined identifier. > > > > That's an algorithm that can't scale then. > > > > Checking the Levenshtein distance for each known identifier within a > > small difference in length would be a better idea. (Clang is said to use > > the Levenshtein distance, it probably does something of the sort.) > > > > http://en.wikipedia.org/wiki/Levenshtein_distance > > > and especially this line: > > # If we are only interested in the distance if it is smaller than a > threshold k, then it suffices to compute a diagonal stripe of width 2k+1 > in the matrix. In this way, the algorithm can be run in O(kl) time, > where l is the length of the shortest string. The source for this is pretty isolated.. anyone want to volunteer take a shot at improving this part of dmd? Later, Brad
Re: dmd 1.060 and 2.045 release
On 6-5-2010 22:37, Michel Fortin wrote: > On 2010-05-05 23:45:50 -0400, Walter Bright > said: > >> Walter Bright wrote: >>> Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. >>> >>> Definitely there's a problem. >> >> The problem is the spell checker is O(n*n) on the number of characters >> in the undefined identifier. > > That's an algorithm that can't scale then. > > Checking the Levenshtein distance for each known identifier within a > small difference in length would be a better idea. (Clang is said to use > the Levenshtein distance, it probably does something of the sort.) > > http://en.wikipedia.org/wiki/Levenshtein_distance > and especially this line: # If we are only interested in the distance if it is smaller than a threshold k, then it suffices to compute a diagonal stripe of width 2k+1 in the matrix. In this way, the algorithm can be run in O(kl) time, where l is the length of the shortest string.
Re: dmd 1.060 and 2.045 release
Steven Schveighoffer wrote: On Thu, 06 May 2010 17:07:12 -0400, Walter Bright wrote: Steven Schveighoffer wrote: That can't be it. The identifier shown by Alex is only 33 characters. O(n^2) is not that slow, especially for smaller variables. There must be other factors you're not considering... I recompiled dmd with the profiler (-gt switch) which confirmed it. So a single unknown symbol (from Alex's example) which can be checked against each existing symbol in O(n^2) time, takes 40 seconds on a decent CPU? How many other symbols are there? 33^2 == 1089, if there are 1 symbols, that's 10 million iterations, that shouldn't take 40 seconds to run, should it? Are there more symbols to compare against? Do you use heuristics to prune the search? For example, if the max distance is 2, and the difference in length between two strings is >2, you should be able to return immediately. Check out the profiler output. It's clearly the vast number of calls to the symbol lookup, not the time spent in each call. - Num TreeFuncPer CallsTimeTimeCall 50409318 632285778 145858160 2 Dsymbol *syscall ScopeDsymbol::search(Loc ,Identifier *,int ) 50411264 131394915 106356855 2 void **syscall StringTable::search(char const *,unsigned ) 50409329 341960075 105532978 2 Dsymbol *syscall DsymbolTable::lookup(Identifier *) 50409329 236427096 105037393 2 StringValue *syscall StringTable::lookup(char const *,unsigned ) 12602340 61389061967393753 5 Dsymbol *syscall Scope::search(Loc ,Identifier *,Dsymbol **) 12602178 69391519766918360 5 void *cdecl scope_search_fp(void *,char const *) 25204505 46135292052529164 2 Dsymbol *syscall Module::search(Loc ,Identifier *,int ) 504121372503847425038474 0 unsigned cdecl Dchar::calcHash(char const *,unsigned ) 3520 1428323068203493755781 void *cdecl spellerX(char const *,void *cdecl (*)(void *,char const *),void *,char const *,int ) 12602664 6811916 6811916 0 syscall Identifier::Identifier(char const *,int ) 12602178 6299089 6299089 0 void cdecl Module::clearCache() 12602183 6151175 6151175 0 Module *syscall Module::isModule() 1600 113294261 2 StringValue *syscall StringTable::update(char const *,unsigned ) -
Re: dmd 1.060 and 2.045 release
On Thu, 06 May 2010 17:07:12 -0400, Walter Bright wrote: Steven Schveighoffer wrote: That can't be it. The identifier shown by Alex is only 33 characters. O(n^2) is not that slow, especially for smaller variables. There must be other factors you're not considering... I recompiled dmd with the profiler (-gt switch) which confirmed it. So a single unknown symbol (from Alex's example) which can be checked against each existing symbol in O(n^2) time, takes 40 seconds on a decent CPU? How many other symbols are there? 33^2 == 1089, if there are 1 symbols, that's 10 million iterations, that shouldn't take 40 seconds to run, should it? Are there more symbols to compare against? Do you use heuristics to prune the search? For example, if the max distance is 2, and the difference in length between two strings is >2, you should be able to return immediately. -Steve
Re: dmd 1.060 and 2.045 release
Steven Schveighoffer wrote: That can't be it. The identifier shown by Alex is only 33 characters. O(n^2) is not that slow, especially for smaller variables. There must be other factors you're not considering... I recompiled dmd with the profiler (-gt switch) which confirmed it.
Re: dmd 1.060 and 2.045 release
Steven Schveighoffer, el 6 de mayo a las 07:17 me escribiste: > On Wed, 05 May 2010 23:45:50 -0400, Walter Bright > wrote: > > >Walter Bright wrote: > >>Alex Makhotin wrote: > >>>It takes ~40 seconds 50% load on the dual core > >>>processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error > >>>messages about the undefined identifier. > >> Definitely there's a problem. > > > >The problem is the spell checker is O(n*n) on the number of > >characters in the undefined identifier. > > That can't be it. The identifier shown by Alex is only 33 > characters. O(n^2) is not that slow, especially for smaller > variables. There must be other factors you're not considering... Run a profiler. -- Leandro Lucarella (AKA luca) http://llucax.com.ar/ -- GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145 104C 949E BFB6 5F5A 8D05) -- No existiría el sonido del mar si faltara en la vida oreja y caracol. -- Ricardo Vaporeso. Cosquín, 1908.
Re: dmd 1.060 and 2.045 release
On 2010-05-05 23:45:50 -0400, Walter Bright said: Walter Bright wrote: Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. Definitely there's a problem. The problem is the spell checker is O(n*n) on the number of characters in the undefined identifier. That's an algorithm that can't scale then. Checking the Levenshtein distance for each known identifier within a small difference in length would be a better idea. (Clang is said to use the Levenshtein distance, it probably does something of the sort.) http://en.wikipedia.org/wiki/Levenshtein_distance -- Michel Fortin michel.for...@michelf.com http://michelf.com/
Re: dmd 1.060 and 2.045 release
Hello Walter, Walter Bright wrote: Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. Definitely there's a problem. The problem is the spell checker is O(n*n) on the number of characters in the undefined identifier. How about switch algos for long identifiers: you could bucket the knows by length and compare histograms on things of similar length. Or maybe just turn it off for long names. -- ... <
Re: dmd 1.060 and 2.045 release
On Wed, 05 May 2010 23:45:50 -0400, Walter Bright wrote: Walter Bright wrote: Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. Definitely there's a problem. The problem is the spell checker is O(n*n) on the number of characters in the undefined identifier. That can't be it. The identifier shown by Alex is only 33 characters. O(n^2) is not that slow, especially for smaller variables. There must be other factors you're not considering... -Steve
Re: dmd 1.060 and 2.045 release
Alex Makhotin wrote: Walter Bright wrote: Walter Bright wrote: Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. Definitely there's a problem. The problem is the spell checker is O(n*n) on the number of characters in the undefined identifier. Is there a way to disable it? Currently, no.
Re: dmd 1.060 and 2.045 release
Walter Bright wrote: Walter Bright wrote: Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. Definitely there's a problem. The problem is the spell checker is O(n*n) on the number of characters in the undefined identifier. Is there a way to disable it? -- Alex Makhotin, the founder of BITPROX, http://bitprox.com
Re: dmd 1.060 and 2.045 release
Walter Bright wrote: Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. Definitely there's a problem. The problem is the spell checker is O(n*n) on the number of characters in the undefined identifier.
Re: dmd 1.060 and 2.045 release
Alex Makhotin wrote: It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. Definitely there's a problem.
Re: dmd 1.060 and 2.045 release
Robert Clipsham wrote: On 05/05/10 20:36, Robert Clipsham wrote: On 04/05/10 20:43, Walter Bright wrote: Yes, do it! http://dwarfstd.org/ShowIssue.php?issue=100504.1 Please feel free to comment on it/make corrections :) "This has been assigned issue #100504.1. We are not accepting extension proposals for DWARF Version 4, but will consider this for the next version." Looks like we'll have to wait a while for its inclusion. In the mean time could I suggest we move the DW_TAG's of dmd's extensions to the area of the spec specified for language specific extensions? I'm not sure if this was specified in DWARF 2 which dmd uses, or what the range is (I don't have the spec to hand), but it would be good not to conflict with DWARF 4. Good idea. Another good idea could be to decide how else DWARF could be extended to help with debug info so we have more chance of getting some useful things for debugging into DWARF 5.
Re: dmd 1.060 and 2.045 release
Walter Bright wrote: This is to fix the unittest and dwarf screwups in the last release. http://www.digitalmars.com/d/1.0/changelog.html http://ftp.digitalmars.com/dmd.1.060.zip http://www.digitalmars.com/d/2.0/changelog.html http://ftp.digitalmars.com/dmd.2.045.zip Hi, The fixes are good but when I compile, for example, the following simplified code: import std.stdio; import core.sync.condition; import core.sync.mutex; import std.contracts; import std.regex; import std.date; void foo(int x, int y) { currentPipedProcessWhateverYouLike = new PipedProcessWhateverYouLike(); } It takes ~40 seconds 50% load on the dual core processor(CentOS 5.3 kernel 2.6.32.4), to get the actual error messages about the undefined identifier. As I said this example is simplified and I need to wait more in the real project, while compiling the BDE itself, eventually I kill the never ending dmd process and go back to the v2.043 which has no such problem. I guess the problem is a spell checker. It would be much better to make this feature optional(if this is the problem), if this cannot be done in other way. Alex Makhotin, the founder of BITPROX, http://bitprox.com
Re: dmd 1.060 and 2.045 release
On 05/05/10 20:36, Robert Clipsham wrote: On 04/05/10 20:43, Walter Bright wrote: Yes, do it! http://dwarfstd.org/ShowIssue.php?issue=100504.1 Please feel free to comment on it/make corrections :) "This has been assigned issue #100504.1. We are not accepting extension proposals for DWARF Version 4, but will consider this for the next version." Looks like we'll have to wait a while for its inclusion. In the mean time could I suggest we move the DW_TAG's of dmd's extensions to the area of the spec specified for language specific extensions? I'm not sure if this was specified in DWARF 2 which dmd uses, or what the range is (I don't have the spec to hand), but it would be good not to conflict with DWARF 4. Another good idea could be to decide how else DWARF could be extended to help with debug info so we have more chance of getting some useful things for debugging into DWARF 5.
Re: dmd 1.060 and 2.045 release
On 04/05/10 20:43, Walter Bright wrote: Yes, do it! http://dwarfstd.org/ShowIssue.php?issue=100504.1 Please feel free to comment on it/make corrections :)
Re: dmd 1.060 and 2.045 release
Walter Bright, el 4 de mayo a las 10:52 me escribiste: > Robert Clipsham wrote: > >Still a long way to go though, various (much!) smaller issues that > >need fixing... If no one else gets to them I'll go on a debug > >fixing spree at some point in a couple of months and see if we > >can't get bug #4044 (debugging tracker) closed :) > > I agree that getting all the gdb issues sorted out will be a nice win. Specially now that GDB will support D natively! -- Leandro Lucarella (AKA luca) http://llucax.com.ar/ -- GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145 104C 949E BFB6 5F5A 8D05) -- Los sueños de los niños son especialmente importantes en su etapa de formación; si un niño no sueña es que será un pelotudo toda la vida. -- Ricardo Vaporeso
Re: dmd 1.060 and 2.045 release
Yay that was fast :) On Wed, May 5, 2010 at 1:30 AM, Walter Bright wrote: > This is to fix the unittest and dwarf screwups in the last release. > > http://www.digitalmars.com/d/1.0/changelog.html > http://ftp.digitalmars.com/dmd.1.060.zip > > > http://www.digitalmars.com/d/2.0/changelog.html > http://ftp.digitalmars.com/dmd.2.045.zip > -- -Arth
Re: dmd 1.060 and 2.045 release
On 04/05/10 20:43, Walter Bright wrote: Yes, do it! I have submitted a proposal, I'm currently awaiting confirmation that it has been received. I'll let you know how/if it progresses and paste links if I can so you can track it yourself :)
Re: dmd 1.060 and 2.045 release
Robert Clipsham wrote: On 04/05/10 19:50, Walter Bright wrote: That's the problem with D extensions; unless they get officially adopted they conflict with future changes to the spec. We need to get them officially adopted. Too late for this, DWARF 4 has introduced conflicts with them already. We could try and get them pushed for dwarf 4, but if they get in the values of the tags will have to change (not really a problem, as only one debugger supports them I think, and to my knowledge it isn't widely used for D yet... So we'd be able to talk to the developers and work around this). I'd be willing to take action to get them pushed for DWARF 4 or (if it's too late for that) DWARF 5, if that's OK with you? Yes, do it!
Re: dmd 1.060 and 2.045 release
On Tue, 4 May 2010, Robert Clipsham wrote: > On 04/05/10 19:50, Walter Bright wrote: > > That's the problem with D extensions; unless they get officially adopted > > they conflict with future changes to the spec. We need to get them > > officially adopted. > > Too late for this, DWARF 4 has introduced conflicts with them already. We > could try and get them pushed for dwarf 4, but if they get in the values of > the tags will have to change (not really a problem, as only one debugger > supports them I think, and to my knowledge it isn't widely used for D yet... > So we'd be able to talk to the developers and work around this). I'd be > willing to take action to get them pushed for DWARF 4 or (if it's too late for > that) DWARF 5, if that's OK with you? I've lost track of the details of the extensions that are in use. I assume they're to cover the array and dictionary layouts. I wouldn't be suprised if there wasn't coverage for them in the current dwarf formats, or close enough to make them work. Another potential path, that would obviously be gdb specific, is to include some scripts that help display the more complex types. I saw something about gdb/python integration. A little bit of scripting can go a long long way. Later, Brad
Re: dmd 1.060 and 2.045 release
On 04/05/10 19:50, Walter Bright wrote: That's the problem with D extensions; unless they get officially adopted they conflict with future changes to the spec. We need to get them officially adopted. Too late for this, DWARF 4 has introduced conflicts with them already. We could try and get them pushed for dwarf 4, but if they get in the values of the tags will have to change (not really a problem, as only one debugger supports them I think, and to my knowledge it isn't widely used for D yet... So we'd be able to talk to the developers and work around this). I'd be willing to take action to get them pushed for DWARF 4 or (if it's too late for that) DWARF 5, if that's OK with you?
Re: dmd 1.060 and 2.045 release
Robert Clipsham wrote: You'd be ok with, for example: -g add symbolic debug info -gcadd symbolic debug info, pretend to be C++ Instead of C then? Or some other language that debuggers support? I say this as C++ supports more of D's features, so we'd be able to give better debugging info for debuggers without explicit D support. Yes. There was another point in that post, about the D extensions to DWARF... I think it is unlikely that patches to support D's extensions to DWARF would be accepted into gdb, particularly as the values for the DW_TAG's conflicts with things in the DWARF4 spec. I think there should be a way to act like D but without these extensions. Ideally the solution to this is to try and get the extensions officially into the DWARF spec, which I'd be willing to push for if possible. That's the problem with D extensions; unless they get officially adopted they conflict with future changes to the spec. We need to get them officially adopted.
Re: dmd 1.060 and 2.045 release
On 04/05/10 19:03, Walter Bright wrote: Basically, I don't think the switches need to change, just: -g: debugger can handle D data types and extensions -gc: make it work with whatever the debugger can handle, i.e. for debuggers that don't know about D You'd be ok with, for example: -g add symbolic debug info -gcadd symbolic debug info, pretend to be C++ Instead of C then? Or some other language that debuggers support? I say this as C++ supports more of D's features, so we'd be able to give better debugging info for debuggers without explicit D support. There was another point in that post, about the D extensions to DWARF... I think it is unlikely that patches to support D's extensions to DWARF would be accepted into gdb, particularly as the values for the DW_TAG's conflicts with things in the DWARF4 spec. I think there should be a way to act like D but without these extensions. Ideally the solution to this is to try and get the extensions officially into the DWARF spec, which I'd be willing to push for if possible.
Re: dmd 1.060 and 2.045 release
Robert Clipsham wrote: While you're reading, is there any chance you could take a look at my post on the dmd-internals ML and give me some feedback? I noticed you agreed with Brad in a bug report, but to what extent? When I get around to fixing all those bugs I'd like to know where you stand so I can work on some enhancements to debug info as well as just bug fixes :) Basically, I don't think the switches need to change, just: -g: debugger can handle D data types and extensions -gc: make it work with whatever the debugger can handle, i.e. for debuggers that don't know about D
Re: dmd 1.060 and 2.045 release
On 04/05/10 18:52, Walter Bright wrote: I agree that getting all the gdb issues sorted out will be a nice win. I fixed all the killer ones (for ELF systems at least), you can now set breakpoints and get backtraces etc, the remaining issues are to do with line numbers being a bit off under certain circumstances, code listing not working in gdb (I've never used this feature, hehe) and a few enhancements. This said, debugging on OS X is currently impossible, I should be able to take a look at that some time soon. While you're reading, is there any chance you could take a look at my post on the dmd-internals ML and give me some feedback? I noticed you agreed with Brad in a bug report, but to what extent? When I get around to fixing all those bugs I'd like to know where you stand so I can work on some enhancements to debug info as well as just bug fixes :)
Re: dmd 1.060 and 2.045 release
Robert Clipsham wrote: Still a long way to go though, various (much!) smaller issues that need fixing... If no one else gets to them I'll go on a debug fixing spree at some point in a couple of months and see if we can't get bug #4044 (debugging tracker) closed :) I agree that getting all the gdb issues sorted out will be a nice win.
Re: dmd 1.060 and 2.045 release
On 04/05/10 18:30, Walter Bright wrote: This is to fix the unittest and dwarf screwups in the last release. http://www.digitalmars.com/d/1.0/changelog.html http://ftp.digitalmars.com/dmd.1.060.zip http://www.digitalmars.com/d/2.0/changelog.html http://ftp.digitalmars.com/dmd.2.045.zip Thanks a lot for prioritizing this and getting it fixed :) Debugging D on linux is now bearable, and even a pleasant experience at times =) Still a long way to go though, various (much!) smaller issues that need fixing... If no one else gets to them I'll go on a debug fixing spree at some point in a couple of months and see if we can't get bug #4044 (debugging tracker) closed :)
Re: dmd 1.060 and 2.045 release
On Tue, May 04, 2010 at 10:30:36AM -0700, Walter Bright wrote: > This is to fix the unittest and dwarf screwups in the last release. Yay, it seems to have fixed the weird endless loop I got in the last one.
dmd 1.060 and 2.045 release
This is to fix the unittest and dwarf screwups in the last release. http://www.digitalmars.com/d/1.0/changelog.html http://ftp.digitalmars.com/dmd.1.060.zip http://www.digitalmars.com/d/2.0/changelog.html http://ftp.digitalmars.com/dmd.2.045.zip