my experience with ghci debugger extensions

Peter Hercek Thu, 05 Feb 2009 01:06:08 -0800

Hi users of ghci debugger,

This post is going to be a bit longer. Here are few cookies to motivateyou to go on:

* you will probably like it more than printf debugging for a lot of cases

* it can provide a way of implementing Claus Reinke's idea ofbreakpoints with a list of identifiers which should be available atbreakpoint location and doing it without recompilation; here is the linkto Claus' message:http://permalink.gmane.org/gmane.comp.lang.haskell.glasgow.user/15900* it gives some idea to ghc team about importance of ghci debuggerrelated tickets (and whether to implement them just the way they wereproposed)

A note to ghc developers: Stepping program in ghci debugger sometimespurges top level bindings and sometimes not. Not sure this is a bug orfeature (whether there is some logic in it). I do not have a simpleexample when it purges the bindings. But I did not really look for one.

I would probably post later, but some ghci bugs and missing features arebadly limiting my progress. There is not much more I can think of toinvestigate. Maybe somebody will have better ideas how to solve theobstacles I'm hitting. I'm also posting with hope that people will findthis interesting and ghc team will fix some of the critical bugs andadds the most critical features, especially if somebody will not havebetter debugging tips.


You can get my extensions here:
http://www.hck.sk/users/peter/pub/ghciext-0.1.tar.gz

The extensions are not in a single .ghci file now. The new .ghci fileneeds to install a library. The reason is that without the library it isnot manageable any more.

And here are arch linux packaging instructions for my custom ghc (if youare an arch linux user just download and run makepkg :) ):

http://www.hck.sk/users/peter/pub/ghc-custom-6.10.1.tar.gz

The custom ghc is just the same one as ghc 6.10.1 with two more patches:t2740.patch and loglocal.patch. The first one fixes ticket 2740 and youwill get it in 6.10.2. The second one adds :loglocal command to ghci.You can extract the patches from the tar file.

If you already read ghci scripting tutorial from Claus Reinke then youwill know how to customize ghciext (that is if you would feel like doingso). The tutorial is here:

http://www.haskell.org/pipermail/haskell-cafe/2007-September/032260.html

Here is the list of custom commands in ghciext package:
:defs                     -- list user-defined commands
:. <file>                 -- source commands from <file>
:redir <var> <cmd>...     -- execute <cmd> redirecting stdout to <var>
:grep <pat> <cmd>...      -- filter lines matching <pat> from <cmd> output
:* <count> <cmd>...       -- run <cmd> <count> times
:x <cmd>...               -- run <cmd> with stdout suppressed

:bp <bpArgs> -- put breakpoint at location <bpArgs> (addshit count):inject <cc> <c> <sc> <b> -- at location <b> execute <c> if <cc>, andstop if <sc>:monitor ["<c>"] <vs> <b> -- show comma separated variables at location<b> if <c>

:watch <cond> <bpArgs>    -- break at location <bpArgs> when <cond> is True
:count (_|<N>) [<bpArgs>] -- count/query/stop execution at location <bpArgs>
:find <var> <cmd>...      -- step with <cmd> until <var> is found
:findex <str> <cmd>...    -- step with <cmd> until <str> is found

:next [lazy] -- step over; lazy version forces only the topconstructor

:xout                     -- redirect ghci stdout to /dev/null
:out                      -- redirect ghci stdout back to console

:defs, :., :redir, :grep, are the same as the commands in Claus'tutorial. The only differences I recall now are:

* grep pattern can be in quotation marks (easier search for spaces)
* grep merges :browse output more nicely

* redir can accept :step, :steplocal etc; i.e. also the commands whichsometimes remove top level bindings

* the commands do not pollute top level bindings so much

The rest will describe my custom commands and now they relate to ticketsin ghci track. If you want to check the tickets mentioned here then themost easy way is probably selecting them from this list:

http://hackage.haskell.org/trac/ghc/search?q=phercek&noquickjump=1&ticket=on

The initial big problem was how to limit the amount of information ghcidebugger spits at you when breakpoints with custom scripts are used.This is also mentioned in forth point of the "unknown" section of ticket#1377:/We can disable a breakpoint with ":set stop N :continue", but thisstill prints out the breakpoint info when we stop. Should we print theinfo only if there were no commands?/So I say: yes do it! Just disable any ghci output related to abreakpoint when the breakpoint has a custom code attached. We candisable the output ourselves, but then we disable all the output(including the output of the debugged program). People who are debuggingconsole applications are busted (especially if the applications areinteractive). This is not an issue for me since my application is notusing console almost at all. I'm solving the problem by prefixingcommands like /:main/ and /:continue /with my command /:x/. This makesoutput by default disabled and ghciExt enables it just before abreakpoint code is run. If the breakpoint continues execution itdisables it again. If the debugged function finishes the output isenabled by /:x/ itself. A small problem happens when you forget to use/:x/ e.g. with your /:main/. Then you do not get a prompt when theprogram finishes. It's time to notice it and use /:out/ to switch it on.This is the only minor disadvantage for a gui application debugging./:xout /in not that useful (it is mostly used internally by /:x/). Itcould be a hidden command (i.e. not registered in defs).

Now when we have ghci "muted" we can get to real goodies: /:monitor/,/:watch/, and /:count/.If you like printf debugging, then /:monitor/ can help you. It canmonitor only free variables in the selected expression (on which abreakpoint is hit) but that was always enough for me during debugsessions I had. Write a script file (lets say it is named *ghciInit*(I'll also call it this way later on in this text)) which is like thisexample:

/:monitor var1,var2 ModuleName 23
:monitor var3 ModuleName 40
:x :main mainArgument/

Then open ghci with your program and run *:. ghciInit*
And you will get nice log like this:
/(0): var1 = <value01>
var2 = <value02>
(1): var3 = <value03>
(0): var1 = <value11>
var2 = <value12>
(1): var3 = <value13>
... etc/

Moreover /:monitor/ allows condition in quotation marks to be specifiedas the first argument. If you do it then the variable values will beprinted only when the condition is True. Of course the condition cancontain only free variables in the selected expression. Mostly it is nota problem./:watch/ is a conditional breakpoint which stops when the specifiedcondition is True.

/:count/ has 3 forms:
/:count _ ModuleName 23/

This never breaks just counts number of times the execution reachedposition ModuleName 23

/:count 5 ModuleNmae 23/
  This breaks when we reach position ModuleName 23 fifth time
/:count 0/

This tells how many times breakpoint number 0 was hit (it can reportthe number of hits for any breakpoint created with /:bp/, /:inject/,/:monitor/, /:watch/, and /:count/).

The first form of /:count/ is interesting when you want to stop earlierthan something bad happens so that you can see why the bad thinkhappened. Put the first form of /:count/ at the start of the functionwith the bug and then put a break in the function which is hit when thebad think happens maybe with /:watch/. When you stop at the bug place,check hit count at the start of the function. Add the hit count in your/:count/ breakpoint in your *ghciInt* file restart and you can use/:steplocal/ or /:loglocal/ to find out what went wrong.

/:loglocal/ is implemented directly in ghc source code. It does exactlythe same as /:steplocal/ but makes sure that the trace history does notcontain anything outside of the function we step with /:steplocal/. Theproblem is that /:steplocal/ works like the code was traced while it isexecuting. Mostly the result is that your trace history is loaded withcrap outside of the scope you are interested in. I'll return to/:loglocal/ later again.

/:inject/ is there when you need something special (/:monitor/,/:watch/, and /:count/ are implemented with something very like/:inject/). E.g. when you want to do monitoring of a value but do notwant the associated breakpoint number printed.


/:find/ and /:findex/ are there primary to search trace history.
/:find var1 :back/
  will find the variable var1 in your trace history by back stepping it
/:find var1 :step/

will single step forward till variable var1 is in the list of freevariables

/:findex BL/Logic.hs:23 :loglocal/

will fill in your trace history will all the local breakpoints tilllocation BL/Logic.hs:23 is hit. Having the trace history filled in withthe right stuff is useful for checking out why you got bad results later.

Now lets return to Claus' idea of breakpoints with a list of identifierswhich should be available at the breakpoint location. You can make surethe identifiers are available with the first form of /:count/. It neverstops but it puts records to the trace history. So the trace historywill contain free variables at the locations where you put /:count/.This will be even more useful when automatic search of trace history isbuilt in (see ticket #2737). So a weaker form of Claus' idea can beimplemented by carefully selecting what should bet to the trace history.Why a weaker form only? Well, in some cases, the variable instances intrace log may not be the expected ones (they may be from a differentlexical scope). Experience with my code indicates this should be rare.

/:next/ is an idea how to implement a kind of step over. That is if bystep over you mean something else than steplocal. The non-lazy form of/:next/ forces _result and does a /:step/. The lazy form forces only thetop level constructor of _result before the step. Hey, I even had a casewhen it worked just like I expected. But typically it does not workbecause of bug #1531. _result is not correctly bound to the result ofselected expression in most of the practical cases. This bug is alsocritical for all the forms of conditional breakpoints. It would be coolif we could specify the condition based on _result or some part of it.The implementation of ghciExt conditional breakpoints would need to beextended to support conditions on _result (in particular the breakpointwould need to be disabled during the condition execution) but that iseasy to do. Even more worrying thing about bug #1531 is that it has themilestone set to _|_.

It is easy to add /:enable/ and /:disable/ to support enabling anddisabling breakpoints. I just did not need it yet. Here is how a GhciExtbreakpoint looks like:

*Main> :show breaks

[0] Main a.hs:4:2-8 ":cmd return$GhciExt.getStopCode 0 (True) "putStr\"(0): \"\n:force x" "False""

*Main>

Just replace getStopCode with getDisabledStopCode and you have itdisabled. Return back to enable. Yeah, and implement getDisabledStopCodewhich will just continue.

I added :/loglocal/ mostly to simulate how /:tracelocal/ in ticket #2737would help. I was also trying how full tracing is helping. In both casesthe answer is: full tracing almost never helps. /:tracelocal/ fromticket #2737 as proposed originally would rarely help. The problem isthat trace log gets overwhelmed with crap when we cannot control whatcan be saved in it and what cannot be saved. My idea is that user shouldbe able to specify what can go in it and also what should not go in it.Here is an alternative solution to the ones I proposed in tickets #2737and #2946. I think this one would be best. The command to control thetracing should look like:

-- should everything be traced?
:set trace (True|False)
-- scopes which should be traced (or should not be traced when ! is present)
:set trace ( (!)? scopeid )*
-- add/remove individual scopeids to/from the trace specification
:set trace (+|-) (!)? scopeid
where scopeid = ( conid . )* ( varid . )* varid

Notice how scopeid looks. It can have a sequence of varids at the end.The reason is so that user can leave out a scope of a function which isdefined in a where clause. The scope specification is similar to theproposal in ticket #3000. E.g. for this code:

fn s = 'a' : add s
 where add = (++"z")
it could look like
:set trace Main.fn !Main.fn.add
meaning trace whole scope of *fn* but not the stuff in the scope of *add.

*Order should not be important, requests for not tracing should haveprecedence before requests to trace.The scopes which we want typically exclude are the ones which containloops. The loop content often fills in the trace log forcing theinteresting stuff out of it. It is better to investigate functions withloops separately in nested context.

Notice that there is a bit difference between this proposal ofcontrolling trace content and the one in ticket #2737. #2737 assumesusage of breakpoint arguments to specify a scope. The breakpointarguments give an ability to define scopes at finer level but there isno option to define exclude scopes which I find important now.

The summary is: Trace log is as useful as much you can control what canget in it. The :trace command looks to me like an error. It is better tocontrol it by allowing/disallowing scopes.

I also changed my opinion a bit about ticket #2945. :mergetrace would bebetter than global trace history. Being able to investigate somethingseparately in a nested context is useful.

If I should order the ghci debugger related tickets then the order wouldbe like (more important first):

#1531 (_result can get bound to the wrong value in a breakpoint)

#2737 and #2946 (add :tracelocal to ghci debugger... and add command:mergetrace...)#3000 (:break command should recognize also nonexported top levelsymbols...)#2803 (bring full top level of a module in scope when a breakpoint ishit in the ...)#1377 (the task: "We should print breakpoint related info only ifbreakpoint has no commands set") but people debugging interactiveconsole applications would like to have this one the very top; *IIRC*this may be easy to do, looks like all the printing is done in onefunction (something like afterCmd???); also #2950 looked like trivial todo (like 15 mins without the compile time???)

And the last thing: my first time experiences hacking in the /:loglocal/into ghc. I cannot tell much, I spend with it only one long Sundayafternoon, but here are my two points:* I needed to extend ghc interface. The type of function GHC.resumechanged from:

   resume :: GhcMonad m => SingleStep -> m RunResult
   to:
   resume :: GhcMonad m => (SrcSpan->Bool) -> SingleStep -> m RunResult

... plus the corresponding implementation change. The added argumentis a filtering function to limit source spans which can recorded in thetrace history.* It would be cool if ghci has its own dir in the souce tree where onlythe ghci source files are. It would encourage people to hack it moresince it would be easier to maintain private patches and mergingupstream. It would be also easier to make sure one modifies only ghcisource code so that it works with other ghc releases.


Hopefully this helps somebody,
 Peter.

_______________________________________________
Glasgow-haskell-users mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/glasgow-haskell-users

my experience with ghci debugger extensions

Reply via email to