Granular code coverage tool?

jan · July 27, 2021, 3:19pm

I guess color(true) (false) is the most sensible. As for other formats, I do not know. If there exists a format that is used by IDEs, that surely makes sense. Note that disabling coloring is as easy as either temporary switching off the color flag or do not not pretend the output is a tty.

peter.ludemann · July 27, 2021, 3:19pm

If your code contains :-use_module(library(apply_macros)), then simply remove it (or comment it out). Note that apply_macros contains a definition for user:goal_expansion/2, so every directive that loads apply_macros needs to be removed (see the description of goal_expansion/2 for details).

jan · July 27, 2021, 4:18pm

I guess emitting HTML is a different beast. That quite naturally would imply using color, as why else would you want HTML? I doubt emitting HTML makes much sense unless you either combine it with the HTML pretty printer in the pldoc package. That means more dependencies My motto is that simple is better …

swi · July 27, 2021, 7:30pm

ahh, I thought you were talking about a generic way, not just about library(apply_macros).

jan · July 28, 2021, 7:07am

There is a remark in the code to make the transformation depend on the optimise flag. Is that something we should do?

swi · July 28, 2021, 6:25pm

I don’t think it is worth it to spend time on this.

Better would be to provide an option listing predicates to ignore in the coverage test.

This would allow better accuracy in the numbers in case term_expansion can’t be handled.

EricGT · July 29, 2021, 4:11pm

@jan

I am thinking about how to capture the relation, think Prolog fact database, between a test case and a clause used during the test.

Currently I run show_coverage/2 with the goal run_tests/0 which is working great but I don’t know

How in show_coverage to identify that tests are running. I know I could check if the show_coverage goal is run_tests but I don’t know all of the ways to kick off a test. I also could not find a hook in the code in plunit.pl.
How to capture the values used for the specific test. In other words I want the relationship to record not only the test name but the bindings of variables used.

Your thoughts are welcome.

EDIT

Peter noted below to use message_hook/3 which is working as advertised.

peter.ludemann · July 29, 2021, 4:40pm

When I ran the coverage tests, I did this, and it displayed the usual messages about which tests were run (my tests are in two separate files; they all ran with run_tests/0).

['../test_protobufs'].
[test_interop].
show_coverage(run_tests, [dir(cov), annotate(true), ext('.cov'), modules([protobufs])]).

I did this from the command line: https://github.com/SWI-Prolog/contrib-protobufs/blob/137a9d87ccc990d29e7ae337bb4a97ad8af05982/interop/Makefile#L174

EricGT · July 29, 2021, 5:22pm

Are you referring to the messages displayed back to the top level in green when test are run? I.e.

% PL-Unit: examples ....... done
% All 7 tests passed

That is what I normally see.
If so those don’t have the details I need and I can’t access those easily in code.

Code for example. (Click arrow to expand)

File: examples.pl

:- module(examples,
    [
        example_001/3
    ]).

% -----------------------------------------------------------------------------

:- load_test_files([]).

% ----------------------------------------------------------------------------

:- use_module(library(http/html_write)).

% -----------------

% For use with test_case 07

:- multifile
    html_write:expand/3.

html_write:expand(my_term(Term)) -->
   html_write:html_quoted(Term).

% -----------------

example_001(Input,TokenizedHtml,HTML) :-
    phrase(html(Input),TokenizedHtml,[]),

File: examples.plt

:- begin_tests(examples).

:- use_module(examples).

test_case(01,success,
   [],
   [],
   ""
   ).

test_case(04,success,
   ['text'],
   [text],
   "text"
   ).

test_case(05,success,
   p('This ~a use of ~s/~d'-[makes,"format",3]),
   [nl(2),<,p,>,nl(1),"This makes use of format/3",</,p,>],
   "\n\n<p>\nThis makes use of format/3</p>"
   ).

test_case(06,success,
   \[a,b],
   [a,b],
   "ab"
   ).

test_case(07,success,
   [my_term(hello)],
   [hello],
   "hello"
   ).

test_case(02,error,_).
test_case(03,error,[_]).

% -------------------------------------

test(example_001,[true,forall(test_case(_,success,Input,TokenizedHtml,HTML))]) :-
   example_001(Input,TokenizedHtml,HTML).

test(example_001,[forall(test_case(_,error,Input)),error(instantiation_error,_)]) :-
      example_001(Input,_,_).

:- end_tests(examples).

Top level query:

show_coverage(run_tests,[dir('./annotated files'),modules([html_write])]).

peter.ludemann · July 29, 2021, 5:34pm

You should be able to get the information programmatically by adding a message hook.

EricGT · July 29, 2021, 5:36pm

Ahh, so you are saying the trick is to hook the messages and not the code. Thanks will take a look.

EDIT

Works as advertised. Now to see if I can interweave the incoming data and assertz/1 as facts.

EDIT

Using both message_hook/3 for messages from plunit and prolog_trace_interception/4 for the trace callbacks will not work. The problem, as I see it, is that prolog_trace_interception/4 is in a test case when the information is needed from message_hook/3 but message_hook/3 has not sent the message needed because prolog_trace_interception/4 traps on every goal.

Since it looks like the information needed to identify if a test is running, what is the predicate indicator for the test and the arguments for goal (predicate indicator) are in the stack frames, the code will have to check for such information in the stack frames of every prolog_trace_interception/4 call. It will be slow but it should be correct. And as I note often, get the code working correctly first then go after optimizations.

jan · August 1, 2021, 7:45am

If we want programmatic access I guess we should provide a proper API for that. I’m not against that if there is a good use case. I don’t really see it though. You typically use this tool to figure out how good your test suite is or to get some insight in what is actually used in a big code base. If you want the coverage of a specific test, simply run show_coverage/2 for only that test. What is wrong with that?

Keep it simple …

EricGT · August 1, 2021, 10:01am

One concept that I am currently exploring with proof of concept code is to wrap a Prolog level flight recorder around prolog_trace_interception/4 , think something like Java flight recorder, that just dumps raw facts.

event(Sequence_number,Frame_id,Port,Clause_id,Choice_id).
frame(Frame_id,Frame_ref,Clause_id,Parent_frame_id,Level).
frame_argument(Frame_id,Argument_id).
argument(Argument_id,Argument).
clause(Clause_id,File,Line_number,Context_module,Predicate,Argument_count).
choice(Choice_id,Parent_choice_id,Frame_id,Type).

A user can then run queries to generate

Text reports
Static visual graphs (Graphviz)
Interactive HML pages
Interactive visual graphs (cytoscape.js or d3.js)
etc.

Granted as it stands now it is not something one would want to run with code that runs in production. Also I don’t plan to create PRs for this or even publish the code at present, it is a proof of concept.

Topic		Replies	Views
Update version of coverage analysis Resources	3	493	February 7, 2024
Any programatic way to obtain rules used to prove a goal? Help!	18	1152	February 18, 2020
Coverage info mysterioulsy disapear Help! bug	12	505	September 16, 2022
PLunit How-To / teaching example Help!	2	981	December 2, 2019
Show_coverage/1 and incremental tabling resulting in odd behavior? General	3	360	October 14, 2022

Granular code coverage tool?

Related topics