maria-developers team mailing list archive

Thread
Date

Re: [GSoC] Accepted student ready to work : )

To: Pablo Estrada <polecito.em@xxxxxxxxx>
From: Sergei Golubchik <serg@xxxxxxxxxxx>
Date: Mon, 28 Apr 2014 13:36:58 +0200
Cc: maria-developers@xxxxxxxxxxxxxxxxxxx, Sergei Golubchik <serg@xxxxxxxxxx>
In-reply-to: <CABDWuakyd1WorbCdpwTKPHrXi_ZOZ3LrK4tU2f=byms8aqWKCQ@mail.gmail.com>
User-agent: Mutt/1.5.23 (2014-03-12)

Hi, Pablo!

On Apr 27, Pablo Estrada wrote:
> In any case, I already have a decent idea of how the data is arranged. I
> can see how it is difficult to trace test results back to code changes
> directly, but it might be possible.

>From historical data that you have it is strictly impossible.
One can for example:
  + do some changes
  * bzr commit -> revno: 1234 (for example), revid: something@unique-1
  * bzr push
  + buildbot builds and tests revno 1234
  * bzr uncommit
  + do more changes
  * bzr commit -> revno: 1234 (for example), revid: something@unique-2
  * bzr push
  + buildbot builds and tests revno 1234

note, that two *different* revisions got the same revno! And the changes
from the first revision are completely and totally lost, there is no way
to retrieve from from anywhere.

Revision-id is the only unique identifier for a revision, unfortunately,
it's not logged in these tables. I believe we'll change buildbot so that
revid would be logged in the future. But so far it wasn't needed, and
this is one of the defects in the data.

In my testing data I've generated fake revision ids - using md5, revno,
set of changed files, timestamps, perhaps, I don't remember the details.
I've presumed that when this will be put in production, buildbot will
log real revids.

> I still haven't figured out how to recognize what code changes pertain
> to the same 'history', or 'line' (I think that you were not
> considering this in your experiments... and it might turn out to not
> be relevant at all, but I still want to consider it, to try to make a
> more complete model).

To a certain extent you can get this from revnos (but not from revids).
See any complex history graph in "bzr viz". For example this:
http://mhall119.com/2010/04/bzr-is-amazing/
(I've just googled it out with image search for "bzr viz")

Note, that changes in the same "line" have the same revno prefix, and
only the last component in the revno is being incremented. On that
screenshot, you can recognize these "lines" of changes:

black:   28 -> 29 -> 30 -> 31 -> 32
blue:    19.3.1 -> 19.3.2
green:   19.1.5 -> 19.1.6 -> 19.1.7 -> 19.1.8 -> 19.1.9 -> 19.1.10 -> 19.1.11
magenta: 19.4.1
red:     19.5.1
yellow:  19.6.1 -> 19.6.2

if you'd like you can install bzr, branch lp:maria and run "bzr viz" or
"qbzr".

> So I have just two questions that should be easy to answer by email:
> 1. In your graphs, exactly, what do you mean by Precision? What about
> Recall? And cutoff?

For many years I was very interested in Information Retrieval (I'm the
author of MySQL full-text search feature). These terms come from IR.
http://en.wikipedia.org/wiki/Precision_and_recall

Think of it this way:
* There's a revision we want to test in buildbot.
* There're NNN (more than 20000) tests to run in total (taking into
  account all combinations, test variants, and builders).
* We only want to run N (much less than NNN) tests.
* We sort all NNN tests in some specific order, and take first N
  tests. N is the "cutoff".
* Now, I check the history - what tests out of all NNN have failed and
  succeeded.
* Ideally, first N tests should've failed, and all other NNN-N tests
  should've succeeded.
* Precision corresponds to the number of succeeded tests in the first N,
  where 100% means that all N tests have failed, no succeeded test was
  selected.
* Recall corresponds to the number of failed tests outside of the first
  N. An 100% there are no failed tests outside of the first N, no failed
  test was missed.

In IR precision and recall should be always considered together. It's
trivial to optimize only one of these values. E.g. for N=0 you have a
perfect precision, you never select a succeeded test. And you have a
very bad recall - all failed tests were missed. Alternatively, for
N=NNN, you have a perfect recall, and bad precision.

In my results (see readme from v1) I've realized that I don't care about
precision as such, I am only interested in recall and the total
testing time. Although precision and the time are related, time is much
more meaningful here. So I optimize a {Time, Recall} pair, not
{Precision, Recall}.

> 2. I looked at your launchpad profile but I can't seem to find the repo
> with your tests (if you even pushed it to lp at all). Is it available?

Uhm. I've never published it. It was a pretty much write-only code in
very hairy perl. I don't know if it makes any sense to look at it at
all.

But ok, I've just pushed it, here you are:
bzr+ssh://bazaar.launchpad.net/~maria-captains/mariadb-tools/mtr-prob-tmp

Regards,
Sergei

Follow ups

Re: [GSoC] Accepted student ready to work : )
From: Kristian Nielsen, 2014-04-28

References

[GSoC] Accepted student ready to work : )
From: Pablo Estrada, 2014-04-22
Re: [GSoC] Accepted student ready to work : )
From: Sergei Golubchik, 2014-04-22
Re: [GSoC] Accepted student ready to work : )
From: Pablo Estrada, 2014-04-23
Re: [GSoC] Accepted student ready to work : )
From: Pablo Estrada, 2014-04-27