randgen team mailing list archive

Thread
Date

Re: Question about Comparison vs. Stress

To: randgen@xxxxxxxxxxxxxxxxxxx
From: John Embretsen <johnemb@xxxxxxxxx>
Date: Tue, 27 Aug 2013 08:19:35 +0200
In-reply-to: <BLU178-W398472A63B7F3127B1DBE0CB490@phx.gbl>
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130329 Thunderbird/17.0.5

Hi Joel,

On 08/26/2013 08:49 PM, Joel Epstein wrote:

Hello,

I am currently trying to run some of the RQG tests for correctness (e.g.
comparison) as opposed to using the suite to test for stress failures.
  Is there anything I should look for within the grammar files or, in
general, that might indicate what tests are not really set up to be run
for correctness (e.g. comparison)?   On one of the tests, I had to an
ORDER BY clause to get the Update, Delete, Insert statements to validate
the test results.  Otherwise, divergence would quickly occur.

There is so far no systematic separation of grammars that are suitablefor comparison testing from other grammars (as far as I know). On theother hand, I have used the RQG for correctness testing myself for along time, and it works pretty well, with some caveats.

Sometimes the grammar author has added some comments to the top of thegrammar file, such as "This grammar is not suitable for comparisonbecause..." or "In order to use this for result comparison, you needto..." or something to that effect. So I recommend to look for this first.

In most grammars there are no such comments. So it is not easy to tellwhich grammars are suitable or not. I can tell from experience thatmost grammars in the optimizer category are somewhat suitable for suchtesting, but not without some effort with post-processing of theresults, in order to separate real issues from false positives. The sameis likely true for several other grammars.

Then it is the question how you do the actual testing. There are severalways to test for correctness, for example:


 - Use a 2-way or 3-way result comparator validator such as
   ResultsetComparatorSimplify, to compare results from two
   (or three) servers directly.

https://github.com/RQG/RQG-Documentation/wiki/RandomQueryGeneratorValidators#wiki-Optimizer_Comparison_Testing_and_Data_Validation

 - Use the Transformer validator to compare results from a single
   server using equivalent alternative queries that are "transformed"
   from the original queries produced by the grammar.

https://github.com/RQG/RQG-Documentation/wiki/RandomQueryGeneratorTransforms

I am not sure what you mean by using "the Update, Delete, Insertstatements" to validate results. If the Transformer validator is used,the order of the rows does not matter in most cases, as long as the sameset of rows is returned for both queries, due to hints like"TRANSFORM_OUTCOME_UNORDERED_MATCH", which makes the Transformervalidator ignore the order when comparing. In other contexts theordering may pose a challenge.

In any case, lack of ORDER BY can be an issue in some special casesregardless of the comparison method used. For example, lack of fullORDER BY in combination with LIMIT can result in non-deterministicresults, making result comparison difficult. The same is often the casewith "hidden GROUP BY/ORDER BY", or combining aggregates andnon-aggregates in the SELECT list without including all non-aggregatesin GROUP BY, or using HAVING without GROUP BY, or non-deterministicstatistical functions, or non-sensical aggregates (e.g. SUM(VARCHAR)), etc.

Some of these issues can be avoided by includingsql_mode=ONLY_FULL_GROUP_BY in the server settings for MySQL. Otherdatabase vendors may have something similar, or simply reject more ofthe non-deterministic query types, so your mileage may vary.

So, there are some traps and pitfalls, but with some patience and someexperience I think the RQG is useful also for correctness testing.


I hope this helps,


--
John

References

Question about Comparison vs. Stress
From: Joel Epstein, 2013-08-26