maria-developers team mailing list archive

Thread
Date

DBT-3/TPC-H RQG tests are now available

To: "maria-developers" <maria-developers@xxxxxxxxxxxxxxxxxxx>
From: "Philip Stoev" <pstoev@xxxxxxxxxxxx>
Date: Tue, 14 Dec 2010 11:45:04 +0200
Organization: Monty Program AB

Hello,

In response to popular demand, the DBT-3 dataset will be used for testingalong with a new set of grammars that generate queries against that dataset.

While I personally doubt that the realism of the DBT-3, it being 99% random,here is what we have so far:


1. DBT-3 datasets for scales 0.1 0.01 and 0.001

2. RQG grammars that implement the following:

- a grammar on the range optimizer via single-table queries against thelineitem database. The WHERE clause consists of nested AND and ORexpressions of varying depth, where each individual expression involves anindexed column and is generated to be realistic. E.g. for a date column, wegenerate expressions that filter records for a specific month or a specificyear.

- a grammar for general join tests - realistic multi-table joins aregenerated by observing the star structure of the dataset. The WHERE andHAVING conditions are generated to be realistic with respect to the columnsbeing queried or filtered out. GROUP BY is used so that it matches theONLY_FULL_GROUP_BY mode.

- (forthcoming) - subquery tests where subqueries that return a currencyvalue are used in various locations and expressions within the query where acurrency value would generally be expected;

In addition, I have reviewed the actual queries from the benchmark and Ithink the RQG grammars more or less cover the scenarios described in thespecification.

From the test runs that have been performed so far, no new bugs are being

discovered as compared to the existing RQG grammars that use purely randomqueries against purely random data.

Philip Stoev