pbxt-discuss team mailing list archive

Thread
Date

Re: goal 0 of embedde pbxt reached || (was: Re: PBXT: Embedded Database (library))

To: Martin Scholl <ms@xxxxxxxxxxxxx>
From: Paul McCullagh <paul.mccullagh@xxxxxxxxxxxxx>
Date: Thu, 11 Feb 2010 23:45:25 +0100
Cc: PBXT Discuss <pbxt-discuss@xxxxxxxxxxxxxxxxxxx>
In-reply-to: <418c6ccf1002110812k5464747cud59e0bc97b440e0d@mail.gmail.com>

Hi Martin,

On Feb 11, 2010, at 5:12 PM, Martin Scholl wrote:

As this is my first posting, I'd like to say "hello" to you all.
On Thu, Feb 11, 2010 at 4:19 PM, Paul McCullagh <paul.mccullagh@xxxxxxxxxxxxx> wrote:
[snip]
I guess the next step would be to defined an interface, and beginthe implementation.
Beneath the missing bloody THD- and codeset / charset-relatedstuff... :-)Even more, you will notice a lot of assert(0)s in the currentversion...To be clear: the current state of the code is solely to "make itcompile" instead of "be maintable" or even "be clean". Please keepthis in mind when reading my "code".


Understand, but it is a good start...

This raises the question of whether to use the MySQL handlerinterface, or to go in and replace ha_pbxt.cc altogether.
I'd propose to skip ha_pbxt.cc altogether and stick with a dedicated(and pbly simplified) embedded API. ha_pbxt.cc is not part of thecurrent build-set anyways. :-)


Yes, OK.

One of the things I like the most about libraries like Tokyo Cabinetis its straight-forward API. I would love to see embedded PBXT beeasy like this as well.

Absolutely agree. As few API calls as possible, and they should beeasy to understand.

Embedded InnoDB's API might be a good start and reference for an APIsketch: http://www.innodb.com/doc/embedded_innodb-1.0/


Yes, I read that through again. Most of it could be taken over 1 to 1.

IMHO what is open and where I would really appreciate yourfeedback / comments:
- what language should the API be in? C or C++?

Well, C has the advantage that it is easy to put a C++ wrapper aroundif you want to, the other way around is tricky. So unless there is agood reason, I would recommend a C API.

- In which format shall we store the table/db definitions? protobuffmaybe? Afair Drizzle does so, so we could borrow some code there...


protobuf may be an overkill for the initial implementation.

How are you planning to do create table? The innodb API does it bybuilding a create table structure with various API calls.

By submitting a CREATE TABLE statement as text, you can save a lot ofAPI routines.

PBXT already has a parser for CREATE (and ALTER) table statements. So,you could accept the text and feed the parser.

Then, you could actually store the table definition as a CREATE TABLEstatement. When the table is loaded you just invoke the parser. TheCREATE TABLE text could be stored in a separate file, like the .frmfile, for each table.

Alternatively the text could be stored in the header of the .xtd file,where I already store the foreign key information (the foreign keyinformation is actually stored as SQL text).

However, this may be going too far with the integration of theembedded code and PBXT itself.

Basically, what would be cool is if the embedded wrapper code controlsthe following:


1. The types of data stored.
   - We can start with a few very basic types.
2. The format of a record in RAM

- This is the same format that PBXT uses on disk, as long as therecords are fixed length- For variable length records it uses a simple serializationmethod (as I mentioned before)

3. The format of index records

- with an interface to get and set data in a row, the engine doesnot need to actual format

4. The comparison of data types
   - the wrapper provides routines to compare data types.

- These are mostly methods which are part of the data dictionaryin RAM

5. The format of the data dictionary on disk, and in RAM
   - the wrapper reads and writes this data.

This will give us great flexibility to add data types and othercomplexities later.

It is also pretty much the division of work between MySQL code andPBXT today. However, the division is not so clear in the code.

- else, should table serialization / deserialization be pluggable oreven be purely programmatic? I am fine with this, too, as it is an_embedded_ library and I'd guess most people will control pbxprogrammatically anyways

Although I spoke mainly about the textual interface above, I am reallyflexible on this. I think both solutions have there advantages.

Use whichever is best and easiest for you at the moment, which may besimply writing your own stuff! :)

- library naming: are you fine with libembpbxt?


Yup, that sounds good.

If you use the handler interface, then you will have to continue tosimulate MySQL, which may not suite the API (you will have to callthe handler functions in the same order that MySQL does).
If you replace ha_pbxt, then you will have to nevertheless includesome of the functionality in this code. For example, you shouldtake over the init and shutdown code.
What you need to keep is the "cursor" type paradigm.

What I mean is, to do and index or table scan you do the following:

- open a cursor for a table
 * which means grap an XTOpenTable from the table pool
- call init
 * Initialize the scan.
- Call search and next in a loop.
- call exit
 * Free resources
- close the cursor
 * which means return the open table to the pool

All such actions need to be enclosed in a:

- begin transaction
...
- commit/rollback transaction
The transaction is per thread, and all relevant information isstored in the XTThread structure.
Ok, a lot of open questions are answered by this. Thank you, Paul!


[snip]

Martin
P.S.: I will set up a TODO file to make it easier to track embeddedPBXT's progress


OK, great.

--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com

Follow ups

Re: goal 0 of embedde pbxt reached || (was: Re: PBXT: Embedded Database (library))
From: Stewart Smith, 2010-02-12
Re: goal 0 of embedde pbxt reached || (was: Re: PBXT: Embedded Database (library))
From: Martin Scholl, 2010-02-11

References

Re: goal 0 of embedde pbxt reached || (was: Re: PBXT: Embedded Database (library))
From: Paul McCullagh, 2010-02-11
Re: goal 0 of embedde pbxt reached || (was: Re: PBXT: Embedded Database (library))
From: Martin Scholl, 2010-02-11