pbxt-discuss team mailing list archive

Thread
Date

Re: Row buffers and Object (was Re: free_table_share() != drizzle)

To: Brian Aker <brian@xxxxxxxxxxx>
From: Paul McCullagh <paul.mccullagh@xxxxxxxxxxxxx>
Date: Tue, 25 May 2010 23:55:09 +0200
Cc: PBXT Discuss <pbxt-discuss@xxxxxxxxxxxxxxxxxxx>
In-reply-to: <57A15D1C-1DFC-40A3-B5D4-6F6001B7E1B1@tangent.org>

Hi Brian,

On May 22, 2010, at 2:41 AM, Brian Aker wrote:

On May 21, 2010, at 2:31 AM, Paul McCullagh wrote:
My problem is that the val_ methods are hard-coded to referencerecord[0].
My plan is to add a second set of Field that will correspond to thesecond row (got any handy names!). That way we won't need a hack inorder to access them.
The cost of generating those for user generated Tables is verysmall, so there is no reason not too. For "internal" tables it is abit different, but then the pattern of access there is really muchmore confined.

As I said in previous e-mails. I think you should to the record[]array into an array of Row objects.


The we would simply access table->record[1]->val_....()

PBXT uses the MySQL record format internally if the record has afixed length (similar to how MyISAM).
Are you though? MyISAM doesn't strictly use the format. It rereadsfor the format and packs in for cases where blobs, varchar, etcexist. Most of the time, assuming that most tables are dynamic, youhave to cycle through it so you are reading each bit. The onlyunaltered use is when you have a row that is entirely made up ofconstants (aka... numbers, etc).

Yes, this is true. And this is the only time that PBXT actually usesthe row exactly as is, unaltered.

Actually, PBXT also uses this "fixed length" format, when VARCHAR,CHAR and other variable types are included (but not BLOBs), as long asthe fields maximum size is not too large.


PBXT uses a different, variable length, format in all other cases.

Which format PBXT uses depends on the estimated average row length(which can be set explicitly using AVG_ROW_LENGTH attribute of thetable). If the fixed length of a row is smaller than the estimatedaverage row length, then it uses the fixed length format.

It MySQL the "byte" type is even more confusing since it is encodeddifferently in the raw format.
Instead, I suggest a "Row" object, into which I can plug the startof the row buffer. Then we would have:
doInsertRecord(Row *row_object)
The Row object knows everything about the row and can be used toaccess the fields.
This sounds fine, and essentially what we have at the moment (but noknowledge of row2).
I hate calling them row1 and row2 though. Any ideas on that? Do youwant them handed to you? Should we encapsulate so that you can'taccess them unless they are passed in?

Basically I think, yes. The engine should not know, and should notcare whether record[0] or record[1] has been passed to doInsertRecord().

One thing I would disagree on though is the setBuffer(), thatremoves all encapsulation on the fields (which is bad, since overtime those do change). We have one format for Decimal right now, ifwe change that and have no version information we take the chance wemight push data back and forth which would not be compatible.

I am OK with this. But, it means we have to create a Row object forevery row buffer we want to use.

This may be OK for the front end, which just needs to create record[0]and record[1] for each handler. But, would not like to have to do thisinside the engine for every row I instantiate.


Best regards,

Paul

Cheers,
	-Brian
So the Row object is a "per thread" object. You call row_object->setBuffer(u_char *buf) to set the row buffer and then you use anarray of fields to get and set the data in the buffer.
So then the record[] array would become an array of Row objects.And an engine could create Row object to manipulate row buffersinternally (which is exactly what I am doing with the internalTable object - or TableShare object).
The record[0] object would be passed to doInsertRecord(buf), as itis today.
But now, the engine will not have to know that: buf == record[0]!
(Which is a correspondence that is far from obvious, but is almostessential to know if you are implementing a storage engine today!)
Stewart: would something like this also work for yourimplementation of the Embedded InnoDB?
On May 20, 2010, at 8:16 PM, Brian Aker wrote:
Hi!
I'd unpack the row by using the Field object and not go with theoffset (longterm we won't support touching the record[] directlysince it creates a big problem for abstraction). Via val_ methodsyou can request the individual members. The offset was originallyadded so that two sets of field objects would be needed (aka onefor recored[0] and one for record[1]). Our cost for building thisstuff is pretty low so we can just give folks a set of fields forboth images so that you aren't stuck trying to figure out theunderlying contents.
Cheers,
	-Brian

On May 20, 2010, at 1:04 AM, Paul McCullagh wrote:
Hi Brian,

Just one problem maybe you have a quick suggestion:

How do I get the offset of a field into the row buffer?

When using the Table object I did this as follows:

field->offset(field->table->record[0])

But this does not work with TableShare.

On May 17, 2010, at 6:07 PM, Brian Aker wrote:
Hi!

On May 17, 2010, at 4:21 AM, Paul McCullagh wrote:
Are you suggesting I create a TableShare on the stack wheneverI need it?I don't think this would work because AFAIK I have to callopen_table_def(), which loads the table definition. So callingthis each time I want to copy data in and out of the row wouldbe too slow.
What may work is to use a TableShare object instead of a Tableobject.
That is what I was suggesting, just create an object and use it.
Question is, can I do something like:
share = new TableShare();
share->init(db_name, 0, name, path);
error = open_table_def(&thd, *ident, share);
Yes you can do this, hell, we can probably make it simpler thenthis as well.
and then later simply:
delete share;
to remove.
Yep. If you look in createTable() you can even see how we dothis during that operation.
Cheers,
	-Brian
On May 14, 2010, at 9:03 PM, Brian Aker wrote:
Hi!

On May 14, 2010, at 9:37 AM, Paul McCullagh wrote:
Here the engine will follow the "table" pointer to the"field" array, where it uses the offsets of the data in arecord, in order to copy data in and out of the record.
So what you need is the Field** that is in share (aka, youdon't even need Table, you just need TableShare).
You could just do this:

TableShare my_share(<share_key>,....);
That way you have your own object and you never pass throughany of the locking system/dealing with any of the objectcounting.
Cheers,
	-Brian
--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com
--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com
--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com




--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com

Follow ups

Re: Row buffers and Object (was Re: free_table_share() != drizzle)
From: Brian Aker, 2010-05-26

References

free_table_share() != drizzle
From: Brian Aker, 2010-05-08
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-10
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-11
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-14
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-14
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-14
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-14
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-17
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-17
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-20
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-20
Row buffers and Object (was Re: free_table_share() != drizzle)
From: Paul McCullagh, 2010-05-21
Re: Row buffers and Object (was Re: free_table_share() != drizzle)
From: Brian Aker, 2010-05-22