pbxt-discuss team mailing list archive

Thread
Date

Re: Row buffers and Object (was Re: free_table_share() != drizzle)

To: Stewart Smith <stewart@xxxxxxxxxxxxxxxx>
From: Paul McCullagh <paul.mccullagh@xxxxxxxxxxxxx>
Date: Tue, 25 May 2010 23:41:55 +0200
Cc: PBXT Discuss <pbxt-discuss@xxxxxxxxxxxxxxxxxxx>
In-reply-to: <8739xif0a0.fsf@willster.local.flamingspork.com>

Hi Stewart,

On May 24, 2010, at 5:26 AM, Stewart Smith wrote:

On Fri, 21 May 2010 11:31:20 +0200, Paul McCullagh <paul.mccullagh@xxxxxxxxxxxxx> wrote:

My problem is that the val_ methods are hard-coded to reference
record[0].


You can get around that by using Field::move_field_offset.

something like:

 ptrdiff_t row_offset= buf - table->record[0];
 (**field).move_field_offset(row_offset);
 (do things with field)
 (**field).move_field_offset(-row_offset);

if buf is record[1] then we'll be doing things with it.

Because I need to access row buffers internally, this does not workfor me, because the Table object is shared amongst threads, so Icannot set the ptr inside the fields.

I have also now switched to the TableShare object, instead of theTable object, because the are easier to create and destroy.

TableShare has no record[], instead of record[0] you have to share->use default_values.

So I have a problem when the row buffer is an engine internal buffer
(not record[0]). In this case I need the offset to calculate wherethe
field data starts in the internal row buffer.
the above should work... although it would be way better if we fixit up
to be an absolute pointer instead of doing (faulty) pointer math.

As I say, this does not work if your table object is shared amongstthreads. I don't really want to have to create one of these objectsper thread.

From this point of view, expanding the dependency on record[] would
be the wrong way to go.

Instead, I suggest a "Row" object, into which I can plug the start of
the row buffer. Then we would have:

doInsertRecord(Row *row_object)

The Row object knows everything about the row and can be used to
access the fields.

I've been thinking of a Tuple object instead, so we don't alwayshave tohave the entire row buffer around if only a subset of rows areoperated

on (this could be more advantageous for column based engines).


I guess you mean a "subset of columns".

I guess by "tuple" you just mean an object where I can get and setfield values but don't know how they are stored. Or do you meansomething else?

I'm also thinking that Tuple would just be an interface, and engines
could provide their own implementation.

With accessor methods, then the main server code could examine rows in
their native format instead of always having to convert from engine to
server formats. We would then only convert rows to over-the-wire
(server) format when they go over the wire.

This would mean the engine would supply the storage for the row. Thismay indeed be more efficient.

The engine already provide storage for the BLOBs, so I guess it wouldnot be a problem if the tuple has the same scope as BLOBs today (whichmeans the tuple is valid until the next row is retrieved on the cursor).


But, how would you then handle doInsertRow()?

The natural way to handle this would be for the front-end to providethe tuple to be inserted.

Would you add a call: getTuple(), which returns an empty tuple, andthe call setField() for each column on the tuple and then calldoInsertRow(tuple)?


Seems rather awkward to me. Or do you have a different idea?

So the Row object is a "per thread" object. You call row_object-

setBuffer(u_char *buf) to set the row buffer and then you use an

array of fields to get and set the data in the buffer.

We could do RowBuffer(row_buffer, ptr) instead, so that it'simpossible

to ever forget to set the buffer back to something.

I am not sure what you mean here. Is RowBuffer() a constructor? Whatis row_buffer, and what is ptr in this case?

So then the record[] array would become an array of Row objects. And

an engine could create Row object to manipulate row buffersinternally

(which is exactly what I am doing with the internal Table object - or
TableShare object).


I think a Row object is just formalising the existing interface a bit
more... which could be a good intermediate step.


Yes, this is true.

I like the Tuple idea a
bit better for a number of reasons, most of which are just different

ways of expressing "things more compact in memory" and "saveconverting

row/tuple formats".

One thing to think about is that Field is not part of record[], and it
would sort of have to be part of Row to be sane and be able to have
different Row objects accessed concurrently (or wrap Field calls to
get/set the offset).

Actually I see Field as part of record[] today by the fact that 'ptr'references the record[0] buffer.


Getting a field to reference any other buffer is a fudge at the moment.

But otherwise, you are right, my suggestion would mean that thisbinding between row buffer and Field becomes even more close.

Although, with setBuffer() as I propose above, all fields can be setto reference a different buffer.

This is the same as move_field_offset(), but instead of changing onefield, you change all fields in the row at once.


So there are 2 possibilities:

1. Keep one set of fields, as it is at the moment, and switch fromrecord[0] to record[1] as required, or2. Turn record[] into an array of Row object where each row referencea different row buffer (then no switching is required).

There is a 3rd possibility, which will probably require a lot of codechange:

- Provide the row buffer, on each call to a Field method. In thiscase, the Row object would be buffer independent, and could be used bymultiple threads.

The record[0] object would be passed to doInsertRecord(buf), as it is
today.

But now, the engine will not have to know that: buf == record[0]!


mostly :)

That's the annoying bit - it's *mostly* true :)

Yup, and it is therefore just as well that I have never relied onthis. I have always used the field offset, and the buffer pointer toget the field data.

Stewart: would something like this also work for your implementation
of the Embedded InnoDB?
I'd prefer the Tuple way of doing things, as when operating on asubset
of columns, it would save a bunch of effort.

This is fine, as long as Drizzle provides some sort of interface to dothe following:


- Get the collation sequence of a column.

- Do string comparison operations with field data, and the collationsequence

- Do comparison of decimal encoded values, and other special data types


--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com

Follow ups

Re: Row buffers and Object (was Re: free_table_share() != drizzle)
From: Stewart Smith, 2010-05-26

References

free_table_share() != drizzle
From: Brian Aker, 2010-05-08
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-10
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-11
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-14
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-14
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-14
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-14
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-17
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-17
Re: free_table_share() != drizzle
From: Paul McCullagh, 2010-05-20
Re: free_table_share() != drizzle
From: Brian Aker, 2010-05-20
Row buffers and Object (was Re: free_table_share() != drizzle)
From: Paul McCullagh, 2010-05-21
Re: Row buffers and Object (was Re: free_table_share() != drizzle)
From: Stewart Smith, 2010-05-24