openstack team mailing list archive

Thread
Date

Several questions about HOW SWIFT WORKS

To: openstack@xxxxxxxxxxxxxxxxxxx
From: Alejandro Comisario <alejandro.comisario@xxxxxxxxxxxxxxxx>
Date: Tue, 03 Jan 2012 14:32:51 -0300
User-agent: Mozilla/5.0 (X11; Linux i686; rv:8.0) Gecko/20111105 Thunderbird/8.0

Hi everyone !

Since we are using swift for a time now, we would like to know a fewthings in a deep way about how some things actually works in SWIFT.


Imagine the setup where im putting all the doubts is as follow :
+ 2 proxyNodes
+ 10 dataNodes ( 5 zones )

So, lets get down to business.

# 1 we have memcache service running on each proxy, so as far as weknow, memcache actually caches keystone tokens and object paths as therequest ( PUT , GET) enters the proxy, but for example, if we restartone proxy server, so the memcached service is empty, is the restartedproxy node going to the neighbor memcache on nex request, lookup forwhat it needs, and cache the answer on itself so the next query issolved locally ?

# 2 the documentation says regarding "For each request, it will look upthe location of the account, container, or object in the ring (seebelow) and route the request accordingly" in what way the proxy actuallydoes the look-up regarding WHERE is an object / container in the cluster? does it connect to any datanode asking for an object location ? doesthe proxy have any locally sotarge data ??

# 3 Maybe it has to do with the previous question but, every dataNodeknows everything that is stored on the cluster (container service) oronly knows the object that has itself, and the replicas of its objects?

# 4 We are building a production cluster of 24 datanodes, having 6drives each (144 immediate drives) we know, that a good default numberof partitions per drive is 100, so the math for creating the ring willbe (24 nodes * 6 drives * 100 partitions) but we know the at the end ofthe year, the amount of datanodes (and drives also) could be 2x or 3xmore. So, for the initial setup, can we build the RING with our 144drives and 100 partitions per drive so we can modify the ring /partitions later and rebalance? or is safer to think about futureinfrastructure increase, and build the ring with those numbers in mind ?

# 5 We put a new object into the cluster, the proxy decides where towrite the object (is it in a round-robin manner ?) is the proxy servergiving a "Created" response when the 1st replica is actually writen andput into the account and container SQLite databases ? or there is and okjust when the OBJECT service actually wrote the data on disc ?


Hope, we can shed some lights regarding this doubts.
Thanks !

Cheers.

--
Alex <www.mercadolibre.com>

Follow ups

Re: Several questions about HOW SWIFT WORKS
From: Chmouel Boudjnah, 2012-01-04
Re: Several questions about HOW SWIFT WORKS
From: John Dickinson, 2012-01-03