← Back to team overview

graphite-dev team mailing list archive

[Question #673695]: better loadbalancing of connections between duplicator, relay and cache

 

New question #673695 on Graphite:
https://answers.launchpad.net/graphite/+question/673695

Hi,

I'm running a graphite setup with two servers. One receives all metrics and all queries. The other one serves as a backup and receives only copies of all metrics. On each server there is haproxy running, 16 duplicators, 8 relays and 16 caches.

As the system is under high load (cpu and disk) the queues of the duplicators and relays are often full.
I've noticed now that the load is not evenly distributed between the duplicators and relays. Some have the queue full for hours while others are basically empty all the time.

Currently haproxy has leastconn configured. But I assume it doesn't do much as connections live long and some send many metrics while others just a few.

Any ideas how to balance the load better across the relays and duplicators? 
Or to other ways to bring down the queue length?

Cheers,
Felix


-- 
You received this question notification because your team graphite-dev
is an answer contact for Graphite.