What does this solve that something like Rabbitmq doesn't? Am I missing some key...

aeyes · on Nov 11, 2020

Everything listed under "Features", "System guarantees" and "Lambda requirements"?

Dropbox using Python, the real question is what Celery didn't solve for them. My guess would be scalability.

solumos · on Nov 11, 2020

It seems that Nextdoor also had issues with celery[0].

"Scalability" is a great scapegoat for making dubious decisions, but my guess here would be the "task priority" requirement.

[0] https://engblog.nextdoor.com/nextdoor-taskworker-simple-effi...

aeyes · on Nov 12, 2020

Hm, this post is from 2014 so many of the arguments presented don't stand anymore.

Today you can containerize and autoscale Celery workers, so having task specific queues and worker groups will solve your resource utilization problems. For even better utilization, you can have gevent and process based workers depending if the task is resource or I/O heavy.

In years I have never seen Celery workers hang, we process millions of tasks (hundreds of task types) every day. Since the post is very old, I'm sure that they had to deal with a few nasty bugs back then.

However, Celery does have real scalability issues: If you run over 1000 worker nodes Celery gossip alone will be well over 1000 messages per second. Having 1000s of gossip queues isn't ideal either given that RabbitMQ is not easily scalable. You might end up with multiple clusters which can be a bit of a pain in a single codebase. I'm not sure if running Celery on SQS solves some of these issues.

sateesh · on Nov 12, 2020

One more reason could be, prioritising the tasks to pick. In celery you can create multiple task queues and prioritise by allocating fewer resource to consume from a lower priority queue (say) and allocate more resources to consume from high priority queue. This configuration is static and is a configuration option. But if you want to dynamically change the priority how you consume tasks from each queue, say by having a scheduling code picking tasks from different queues and handing it over to your application code that would be difficult. I don't think introducing a scheduling layer between celery and the application code is possible unless you modify the celery code.

Also I am not sure how scalable RabbitMQ is, redis as broker scales quite well but you lose durability when you use redis.

Thaxll · on Nov 11, 2020

Rabbitmq does not solve this problem, Rabbitmq offer a solution for message passing that's it, it does not offer a framework to execute tasks etc ...