create database for each Django Model instance

2023-03-23 14:11 问答作者：

I have one "master Model" with lots of "children Models" : As i know there will be a lot of data (through children), i would like to store dynamically each new instance of that MasterModel in a separated database (with all its children)

class MasterModel(models.Model):
    name = models.CharField()
    #db_connexion_name='mastermodel_1_db'

class ChildModelA开发者_运维技巧(models.Model):
    mastermodel = models.ForeignKey(MasterModel)
class ChildModelB(models.Model):
    mastermodel = models.ForeignKey(MasterModel)
    child = models.ManyToManyField(ChildModelA)
    child = models.ManyToManyField(ChildModelA)
    class ChildModelC(models.Model):
    ...

There is a lot of childs and relationships, but never between MasterModel objects

From now on, i guess i have to do : For each new MasterModel instance (by overriding the save() method):

dynamically update settings.DATABASES dictionnary to add a new database : 'mastermodel_1_db', 'mastermodel_2_db'
syncdb (to create schema / tables) on that db
and then use a custom DatabaseRouter manage db transactions

like this:

class MyDatabaseRouter(object):
    def db_for_read() / db_for_write() / ... :
        # for any model, return the database
        # of the mastermodel related object
        # like :
        if hasattr(model,'mastermodel'):
            return model.mastermodel.db_connexion_name

Am i on the right way ?

This sounds like a massive case of premature optimization? Think about what you're doing. You're setting up a a system with a huge number of moving parts, with likely race conditions, that is complicated, and hard to maintain. All of this before you've even seen of the data will present performance issues.

If you are really convinced (and maybe have some numbers to back it up) that the size of the data you're dealing with will actually tax your system. These are the steps you should take:

Better hardware. Hardware is way cheaper than developer time.
Partitioning
Correctly done sharding (what you're proposing isn't likely a correct sharding approach)

It's imporant to note that 1 and 2 together should be able to (under most circumstances) get you to tables with billions of rows.

You should have a read over at: http://www.flounder.com/optimization.htm. Particularly the last lines:

Optimization matters only when it matters. When it matters, it matters a lot, but until you know that it matters, don't waste a lot of time doing it. Even if you know it matters, you need to know where it matters. Without performance data, you won't know what to optimize, and you'll probably optimize the wrong thing.

The result will be obscure, hard to write, hard to debug, and hard to maintain code that doesn't solve your problem. Thus it has the dual disadvantage of (a) increasing software development and software maintenance costs, and (b) having no performance effect at all.

继续阅读：database django instance model

create database for each Django Model instance

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？