开发者

Django Models: preserve object identity over foreign-key following

Django's ORM (version 1.2.3) does not preserve identity when following foreign keys back and forth. This is best explained with an example:

class Parent(models.Model):
    pass

class Child(models.Model):
    parent = models.ForeignKey(Parent)

parent = Parents.objects.get(id=1)
for child in parent.child_set.all():
    print id(child.parent), "=!", id(parent)

So, for each child the parent is re-fetched from the database, even though we know the parent at the moment we fetch the child. This is counterintuitive to me.

In my case this also leads to performance issues, since I do some heavy operations at the parent level which I'd like to cache at object instance level. However, since the results of these calculations are accessed via the child => parent link, this caching at the parent level is useless.

Any ideas on how to solve this?

I've gotten as far as figuring开发者_运维知识库 out there's a ForeignRelatedObjectsDescriptor and a ReverseSingleRelatedObjectDescriptor.


There are a number of possible solutions to this.

Perhaps the easiest is to keep track of the parent yourself:

parent = Parents.objects.get(id=1)
for child in parent.child_set.all():
    child._parent_cache = parent

_FOO_cache is the way Django keeps track of items fetched via ForeignKey, so if you pre-populate that object on the child with the parent you already have, Django won't fetch it again when you reference child.parent.

Alternatively, you could look into one of the third-party libraries that attempt to fix this - django-idmapper or django-selectreverse are two I know of.


Django's ORM does not follow "reverse" relationships. This means that every time you access child.parent it makes a new database call.

One way to solve this in some (but not all) situations is to filter Child objects and use select_related() while doing so. This will reduce the number or database calls as child and parent tables are joined at query execution time and no separate query is fired when child.parent is accessed.

For e.g.

from django.db import connection

parent = Parents.objects.get(id=1)
print parent
print len(connection.queries) # say, X

children = Child.objects.select_related().filter(parent = parent)
for child in children:
    print child.parent

print len(connection.queries) # should be X + 1

The Python object id of the parent and child.parent won't be the same but you will see that no additional queries are fired when you access child.parent.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜