MongoDB: What’s the most efficient way to store a chromosome/position

2023-01-16 11:06 问答作者：

I want to store some genomic positions (chromosome,position) using MongoDB.

something like:

{
chrom:"chr2",
position:100,
name:"rs25"
}

I want to be able to quickly find all the records in a given segment (chrom , [posStart - posEnd]). What would be the best key/_id to be used ?

a chrom , position object ?

db.snps.save({_id:{chrom:"chr2",position:100},name:"rs25"})

a padded string ?

db.snps.save({_id:"chr02:00000000100",chrom:"chr2",position:100,name:"rs25"})

an auto-generated id with an index on chrom and position ?

db.snps.sa开发者_如何转开发ve({chrom:"chr2",position:100,name:"rs25"})

other ?

???

thanks for your suggestion(s)

Pierre

PS: (this question was cross posted on biostar: http://biostar.stackexchange.com/questions/2519 )

I believe the two-column index will offer the fastest access path, because it will be the most compact index.

However, it will be an additonal index (since you already have the _id index, which you are not using), so the first two options are nice in that they eliminate the extra index.

The padded string is shorter than the complex object solution, shorter means less memory use, hence faster the scan. I'd only go for complex object, if flattening/padding is not possible. Also, since the complex object keys need to be encoded into the index (not the case with other indexes), choose shorter key names (c and p).

So, I'd go for two-column index (if you do not mind "wasting" the id index) or padded string. You could even go padded binary (saving a few bytes on encoding the integer), but that is probably not worth the hassle.

继续阅读：bioinformatics database indexing mongodb position

MongoDB: What’s the most efficient way to store a chromosome/position

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？