Twitter Live Search

2023-01-14 19:08 问答作者：

I was trying to reverse engineer Twitter-Live Search. Maybe we could discuss it here. I am talking about the feature where Tweets are shown even latest to "1 sec ago" etc. Trying to understand how the following might happen -

There must be some layer between when the user tweets &开发者_开发知识库 when the index (updates) happen. Is this layer MySQL or some other caching layer (memcached, cassandra)? Maybe...
Indexing - How might the index updates be happening? They can't possibly build a new index from scratch?
Indexing - There must be a distributed index here. How to update all the Indexes without having to serve stale data from one index & latest data from the other?
Indexing - Or does it matter if something like this happens? Honestly I don't think so :) Which user would notice...

Anybody have anything interesting to add/discuss. I am just trying to understand...

Interesting indeed, but I guess it's more of an "architecture" question, and not really a programming question.

But FYI there's a lot of information at high scalability: posts tagged with twitter

Do they keep all tweets? My guess is they just throw them away after a while, and surely they don't need ACID properties? ..

And I wouldn't trust those timestamps if I where you :)

继续阅读：architecture live search search-engine twitter

Twitter Live Search

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？