Is it possible to get the matching document and all its ancestors in one query?

2023-01-15 14:15 问答作者：

To illustrate my requirements consider the following directory structure:

C:\Dev

C:\Dev\Projects

C:\Dev\Projects\Test Project

C:\Dev\Projects\Test Project\Test.cs

C:\Dev\Projects\Foo

C:\Dev\Projects\Foo\foo.cs (containing the word test)

The basic document will have id, type, name and content fields, where开发者_Python百科 type will be file or folder and name will be ether file name or folder name.

When searching for "test" I should get:

C:\Dev (ancestor of a result)

C:\Dev\Projects (ancestor of a result)

C:\Dev\Projects\Test Project (result)

C:\Dev (ancestor of a result)

C:\Dev\Projects (ancestor of a result)

C:\Dev\Projects\Test Project (ancestor of a result)

C:\Dev\Projects\Test Project\Test.cs (result)

C:\Dev (ancestor of a result)

C:\Dev\Projects (ancestor of a result)

C:\Dev\Projects\Foo (ancestor of a result)

C:\Dev\Projects\Foo\foo.cs (result)

Even better if it possible to avoid duplications:

C:\Dev (ancestor of a result)

C:\Dev\Projects (ancestor of a result)

C:\Dev\Projects\Test Project (result)

C:\Dev\Projects\Test Project\Test.cs (result)

C:\Dev\Projects\Foo (ancestor of a result)

C:\Dev\Projects\Foo\foo.cs (result)

When searching for "project" I should get:

C:\Dev (ancestor of a result)

C:\Dev\Projects (ancestor of a result)

C:\Dev\Projects\Test Project (result)

When searching for "foo" I should get:

C:\Dev (ancestor of a result)

C:\Dev\Projects (ancestor of a result)

C:\Dev\Projects\Foo (result) C:\Dev\Projects\Foo\foo.cs (result)

Thanks for any help

If you generate your index once or have a very small number of writes you could set up a solution in the indexing of the documents.

So for each document you would save another field called "path" and have it hold a tokenized list of all words from the sub elements of the path:

name: C:\Dev\Projects
path: C:, Dev, Projects, Test, Test Project, Test.cs, Foo, Foo.cs (use whatever tokenizer you want)

then index the field as INDEXED:true STORED:false and use it for searching for matches:

query: +path:"Foo"

Should return all the documents that have Foo as a child element. Keep in mind this solution is very costly for writes and may be impractical for a very large tree structure where you have many thousands of leafs.

继续阅读：lucene lucene.net

Is it possible to get the matching document and all its ancestors in one query?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集 河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？