Document Management System Recommendation [closed]
I have an application that generates around 10000 printed pages per month. Each report (around 2000/month) is archived as PDF on a simple network file share. I am searching for a Document Management System meeting the following requirements:
- watch the archive folder and update the index either on regular basis or when changes are detected
- provide an Intranet Webpage where 开发者_如何学JAVAusers can search documents based on filenames, timespans and other relevant file attributes
- fulltext search
- can handle large/substantially growing archives
To be clear, I am searching for a pre-built solution here, commercial products are accepted.
Sounds like Microsoft Search Server 2008 Express would be a good candidate. Free and installs in a couple of minutes.
I can suggest you google docs. AFAIK It can handle all your requirements.
This is a very vague question and I'm not quite sure how to respond.
It looks like you want a way to index all your files and ensure that the information is kept up to date in the database. What I can suggest is you look into some search servers like:
Sphinx
Solr
These both take some setup but they handle all your requirements: They can easily be setup to watch a folder and keep your index up to date, they provide great fulltext search, they can be accessed via an intranet webpage if you setup a page to search your database, and they are used for enormous operations so large archives shouldn't be a problem.
If you're looking for a pre-built solution, I'm not sure what to mention.
Plone could work pretty well for your needs. It has plugins for indexing PDF content, and you can customize the metadata. Also, it has a fantastic web interface with built-in search. The best part is that it's free and easy-to-use, and if your needs grow, you can pay for support.
My only recommendation (at first glance) is that you store your content on the file system and not in the Zope OO database. You should only store your metadata and index data in the database. This is a pretty common way of storing large amounts of content in the document management world.
Hope that helps!
Tom Purl
As Tom said Plone does to what you describe. It has build in full text search that relies on the commandline programm pdftotext for pdfs to be in the path. There are several Extension you may me interested in:
- Reflecto - Watches a part of the
filesystem and alows searching and
displaying it inside Plone:
See reflecto on the plone.org/products - TextIndexNG 3 - Indexing extentions written for a publishing house
http://www.zopyx.com/projects/TextIndexNG3/textindexng3-the-leading-fulltext-indexing/
or
collective.solr - use the search enging "solr" to drive the catalog:
See collective.solr on plone.org/products
(Sorry, missing links due to stackoverflows new user policy)
精彩评论