Can anyone recommend a synthetic data generator? [closed]
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this questionDid anyone come across a free data generator that can create synthetic data to load into databases for testing purposes?
Looking at other questions I stumbled on Benerator, http://databene.org/databene-benerator.html. I haven't tried it, but:
apparently comes with these JDBC drivers http://databene.org/databene-benerator/112-installing-the-benerator-distribution.html • jTDS Driver (MS SQL Server or Sybase) • MySQL Connector • HSQL DB • Derby Client • PostgreSQL • Jaybird (Firebird DB) you can of course install more (e.g. Oracle) yourself...
I don't know of any free products to recommend, but if you are willing to purchase a commercial license, I can recommend ExactData's Dynamic Data Generator. It attempts to produce large scale, synthetic, realistic, and engineered data sets. It allows you to model the data sets for your tests, customize the output format (CSV, for instance), and then generate an large numbers of internally consistent data records.
I can recommend the DDG because I am a developer on this product.
As you know your data better than anyone outside your organization, you would probably get the best results by writing code yourself to do this. It would be fairly straightforward to write a small program in C#, Python, etc to generate test data.
You could take a look at my FOSS data generator - this is kind of a proof of concept at the moment, but is useable.
Redgate has a tool to do exactly this. I fooled around a little with the trial and thought it had potential. It does cost $$ so it may not suit your current needs.
I have tried and used the Redgate tool, it is part of the Redgate Toolbelt bundle, and I can recommend it. Very nice tool with lots of possibilities. It however costs money as peacedog pointed out, if you are doing database development on consistent basis it is very much worth it.
to generate synthetic transactional datasets you can use IBM Synthetic data generator. but I suggest you to download Iranshamim Synthetic dataset generator, because it has GUI and generates more accurate results.
精彩评论