Skip to content

Automatically exported from code.google.com/p/fake-data-generator (including wiki pages)

Notifications You must be signed in to change notification settings

thinkh/fake-data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

author
Holger Stitz
May 7, 2015
3196da6 · May 7, 2015

History

29 Commits
Mar 15, 2012
Mar 12, 2012
Dec 22, 2011
May 7, 2015

Repository files navigation

fake-data-generator

Automatically exported from code.google.com/p/fake-data-generator

Testing data mining is hard. If all you have are real-world data sets, then you don't actually know what you're looking for. If you did, you probably wouldn't be using a data mining algorithm.

This tool-set is designed to create data sets with known properties and relations of varying complexity. It produces a full listing of the model it used to generate a data set, so the results of using a machine learning algorithm to try to develop a model of the data can be compared to the real model, since a real model exists. Varying amounts of noise and bias can be introduced, to simulate real-world imperfection of data.

License

GNU GPL v2

About

Automatically exported from code.google.com/p/fake-data-generator (including wiki pages)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages