Skip to content

badgermind/tdda

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Test-Driven Data Analysis (Python TDDA library)

Installation

The simplest way to install all of the TDDA Python modules is using pip:

pip install tdda

But note that it is not particularly easy to run through the examples using a PyPi binary installation like that. The full set of sources, including all examples, will be downloadable from PyPi with:

pip download --no-binary :all: tdda

The sources are also publicly available from Github:

git clone git@github.com:tdda/tdda.git

Documentation is available at http://pythonhosted.org/tdda/.

Level 0:

The tdda.referencetest library is used to support the creation of reference tests, based on either unittest or pytest.

These are like other tests except:

  1. They have special support for comparing strings to files and files to files.
  2. That support includes the ability to provide exclusion patterns (for things like dates and versions that might be in the output).
  3. When a string/file assertion fails, it spits out the command you need to diff the output.
  4. If there were exclusion patterns, it also writes modified versions of both the actual and expected output and also prints the diff command needed to compare those.
  5. They have special support for handling CSV files.
  6. It supports flags (-w and -W) to rewrite the reference (expected) results once you have confirmed that the new actuals are correct.

For usage details:

>>> from tdda import referencetest
>>> help(referencetest)

For more details from a source distribution or checkout, see the README.md file and examples in the referencetest subdirectory.

An older implementation of these ideas is available as a unittest wrapper class, WritableTestCase, in writabletestcase.py. This can be imported directly with from tdda.writabletestcase import WritableTestCase. Examples of using this are currently available in the deprecated subdirectory. This older version will be fully deprecated and removed soon.

Level 1:

The tdda.constraints library is used to 'discover' constraints from a (Pandas) DataFrame, write them out as JSON, and to verify that datasets meet the constraints in the constraints file.

For usage details:

>>> from tdda import constraints
>>> help(constraints)

For more details from a source distribution or checkout, see the README.md file and examples in the constraints subdirectory.

Resources

Resources on these topics include:

All examples, tests and code should run under Python2 and Python3.

The tdda repository also includes rexpy, a tool for automatically inferring regular expressions from a single field of data examples.

For usage details:

>>> from tdda import rexpy
>>> help(rexpy)

About

Test-Driven Data Analysis Functions

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 94.6%
  • HTML 5.3%
  • Shell 0.1%