Skip to content

Releases: mhahsler/stream

stream_2.0-2

21 Apr 22:03
Compare
Choose a tag to compare

stream 2.0-2 (04/19/24)

New Features

  • Added DSF_FeatureSelection.

Bugfix

  • Fixed exports for get_macroclusters.DSC_DStream, get_macroweights.DSC_DStream
    and recluster.DSC
  • predict for DSC now honors the formula in the DSC.

Changes

  • Improved manual pages
  • proxy is now imported and not attached globally.

stream 2.0-1 (02/28/23)

New Features

  • Added DST_SlidingWindow to call functions on a sliding window.
  • Added DSClassifier_SlidingWindow, DSRegressor_SlidingWindow, and DSC_SlidingWindow.
  • Added get_model() for DST.
  • DSF can now be used with DST interface (update(), etc.).
  • DSD now also provides update().

Changes

  • transitioned from C++11 to C++17.
  • update() gained parameter return and lost assignment for DSC.
  • DST_WriteStream and write_stream now flush after the write.
  • DST_WriteStream gained close_stream().
  • Added tests for DSD. get_points with n = 0 and n = -1 produce now consistent results.
  • write_stream has now no default for n and accepts n = -1.

stream_2.0-0

03 Sep 17:38
Compare
Choose a tag to compare

stream 2.0-0 (09/01/22)

New Features

  • We now support %>% and defining pipelines with the new class DST_Runner.
  • New base class DSF for data stream filters with implementations for
    DSF_Downsample, DSF_Convolve, DSF_Func, DSF_dplyr
  • New DST_Multi to run multiple tasks on a stream.
  • New DSD_NULL
  • New DSOutlier: DSOutlier_DBSTREAM, DSOutlier_DStream
  • DSD_ReadCSV gained parameter col.names
  • Added DSD_ReadStream as an alias for DSD_ReadCSV.
  • Added DST_WriteStream to write streams using update().
  • Added DSD_Mixture to combine streams.
  • update() now returns information like cluster assignments as a data.frame.
  • DSC implementations now have a formula argument to decide what variables should be used for
    clustering.
  • DSD_ReadDB has now close_stream().
  • DSD_Memory, DSD_ReadDB and DSD_ReadStream have now a parameter called outofpoints to handle
    the situation that get_points requests more points than available.

Changes

  • stream now implements a standard predict function (get_assignment() is now deprecated).
  • class information and extra information are now stored as columns starting with '.'
    instead of as attributes. See get_points().
  • DSD_ScaleStream is now DSF_Scale and DSD_ScaleStream is deprecated.
  • DSO is now called DSAggregate.
  • The NAMESPACE is now managed using roxygen.
  • evaluate is now a generic.
  • noise and outliers are now the same concept. DS_Gaussian can make sure that noise points are
    separated from clusters.
  • evaluate() is now called evaluate_static() and evaluate_cluster is evaluate_stream(), both are now generics.
  • plot for DSC now automatically finds micro and macro-clusters to plot.

stream_1.5-1

10 May 19:01
Compare
Choose a tag to compare

Changes

  • Removed registry in favor of using R Studio auto-complete.
  • Outlier detectors are now in class DSOutlier class.
  • We use now roxygen2 for man pages.
  • Abstract classes now have constructors.

Bug Fixes

  • Fixed typo in BIRCH interface: treshold -> threshold (by dinarior)

stream_1.5.0

07 Sep 19:15
Compare
Choose a tag to compare

New Features

  • DSC implementations are now registered using DSC_registry.

Bug Fixes

  • Fixed get_assignment.DSC_TwoStage when new data is available (reported by
    ozlempoyraz).

stream_1.4-0: Single phase + outlier clusterer support and other additions (#14)

02 Dec 19:36
f42a2d3
Compare
Choose a tag to compare

New Features

  • Added additional features to the DSD_Gaussians, now capable of using Mahalanobis distance and
    generating outliers.
  • Updated evaluation procedure, now capable of performing external indices calculation using callbacks
  • Added support for single-pass clusterers and outlier detectors
  • Added outlier correctness assessment indices

stream_1.3-2

04 May 17:50
Compare
Choose a tag to compare

stream 1.3-2 (05/04/20)

Bug Fixes

  • DBSTREAM: Fixed array index bug (reported by MatthiasCarnein)
  • BIRCH: Fixed C++ this pointer problem.

stream_1.3.1

10 Jun 01:22
Compare
Choose a tag to compare

New Features

  • Added DSC_evoStream and DSC_EA. Code by Matthias Carnein.

Changes

  • Package animation is now only suggested since it requires package magick
    which may need the imagemagick++ libraries installed.

stream_1.3-0

02 Jun 13:51
Compare
Choose a tag to compare
  • Added DSC_BIRCH. Code and Interface by Dennis Assenmacher and Matthias Carnein.
  • Added DSC_BICO. Code by Hendrik Fichtenberger, Marc Gille, Melanie Schmidt,
    Chris Schwiegelshohn, Christian Sohler and Interface provided by Matthias
    Carnein and Dennis Assenmacher.
  • DSD_ReadCSV: Fixed bug with streams that have no class/cluster label
    (reported by Matthias Carnein).
  • animate_cluster: noise now accepts "class" or "exclude" ("ignore" is deprecated).

stream_1.2-4

26 Feb 17:29
Compare
Choose a tag to compare
  • Use dbFetch in DSD_ReadDB (new version of RSQLite).
  • Register native C routines.

stream_1.2-3

08 Aug 00:15
Compare
Choose a tag to compare

Changes in version 1.2-3 (08/07/16)

  • fixed saveDSC for DBStream.
  • fixed handling of data with d=1 (reported by Ilana Lichtenstein).
  • plot now automatically determines if the data supports a class attribute.

Changes in version 1.2-2 (10/28/15)

  • evaluate now reports noise information.

Changes in version 1.2-1 (09/08/15)

  • fixed problem with failing test under Windows.

Changes in version 1.2-0 (09/06/15)

  • generic and methods for description() added to exact descriptions from
    DSD, DSC and DSO objects.
  • write_stream() gained parameter append and now throws an error if it
    would overwrite a file.
  • DSC objects can now be saved and loaded using saveDSC and readDSC.
  • we use now DBSCAN from package dbscan.
  • DSC_DBSTREAM gained parameter metric and now also supports
    Manhattan and Maximum norm.
  • DSC_DBSTREAM gained parameter assignments and function
    get_cluster_assignments() to retrieve the MC assignment of the
    clustered data points.
  • cleaned up interface for animate_cluster() and animate_data().
  • DSD_ReadCSV was completely rewritten to be more reliable. Lost argument d
    which is now figured out automatically.
  • write_stream has now an argument called header (former name was col.names)
    to be consistent with DSD_ReadCSV.

Changes in version 1.1-5 (07/02/15)

  • NAMESPACE now imports non-standard packages correctly.
  • DSC_DBSTREAM uses now Cm instead of noise.
  • fixed iterator bug for DSC_DBSTREAM.
  • evaluate gains argument noise to control if noise is ignored

Changes in version 1.1-4 (05/24/15)

  • evaluate checks if DSD has cluster labels for external
    evaluation measures.
  • DSD_mlbenchmarkGenerator now shuffles data points.
  • DSC_ReadCSV gains arguments skip and header.
  • DSC_DStream: was reimplemented in C++ (Rcpp),
    number of grids N can now be fixed by the user.
  • DSC_tNN was renamed DSC_DBSTREAM. Uses now SOM-style micro-cluster update
    and was reimplemented in C++ (Rcpp).

Changes in version 1.1-1 (01/15/15)

  • DSC_DStream: fixed bug with removing too many sporadic grids
  • DSD_ReadCSV now uses readLine so it can read properly from URLs
  • updated vignette

Changes in version 1.1-0 (12/18/14)

  • update now directly dispatches
  • DSC_Memory replaces DSD_Wrapper
  • DSD_ReadCSV replaces DSD_ReadStream. Improved handling of blocking and
    end of stream.
  • added DSD_ReadDB (DBI interface)
  • get_points can now produce cluster and class information

Changes in version 1.0-3 (07/14/14)

  • Fixed precision and recall calculation
  • Added DSC_TwoStage

Changes in version 1.0-2 (06/16/14)

  • Warning for reclusterers removed.
  • plot can now show micro-cluster assignment areas using assignment=TRUE

Changes in version 1.0-1 (06/12/14)

  • Improved documentation
  • Improved DSD_MG
  • plot gained a dim argument to plot only selected dimensions
  • get_assignment gained a threshold argument
  • DSC_Window added
  • DSC_Sample gained a biased argument for biased sampling
  • DSC_Wrapper can now wrap matrix-like objects (e.g., from package ff and
    bigmemory)

Changes in version 1.0-0 (5/24/14)

  • added D-Stream (with attraction)
  • improved support for creating animations
  • tnn: new decay models, tNN without shared density now reclusters
    using density reachability
  • plot gained the type "both" that plots micro and macro-clusters
  • DSC_Hierarchical and DSC_Kmeans gained min_weight to filter low weight
    micro-clusters before reclustering
  • removed default radius, etc. for most clustering algorithms
  • Added DSD_MG for simulating streams with concept drift
  • moved MOA related code to streamMOA
  • suspended DSC_BIRCH because of memory issues
  • reset_stream gained a pos argument

Changes in version 0.2-0 (2/21/14)

  • major restructuring

Initial version 0.1-1 (8/16/13)