Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Analyse Data Report not displaying correct range of values #64

Closed
VinnyOp opened this issue Apr 21, 2023 · 2 comments
Closed

Analyse Data Report not displaying correct range of values #64

VinnyOp opened this issue Apr 21, 2023 · 2 comments
Labels
bug Something isn't working

Comments

@VinnyOp
Copy link

VinnyOp commented Apr 21, 2023

The analysis data report can display an inaccurate range. In the below screen capture you can see the range of 09/08/2022-09/30/2022 but the distinct values displayed show something different. The only reason I see this is because I changed the default setting for unique values from 15 to 35, otherwise I would have thought the range was correct.

Note: The data is in the format of MM/DD/YYYY

20: Date_Field
DataTypes : datetime (5106 = 99.9%)
Width range : 10 characters
DateTime range : 09/08/2022 ~ 09/30/2022
-- Unique values (30) --
n=175 : 09/01/2022
n=174 : 09/02/2022
n=156 : 09/03/2022
n=122 : 09/04/2022
n=115 : 09/05/2022
n=112 : 09/06/2022
n=160 : 09/07/2022
n=183 : 09/08/2022
n=172 : 09/09/2022
n=158 : 09/10/2022
n=110 : 09/11/2022
n=170 : 09/12/2022
n=178 : 09/13/2022
n=174 : 09/14/2022

BdR76 added a commit that referenced this issue Apr 22, 2023
Incomplete fix for #64 and SkipLines minimum=0
@BdR76
Copy link
Owner

BdR76 commented Apr 22, 2023

I also noticed this issue with the kinesology.csv file, which incorrectly reports:

----------------------------------------
7: Date
DataTypes      : datetime (20 = 100.0%)
Width range    : 10 characters
DateTime range : 06/14/2022 ~ 06/10/2022

while it should be DateTime range : 06/08/2022 ~ 08/23/2022. This has to do with guessing the datetime format when a column has values like '09/08/2022' vs '08/09/2022' and the day/month order is unknown.

In the current version the plug-in when the dateformat is still undetermined, it defaults to assuming D-M-Y order. I've patched it so that it now makes a distiction when the date separator is / then is initially assumes M-D-Y order, which will probably fix your specific case too.

That part of the detection algorithm is clunky and needs to be rewritten to be more robust, because even with this patch it will still report an incorrect range in some cases.

@BdR76 BdR76 added the bug Something isn't working label Apr 22, 2023
@BdR76 BdR76 closed this as completed in ccf92cf May 26, 2023
@BdR76
Copy link
Owner

BdR76 commented Jun 4, 2023

This issue is fixed in the latest version v0.4.6.5, see the releases page. You can download it manually and it will be available in the next Notepad++ update in the Plugin Manager.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants