Releases: allenyllee/DataTag_helper
Releases · allenyllee/DataTag_helper
v0.9.0
v0.8.9
v0.8.9
- Support duplicate content by using title_hash - content_hash as TextID in excel file
v0.8.8
-
add excel to json support
Title 就是文章標題,
Poster 為發文者,
Gender 發文者為性別,
Date 為日期(年月日),
Time 為時間(時分秒),
Content 為文章內容轉出來的 json 檔,會變成四個欄位:
Title, Content, Author, TimePoster 跟 Gender 會合併為 Author
Date 跟 Time 會合併為 Time如下所示:
"Title": "Irure incididunt ipsum velit anim amet occaecat elit esse cillum amet excepteur.", "Content": "Ex veniam ad cupidatat nostrud excepteur in ipsum. Cupidatat Lorem duis ipsum laboris sit ex sunt ea dolore cupidatat sint consectetur duis. Dolor dolore esse ullamco excepteur consectetur eiusmod sit. Irure reprehenderit pariatur et laboris. Dolor incididunt nulla voluptate magna enim duis occaecat mollit dolor Lorem enim anim pariatur.", "Author": "匿名/女", "Time": "2019-07-02/16:45:17"
v0.8.7
- add
.docx
support - add
.pdf
support - Support duplicate content by using
title_hash - content_hash
asTextID
v0.8.6
exported title in json file use relative path
v0.8.5
- rename title to DataTag Helper (標註轉檔小幫手)
- set default selection to the first option, and move input dir that contains .txt to the first option.
v0.8.3
Rename to DataTag_Helper
v0.8.2
-
fix "Missing optional dependency 'xlrd'." error, jsut use openpyxl. see: pandas-dev/pandas#38424
-
called monkey patch for gooey gui components only when before entering GUI mode (to avoid non-gui import generate "module not found" error)
v0.8.1
add multiple input file support with MultiFileChooser
v0.8.0
- add second_upload function
- add second labeled json cleaner and converter
- fix windows open unicode file with cp950 codec issue
v0.7.2
adding SerialID column