train_with_summary.txt train_tokenized.txt #4

world4jason · 2020-07-04T08:35:49Z

有沒有這兩個的連結?
參考一下格式
謝謝

ccs96307 · 2020-11-03T03:36:19Z

Maybe you can refer： https://zhuanlan.zhihu.com/p/113869509

The data format is similar to the following example:
{"summarization": "xxxxxxxxx", "article": "aaaaaaaaa"}

You can use json.dumps() to convert data to string data type and save it, using '\n' to split data. (Because the source code is using json.loads() to load the training data)

libhot mentioned this issue Jul 29, 2020

Segmentation fault段错误 #8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train_with_summary.txt train_tokenized.txt #4

train_with_summary.txt train_tokenized.txt #4

world4jason commented Jul 4, 2020

ccs96307 commented Nov 3, 2020

train_with_summary.txt train_tokenized.txt #4

train_with_summary.txt train_tokenized.txt #4

Comments

world4jason commented Jul 4, 2020

ccs96307 commented Nov 3, 2020