forked from Kelly-ST-HRI/hicup-plus
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathRELEASE_NOTES.txt
567 lines (360 loc) · 20.4 KB
/
RELEASE_NOTES.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
RELEASE NOTES FOR HICUP-Plus v1.0.3 (8 NOVEMBER 2023)
HiCUP+ v1.0.3 is a mnior update incorporating the changes described below:
- adds support for the STAR aligner
- updated logs for reporting results
- updated documentation to describe supported aligners and Docker images
-------------------------------------------------
RELEASE NOTES FOR HICUP-Plus v1.0.2 (2 NOVEMBER 2022)
HiCUP+ v1.0.2 is a mnior update incorporating the changes described below:
- updated checks in the HiCUP modules to check for SSH connections
- updated Dragen call to run via SSH if hostname does not include "dragen"
-------------------------------------------------
HiCUP+ v1.0.1 is a mnior update incorporating the changes described below:
- implements built-in timing reported to stdout
- bug fixes to post-processing in pairing step of hicup_mapper:
ensures that reads are not sorted in intermeidiate SAM files
and that reads are correctly paired in order. Index numbers
are used for Bowtie, Bowtie2, and HiSAT2 in stream inputs.
Matching strings in Dragen output is implemented without
assuming FASTQ header format.
RELEASE NOTES FOR HICUP-Plus v1.0.0 (15 FEBRUARY 2022)
-------------------------------------------------
HiCUP+ v1.0.0 is a major update incorporating the changes described below:
- refactors error handling and warnings without changes to results from
Bowtie or Bowtie2 aligners
- initialises reverse read for pairing and migrates filtering to use
matches in header strings rather than index numbers
- adds calls to the HiSAT2 aligner (Kim et al., 2019) using stream
inputs and parameters equivalent to the Bowtie2 parameters
- adds a call to the proprietary Dragen (Illumina Inc., 2021) aligner
via a system call with the error handling and log outputs adjusted
for compatibility
- updates to pairing and reporting scripts for compatilibity with
the Dragen aligner run only if this aligner is chosen
- updates the documentation to describe configuration of changing
aligners to HiSAT2 or Dragen
- updates R graphical devices to PDF (removing need for Cairo graphics in SVG format)
- updates Maintainer of this fork to S. Thomas Kelly
**simonthomas.kelly [at] hugp [dot] com** (Bioinformatics Team,
H.U. Group Research Institute G.K., Tokyo, Japan; SRL Inc., Tokyo, Japan)
This release is a fork of the original HiCUP pipline. See the release notes
for previous versions below and the GitHub repository for more details:
https://github.com/StevenWingett/HiCUP
This fork has been renamed to HiCUP-Plus (HiCUP+) to avoid it being
mistaken to the original implementation.
Full details of changes can be viewed on GitHub:
https://github.com/hugp-ri/hicup-plus/compare/test-original...master#diff-HEAD?w=1
RELEASE NOTES FOR HICUP v0.8.3 (11 January 2022)
-------------------------------------------------
HiCUP v0.8.3 is a minor update incorporating the change described below:
- Updated contact details
RELEASE NOTES FOR HICUP v0.8.2 (13 JULY 2021)
-------------------------------------------------
HiCUP v0.8.2 is a minor update incorporating the change described below:
- The get_captured_reads script has been renamed to hicup_capture. The
script is found in the Misc folder and is used to identify "captured"
di-tags in a Capture Hi-C experiment. The renamed script now works in
both the Misc directory or in the same directory as hicup_module (which
is useful for Conda installs or when creating symbolic links to the
hicup_capture script).
RELEASE NOTES FOR HICUP v0.8.1 (16 FEBRUARY 2021)
-------------------------------------------------
HiCUP v0.8.1 is a minor update incorporating the change described below:
- HiCUP previously could not process Gzipped files when run on a Mac
computer. While HiCUP is designed for Linux systems, a simple fix was
made on how Gzipped files are uncompressed and consequently HiCUP should
now work on Mac systems.
RELEASE NOTES FOR HICUP v0.8.0 (08 OCTOBER 2020)
------------------------------------------------
HiCUP v0.8.0 is a major update incorporating the changes described below:
- HiCUP has a new script named hicup_reporter that collates the summary
reports from each step of the pipeline into a new report.
Note: This hicup_reporter script can be run on its own to collate summary
results files generated previously by HiCUP.
- The pipeline generates a new HTML summary report using the script
r_scripts/hicup_reporter.rmd (this script is executed by hicup_reporter).
Note: this requires the R packages Tidyverse and Plotly to be installed.
RELEASE NOTES FOR HICUP v0.7.4 (18 June 2020)
---------------------------------------------
HiCUP v0.7.4 is a minor update incorporating the changes described below:
-Fixed bug when specifying a cut-site containing an N (any nucleotide).
The script hicup_truncater used to incorporate such Ns into the FASTQ
truncated read. This is now fixed and no longer occurs.
-HiCUP now uses CIGAR string information when positioning reads to
restriction fragments and during the de-duplication process.
-Added scripts for GitHub Actions unit testing.
-Added a HiCUP Singularity recipe to the Misc folder.
RELEASE NOTES FOR HICUP v0.7.3 (13 May 2020)
--------------------------------------------
HiCUP v0.7.3 is a minor update incorporating the changes described below:
-Added hicup2juicer to make HiCUP output compatible with Juicer.
-Modification to hicup_deduplicator output so files strictly adhere to
SAM/BAM format.
-Fixed bug preventing --nofill option from working.
-No longer reports Bowtie2 message concerning gzbuffer changes.
RELEASE NOTES FOR HICUP v0.7.2 (15 January 2019)
------------------------------------------------
HiCUP v0.7.2 is a minor update incorporating the changes described below:
-Fixed bug causing output files to not strictly adhere to SAM/BAM format.
RELEASE NOTES FOR HICUP v0.7.1 (12 November 2018)
-------------------------------------------------
HiCUP v0.7.1 is a minor update incorporating the changes described below:
-Added the option --arima to HiCUP Digester, to generate digest files
compatible with the Arima protocol.
RELEASE NOTES FOR HICUP v0.7.0 (26 October 2018)
------------------------------------------------
HiCUP v0.7.0 is a major update incorporating the changes described below:
HiCUP can now process restriction enzymes that cut at different sites.
Such sequences contain an "N" within the restriction enzyme recognition
site to denote the four bases (AGCT).
Thanks to Rola Dali, Edouard Henrion and Mathieu Bourgey (McGill University)
for adding this feature.
RELEASE NOTES FOR HICUP v0.6.1 (10 May 2018)
----------------------------------------------
HiCUP v0.6.1 is a minor update incorporating the changes described below:
-Bowtie2 may now align reads using more than one thread
(while preserving the input/output read order), thereby reducing
processing times. The number of threads allocated to Bowtie2 will be
the number of threads specified by the user divided by the number of
files processed.
-Fixed bug causing HiCUP Mapper to try to read beyond the end of a mapped
reads SAM file, consequently causing the script to crash.
RELEASE NOTES FOR HICUP v0.6.0 (23 April 2018)
----------------------------------------------
HiCUP v0.6.0 is a major update incorporating the changes described below:
-Parameters adjusted for HiCUP mapper in determining what constitutes a
multi-mapping read, when using Bowtie2 as the aligner.
RELEASE NOTES FOR HICUP v0.5.10 (26 October 2017)
-------------------------------------------------
HiCUP v0.5.10 is a minor update incorporating the changes described below:
-Fixed bug causing hicup_mapper to not pair a small proportion of valid
reads.
RELEASE NOTES FOR HICUP v0.5.9 (15 September 2016)
--------------------------------------------------
HiCUP v0.5.9 is a minor update incorporating the changes described below:
-Modified HiCUP Deduplicator so it may process genomes comprising
hundreds of chromosomes (e.g. genome assemblies)
-Added script Misc/get_captured_reads for identifying on/off target
di-tags in capture Hi-C datasets
-Fixed bug when running hicup_truncater independently, causing the script
to interpret NoFill:0 in the config file as effectively NoFill:1
RELEASE NOTES FOR HICUP v0.5.8 (19 October 2015)
------------------------------------------------
HiCUP v0.5.8 is a minor update incorporating the changes described below:
- Fixed bug preventing an odd number of files being specified when using
a configuration file with hicup_digester, hicup_filter or
hicup_deduplicator.
RELEASE NOTES FOR HICUP v0.5.7 (18 June 2015)
---------------------------------------------
HiCUP v0.5.7 is a minor update incorporating the changes described below:
- Fixed bug preventing whole pipeline running to completion if --outdir
is set to a folder other than the current working directory.
RELEASE NOTES FOR HICUP v0.5.6 (09 June 2015)
---------------------------------------------
HiCUP v0.5.6 is a minor update incorporating the changes described below:
- Fixed bug preventing, when multiple samples are processed, the
generation of HTML summary reports and the text file auditing the
progress of read pairs through the entire pipeline.
RELEASE NOTES FOR HICUP v0.5.5 (20 May 2015)
--------------------------------------------
HiCUP v0.5.5 is a minor update incorporating the changes described below:
- HiCUP produces a text file auditing the progress of read pairs through
the entire pipeline for all samples processed.
- Added the 'Conversion' folder, containing scripts for converting HiCUP
BAM/SAM output into formats compatible with GOTHiC, Homer, Hicpipe and
Fit-Hi-C.
Fixed bug preventing users specifying two --re1 restriction enzymes
(i.e. the enzymes that create the Hi-C ligation junctions).
RELEASE NOTES FOR HICUP v0.5.4 (01 April 2015)
----------------------------------------------
HiCUP v0.5.4 is a minor update incorporating the changes described below:
- HiCUP Digester checks the first line of each input file begins with a
valid FASTA header.
- Fixed bug causing HiCUP to ignore choice of aligner.
- HiCUP output compatible with SamTools release (v1.2), which is more
strict with regard to the PG headers in a BAM/SAM file.
- Fixed bug causing files to be compressed if zip:0 selected in a
configuration file.
- Fixed bug causing --re2 option sequence (i.e. not --re1 sequence) to
be used to truncate reads prior to mapping.
- Should one or more sample contain no valid di-tags the pipeline would
produce no HTML summary files, even for samples containing valid di-tags.
This no longer happens, instead the script displays a warning message
reporting samples not progressing through the pipeline and produces
HTML for samples containing valid di-tags.
- Improved documentation by creating Markdown files. These were added to
the 'Documentation' folder. The HiCUP Manual and QuickStart
PDF files were deleted.
RELEASE NOTES FOR HICUP v0.5.3 (31 December 2014)
-------------------------------------------------
HiCUP v0.5.3 is a minor update incorporating the changes described below:
- HiCUP attempts to intelligently name files as the pipeline proceeds.
Please Note: To avoid passing HiCUP input files with identical names
prior to the filename extension. For example, the files 'sample.fa' and
'sample.fastq' would produce files with identical names as the pipeline
progresses. This problem could be overcome by renaming one the files to
'sample2.fa'. To minimise inconvenience, HiCUP will immediately
produce a warning message and not run if the input filenames are too
similar.
RELEASE NOTES FOR HICUP v0.5.2 (22 November 2014)
-------------------------------------------------
HiCUP v0.5.2 is a minor update incorporating the changes described below:
- When running the whole pipeline, a folder may be specified to which all
intermediate files are written. These files are deleted after the
hicup_deduplicator script has terminated.
- FASTQ format now specified differently, valid options are:
Sanger, Solexa_Illumina_1.0, Illumina_1.3 or Illumina_1.5
- All configuration files kept in the 'config_files' folder.
- hicup_digester may be passed lowercase characters denoting the
restriction enzyme recognition site.
- Pipeline no longer terminates if SAM Tools in not installed.
RELEASE NOTES FOR HICUP v0.5.1 (02 November 2014)
-------------------------------------------------
HiCUP v0.5.1 is a minor update incorporating the changes described below:
- hicup_deduplicator now accepts --threads as an argument.
RELEASE NOTES FOR HICUP v0.5.0 (01 October 2014)
-------------------------------------------------
HiCUP v0.5.0 is a major update incorporating the changes described below:
- HiCUP compatible with Bowtie2.
- To reduce mis-mapping, HiCUP no longer uses --strata as a parameter
when aligning with Bowtie.
- Perl module GD::Graph is no longer a dependency since the pipeline now
uses R to generate figures.
- The scripts ‘hicup’, ‘hicup_trunacter’ and ‘hicup_mapper’ all require
file pairs as input. Such pairs should now be placed on adjacent lines
in their respective configuration files. (Previous versions of HiCUP
required paired files to be placed on the same line, separated using
the pipe (‘|’) character. HiCUP still accepts filename pairs in this
deprecated format.)
- Command line arguments can be used in conjunction with a configuration
file. Parameters set using the command line will override those set
using a configuration file.
- Fixed bug causing HiCUP only being able to process files in the
current working directory.
RELEASE NOTES FOR HICUP v0.4.2 (27 November 2013)
-------------------------------------------------
HiCUP v0.4.2 is a minor update incorporating the changes described below:
- Pipeline produces di-tag length (prior to filtering) distribution
plots in the HTML document and as a separate GD::Graph line plot.
- Pipeline produces an HTML report when following the double-digest
protocol.
- HTML file reports percentage unique di-tags following de-duplication.
- HiCUP version number and pipeline parameters are printed in the
headers of the SAM/BAM files.
- User can specify hi-c ligation sequences directly via hicup.conf
- Fixed bug causing HTML reports not to be written if a single sample
did not pass completely through the pipeline.
- Fixed bug preventing 'longest:' or 'shortest:' config option being
left blank in hicup.conf
RELEASE NOTES FOR HICUP v0.4.1 (23 September 2013)
--------------------------------------------------
HiCUP v0.4.1 is a minor update incorporating the changes described below:
- Fixed bug causing results from different datasets to become mixed in
the HTML report.
- Improved HTML report layout.
RELEASE NOTES FOR HICUP v0.4.0 (10 September 2013)
--------------------------------------------------
HiCUP v0.4.0 is a major update incorporating the changes described below:
- A di-tag is defined by each of its constituent reads and each read is
now defined by 1) the position in the reference genome to which the first
base of the sequence read (putative sonication cut site) map; and 2) the
orientation of the sequence read with respect to the reference genome.
The read end position is no longer considered.
- Fixed bug in hicup_deduplicator preventing it identifying all
duplicates if the sequencing run was performed with a different number
of cycles.
- Added --nofill option to hicup_truncater (and consequently hicup)
allowing users to specify that a fill-in of sticky ends was not
performed.
- Made slight adjustment to hicup_truncator to prevent division by zero
errors when working out the %truncation rate.
- hicup_truncater now truncates upstream of the restriction enzyme
recognition site, not at the restriction enzyme cut site.
- hicup_mapper now reports the number of reads that were too short to
be sent to Bowtie. The mapping efficiency will refer to:
(100 * unique alignments) /
(reads too short to send to Bowtie + reads sent to Bowtie).
- hicup_filter now discriminates between same-fragment dangling ends
and same-fragment internal.
- HiCUP master script edited so the file processes all the reads in a
file to determine FASTQ quality scores.
- Added a check in the pipeline to confirm the number of forward and
reverse reads are the same in the FASTQ file pairs (some users appear
to filter reads in some way prior passing to HiCUP).
- Users can now specify an output directory to which output files are
written.
- All summary files now display the same datestamp.
- HiCUP sonication protocol now generates an HTML report summarising the
results of each file processed.
RELEASE NOTES FOR HICUP v0.3.0 (02 November 2012)
-------------------------------------------------
HiCUP v0.3.0 is a major update incorporating the changes described below:
- hicup_sorter removed from the pipeline. Users should now de-multiplex
samples using other software.
- The pipeline determines automatically the FASTQ format (e.g. Sanger)
adopted if this was not specified by the user. It may not always be
possible to determine this unambiguously and in such circumstances the
pipeline selects the most likely option.
- The pipeline will try to determine the path to Bowtie if this is not
specified by the user. Also, fixed a bug in how HiCUP identifies the
location of SAMtools.
- Improved how the pipeline checks Bowtie indices have been specified
correctly by the user.
RELEASE NOTES FOR HICUP v0.2.2 (03 August 2012)
-----------------------------------------------
HiCUP v0.2.2 is a minor update adding the functionality described below:
- The mapping process is now less memory intensive, with Bowtie
option --chunkmbs lowered from 1024 to 512. Also, reads shorter than
20bps are no longer passed to Bowtie.
- Fixed bug preventing HiCUP processing files in a separate folder from
the hicup.conf configuration file.
- The hicup master script terminates immediately if another pipeline
script dies.
RELEASE NOTES FOR HICUP v0.2.1 (19 July 2012)
---------------------------------------------
HiCUP v0.2.1 is a minor update adding the functionality described below:
- The hicup_filter script now calculates the number of di-tags whose
alignment to the reference genome suggests they were created by a single
restriction fragment circularizing and ligating to itself, and then
following sonication forms a liner molecule containing the modified Hi-C
restriction site. These di-tags were previously classified as
'Same Fragment' di-tags, but this category now comprises all di-tags
in which both reads map to the same restriction fragment except those
classified as derived from circularized molecules.
- The hicup_filter summary file now reports the number of di-tags
falling within each category, not the percentage summary.
RELEASE NOTES FOR HICUP v0.2.0 (26 June 2012)
---------------------------------------------
HiCUP v0.2.0 is a major update providing extra functionality in addition
to the removal of several bugs. The most significant changes are
described below:
- Addition of a new script ('hicup_deduplicator') at the end of the
pipeline for removing duplicate di-tags (most probably PCR artefacts).
The script generates a text file summarising the results, along with a
pie chart (*.png format) of the cis/trans ratio of di-tags present in
the final dataset.
- hicup_mapper and hicup_pairer combined into a single script called
'hicup_mapper'. In addition to reporting Bowtie mapping efficiencies,
the script also reports the percentage read pairs in which both
forward and reverse reads could be aligned to the reference genome.
- hicup_filter, when processing Hi-C data generated using the Hi-C
sonication protocol, now rejects di-tags on the basis of size AFTER
all other filters have been passed.
- hicup_filter produces a less verbose summary file and produces pie
charts of the results (*.png format).
- hicup_filter modified so when following the sonication protocol it
identifies and rejects di-tags containing re-ligated fragments, not
simply those on adjacent fragments. It does this by checking that the
fragments are adjacent (as before) and in the same orientation - which
means the forward and reverse reads will be on opposite strands
(+ and -).
- hicup_truncater now reports the average length of a truncated
sequence.
- Fixed a bug causing hicup_digester to only process the last
chromosome in a file containing multiple chromosomes.
RELEASE NOTES FOR HICUP v0.1.1
------------------------------
HiCUP v0.1.1 is an alpha release and as such is still a work in progress.
Although the core functionality of the program should all now be
present, it is very much still a work in progress and you should bear
this mind when running the software.