net.cpp now allows zero-sized batches #2053

mtamburrano · 2015-03-06T12:15:17Z

This is the new PR based on old #1484 which was not mergeable anymore

Old description:

Implemented a way to allow zero-sized batches as discussed in #1448 with @sguada and @longjon (point 5).
The net now checks, before forwarding or backwarding anything, if some blob has num == 0, if so forward and backward are denyied and > all subsequent layers are reshaped to avoid forward and bacward to those, too.

bhack · 2015-03-13T18:29:56Z

@jeffdonahue I see that somebody is tagging PR as "ready for review". Can you tag also this one?

cdoersch · 2015-04-10T00:34:44Z

I also have a use case for this PR, so I vote for making caffe aware of this.

However, force-reshaping all the top blobs doesn't quite seem right to me; it seems to me like this would be better handled when reshape gets called on each layer, right before the forward pass. That way, the layer can decide for itself if it wants all the top blobs to have zero batch size. Then the forward/backward would only be skipped if all bottom and top blobs have zero batch size.

Another potential improvement to this PR is to make the solvers aware of backwardIsAllowed() and not update the associated parameters. Right now, it seems like momentum and weight decay would still be applied.

mtamburrano · 2015-05-18T14:42:06Z

rebased on Master

bhack · 2015-06-03T16:40:34Z

@jeffdonahue This is the last one of the triplet for filter_layer. Can you pass here?

bhack · 2016-01-15T15:33:43Z

Ping

jeffdonahue · 2016-01-27T00:41:44Z

Agreed with @cdoersch -- this is a bit too aggressive in its assumptions about what each layer might want to do in the event of size 0 batches (for example, the output shouldn't necessarily have the same shape as the input as assumed here, and often doesn't), and should probably operate on the level of individual layers rather than the net. Furthermore the net itself doesn't and shouldn't have any global notion of batch size, and the assumption that "batch size" is the 0th axis of each blob in the net isn't valid (at least, not anymore). On a more mundane note, code in net.cpp should certainly not use the legacy dimension calls (num/channels/height/width), and probably should never assume any particular shape layout.

seanbell · 2016-01-27T05:24:04Z

A more general solution would be to skip blobs with 0 entries, since any blob with 0 along its first axis has 0 total entries. One open question would be how to resize all the blobs after the 0-sized blob, if at all.

Another thought: ForwardIsAllowed() seems to suggest that it acts like a getter. However, calling this function changes state -- it destructively resizes the output blobs. It might be worth renaming this function, or splitting them into two functions, to make the consequences of calling this function more obvious.

hyojinie · 2017-02-28T08:28:17Z

Could anyone tell me how empty top/bottom blobs are handled right now? Is forward/backward prevented for 0 size batches? All I know is that when I feed in empty bottoms (as a result of the FilterLayer) to a loss layer (SoftmaxWithLoss) Caffe crashes with cudnn bad param error.

cypof · 2017-04-14T02:20:48Z

@hyojinie unlikely to resume this

This was referenced Mar 6, 2015

Filter layer rebased #2054

Closed

net.cpp now allows zero-sized batches #1484

Closed

net.cpp now allows zero-sized batches

40f4f6c

mtamburrano force-pushed the allow_zero-sized_batches branch from 47169e3 to 40f4f6c Compare May 18, 2015 14:24

This was referenced Jun 19, 2015

skip empty top in backward propagation in filter layer #2569

Open

FilterLayer may produce empty empty blob and lead to crash #2623

Open

ronghanghu mentioned this pull request Oct 28, 2015

Detect division by zero #3238

Open

hyojinie mentioned this pull request Feb 28, 2017

Skip unneeded forward computation during learning? #1360

Closed

cypof closed this Apr 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

net.cpp now allows zero-sized batches #2053

net.cpp now allows zero-sized batches #2053

mtamburrano commented Mar 6, 2015

bhack commented Mar 13, 2015

cdoersch commented Apr 10, 2015

mtamburrano commented May 18, 2015

bhack commented Jun 3, 2015

bhack commented Jan 15, 2016

jeffdonahue commented Jan 27, 2016

seanbell commented Jan 27, 2016

hyojinie commented Feb 28, 2017 •

edited

Loading

cypof commented Apr 14, 2017

net.cpp now allows zero-sized batches #2053

net.cpp now allows zero-sized batches #2053

Conversation

mtamburrano commented Mar 6, 2015

bhack commented Mar 13, 2015

cdoersch commented Apr 10, 2015

mtamburrano commented May 18, 2015

bhack commented Jun 3, 2015

bhack commented Jan 15, 2016

jeffdonahue commented Jan 27, 2016

seanbell commented Jan 27, 2016

hyojinie commented Feb 28, 2017 • edited Loading

cypof commented Apr 14, 2017

hyojinie commented Feb 28, 2017 •

edited

Loading