Removing layers with layers.pop() doesn't work? #2371

aristid · 2016-04-17T15:15:59Z

I'm trying to start https://github.com/iamaaditya/VQA_Demo which uses pre-trained VGG16 model (https://github.com/iamaaditya/VQA_Demo/blob/master/models/CNN/VGG.py). The last 2 layers are removed with layers.pop(). This doesn't seem to work, however. The message "ValueError: could not broadcast input array from shape (1000) into shape (4096)" is displayed, and I see the layers were not removed when I do "plot(image_model, to_file='model_vgg.png', show_shapes=True)". Thank you.

joelthchao · 2016-04-17T16:08:35Z

If you want to fine-tune this model, you should add new dense layer after popping the original one.

aristid · 2016-04-17T17:57:25Z

The original author does that; here is the notebook: https://github.com/iamaaditya/VQA_Demo/blob/master/Visual_Question_Answering_Demo_in_python_notebook.ipynb

Image features are supposed to be merged with language features, but it doesn't come to that because of the error I mentioned. The issue is, the model still uses the popped layers, which can be verified by plotting it.

If I comment out "model = model.model" (line 21 of keras/utils/visualize_util.py), the right model without the last layers is plotted. But I don't know what it means or how to let Keras know the layers have changed.

Additionally, I just verified it doesn't happen with Keras 0.3.1, the layers were dropped correctly there.

somewacko · 2016-04-17T21:38:28Z

I actually ran into this recently, and I believe that this is an issue with how Model was rewritten to be more graph-based. In addition to the layers property models also have an outputs property, so when you pop layers off your last layer is left dangling in the model's output (which is why your model still thinks its output size is 1000).

A method to pop layers off and manage all the links correctly is needed, but until then I believe this would work: (my model compiles, but I haven't tried training it yet)

# ... Load pre-trained VGG16 model

model.layers.pop() # Get rid of the classification layer
model.layers.pop() # Get rid of the dropout layer
model.outputs = [model.layers[-1].output]
model.layers[-1].outbound_nodes = []

aristid · 2016-04-18T06:06:05Z

Thank you. I downgraded to 0.3.1 where it works, so can't try your fix right now. I'm not sure it's all there is, because if I'm not mistaken plot() would draw all the popped layers (I tried popping many layers which all re-appeared), so maybe the copy of model.layers is stored somewhere.

As a side note, model.layers.pop() looked like a hack anyway. We did not push the layers there and should not expect to pop them without side effects. Maybe something like model.remove() would make more sense as a pendant to model.add().

albertomontesg · 2016-04-19T00:48:52Z

@fchollet It could be possible to add into a Sequential model the possibility to pop the last layer and updating the output and outbound_nodes?

joelthchao · 2016-04-19T05:33:00Z

@albertomontesg like this?

def pop(self):
    '''Removes a layer instance on top of the layer stack.
    '''
    if not self.outputs:
        raise Exception('Sequential model cannot be popped: model is empty.')
    else:
        self.layers.pop()
        if not self.layers:
            self.outputs = []
            self.inbound_nodes = []
            self.outbound_nodes = []
        else:
            self.layers[-1].outbound_nodes = []
            self.outputs = [self.layers[-1].output]
        self.built = False

albertomontesg · 2016-04-19T07:33:33Z

@joelthchao Yes. That was exactly what I was thinking.

aristid · 2016-04-19T10:21:12Z

@joelthchao seems to work fine, indeed. Thanks!

P.S. This particular example runs into #2386 after that, so it works under 0.3.1 only.

ref: keras-team/keras#2371

iamaaditya · 2016-05-13T15:07:38Z

@joelthchao Thanks for the idea.
@aristid I have changed the code to work with Keras version 1+

This issue can be considered close.

shaayaansayed · 2016-07-18T17:19:47Z

can confirm I'm still having this issue on keras 1.0.6.

farizrahman4u · 2016-07-18T21:16:52Z

#3252

m1sk · 2016-07-27T05:07:43Z

I'm having a related problem #3310.

keunwoochoi · 2016-09-09T23:41:59Z

Same.

SlimFrikha · 2017-01-06T10:18:13Z

I encountered the same problem and I managed to pop the layers like this:

from convnetskeras.convnets import AlexNet
alexnet = AlexNet(weights_path="weights/alexnet_weights.h5")
alexnet.layers.pop() # Get rid of the classification layer softmax
alexnet.outputs = [alexnet.layers[-1].output]
alexnet.output_layers = [alexnet.layers[-1]] # added this line in addition to zo7 solution
alexnet.layers[-1].outbound_nodes = []

Hope this helps!
(run on keras version 1.2.0)

alyato · 2017-03-07T11:50:50Z

@joelthchao I wanna use this pretrained vgg19.

x = Flatten(name='flatten')(x)
x = Dense(4096, activation='relu', name='fc1')(x)
x = Dense(4096, activation='relu', name='fc2')(x)
x = Dense(1000, activation='softmax', name='predictions')(x)
model = Model(img_input, x)
return model

i have only 8 labels to classlify.
But it is used by the keras function API ,not the Sequential model.
how can i use the model.layers.pop()
like this

x = Dense(1000, activation='softmax', name='predictions')(x)
model = Model(img_input, x)
model.layers.pop()
x = Dense(8, activation='softmax', name='predictions')(x)
loading weight
return model

Do you give me some advices? Thanks

joelthchao · 2017-03-08T08:35:06Z

@alyato No need to pop, just find the layer you want to connect to, connect to another layer and make another new model with correct input and output.

joelthchao · 2017-03-08T15:41:44Z

Snippet for your question:

from keras.models import Model
from keras.layers import Dense, Input

# Assume we have a pretrained model
net_input = Input(shape=(10))
net = Dense(4) (net_input)
net_output = Dense(1) (net)
model1 = Model(net_input, net_output)

# Get input
new_input = model1.input
# Find the layer to connect
hidden_layer = model1.layers[-2].output
# Connect a new layer on it
new_output = Dense(2) (hidden_layer)
# Build a new model
model2 = Model(new_input, new_output)

And this is the result

model1:
____________________________________________________________________________________________________
Layer (type)                     Output Shape          Param #     Connected to
====================================================================================================
input_1 (InputLayer)             (None, 10)            0
____________________________________________________________________________________________________
dense_1 (Dense)                  (None, 4)             44          input_1[0][0]
____________________________________________________________________________________________________
dense_2 (Dense)                  (None, 1)             5           dense_1[0][0]
====================================================================================================

model2:
____________________________________________________________________________________________________
Layer (type)                     Output Shape          Param #     Connected to
====================================================================================================
input_1 (InputLayer)             (None, 10)            0
____________________________________________________________________________________________________
dense_1 (Dense)                  (None, 4)             44          input_1[0][0]
____________________________________________________________________________________________________
dense_3 (Dense)                  (None, 2)             10          dense_1[0][0]
====================================================================================================

somewacko · 2017-03-08T18:31:12Z

Also, there's an available method model.get_layer(name, idx) for easily getting intermediate layers out of a model.

https://keras.io/models/model/#methods

alyato · 2017-03-09T02:55:44Z

@joelthchao Thanks. I run it well. But I also don't know how to get the index of every layer.
Do i use the model.get_config() to count the index of every layer

joelthchao · 2017-03-09T03:10:32Z

@alyato Sort of, I use [x.name for x in model.layers] to find the index manually.

srv902 · 2017-03-09T13:58:06Z

@joelthchao

Hi, I am trying to append vggnet-16 to an existing ResNet model, but after executing Pop() written by you, the output shape value is Multiple. How do I resolve this?

joelthchao · 2017-03-09T15:08:52Z

@srv902 Is it a sequential model or functional API model?

srv902 · 2017-03-09T16:48:51Z

@joelthchao It is a functional API model.

joelthchao · 2017-03-10T13:27:22Z

@srv902 If the layer you want to connect to has multiple outputs, then you can only choose one output to connect. (I bet you are trying to connect to residual block.) BTW, you can provide a code snippet to help others to understand your problem.

OnlyBelter · 2017-06-14T11:51:45Z

Still has this problem!

OnlyBelter · 2017-06-15T01:41:18Z

Below method works:

from keras.models import Model
from keras.layers import Dense,Flatten
from keras.applications import vgg16
from keras import backend as K

model = vgg16.VGG16(weights='imagenet', include_top=True)

model.input

model.summary(line_length=150)

model.layers.pop()
model.layers.pop()

model.summary(line_length=150)

inp = model.input
out =model.layers[-1].output

model2 = Model(inp, out)  # create a new model which doesn't have the last two layers in VGG16
model2.summary(line_length=150)

reference here

stale · 2017-09-13T15:57:40Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 30 days if no further activity occurs, but feel free to re-open a closed issue if needed.

hlfshell · 2017-09-18T00:48:44Z

I can confirm that keras 1.1.0 model.pop() does not remove the last layer.

Found this thread due to running into the issue myself.

Code with the error:

vgg = Vgg16()

vgg.model.summary()

vgg.model.layers.pop()

vgg.models.summary()

Edit:

I got the pop to work by calling vgg.model.pop instead of vgg.model.layers.pop, as it is a Sequential model.

winzee · 2017-10-27T15:22:54Z

I think I have a similar problem. I am running Keras 2.0.6 with Python 3.6.2 and Tensorflow-gpu 1.3.0.

I run this code after having hand built a vgg16 architecture and loaded the weights:

        model = self.model
        model.pop()
        for layer in model.layers: layer.trainable=False
        model.add(Dense(num, activation='softmax'))
        self.compile()

And when I check the graph in Tensorboard I see (check top left of attached picture) dense_3 connected to dropout_2 but dangling by itself. And then next to it I see dense_4, also connected to dropout_2.

Curiously, I get a low accuracy when running this on the Kaggle cats vs dog competition where I hover around 90% whilst others running this code (it's adapted from fast.ai) on top of Theanos easily get 97%. Perhaps my accuracy problem comes from somewhere else, but I still don't think dense_3 should be dangling there and I am wondering if this could be the source of my precision problem.

Secondary question: how could I definitely disconnect and remove dense_3?

AllardJM · 2017-11-12T19:27:07Z

@winzee

Have you tried this:

model = self.model
model.pop()

#add these
#https://github.com/flyyufelix/cnn_finetune/blob/master/vgg16.py
model.layers[-1].outbound_nodes = []
model.outputs = [model.layers[-1].output]

for layer in model.layers: layer.trainable=False
model.add(Dense(num, activation='softmax'))
self.compile()

mrgloom · 2018-03-21T10:04:51Z

@zo7 Can you comment on outbound_nodes ?

i.e. why this code is not sufficient?

model.layers.pop() 
model.outputs = [model.layers[-1].output]

dlhocker · 2018-06-19T23:36:25Z

I am having a similar issue trying to pop the last layer of a ResNet50 model and add a new classifier. Using Keras version 2.2.0 with tensorflow backend. The previous fix in this thread for dealing with the outbound nodes and output is producing an error that I haven't seen mentioned yet. Is there a smarter way to pop the classification layer and add a new one, or a way to deal with this attribute error ?

from keras.models import Model
from resnet50 import ResNet50

model = ResNet50(weights='imagenet', include_top=True)`
model.layers.pop() # Get rid of the classification layer
model.outputs = [model.layers[-1].output]
model.output_layers = [model.layers[-1]] # added this line in addition to zo7 solution
model.layers[-1].outbound_nodes = []

This is the output

AttributeError                            Traceback (most recent call last)
<ipython-input-3-8d9365679847> in <module>()
     16 model.outputs = [model.layers[-1].output]
     17 model.output_layers = [model.layers[-1]] # added this line in addition to zo7 solution
---> 18 model.layers[-1].outbound_nodes = []

AttributeError: can't set attribute

As an alternative, I know that the include_top=False option will remove the classifier (and the Flatten layer), but I don’t know how to correctly attach the output of this form of ResNet to a new classifier (e.g., using model.layers[-1].output ?). Any advice on this?

juanklopper · 2018-06-20T16:57:37Z

pop() still does not work (Win 10, GPU Keras on CNTK backend, newest as of 2018-06-20). To be more specific, when a model is copied from a known model and is of type Sequential, it does not work. However, pop() will work on the original Model type model, so use it there to remove the last layer before creating a new Sequential type model from the original layers, i.e. vgg16.

WilliamHoo · 2018-10-19T18:09:00Z

pls fix this problem as soon as possible, please i need this!

mrgloom · 2018-11-27T12:02:04Z

https://stackoverflow.com/a/49403298/1179925

For some reason I need to build model with popped layer using Model before adding new layers to make things work.

def pop_layer(model):
    if not model.outputs:
        raise Exception('Sequential model cannot be popped: model is empty.')

    model.layers.pop()
    if not model.layers:
        model.outputs = []
        model.inbound_nodes = []
        model.outbound_nodes = []
    else:
        model.layers[-1].outbound_nodes = []
        model.outputs = [model.layers[-1].output]
    model.built = False

def get_model():
    #Fully convolutional part of VGG16
    model = VGG16(include_top=False, weights='imagenet')

    #Remove last max pooling layer
    pop_layer(model)

    #Freeze pretrained layers
    for layer in model.layers:
        layer.trainable = False

    model = Model(inputs=model.inputs, outputs=model.outputs)

    print('len(model.layers)', len(model.layers)) #
    print(model.summary()) #

    x = GlobalAveragePooling2D()(model.output)
    head = Dense(N_CLASS, activation='softmax')(x)

    model = Model(inputs=model.inputs, outputs=head)

    model.compile(optimizer=Adadelta(), loss='categorical_crossentropy', metrics=['accuracy'])

    print('len(model.layers)', len(model.layers)) #
    print(model.summary()) #

    return model

Rajchowdhury420 · 2019-08-01T21:12:49Z

Sleep(1)

amandeep25 · 2020-07-22T06:29:26Z

Hi,
I am using pre-trained VGG16 model and wanted to remove the last layer used in VGG16 for classification
For my code this worked:

model._layers.pop()
model.summary()

llealgt · 2020-08-01T07:24:46Z

I have a similar case(https://stackoverflow.com/questions/63202651/how-to-remove-layers-from-a-keras-model-in-order-to-use-as-baseline-for-creating) using model._layers.pop() as @amandeep25 posted seems to work (model.summary() returns desired result) however when checking tensorboard or exporting the model to .pb (final goal is to compile for deployment on raspberry) the original VGG16 model is still there. Any suggestions?

hafiz031 · 2020-12-01T20:42:00Z

@somewacko is there any way to pop a range of layers using indices? I mean without using loop or calling the same pop() function multiple times?

somewacko · 2020-12-02T19:37:09Z

I believe model.layers is just a list, you can probably do:

model.layers = model.layers[:-3]  # everything but the last 3 layers

albertomontesg mentioned this issue Apr 20, 2016

Implement Pop Last Layer on Sequential Model #2418

Closed

joelthchao mentioned this issue May 6, 2016

How can i fine tune the last layer using Keras? #2640

Closed

JonathanKChang added a commit to FlorianMuellerklein/backseat_driver that referenced this issue May 6, 2016

Fixed layer popping issue

5641932

ref: keras-team/keras#2371

ChristianThomae mentioned this issue Jul 12, 2016

model.layers.pop() does not work properly on mac #3199

Closed

alyato mentioned this issue Jul 31, 2016

Predictions of trained model are different for same image #3188

Closed

bernardohenz mentioned this issue Aug 10, 2016

Can't remove layers #3442

Closed

3 tasks

kiyoon mentioned this issue Jan 20, 2017

Popping layer problem when merging it #5096

Closed

Huxwell mentioned this issue Apr 12, 2017

model.layers.pop() is broken - deleted layer reappears #6229

Closed

stale bot added the stale label Sep 13, 2017

stale bot removed the stale label Sep 18, 2017

koul mentioned this issue Dec 29, 2017

Removing layers from model gives the same output as original model #8909

Closed

ljames1 mentioned this issue Jun 8, 2018

How to figure out if Keras is loading the weights "correctly": model.load_weights(filename, by_name=True) #5397

Closed

fchollet closed this as completed Jun 24, 2021

Removing layers with layers.pop() doesn't work? #2371

Removing layers with layers.pop() doesn't work? #2371

Comments

aristid commented Apr 17, 2016

joelthchao commented Apr 17, 2016

aristid commented Apr 17, 2016 • edited Loading

somewacko commented Apr 17, 2016 • edited Loading

aristid commented Apr 18, 2016

albertomontesg commented Apr 19, 2016

joelthchao commented Apr 19, 2016 • edited Loading

albertomontesg commented Apr 19, 2016

aristid commented Apr 19, 2016

iamaaditya commented May 13, 2016

shaayaansayed commented Jul 18, 2016

farizrahman4u commented Jul 18, 2016

m1sk commented Jul 27, 2016

keunwoochoi commented Sep 9, 2016

SlimFrikha commented Jan 6, 2017 • edited Loading

alyato commented Mar 7, 2017

joelthchao commented Mar 8, 2017

joelthchao commented Mar 8, 2017

somewacko commented Mar 8, 2017

alyato commented Mar 9, 2017

joelthchao commented Mar 9, 2017

srv902 commented Mar 9, 2017

joelthchao commented Mar 9, 2017

srv902 commented Mar 9, 2017

joelthchao commented Mar 10, 2017

OnlyBelter commented Jun 14, 2017

OnlyBelter commented Jun 15, 2017

stale bot commented Sep 13, 2017

hlfshell commented Sep 18, 2017 • edited Loading

Edit:

winzee commented Oct 27, 2017

AllardJM commented Nov 12, 2017

mrgloom commented Mar 21, 2018

dlhocker commented Jun 19, 2018

juanklopper commented Jun 20, 2018

WilliamHoo commented Oct 19, 2018

mrgloom commented Nov 27, 2018

Rajchowdhury420 commented Aug 1, 2019

amandeep25 commented Jul 22, 2020

llealgt commented Aug 1, 2020

hafiz031 commented Dec 1, 2020

somewacko commented Dec 2, 2020

aristid commented Apr 17, 2016 •

edited

Loading

somewacko commented Apr 17, 2016 •

edited

Loading

joelthchao commented Apr 19, 2016 •

edited

Loading

SlimFrikha commented Jan 6, 2017 •

edited

Loading

hlfshell commented Sep 18, 2017 •

edited

Loading