Gradient computation fails while training 'HRNet_FeatureExtractor' due to an inplace operation #308

mohammadalihumayun · 2024-08-06T05:05:19Z

Using latest torch version when i try to train HRNet_FeatureExtractor from modules.feature_extraction, i get following error

one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [4, 512, 4, 50]], which is output 0 of ReluBackward0, is at version 1; expected version 0 instead. Hint: the backtrace further above shows the operation that failed to compute its gradient. The variable in question was changed in there or anywhere later.

Following is the model using the feat extractor
Please note that dataset used as input is a list of tuples each containing images as numpy arrays and labels as tex strings
`
class new_Model(nn.Module):
def init(self,input_channel = 3,
output_channel = 32,
FeatureExtraction = 'HRNet',
SequenceModeling = 'DBiLSTM',
Prediction = 'CTC',
batch_max_length=100,
hidden_size=256,
imgH=32,
imgW=400,):
super(new_Model, self).init()
self.stages = {'Feat': FeatureExtraction,
'Seq': SequenceModeling,
'Pred': Prediction}

    self.FeatureExtraction = HRNet_FeatureExtractor(input_channel, output_channel)
    self.FeatureExtraction_output = output_channel
    self.AdaptiveAvgPool = nn.AdaptiveAvgPool2d((None, 1)) # Transform final (imgH/16-1) -> 1
    self.SequenceModeling_output = hidden_size
    self.SequenceModeling = nn.Sequential(
            BidirectionalLSTM(self.FeatureExtraction_output, hidden_size, hidden_size),
            BidirectionalLSTM(hidden_size, hidden_size, hidden_size))
    self.Prediction = nn.Linear(self.SequenceModeling_output, num_class)
def forward(self, input, text=None, is_train=True):
    visual_feature = self.FeatureExtraction(input)
    visual_feature = self.AdaptiveAvgPool(visual_feature.permute(0, 3, 1, 2))  # [b, c, h, w] -> [b, w, c, h]
    visual_feature = visual_feature.squeeze(3)
    contextual_feature = self.SequenceModeling(visual_feature)
    prediction = self.Prediction(contextual_feature.contiguous())
    return prediction

model = new_Model( )
model = model.to(device)
model.train()
However please note that same code run fines when i use another feature extractor e.g. just by replacing
self.FeatureExtraction = HRNet_FeatureExtractor(input_channel, output_channel)
with
self.FeatureExtraction = DenseNet_FeatureExtractor(input_channel, output_channel)`
within the model, the code runs fine

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient computation fails while training 'HRNet_FeatureExtractor' due to an inplace operation #308

Gradient computation fails while training 'HRNet_FeatureExtractor' due to an inplace operation #308

mohammadalihumayun commented Aug 6, 2024 •

edited

Loading

Gradient computation fails while training 'HRNet_FeatureExtractor' due to an inplace operation #308

Gradient computation fails while training 'HRNet_FeatureExtractor' due to an inplace operation #308

Comments

mohammadalihumayun commented Aug 6, 2024 • edited Loading

mohammadalihumayun commented Aug 6, 2024 •

edited

Loading