How could I use multi-gpu #58

CoinCheung · 2018-09-04T12:17:31Z

It seems that the model only runs on a single gpu no matter how many gpus are available. If the space the model takes up is more than the volume of one gpu, there would be oom error. I can train the model on a single gpu with default configuration, but once I double the batch size and use two gpus, there is oom errors. How could I use multi-gpu in this case ?

Pandoro · 2018-09-10T11:47:55Z

This sadly doesn't work out of the box in Tensorflow, you will need to adjust the code quite a bit for this to work. For example you could start by taking a look at this example. This is nothing we are planning on doing though.

chris20181220 · 2018-12-20T02:26:25Z

It seems that the model only runs on a single gpu no matter how many gpus are available. If the space the model takes up is more than the volume of one gpu, there would be oom error. I can train the model on a single gpu with default configuration, but once I double the batch size and use two gpus, there is oom errors. How could I use multi-gpu in this case ?

I have the same problem, did you solve it?

CoinCheung · 2018-12-20T02:50:57Z

@chris20181220 yes, I reimplement it with pytorch, and my implementation supports multi-gpu working

chris20181220 · 2018-12-20T02:58:05Z

@chris20181220 yes, I reimplement it with pytorch, and my implementation supports multi-gpu working

@CoinCheung if I need to use tf, do u know how to fix?

CoinCheung · 2018-12-20T03:04:05Z

@chris20181220 As the author said, it will be quite tedious and many code should be modified, I do not think I can do it now. Sorry I cannot help.

chris20181220 · 2018-12-20T03:09:20Z

@chris20181220 As the author said, it will be quite tedious and many code should be modified, I do not think I can do it now. Sorry I cannot help.

@CoinCheung OK thank you all the same, i try to modify

lucasb-eyer · 2018-12-20T19:58:36Z

You should also be aware that there comes the question of how to do the triplet mining in the batch: mine on each GPU's batch independently, or gather all batch outputs to one fixed GPU and mine in the large complete batch there. There are trade-offs and it's not clear what is best.

Note: I have linked your re-implementation in our README as it could be useful for others. Let me know if you don't want this.

Pandoro · 2018-12-20T20:05:33Z

Also keep in mind what you do with the batch normalization. When you split the batch, it could pay off to specifically split the batch to make two P×K/2 batches, instead of two P/2×K batches, unless you specifically sync your batch normalization across GPUs.

…

On Thu, Dec 20, 2018, 20:58 Lucas Beyer ***@***.***> wrote: You should also be aware that there comes the question of how to do the triplet mining in the batch: mine on each GPU's batch independently, or gather all batch outputs to one fixed GPU and mine in the large complete batch there. There are trade-offs and it's not clear what is best. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#58 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AByRzXjXByl0VnGVqNr9WFP1PUIm4ATAks5u6-vsgaJpZM4WY3mf> .

lucasb-eyer added the wontfix label Oct 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How could I use multi-gpu #58

How could I use multi-gpu #58

CoinCheung commented Sep 4, 2018

Pandoro commented Sep 10, 2018

chris20181220 commented Dec 20, 2018

CoinCheung commented Dec 20, 2018

chris20181220 commented Dec 20, 2018

CoinCheung commented Dec 20, 2018

chris20181220 commented Dec 20, 2018

lucasb-eyer commented Dec 20, 2018 •

edited

Loading

Pandoro commented Dec 20, 2018 via email

How could I use multi-gpu #58

How could I use multi-gpu #58

Comments

CoinCheung commented Sep 4, 2018

Pandoro commented Sep 10, 2018

chris20181220 commented Dec 20, 2018

CoinCheung commented Dec 20, 2018

chris20181220 commented Dec 20, 2018

CoinCheung commented Dec 20, 2018

chris20181220 commented Dec 20, 2018

lucasb-eyer commented Dec 20, 2018 • edited Loading

Pandoro commented Dec 20, 2018 via email

lucasb-eyer commented Dec 20, 2018 •

edited

Loading