-
Notifications
You must be signed in to change notification settings - Fork 501
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable proper override of AVX512_256 flag
cla signed
fb-exported
#3382
opened Nov 15, 2024 by
efiks
Loading…
[fbgemm_gpu] Re-enable cache tests for ROCm
ciflow/rocm
cla signed
module: rocm
#3380
opened Nov 15, 2024 by
q10
Loading…
Add support for
int32_t
indices in TBE training (2H/N)
cla signed
fb-exported
#3379
opened Nov 15, 2024 by
q10
Loading…
Add support for
int32_t
indices in TBE training (2G/N)
cla signed
fb-exported
#3377
opened Nov 14, 2024 by
q10
Loading…
Add support for
int32_t
indices in TBE training (2F/N)
cla signed
fb-exported
#3376
opened Nov 14, 2024 by
q10
Loading…
Add support for
int32_t
indices in TBE training (2E/N)
cla signed
fb-exported
#3375
opened Nov 14, 2024 by
q10
Loading…
Add support for
int32_t
indices in TBE training (2D/N)
cla signed
fb-exported
#3374
opened Nov 14, 2024 by
q10
Loading…
Add support for
int32_t
indices in TBE training (3/N)
cla signed
fb-exported
#3372
opened Nov 14, 2024 by
q10
Loading…
Add support for
int32_t
indices in TBE training (2B/N)
cla signed
fb-exported
#3371
opened Nov 14, 2024 by
q10
Loading…
Optimzed backward pass for ROCm devices
cla signed
module: rocm
#3367
opened Nov 13, 2024 by
avbokovoy
Loading…
Adjust EmbeddingSpMDMAutovec API
cla signed
fb-exported
#3366
opened Nov 13, 2024 by
MatzeB
Loading…
open-source SLL jagged_dense_elementwise_mul_jagged_out
cla signed
fb-exported
#3354
opened Nov 12, 2024 by
TroyGarden
Loading…
open-source SLL jagged2_to_padded_dense
cla signed
fb-exported
#3352
opened Nov 12, 2024 by
TroyGarden
Loading…
Introduce sve function for matrix multiplication
cla signed
fb-exported
#3348
opened Nov 11, 2024 by
Nicoshev
Loading…
Add manual loop unroll for rocm devices in fwd pass (#3309)
cla signed
fb-exported
module: rocm
#3345
opened Nov 8, 2024 by
leitian
Loading…
Add new optimizer state
row_counter
for Adam [Backend]
cla signed
fb-exported
#3342
opened Nov 8, 2024 by
spcyppt
Loading…
Fix global namespace pollution in ATen/Dispatch.h
cla signed
fb-exported
#3334
opened Nov 6, 2024 by
slyfox3
Loading…
Add support for
int32_t
indices in TBE training (2/N)
cla signed
fb-exported
#3326
opened Nov 5, 2024 by
q10
Loading…
Add template info into generated files
cla signed
fb-exported
#3325
opened Nov 5, 2024 by
q10
Loading…
Update benchmark test for
Int32_t
Indicies
cla signed
fb-exported
#3317
opened Nov 3, 2024 by
q10
Loading…
Unitifed Prefetching API for CPU TBE
cla signed
fb-exported
#3314
opened Nov 2, 2024 by
excelle08
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.