Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adamw thread coarsening kernel
#753 opened Sep 3, 2024 by saladpalad Loading…
implement rmsnorm in C
#750 opened Aug 28, 2024 by Jake-Song Loading…
Fix sizing typo in train_gpt2_fp32.cu
#748 opened Aug 25, 2024 by gajanan-choudhary Loading…
log with LINE and FILE for better addressing.
#746 opened Aug 22, 2024 by NEWPLAN Loading…
check libnccl instead of nccl to be more reliable
#742 opened Aug 14, 2024 by dengl11 Loading…
[WIP] initial curand implementation for model init
#741 opened Aug 13, 2024 by ngc92 Loading…
multi-threaded model initialization
#737 opened Aug 12, 2024 by ngc92 Loading…
Add external KV to LLaMA 3
#734 opened Aug 10, 2024 by gordicaleksa Loading…
Add SwiGLU support
#718 opened Jul 29, 2024 by gordicaleksa Loading…
Add RoPE positional encoding
#714 opened Jul 28, 2024 by gordicaleksa Loading…
Add high perf mode
#708 opened Jul 23, 2024 by gordicaleksa Loading…
Add KV cache for inference
#707 opened Jul 22, 2024 by gordicaleksa Loading…
Major FP32 llm.c improvements/refactoring/etc.
#696 opened Jul 18, 2024 by ademeure Loading…
Update README.md with prerequisite of libomp
#691 opened Jul 17, 2024 by nzhang Loading…
FP8 work in progress
#678 opened Jul 12, 2024 by ademeure Draft
Recompute mlp
#676 opened Jul 11, 2024 by ngc92 Loading…
Add option to remove biases
#675 opened Jul 10, 2024 by gordicaleksa Loading…
Store RNG seed instead of shuffled indices
#673 opened Jul 9, 2024 by ngc92 Loading…
ProTip! Exclude everything labeled bug with -label:bug.