-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: antirez/ds4
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
DSpark B2 rejection sampling + adaptive block sizing
#482
opened Jun 30, 2026 by
machiabeli
Loading…
feat: add headless browser support with curl fallback for web tools
#479
opened Jun 29, 2026 by
J3rr1ck
Loading…
CUDA: make DeepSeek-V4-Pro correct on the indexed-attention path (top_k 512→1024) + enable decode LUT gate for in_dim>4096
#478
opened Jun 29, 2026 by
slackarea
Loading…
CUDA: scale q8->f16 cache reserve on >=112 GiB cards (fixes session OOM on large models)
#472
opened Jun 28, 2026 by
slackarea
Loading…
Fix slow decodes "poisoning" sleep times when using power throttling
#464
opened Jun 27, 2026 by
omnomburp
Loading…
CUDA: batch gate/up/down uploads for selected expert cache misses
#460
opened Jun 26, 2026 by
fmolara
Loading…
Add served model name option for server discovery
#456
opened Jun 25, 2026 by
RiccardoFiorentini
Loading…
Metal: keep selected-address SSD prefill opt-in by default
#454
opened Jun 25, 2026 by
andreaborio
•
Draft
Fix ROCm Q8->F16 cache reserve starving session tensors on large models (q4q2)
#446
opened Jun 23, 2026 by
alantsev
Contributor
Loading…
AGENTS.md rename (and server performance improvements?)
#443
opened Jun 21, 2026 by
OPS-NeoRetro
Loading…
Add reverse distributed topology with coordinator-owned output suffix
#430
opened Jun 16, 2026 by
lobanov
Loading…
Fix: ds4-server rejects HTTP requests using Transfer-Encoding: chunked
#423
opened Jun 16, 2026 by
moritzburgard
Loading…
agent: reject edit calls whose new= text contains [upto]
#421
opened Jun 16, 2026 by
aledesogusbusiness-hue
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.