-
Notifications
You must be signed in to change notification settings - Fork 146
Pull requests: llm-d/llm-d-inference-scheduler
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
perf: optimize sidecar proxy hot path for high-concurrency P/D routing
#746
opened Mar 21, 2026 by
tlrmchlsmth
•
Draft
2 of 3 tasks
feat: add pending-tokens-scorer for token-aware load balancing
#745
opened Mar 21, 2026 by
KaveeshKhattar
Loading…
feat: precise-prefix-cache-scorer consumes tokenizer plugin
#744
opened Mar 20, 2026 by
RishabhSaini
Loading…
Remove UDS tokenizer image build from inference scheduler repo
#739
opened Mar 19, 2026 by
elevran
Loading…
deps(go): bump github.com/llm-d/llm-d-kv-cache from 0.6.1-0.20260317211430-786d9c8cd8f6 to 0.6.1 in the go-dependencies group across 1 directory
dependencies
Pull requests that update a dependency file
release-note-none
release notes not required
#738
opened Mar 19, 2026 by
dependabot
bot
Loading…
test: add disruption e2e tests for scheduler failure scenarios
#735
opened Mar 18, 2026 by
hexfusion
Loading…
[DO_NOT_MERGE] Tracker PR for testing GIE
v1.4.0-rc.3
#734
opened Mar 18, 2026 by
Gregory-Pereira
•
Draft
Unified Disaggregate Handler
hold
PRs that are blocked on design, other features, release cycle, etc.
lgtm
"Looks good to me", indicates that a PR is ready to be merged.
#732
opened Mar 17, 2026 by
roytman
Loading…
Basic implementation of dynamic LoRA adapters placement, based on shuffle sharding algorithm
do-not-merge/work-in-progress
Indicates that a PR should not merge because it is a work in progress.
#720
opened Mar 15, 2026 by
dmitripikus
•
Draft
fix(test): Increase test coverage for prefix_based_pd_decider.go
#715
opened Mar 12, 2026 by
gyliu513
Loading…
Optimize TTFT: send first token immediately after prefill for streaming
#701
opened Mar 10, 2026 by
RishabhSaini
Loading…
refactor: improve config validation in precise-prefix-cache-scorer
#690
opened Mar 9, 2026 by
lisperz
Loading…
Align repository with llm-d repo template
do-not-merge/hold
Indicates that a PR should not merge because someone has issued a /hold command.
lifecycle/stale
#649
opened Feb 24, 2026 by
InfraWhisperer
Loading…
🌱 Standardize governance workflows, pre-commit, and remove legacy CI
lifecycle/rotten
#633
opened Feb 18, 2026 by
clubanderson
Loading…
3 tasks
add cookie-based affinity support
lgtm
"Looks good to me", indicates that a PR is ready to be merged.
#600
opened Feb 5, 2026 by
roytman
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.