-
Notifications
You must be signed in to change notification settings - Fork 739
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[BugFix] Fix real token exceeding max_batched_tokens limit
#7438
opened Apr 16, 2026 by
freeliuzc
Collaborator
Loading…
5 tasks done
[Cherry-Pick][Feature] support blackwell gemm in ll
#7435
opened Apr 16, 2026 by
lizhenyun01
Collaborator
Loading…
5 tasks
[BugFix][Cherry-Pick] Fix race condition in async RL control request(#7430)
#7433
opened Apr 16, 2026 by
jackyYang6
Contributor
Loading…
2 of 5 tasks
[Feature] consolidate cache, worker_process, and paddle logs into unified files
#7432
opened Apr 16, 2026 by
xyxinyang
Collaborator
Loading…
5 tasks done
[XPU] get_infer_param use inplace copy, remove block_tables abundant d2h copy
XPU
#7431
opened Apr 16, 2026 by
RuohengMa
Contributor
Loading…
5 tasks
[Bugfix][RL] fix control request timeout in async update weights pipe…
#7430
opened Apr 16, 2026 by
jackyYang6
Contributor
Loading…
1 of 5 tasks
[Feature] Support MOE Cutlass backend for latent MOE
#7428
opened Apr 16, 2026 by
chang-wenbin
Collaborator
Loading…
5 tasks
[FDConfig] Add is_bidirectional property to ModelConfig for EB5 models
#7427
opened Apr 16, 2026 by
kevincheng2
Collaborator
Loading…
2 of 4 tasks
⚡ Bolt: Optimize single element list appends
#7423
opened Apr 15, 2026 by
google-labs-jules
bot
Loading…
[DataProcessor] Remove legacy vl_processor directories and deprecated files
#7422
opened Apr 15, 2026 by
luukunn
Collaborator
Loading…
5 tasks
[KVCache] Mooncake storage register local buffer by chunk
cherry-pick: release/2.6
#7416
opened Apr 15, 2026 by
juncaipeng
Collaborator
Loading…
5 tasks
[XPU] async gpu speculate_set_stop_value_multi_seqs
#7409
opened Apr 15, 2026 by
cmcamdy
Collaborator
Loading…
5 tasks
Optimize scheduler for chunk prefill
#7408
opened Apr 15, 2026 by
rainyfly
Collaborator
Loading…
5 tasks
[Speculative Decoding] [BugFix]fix shape mismatch while cuda graph closed
#7406
opened Apr 15, 2026 by
huicongyao
Contributor
Loading…
3 of 5 tasks
[CI] Add pytest failure log collection and persistence
#7405
opened Apr 14, 2026 by
EmmonsCurse
Collaborator
Loading…
5 tasks done
[Models] support MLA gate attention
#7403
opened Apr 14, 2026 by
chang-wenbin
Collaborator
•
Draft
5 tasks
[Scheduler] Only decode req can be preempted
#7396
opened Apr 14, 2026 by
juncaipeng
Collaborator
Loading…
5 tasks
[Optimize] fetch requests from local scheduler continous
#7392
opened Apr 14, 2026 by
rainyfly
Collaborator
Loading…
5 tasks
[Optimization] Preempt decoding requests to prioritize chunked prefill requests
#7391
opened Apr 14, 2026 by
liyonghua0910
Collaborator
Loading…
5 tasks
[RL][Feature] R3 Support CPU PrefixCache
#7390
opened Apr 14, 2026 by
gongshaotian
Collaborator
Loading…
5 tasks
[Optim] Fetch requests from local scheduler continously
#7389
opened Apr 14, 2026 by
rainyfly
Collaborator
Loading…
5 tasks
[Benchmark]Support metrics for stream and no-stream response && Return usage info according to env params
#7388
opened Apr 14, 2026 by
juncaipeng
Collaborator
Loading…
5 tasks
[Others] print evictable blocks in console log
cherry-pick: release/2.5
cherry-pick: release/2.6
#7384
opened Apr 14, 2026 by
liyonghua0910
Collaborator
Loading…
5 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.