Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 21
Star 36

Code
Pull requests 19
Actions
Projects
Security
Insights

Additional navigation options

Code
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/vllm

Labels 10 Milestones 0

Labels 10 Milestones 0

New pull request New

19 Open 181 Closed

19 Open 181 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fixing P3L incompatibility with cython.

#200 opened Sep 20, 2024 by Alexei-V-Ivanov-AMD

Loading…

1

Update run-amd-test.sh

#192 opened Sep 17, 2024 by Alexei-V-Ivanov-AMD

Loading…

1

multi-gpu fused_moe tuning support

#143 opened Aug 16, 2024 by divakar-amd

Loading…

4

[DO NOT MERGE] Vinayak/moe final hashem

#127 opened Aug 11, 2024 by carlushuang

Loading…

[DO NOT MERGE] patch wvSpltK_fused_moe from https://github.com/amd-hhashemi/vllm/tre…

#126 opened Aug 10, 2024 by carlushuang

Loading…

Add max-batch-size to benchmark_throughput.py

#122 opened Aug 7, 2024 by dllehr-amd

Loading…

Add truncate to all files after json dump

#117 opened Aug 2, 2024 by jpvillam-amd

Loading…

[Misc] Use main triton branch

#115 opened Aug 1, 2024 by binarman

Loading…

1

Adding SHM broadcast to ROCm/vllm

#113 opened Jul 31, 2024 by Lzy17

Loading…

optimizations for process output step

#104 opened Jul 25, 2024 by sanyalington

Loading…

Update QueueLLM

#97 opened Jul 22, 2024 by gyulaz-htec

Loading…

Add benchmark_latency_batched.py

#96 opened Jul 22, 2024 by dllehr-amd

Loading…

2

New LLM for MLPerf Server scenario serving

#94 opened Jul 19, 2024 by gyulaz-htec

Loading…

Update LLM and AsyncLLM to expose more functionality

#91 opened Jul 18, 2024 by gyulaz-htec

Loading…

Update LLM and AsyncLLM to expose more functionality

#90 opened Jul 18, 2024 by gyulaz-htec

Loading…

Add VLLM_SCHED_PREFILL_KVC_FREEPCT

#89 opened Jul 18, 2024 by sanyalington

Loading…

1

Torchrun api server

#71 opened Jun 27, 2024 by gshtras

Loading…

Use tgemm for mi300 only

#48 opened Jun 13, 2024 by ppalaniappan-amd

Loading…

1

Update on naive_attn module

#21 opened May 28, 2024 by seungrokj

Loading…

ProTip! What’s not been updated in a month: updated:<2024-08-19.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.