Logo
Explore Help
Register Sign In
20231088/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 1 Packages Projects Releases Wiki Activity
vllm/tests/compile
History
Luka Govedič e1744502c2
[FP8] Refactor apply_fp8_linear and apply_fp8_linear_generic into an object (#14390)
Signed-off-by: luka <luka@neuralmagic.com>
2025-03-07 05:20:16 +00:00
..
piecewise
Update deprecated Python 3.8 typing (#13971)
2025-03-02 17:34:51 -08:00
__init__.py
[torch.compile] register allreduce operations as custom ops (#8526)
2024-09-16 22:57:57 -07:00
backend.py
[torch.compile] Fix RMSNorm + quant fusion in the non-cutlass-fp8 case, rename RedundantReshapesPass to NoopEliminationPass (#10902)
2025-02-28 16:20:11 -07:00
test_basic_correctness.py
Update deprecated Python 3.8 typing (#13971)
2025-03-02 17:34:51 -08:00
test_full_graph.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
test_functionalization.py
[torch.compile] Fix RMSNorm + quant fusion in the non-cutlass-fp8 case, rename RedundantReshapesPass to NoopEliminationPass (#10902)
2025-02-28 16:20:11 -07:00
test_fusion.py
[FP8] Refactor apply_fp8_linear and apply_fp8_linear_generic into an object (#14390)
2025-03-07 05:20:16 +00:00
test_pass_manager.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
test_wrapper.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
utils.py
Consolidate Llama model usage in tests (#13094)
2025-02-13 22:18:03 -08:00
Powered by Gitea Version: 23.0.0 Page: 251ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API