https://github.com/vllm-project/vllm
https://github.com/allenai/OLMo
https://github.com/hiyouga/LLaMA-Factory
https://github.com/QwenLM/Qwen2.5
https://github.com/QwenLM/Qwen2
https://github.com/QwenLM/Qwen
https://github.com/modelscope/modelscope-classroom
https://github.com/modelscope/swift
https://github.com/opendatalab/MinerU
https://github.com/rasbt/LLMs-from-scratch
https://github.com/HazyResearch/flash-attention
https://github.com/zhanshijinwat/Steel-LLM
https://github.com/NVIDIA/TensorRT-LLM
https://github.com/UKPLab/sentence-transformers
https://github.com/jerryjliu/llama_index
https://github.com/huggingface/text-generation-inference TGI
https://github.com/NVIDIA/TransformerEngine
https://github.com/NVIDIA/apex
https://github.com/NVIDIA/NeMo