-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Description
Checklist
- 1. I have searched related issues but cannot get the expected help.
- 2. The bug has not been fixed in the latest version.
- 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
- 4. If the issue you raised is not a bug but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- 5. Please use English, otherwise it will be closed.
Describe the bug
When prompting Gemma3-27B-it it outputs and unrelated answer to the prompt
Reproduction
python3 -m venv .venv
source .venv/bin/activate.fish
pip install --upgrade pip
pip install uv
uv pip install "sglang[all]>=0.4.5" --find-links https://flashinfer.ai/whl/cu124/torch2.5/flashinfer-python
CUDA_VISIBLE_DEVICES=0,2,3,4 python -m sglang.launch_server --model-path /mnt/llms/models/gemma-3-27b-it --tp 4 --port 5000 --host 0.0.0.0 --context-length 25000
curl -X POST "http://192.168.10.115:5000/v1/chat/completions" \
-H "Content-Type: application/json" \
-d '{
"model": "/mnt/llms/models/gemma-3-27b-it",
"messages": [
{
"role": "user",
"content": "write python code to check the price of BTC"
}
],
"temperature": 0.8,
"max_tokens": 4096
}'
Environment
Ubuntu 22.04
Python3.10 using venv
CPU: Epyc 7402
Motherboard: Supermicro H12SSL-i
RAM: 512GB RAM
GPUs:
Device 0 = 3090
Device 1 = 5090
Device 2 = 3090
Device 3 = 3090
Device 4 = 3090
The 5090 is in slot pcie 1 (the top one) but is recognised as the device 1 instead of 0, not sure why. I have CUDA_DEVICE_ORDER=PCI_BUS_ID set.
I'm using CUDA_VISIBLE_DEVICES=0,2,3,4 so the 5090 is not used as I believe is not supported yet
nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Fri_Feb_21_20:23:50_PST_2025
Cuda compilation tools, release 12.8, V12.8.93
Build cuda_12.8.r12.8/compiler.35583870_0

python -m pip list
Package Version
------------------------- -------------------
aiohappyeyeballs 2.6.1
aiohttp 3.11.16
aiosignal 1.3.2
airportsdata 20250224
annotated-types 0.7.0
anthropic 0.49.0
anyio 4.9.0
asttokens 3.0.0
async-timeout 5.0.1
attrs 25.3.0
certifi 2025.1.31
cffi 1.17.1
charset-normalizer 3.4.1
click 8.1.8
cloudpickle 3.1.1
compressed-tensors 0.9.3
cuda-bindings 12.8.0
cuda-python 12.8.0
datasets 3.5.0
decorator 5.2.1
decord 0.6.0
dill 0.3.8
diskcache 5.6.3
distro 1.9.0
einops 0.8.1
exceptiongroup 1.2.2
executing 2.2.0
fastapi 0.115.12
filelock 3.18.0
flashinfer-python 0.2.3+cu124torch2.5
frozenlist 1.5.0
fsspec 2024.12.0
h11 0.14.0
hf_transfer 0.1.9
httpcore 1.0.7
httpx 0.28.1
huggingface-hub 0.30.1
idna 3.10
importlib_metadata 8.6.1
interegular 0.3.3
ipython 8.34.0
jedi 0.19.2
Jinja2 3.1.6
jiter 0.9.0
jsonschema 4.23.0
jsonschema-specifications 2024.10.1
lark 1.2.2
litellm 1.65.4.post1
llguidance 0.7.13
MarkupSafe 3.0.2
matplotlib-inline 0.1.7
modelscope 1.24.1
mpmath 1.3.0
multidict 6.3.2
multiprocess 0.70.16
nanobind 2.6.1
nest-asyncio 1.6.0
networkx 3.4.2
ninja 1.11.1.4
numpy 2.2.4
nvidia-cublas-cu12 12.4.5.8
nvidia-cuda-cupti-cu12 12.4.127
nvidia-cuda-nvrtc-cu12 12.4.127
nvidia-cuda-runtime-cu12 12.4.127
nvidia-cudnn-cu12 9.1.0.70
nvidia-cufft-cu12 11.2.1.3
nvidia-curand-cu12 10.3.5.147
nvidia-cusolver-cu12 11.6.1.9
nvidia-cusparse-cu12 12.3.1.170
nvidia-ml-py 12.570.86
nvidia-nccl-cu12 2.21.5
nvidia-nvjitlink-cu12 12.4.127
nvidia-nvtx-cu12 12.4.127
openai 1.70.0
orjson 3.10.16
outlines 0.1.11
outlines_core 0.1.26
packaging 24.2
pandas 2.2.3
parso 0.8.4
partial-json-parser 0.2.1.1.post5
pexpect 4.9.0
pillow 11.1.0
pip 22.0.2
prometheus_client 0.21.1
prompt_toolkit 3.0.50
propcache 0.3.1
psutil 7.0.0
ptyprocess 0.7.0
pure_eval 0.2.3
pyarrow 19.0.1
pycountry 24.6.1
pycparser 2.22
pydantic 2.11.2
pydantic_core 2.33.1
Pygments 2.19.1
pynvml 12.0.0
python-dateutil 2.9.0.post0
python-dotenv 1.1.0
python-multipart 0.0.20
pytz 2025.2
PyYAML 6.0.2
pyzmq 26.4.0
referencing 0.36.2
regex 2024.11.6
requests 2.32.3
rpds-py 0.24.0
safetensors 0.5.3
sentencepiece 0.2.0
setproctitle 1.3.5
setuptools 59.6.0
sgl-kernel 0.0.8
sglang 0.4.5
six 1.17.0
sniffio 1.3.1
soundfile 0.13.1
stack-data 0.6.3
starlette 0.46.1
sympy 1.13.1
tiktoken 0.9.0
tokenizers 0.21.1
torch 2.5.1
torchao 0.9.0
tqdm 4.67.1
traitlets 5.14.3
transformers 4.51.0
triton 3.1.0
typing_extensions 4.13.1
typing-inspection 0.4.0
tzdata 2025.2
urllib3 2.3.0
uv 0.6.12
uvicorn 0.34.0
uvloop 0.21.0
wcwidth 0.2.13
xgrammar 0.1.17
xxhash 3.5.0
yarl 1.19.0
zipp 3.21.0
Output:
curl -X POST "http://192.168.10.115:5000/v1/chat/completions" \
-H "Content-Type: application/json" \
-d '{
"model": "/mnt/llms/models/gemma-3-27b-it",
"messages": [
{
"role": "user",
"content": "write python code to check the price of BTC"
}
],
"temperature": 0.8,
"max_tokens": 4096
}'
{"id":"5b109e7c2cd2440d875a635ee6e9ab99","object":"chat.completion","created":1744017128,"model":"/mnt/llms/models/gemma-3-27b-it","choices":[{"index":0,"message":{"role":"assistant","content":"Tali Capung (Dragonfly String) is a type of traditional Indonesian handicraft made from woven fibers, creating a delicate and intricate string. It's known for its vibrant colors and decorative designs, and it's often used in a variety of applications. Here's a breakdown of what you need to know about Tali Capung:\n\n**1. Materials and Production:**\n\n* **Fiber Source:** Traditionally, Tali Capung is made from the fibers of the *Ronca* plant (also known as *Cabuk*), a type of shrub common in Indonesia, particularly in Java. Newer versions may utilize synthetic fibers, but the authentic kind focuses on the natural material.\n* **Processing:** The Ronca plant stalks are harvested, the fibers are extracted, dyed with natural or synthetic dyes to achieve vibrant colors, and then meticulously woven together. This weaving is done by hand, making each piece unique. The fibers are very thin and require skill to work with.\n* **\"Capung\" (Dragonfly) Name:** The name \"Tali Capung\" comes from the resemblance of the finished string to the delicate wings of a dragonfly. The intricate weaving creates a light, airy texture.\n\n**2. Uses & Applications:**\n\n* **Decoration:** This is the most common use. Tali Capung is used to decorate:\n * **Batik:** It's sewn onto Batik clothing, bags, and other textiles as an embellishment. It's very popular to add edging or motifs.\n * **Home Decor:** Used for trim on cushions, curtains, tablecloths, wall hangings, and other decorative items.\n * **Gift Wrapping:** Tali Capung adds a beautiful, handcrafted touch to gift packaging.\n * **Accessories:** Used in jewelry making (bracelets, necklaces, earrings), keychains, and embellishments for hats and clothing.\n* **Traditional Ceremonies:** In some areas, Tali Capung is used in traditional ceremonies and rituals as a decorative element.\n* **Fashion:** Designers incorporate Tali Capung into clothing designs, adding a unique Indonesian element to their creations.\n* **Craft Projects:** A popular material for crafting and DIY projects.\n\n**3. Characteristics:**\n\n* **Delicate & Lightweight:** Tali Capung is very thin and light.\n* **Vibrant Colors:** It comes in a wide range of bright and saturated colors.\n* **Intricate Designs:** The weaving creates a variety of patterns and textures.\n* **Handmade:** Being handmade, variations in color and pattern are common and contribute to its unique charm.\n* **Texture:** It has a slightly rough, natural texture.\n* **Durability:** While delicate, when cared for properly, it can be quite durable.\n\n\n\n**4. Where to Find It:**\n\n* **Indonesia:** You will find Tali Capung in traditional markets (pasar tradisional), craft shops, and Batik shops throughout Indonesia, particularly in Java.\n* **Online Marketplaces:** Etsy, eBay, and specialized Indonesian handicraft websites sell Tali Capung.\n* **Indonesian Shops Abroad:** Some Indonesian stores located in other countries may carry it.\n* **Craft Fairs:** Look for vendors specializing in Indonesian crafts at craft fairs.\n\n\n\n**5. Care Instructions:**\n\n* **Hand Wash:** Gentle hand washing is recommended. Avoid harsh detergents or bleach.\n* **Air Dry:** Lay flat to air dry. Avoid direct sunlight, which can fade the colors.\n* **Ironing:** Iron on a low setting, if necessary, with a pressing cloth.\n* **Storage:** Store in a cool, dry place to prevent mildew or damage.\n\n\n\nTali Capung is a beautiful example of Indonesian artistry and a versatile material that adds a touch of culture and handcrafted charm to any project or item. Its delicate beauty and vibrant colors make it a popular choice for those seeking unique and authentic Indonesian crafts.","reasoning_content":null,"tool_calls":null},"logprobs":null,"finish_reason":"stop","matched_stop":106}],"usage":{"prompt_tokens":1,"total_tokens":857,"completion_tokens":856,"prompt_tokens_details":null}}
