env:
| machine IP |
10.0.0.31 |
| type |
910b |
| test date |
2026.6.8 |
| test fep |
flagos-ai/community#4 |
| docker image |
harbor.baai.ac.cn/flagtree/flagtree-ascend-910c-py311-torch2.6.0-cann8.5.0-ubuntu22.04-aarch64:202603 |
Reproduction command:
1:set up docker
docker run -dit --name svt-ascend \ --privileged --network=host --ipc=host --shm-size=64g \ --device=/dev/davinci0 --device=/dev/davinci1 \ --device=/dev/davinci2 --device=/dev/davinci3 \ --device=/dev/davinci4 --device=/dev/davinci5 \ --device=/dev/davinci6 --device=/dev/davinci7 \ --device=/dev/davinci_manager --device=/dev/hisi_hdc \ --volume /usr/local/sbin:/usr/local/sbin \ --volume /usr/local/Ascend/driver:/usr/local/Ascend/driver \ --volume /public-flash:/public-flash \ --volume /etc/ascend_install.info:/etc/ascend_install.info \ harbor.baai.ac.cn/flagtree/flagtree-ascend-910c-py311-torch2.6.0-cann8.5.0-ubuntu22.04-aarch64:202603 \ bash
2:cd /home git clone https://github.com/flagos-ai/KernelGenBench.git cd /home/KernelGenBench
3:git checkout 0.1.0-rc2 pip install -r requirements/requirements_ascend.txt pip install -e .
4:root@bm-ctyun-wq-910b-64g-0-31:/home/KernelGenBench# python scripts/generate_kernel_and_verify.py \
--op-name aten::add \
--single-test \
--server-type openai \
--model-name your-model-name \
--max-rounds 3
(Note that this only tests whether the installation is successful. The model name is not replaced, and AI Claude Code is not configured. Any configuration related to Claude Code is considered as passing. Other errors need to have their causes resolved.)
error info:
Traceback (most recent call last):
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/init.py", line 39, in
import torch_npu.npu
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/npu/init.py", line 127, in
from torch_npu.utils import _should_print_warning
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/utils/init.py", line 1, in
from torch_npu import _C
ImportError: /usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/lib/libtorch_npu.so: undefined symbol: _ZNK3c1010TensorImpl20is_contiguous_customENS_12MemoryFormatE
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch/init.py", line 2833, in _import_device_backends
entrypoint = backend_extension.load()
^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/python3.11.13/lib/python3.11/importlib/metadata/init.py", line 202, in load
module = import_module(match.group('module'))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/python3.11.13/lib/python3.11/importlib/init.py", line 126, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "", line 1204, in _gcd_import
File "", line 1176, in _find_and_load
File "", line 1147, in _find_and_load_unlocked
File "", line 690, in _load_unlocked
File "", line 940, in exec_module
File "", line 241, in _call_with_frames_removed
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/init.py", line 41, in
from torch_npu.utils._error_code import ErrCode, pta_error
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/utils/init.py", line 1, in
from torch_npu import _C
ImportError: /usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/lib/libtorch_npu.so: undefined symbol: _ZNK3c1010TensorImpl20is_contiguous_customENS_12MemoryFormatE
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/home/KernelGenBench/scripts/generate_kernel_and_verify.py", line 11, in
from kernelgenbench.dataset import TorchOpsLoader, APIInfo
File "/home/KernelGenBench/src/kernelgenbench/init.py", line 2, in
import torch
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch/init.py", line 2878, in
_import_device_backends()
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch/init.py", line 2837, in _import_device_backends
raise RuntimeError(
RuntimeError: Failed to load the backend extension: torch_npu. You can disable extension auto-loading with TORCH_DEVICE_BACKEND_AUTOLOAD=0.
env:
Reproduction command:
1:set up docker
docker run -dit --name svt-ascend \ --privileged --network=host --ipc=host --shm-size=64g \ --device=/dev/davinci0 --device=/dev/davinci1 \ --device=/dev/davinci2 --device=/dev/davinci3 \ --device=/dev/davinci4 --device=/dev/davinci5 \ --device=/dev/davinci6 --device=/dev/davinci7 \ --device=/dev/davinci_manager --device=/dev/hisi_hdc \ --volume /usr/local/sbin:/usr/local/sbin \ --volume /usr/local/Ascend/driver:/usr/local/Ascend/driver \ --volume /public-flash:/public-flash \ --volume /etc/ascend_install.info:/etc/ascend_install.info \ harbor.baai.ac.cn/flagtree/flagtree-ascend-910c-py311-torch2.6.0-cann8.5.0-ubuntu22.04-aarch64:202603 \ bash2:
cd /home git clone https://github.com/flagos-ai/KernelGenBench.git cd /home/KernelGenBench3:
git checkout 0.1.0-rc2 pip install -r requirements/requirements_ascend.txt pip install -e .4:root@bm-ctyun-wq-910b-64g-0-31:/home/KernelGenBench# python scripts/generate_kernel_and_verify.py \
(Note that this only tests whether the installation is successful. The model name is not replaced, and AI Claude Code is not configured. Any configuration related to Claude Code is considered as passing. Other errors need to have their causes resolved.)
error info:
Traceback (most recent call last):
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/init.py", line 39, in
import torch_npu.npu
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/npu/init.py", line 127, in
from torch_npu.utils import _should_print_warning
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/utils/init.py", line 1, in
from torch_npu import _C
ImportError: /usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/lib/libtorch_npu.so: undefined symbol: _ZNK3c1010TensorImpl20is_contiguous_customENS_12MemoryFormatE
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch/init.py", line 2833, in _import_device_backends
entrypoint = backend_extension.load()
^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/python3.11.13/lib/python3.11/importlib/metadata/init.py", line 202, in load
module = import_module(match.group('module'))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/python3.11.13/lib/python3.11/importlib/init.py", line 126, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "", line 1204, in _gcd_import
File "", line 1176, in _find_and_load
File "", line 1147, in _find_and_load_unlocked
File "", line 690, in _load_unlocked
File "", line 940, in exec_module
File "", line 241, in _call_with_frames_removed
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/init.py", line 41, in
from torch_npu.utils._error_code import ErrCode, pta_error
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/utils/init.py", line 1, in
from torch_npu import _C
ImportError: /usr/local/python3.11.13/lib/python3.11/site-packages/torch_npu/lib/libtorch_npu.so: undefined symbol: _ZNK3c1010TensorImpl20is_contiguous_customENS_12MemoryFormatE
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/home/KernelGenBench/scripts/generate_kernel_and_verify.py", line 11, in
from kernelgenbench.dataset import TorchOpsLoader, APIInfo
File "/home/KernelGenBench/src/kernelgenbench/init.py", line 2, in
import torch
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch/init.py", line 2878, in
_import_device_backends()
File "/usr/local/python3.11.13/lib/python3.11/site-packages/torch/init.py", line 2837, in _import_device_backends
raise RuntimeError(
RuntimeError: Failed to load the backend extension: torch_npu. You can disable extension auto-loading with TORCH_DEVICE_BACKEND_AUTOLOAD=0.